Cayley transform

Last updated June 10, 2023

In mathematics, the Cayley transform, named after Arthur Cayley, is any of a cluster of related things. As originally described by Cayley (1846), the Cayley transform is a mapping between skew-symmetric matrices and special orthogonal matrices. The transform is a homography used in real analysis, complex analysis, and quaternionic analysis. In the theory of Hilbert spaces, the Cayley transform is a mapping between linear operators ( Nikolski 1988 ).

Real homography

A simple example of a Cayley transform can be done on the real projective line. The Cayley transform here will permute the elements of {1, 0, −1, ∞} in sequence. For example, it maps the positive real numbers to the interval [−1, 1]. Thus the Cayley transform is used to adapt Legendre polynomials for use with functions on the positive real numbers with Legendre rational functions.

As a real homography, points are described with projective coordinates, and the mapping is

[y,\ 1]=\left[{\frac {x-1}{x+1}},\ 1\right]\thicksim [x-1,\ x+1]=[x,\ 1]{\begin{pmatrix}1&1\\-1&1\end{pmatrix}}.

Complex homography

On the Riemann sphere, the Cayley transform is:^[1]^[2]

f(z)={\frac {z-i}{z+i}}.

Since $\{\infty ,1,-1\}$ is mapped to $\{1,-i,i\}$ , and Möbius transformations permute the generalised circles in the complex plane, $f$ maps the real line to the unit circle. Furthermore, since $f$ is continuous and $i$ is taken to 0 by $f$ , the upper half-plane is mapped to the unit disk.

In terms of the models of hyperbolic geometry, this Cayley transform relates the Poincaré half-plane model to the Poincaré disk model. In electrical engineering the Cayley transform has been used to map a reactance half-plane to the Smith chart used for impedance matching of transmission lines.

Quaternion homography

In the four-dimensional space of quaternions $a+b{\vec {i}}+c{\vec {j}}+d{\vec {k}}$ , the versors

u(\theta ,r)=\cos \theta +r\sin \theta

form the unit 3-sphere.

Since quaternions are non-commutative, elements of its projective line have homogeneous coordinates written $U(a,b)$ to indicate that the homogeneous factor multiplies on the left. The quaternion transform is

f(u,q)=U[q,1]{\begin{pmatrix}1&1\\-u&u\end{pmatrix}}=U[q-u,\ q+u]\sim U[(q+u)^{-1}(q-u),\ 1].

The real and complex homographies described above are instances of the quaternion homography where $\theta$ is zero or $\pi /2$ , respectively. Evidently the transform takes $u\to 0\to -1$ and takes $-u\to \infty \to 1$ .

Evaluating this homography at $q=1$ maps the versor $u$ into its axis:

f(u,1)=(1+u)^{-1}(1-u)=(1+u)^{*}(1-u)/|1+u|^{2}.

But $|1+u|^{2}=(1+u)(1+u^{*})=2+2\cos \theta ,\quad {\text{and}}\quad (1+u^{*})(1-u)=-2r\sin \theta .$

Thus $f(u,1)=-r{\frac {\sin \theta }{1+\cos \theta }}=-r\tan {\frac {\theta }{2}}.$

In this form the Cayley transform has been described as a rational parametrization of rotation: Let $t=\tan \phi /2$ in the complex number identity^[3]

e^{-i\varphi }={\frac {1-ti}{1+ti}}

where the right hand side is the transform of $ti$ and the left hand side represents the rotation of the plane by negative $\phi$ radians.

Inverse

Let $u^{*}=\cos \theta -r\sin \theta =u^{-1}.$ Since

{\begin{pmatrix}1&1\\-u&u\end{pmatrix}}\ {\begin{pmatrix}1&-u^{*}\\1&u^{*}\end{pmatrix}}\ =\ {\begin{pmatrix}2&0\\0&2\end{pmatrix}}\ \sim \ {\begin{pmatrix}1&0\\0&1\end{pmatrix}}\ ,

where the equivalence is in the projective linear group over quaternions, the inverse of $f(u,1)$ is

U[p,1]{\begin{pmatrix}1&-u^{*}\\1&u^{*}\end{pmatrix}}\ =\ U[p+1,\ (1-p)u^{*}]\sim U[u(1-p)^{-1}(p+1),\ 1].

Since homographies are bijections, $f^{-1}(u,1)$ maps the vector quaternions to the 3-sphere of versors. As versors represent rotations in 3-space, the homography $f^{-1}$ produces rotations from the ball in $\mathbb {R} ^{3}$ .

Matrix map

Among n×n square matrices over the reals, with I the identity matrix, let A be any skew-symmetric matrix (so that A^T = −A).

Then I + A is invertible, and the Cayley transform

Q=(I-A)(I+A)^{-1}\,\!

produces an orthogonal matrix, Q (so that Q^TQ = I). The matrix multiplication in the definition of Q above is commutative, so Q can be alternatively defined as $Q=(I+A)^{-1}(I-A)$ . In fact, Q must have determinant +1, so is special orthogonal.

Conversely, let Q be any orthogonal matrix which does not have −1 as an eigenvalue; then

A=(I-Q)(I+Q)^{-1}\,\!

is a skew-symmetric matrix. (See also: Involution.) The condition on Q automatically excludes matrices with determinant −1, but also excludes certain special orthogonal matrices.

However, any rotation (special orthogonal) matrix Q can be written as

Q={\bigl (}(I-A)(I+A)^{-1}{\bigr )}^{2}

for some skew-symmetric matrix A; more generally any orthogonal matrix Q can be written as

Q=E(I-A)(I+A)^{-1}

for some skew-symmetric matrix A and some diagonal matrix E with ±1 as entries.^[4]

A slightly different form is also seen,^[5]^[6] requiring different mappings in each direction,

{\begin{aligned}Q&=(I-A)^{-1}(I+A),\\[5mu]A&=(Q-I)(Q+I)^{-1}.\end{aligned}}

The mappings may also be written with the order of the factors reversed;^[7]^[8] however, A always commutes with (μI ± A)⁻¹, so the reordering does not affect the definition.

Examples

In the 2×2 case, we have

{\begin{bmatrix}0&\tan {\frac {\theta }{2}}\\-\tan {\frac {\theta }{2}}&0\end{bmatrix}}\leftrightarrow {\begin{bmatrix}\cos \theta &-\sin \theta \\\sin \theta &\cos \theta \end{bmatrix}}.

The 180° rotation matrix, −I, is excluded, though it is the limit as tan ^θ⁄₂ goes to infinity.

In the 3×3 case, we have

{\begin{bmatrix}0&z&-y\\-z&0&x\\y&-x&0\end{bmatrix}}\leftrightarrow {\frac {1}{K}}{\begin{bmatrix}w^{2}+x^{2}-y^{2}-z^{2}&2(xy-wz)&2(wy+xz)\\2(xy+wz)&w^{2}-x^{2}+y^{2}-z^{2}&2(yz-wx)\\2(xz-wy)&2(wx+yz)&w^{2}-x^{2}-y^{2}+z^{2}\end{bmatrix}},

where K = w² + x² + y² + z², and where w = 1. This we recognize as the rotation matrix corresponding to quaternion

w+\mathbf {i} x+\mathbf {j} y+\mathbf {k} z\,\!

(by a formula Cayley had published the year before), except scaled so that w = 1 instead of the usual scaling so that w² + x² + y² + z² = 1. Thus vector (x,y,z) is the unit axis of rotation scaled by tan ^θ⁄₂. Again excluded are 180° rotations, which in this case are all Q which are symmetric (so that Q^T = Q).

Other matrices

One can extend the mapping to complex matrices by substituting "unitary" for "orthogonal" and "skew-Hermitian" for "skew-symmetric", the difference being that the transpose (·^T) is replaced by the conjugate transpose (·^H). This is consistent with replacing the standard real inner product with the standard complex inner product. In fact, one may extend the definition further with choices of adjoint other than transpose or conjugate transpose.

Formally, the definition only requires some invertibility, so one can substitute for Q any matrix M whose eigenvalues do not include −1. For example,

{\begin{bmatrix}0&-a&ab-c\\0&0&-b\\0&0&0\end{bmatrix}}\leftrightarrow {\begin{bmatrix}1&2a&2c\\0&1&2b\\0&0&1\end{bmatrix}}.

Note that A is skew-symmetric (respectively, skew-Hermitian) if and only if Q is orthogonal (respectively, unitary) with no eigenvalue −1.

Operator map

An infinite-dimensional version of an inner product space is a Hilbert space, and one can no longer speak of matrices. However, matrices are merely representations of linear operators, and these can be used. So, generalizing both the matrix mapping and the complex plane mapping, one may define a Cayley transform of operators.^[9]

{\begin{aligned}U&{}=(A-\mathbf {i} I)(A+\mathbf {i} I)^{-1}\\A&{}=\mathbf {i} (I+U)(I-U)^{-1}\end{aligned}}

Here the domain of U, dom U, is (A+iI) dom A. See self-adjoint operator for further details.

Related Research Articles

In mathematical physics and mathematics, the Pauli matrices are a set of three $2 \times 2$ complex matrices which are Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma, they are occasionally denoted by tau when used in connection with isospin symmetries.

In linear algebra, an orthogonal matrix, or orthonormal matrix, is a real square matrix whose columns and rows are orthonormal vectors.

In linear algebra, an invertible complex square matrix $U$ is unitary if its conjugate transpose $U *$ is also its inverse, that is, if

In mathematics, the orthogonal group in dimension $, denoted, is the group of distance-preserving transformations of a Euclidean space of dimension that preserve a fixed point, where the group operation is given by composing transformations. The orthogonal group is sometimes called the general orthogonal group, by analogy with the general linear group. Equivalently, it is the group of orthogonal matrices, where the group operation is given by matrix multiplication. The orthogonal group is an algebraic group and a Lie group. It is compact.$

In mechanics and geometry, the 3D rotation group, often denoted SO(3), is the group of all rotations about the origin of three-dimensional Euclidean space $under the operation of composition.$

In mathematics, the special unitary group of degree $n$ , denoted $SU(n)$ , is the Lie group of $n \times n$ unitary matrices with determinant 1.

In mathematics, particularly in linear algebra, a skew-symmetricmatrix is a square matrix whose transpose equals its negative. That is, it satisfies the condition

Unit quaternions, known as versors, provide a convenient mathematical notation for representing spatial orientations and rotations of elements in three dimensional space. Specifically, they encode information about an axis-angle rotation about an arbitrary axis. Rotation and orientation quaternions have applications in computer graphics, computer vision, robotics, navigation, molecular dynamics, flight dynamics, orbital mechanics of satellites, and crystallographic texture analysis.

Rotation in mathematics is a concept originating in geometry. Any rotation is a motion of a certain space that preserves at least one point. It can describe, for example, the motion of a rigid body around a fixed point. Rotation can have sign (as in the sign of an angle): a clockwise rotation is a negative magnitude so a counterclockwise turn has a positive magnitude. A rotation is different from other types of motions: translations, which have no fixed points, and (hyperplane) reflections, each of them having an entire $(n - 1)$ -dimensional flat of fixed points in a $n$ -dimensional space.

An infinitesimal rotation matrix or differential rotation matrix is a matrix representing an infinitely small rotation.

In linear algebra, linear transformations can be represented by matrices. If $is a linear transformation mapping to and is a column vector with entries, then$

In quantum mechanics and computing, the Bloch sphere is a geometrical representation of the pure state space of a two-level quantum mechanical system (qubit), named after the physicist Felix Bloch.

In linear algebra, a rotation matrix is a transformation matrix that is used to perform a rotation in Euclidean space. For example, using the convention below, the matrix

In geometry, Euler's rotation theorem states that, in three-dimensional space, any displacement of a rigid body such that a point on the rigid body remains fixed, is equivalent to a single rotation about some axis that runs through the fixed point. It also means that the composition of two rotations is also a rotation. Therefore the set of rotations has a group structure, known as a rotation group.

Spatial rotations in three dimensions can be parametrized using both Euler angles and unit quaternions. This article explains how to convert between the two representations. Actually this simple use of "quaternions" was first presented by Euler some seventy years earlier than Hamilton to solve the problem of magic squares. For this reason the dynamics community commonly refers to quaternions in this application as "Euler parameters".

In mathematics, the group of rotations about a fixed point in four-dimensional Euclidean space is denoted SO(4). The name comes from the fact that it is the special orthogonal group of order 4.

In geometry, various formalisms exist to express a rotation in three dimensions as a mathematical transformation. In physics, this concept is applied to classical mechanics where rotational kinematics is the science of quantitative description of a purely rotational motion. The orientation of an object at a given instant is described with the same tools, as it is defined as an imaginary rotation from a reference placement in space, rather than an actually observed rotation from a previous placement in space.

In mathematics, the axis–angle representation of a rotation parameterizes a rotation in a three-dimensional Euclidean space by two quantities: a unit vector $e$ indicating the direction of an axis of rotation, and an angle $θ$ describing the magnitude of the rotation about the axis. Only two numbers, not three, are needed to define the direction of a unit vector $e$ rooted at the origin because the magnitude of $e$ is constrained. For example, the elevation and azimuth angles of $e$ suffice to locate it in any particular Cartesian coordinate frame.

This article derives the main properties of rotations in 3-dimensional space.

In mathematics, quaternionic analysis is the study of functions with quaternions as the domain and/or range. Such functions can be called functions of a quaternion variable just as functions of a real variable or a complex variable are called.

References

↑ Robert Everist Green & Steven G. Krantz (2006) Function Theory of One Complex Variable, page 189, Graduate Studies in Mathematics #40, American Mathematical Society ISBN 9780821839621
↑ Erwin Kreyszig (1983) Advanced Engineering Mathematics, 5th edition, page 611, Wiley ISBN 0471862517
↑ See Tangent half-angle formula
↑ Gallier, Jean (2006). "Remarks on the Cayley Representation of Orthogonal Matrices and on Perturbing the Diagonal of a Matrix to Make it Invertible". arXiv: math/0606320 .
As described by Gallier, the first of these results is a sharpened variant of Weyl, Hermann (1946). The Classical Groups (2nd ed.). Princeton University Press. Lemma 2.10.D, p. 60.
The second appeared as an exercise in Bellman, Richard (1960). Introduction to Matrix Analysis. SIAM Publications. §6.4 exercise 11, p. 91–92.
↑ Golub, Gene H.; Van Loan, Charles F. (1996), Matrix Computations (3rd ed.), Johns Hopkins University Press, ISBN 978-0-8018-5414-9
↑ F. Chong (1971) "A Geometric Note on the Cayley Transform", pages 84,5 in A Spectrum of Mathematics: Essays Presented to H. G. Forder, John C. Butcher editor, Auckland University Press
↑ Courant, Richard; Hilbert, David (1989), Methods of Mathematical Physics, vol. 1 (1st English ed.), New York: Wiley-Interscience, pp. 536, 7, ISBN 978-0-471-50447-4 Ch.VII, §7.2
↑ Howard Eves (1966) Elementary Matrix Theory, § 5.4A Cayley’s Construction of Real Orthogonal Matrices, pages 365–7, Allyn & Bacon
↑ Rudin 1991, p. 356-357 §13.17.

Sterling K. Berberian (1974) Lectures in Functional Analysis and Operator Theory, Graduate Texts in Mathematics #15, pages 278, 281, Springer-Verlag ISBN 978-0-387-90081-0
Cayley, Arthur (1846), "Sur quelques propriétés des déterminants gauches", Journal für die reine und angewandte Mathematik , 32 (2): 119–123, doi:10.1515/crll.1846.32.119, ISSN 0075-4102 ; reprinted as article 52 (pp. 332–336) in Cayley, Arthur (1889), The collected mathematical papers of Arthur Cayley, vol. I (1841–1853), Cambridge University Press, pp. 332–336
Lokenath Debnath & Piotr Mikusiński (1990) Introduction to Hilbert Spaces with Applications, page 213, Academic Press ISBN 0-12-208435-7
Gilbert Helmberg (1969) Introduction to Spectral Theory in Hilbert Space, page 288, § 38: The Cayley Transform, Applied Mathematics and Mechanics #6, North Holland
Nikolski, Nikolai Kapitonovich (1988), "Cayley transform", in Hazewinkel, Michiel (ed.), Encyclopaedia of Mathematics , vol. 2, Kluwer, p. 80, doi:10.1007/978-94-009-6000-8, ISBN 978-94-009-6002-2 ; translated from the Russian Vinogradov, Ivan Matveevich, ed. (1977), Matematicheskaya Entsiklopediya, Moscow: Sovetskaya Entsiklopediya
Henry Ricardo (2010) A Modern Introduction to Linear Algebra, page 504, CRC Press ISBN 978-1-4398-0040-9 .
Rudin, Walter (1991). Functional Analysis. International Series in Pure and Applied Mathematics. Vol. 8 (Second ed.). New York, NY: McGraw-Hill Science/Engineering/Math. ISBN 978-0-07-054236-5. OCLC 21163277.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Robert Everist Green & Steven G. Krantz (2006) Function Theory of One Complex Variable, page 189, Graduate Studies in Mathematics #40, American Mathematical Society ISBN 9780821839621

[2] Erwin Kreyszig (1983) Advanced Engineering Mathematics, 5th edition, page 611, Wiley ISBN 0471862517

[3] See Tangent half-angle formula

[4] Gallier, Jean (2006). "Remarks on the Cayley Representation of Orthogonal Matrices and on Perturbing the Diagonal of a Matrix to Make it Invertible". arXiv: math/0606320 .
As described by Gallier, the first of these results is a sharpened variant of Weyl, Hermann (1946). The Classical Groups (2nd ed.). Princeton University Press. Lemma 2.10.D, p. 60.
The second appeared as an exercise in Bellman, Richard (1960). Introduction to Matrix Analysis. SIAM Publications. §6.4 exercise 11, p. 91–92.

[5] Golub, Gene H.; Van Loan, Charles F. (1996), Matrix Computations (3rd ed.), Johns Hopkins University Press, ISBN 978-0-8018-5414-9

[6] F. Chong (1971) "A Geometric Note on the Cayley Transform", pages 84,5 in A Spectrum of Mathematics: Essays Presented to H. G. Forder, John C. Butcher editor, Auckland University Press

[7] Courant, Richard; Hilbert, David (1989), Methods of Mathematical Physics, vol. 1 (1st English ed.), New York: Wiley-Interscience, pp. 536, 7, ISBN 978-0-471-50447-4 Ch.VII, §7.2

[8] Howard Eves (1966) Elementary Matrix Theory, § 5.4A Cayley’s Construction of Real Orthogonal Matrices, pages 365–7, Allyn & Bacon

[FOOTNOTERudin1991356-357_§13.17-9] Rudin 1991, p. 356-357 §13.17.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]