Polar decomposition

Last updated

In mathematics, the polar decomposition of a square real or complex matrix is a factorization of the form , where is a unitary matrix and is a positive semi-definite Hermitian matrix ( is an orthogonal matrix and is a positive semi-definite symmetric matrix in the real case), both square and of the same size. [1]

Contents

If a real matrix is interpreted as a linear transformation of -dimensional space , the polar decomposition separates it into a rotation or reflection of , and a scaling of the space along a set of orthogonal axes.

The polar decomposition of a square matrix always exists. If is invertible, the decomposition is unique, and the factor will be positive-definite. In that case, can be written uniquely in the form , where is unitary and is the unique self-adjoint logarithm of the matrix . [2] This decomposition is useful in computing the fundamental group of (matrix) Lie groups. [3]

The polar decomposition can also be defined as where is a symmetric positive-definite matrix with the same eigenvalues as but different eigenvectors.

The polar decomposition of a matrix can be seen as the matrix analog of the polar form of a complex number as , where is its absolute value (a non-negative real number), and is a complex number with unit norm (an element of the circle group).

The definition may be extended to rectangular matrices by requiring to be a semi-unitary matrix and to be a positive-semidefinite Hermitian matrix. The decomposition always exists and is always unique. The matrix is unique if and only if has full rank. [4]

Geometric interpretation

A real square matrix can be interpreted as the linear transformation of that takes a column vector to . Then, in the polar decomposition , the factor is an real orthonormal matrix. The polar decomposition then can be seen as expressing the linear transformation defined by into a scaling of the space along each eigenvector of by a scale factor (the action of ), followed by a rotation of (the action of ).

Alternatively, the decomposition expresses the transformation defined by as a rotation () followed by a scaling () along certain orthogonal directions. The scale factors are the same, but the directions are different.

Properties

The polar decomposition of the complex conjugate of is given by Note thatgives the corresponding polar decomposition of the determinant of A, since and . In particular, if has determinant 1 then both and have determinant 1.

The positive-semidefinite matrix P is always unique, even if A is singular, and is denoted aswhere denotes the conjugate transpose of . The uniqueness of P ensures that this expression is well-defined. The uniqueness is guaranteed by the fact that is a positive-semidefinite Hermitian matrix and, therefore, has a unique positive-semidefinite Hermitian square root. [5] If A is invertible, then P is positive-definite, thus also invertible and the matrix U is uniquely determined by

Relation to the SVD

In terms of the singular value decomposition (SVD) of , , one haswhere , , and are unitary matrices (orthogonal if the field is the reals ). This confirms that is positive-definite and is unitary. Thus, the existence of the SVD is equivalent to the existence of polar decomposition.

One can also decompose in the formHere is the same as before and is given byThis is known as the left polar decomposition, whereas the previous decomposition is known as the right polar decomposition. Left polar decomposition is also known as reverse polar decomposition.

The polar decomposition of a square invertible real matrix is of the form where is a positive-definite matrix and is an orthogonal matrix.

Relation to normal matrices

The matrix with polar decomposition is normal if and only if and commute: , or equivalently, they are simultaneously diagonalizable.

Construction and proofs of existence

The core idea behind the construction of the polar decomposition is similar to that used to compute the singular-value decomposition.

Derivation for normal matrices

If is normal, then it is unitarily equivalent to a diagonal matrix: for some unitary matrix and some diagonal matrix . This makes the derivation of its polar decomposition particularly straightforward, as we can then write where is a diagonal matrix containing the phases of the elements of , that is, when , and when .

The polar decomposition is thus , with and diagonal in the eigenbasis of and having eigenvalues equal to the phases and absolute values of those of , respectively.

Derivation for invertible matrices

From the singular-value decomposition, it can be shown that a matrix is invertible if and only if (equivalently, ) is. Moreover, this is true if and only if the eigenvalues of are all not zero. [6]

In this case, the polar decomposition is directly obtained by writing and observing that is unitary. To see this, we can exploit the spectral decomposition of to write .

In this expression, is unitary because is. To show that also is unitary, we can use the SVD to write , so that where again is unitary by construction.

Yet another way to directly show the unitarity of is to note that, writing the SVD of in terms of rank-1 matrices as , where are the singular values of , we have which directly implies the unitarity of because a matrix is unitary if and only if its singular values have unitary absolute value.

Note how, from the above construction, it follows that the unitary matrix in the polar decomposition of an invertible matrix is uniquely defined.

General derivation

The SVD of a square matrix reads , with unitary matrices, and a diagonal, positive semi-definite matrix. By simply inserting an additional pair of s or s, we obtain the two forms of the polar decomposition of :More generally, if is some rectangular matrix, its SVD can be written as where now and are isometries with dimensions and , respectively, where , and is again a diagonal positive semi-definite square matrix with dimensions . We can now apply the same reasoning used in the above equation to write , but now is not in general unitary. Nonetheless, has the same support and range as , and it satisfies and . This makes into an isometry when its action is restricted onto the support of , that is, it means that is a partial isometry.

As an explicit example of this more general case, consider the SVD of the following matrix:We then havewhich is an isometry, but not unitary. On the other hand, if we consider the decomposition ofwe findwhich is a partial isometry (but not an isometry).

Bounded operators on Hilbert space

The polar decomposition of any bounded linear operator A between complex Hilbert spaces is a canonical factorization as the product of a partial isometry and a non-negative operator.

The polar decomposition for matrices generalizes as follows: if A is a bounded linear operator then there is a unique factorization of A as a product A = UP where U is a partial isometry, P is a non-negative self-adjoint operator and the initial space of U is the closure of the range of P.

The operator U must be weakened to a partial isometry, rather than unitary, because of the following issues. If A is the one-sided shift on l2(N), then |A| = {A*A}1/2 = I. So if A = U |A|, U must be A, which is not unitary.

The existence of a polar decomposition is a consequence of Douglas' lemma:

Lemma  If A, B are bounded operators on a Hilbert space H, and A*AB*B, then there exists a contraction C such that A = CB. Furthermore, C is unique if ker(B*) ker(C).

The operator C can be defined by C(Bh) := Ah for all h in H, extended by continuity to the closure of Ran(B), and by zero on the orthogonal complement to all of H. The lemma then follows since A*AB*B implies ker(B) ⊂ ker(A).

In particular. If A*A = B*B, then C is a partial isometry, which is unique if ker(B*) ⊂ ker(C). In general, for any bounded operator A, where (A*A)1/2 is the unique positive square root of A*A given by the usual functional calculus. So by the lemma, we have for some partial isometry U, which is unique if ker(A*) ⊂ ker(U). Take P to be (A*A)1/2 and one obtains the polar decomposition A = UP. Notice that an analogous argument can be used to show A = P'U', where P' is positive and U' a partial isometry.

When H is finite-dimensional, U can be extended to a unitary operator; this is not true in general (see example above). Alternatively, the polar decomposition can be shown using the operator version of singular value decomposition.

By property of the continuous functional calculus, |A| is in the C*-algebra generated by A. A similar but weaker statement holds for the partial isometry: U is in the von Neumann algebra generated by A. If A is invertible, the polar part U will be in the C*-algebra as well.

Unbounded operators

If A is a closed, densely defined unbounded operator between complex Hilbert spaces then it still has a (unique) polar decomposition where |A| is a (possibly unbounded) non-negative self adjoint operator with the same domain as A, and U is a partial isometry vanishing on the orthogonal complement of the range ran(|A|).

The proof uses the same lemma as above, which goes through for unbounded operators in general. If dom(A*A) = dom(B*B) and A*Ah = B*Bh for all h ∈ dom(A*A), then there exists a partial isometry U such that A = UB. U is unique if ran(B) ⊂ ker(U). The operator A being closed and densely defined ensures that the operator A*A is self-adjoint (with dense domain) and therefore allows one to define (A*A)1/2. Applying the lemma gives polar decomposition.

If an unbounded operator A is affiliated to a von Neumann algebra M, and A = UP is its polar decomposition, then U is in M and so is the spectral projection of P, 1B(P), for any Borel set B in [0, ∞).

Quaternion polar decomposition

The polar decomposition of quaternions with orthonormal basis quaternions depends on the unit 2-dimensional sphere of square roots of minus one, known as right versors . Given any on this sphere, and an angle π < aπ , the versor is on the unit 3-sphere of For a = 0 and a = π , the versor is 1 or −1, regardless of which r is selected. The norm t of a quaternion q is the Euclidean distance from the origin to q. When a quaternion is not just a real number, then there is a unique polar decomposition:

Here r, a, t are all uniquely determined such that r is a right versor ( r2 = –1 ),a satisfies 0 < a < π , and t > 0 .

Alternative planar decompositions

In the Cartesian plane, alternative planar ring decompositions arise as follows:

Numerical determination of the matrix polar decomposition

To compute an approximation of the polar decomposition A = UP, usually the unitary factor U is approximated. [8] [9] The iteration is based on Heron's method for the square root of 1 and computes, starting from , the sequence

The combination of inversion and Hermite conjugation is chosen so that in the singular value decomposition, the unitary factors remain the same and the iteration reduces to Heron's method on the singular values.

This basic iteration may be refined to speed up the process:

See also

Related Research Articles

In mathematics, and more specifically in linear algebra, a linear map is a mapping between two vector spaces that preserves the operations of vector addition and scalar multiplication. The same names and the same definition are also used for the more general case of modules over a ring; see Module homomorphism.

<span class="mw-page-title-main">Fourier transform</span> Mathematical transform that expresses a function of time as a function of frequency

In mathematics, the Fourier transform (FT) is an integral transform that takes a function as input and outputs another function that describes the extent to which various frequencies are present in the original function. The output of the transform is a complex-valued function of frequency. The term Fourier transform refers to both this complex-valued function and the mathematical operation. When a distinction needs to be made, the output of the operation is sometimes called the frequency domain representation of the original function. The Fourier transform is analogous to decomposing the sound of a musical chord into the intensities of its constituent pitches.

In linear algebra and functional analysis, a spectral theorem is a result about when a linear operator or matrix can be diagonalized. This is extremely useful because computations involving a diagonalizable matrix can often be reduced to much simpler computations involving the corresponding diagonal matrix. The concept of diagonalization is relatively straightforward for operators on finite-dimensional vector spaces but requires some modification for operators on infinite-dimensional spaces. In general, the spectral theorem identifies a class of linear operators that can be modeled by multiplication operators, which are as simple as one can hope to find. In more abstract language, the spectral theorem is a statement about commutative C*-algebras. See also spectral theory for a historical perspective.

<span class="mw-page-title-main">Symmetric matrix</span> Matrix equal to its transpose

In linear algebra, a symmetric matrix is a square matrix that is equal to its transpose. Formally,

<span class="mw-page-title-main">Singular value decomposition</span> Matrix decomposition

In linear algebra, the singular value decomposition (SVD) is a factorization of a real or complex matrix into a rotation, followed by a rescaling followed by another rotation. It generalizes the eigendecomposition of a square normal matrix with an orthonormal eigenbasis to any matrix. It is related to the polar decomposition.

In mathematics, a complex square matrix A is normal if it commutes with its conjugate transpose A*:

<span class="mw-page-title-main">Special unitary group</span> Group of unitary matrices with determinant of 1

In mathematics, the special unitary group of degree n, denoted SU(n), is the Lie group of n × n unitary matrices with determinant 1.

In mathematics, a Hermitian matrix is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the i-th row and j-th column is equal to the complex conjugate of the element in the j-th row and i-th column, for all indices i and j:

In mathematics, the spectral radius of a square matrix is the maximum of the absolute values of its eigenvalues. More generally, the spectral radius of a bounded linear operator is the supremum of the absolute values of the elements of its spectrum. The spectral radius is often denoted by ρ(·).

<span class="mw-page-title-main">Singular value</span> Square roots of the eigenvalues of the self-adjoint operator

In mathematics, in particular functional analysis, the singular values of a compact operator acting between Hilbert spaces and , are the square roots of the eigenvalues of the self-adjoint operator .

In mathematics, operator theory is the study of linear operators on function spaces, beginning with differential operators and integral operators. The operators may be presented abstractly by their characteristics, such as bounded linear operators or closed operators, and consideration may be given to nonlinear operators. The study, which depends heavily on the topology of function spaces, is a branch of functional analysis.

In mathematics, the Iwasawa decomposition of a semisimple Lie group generalises the way a square real matrix can be written as a product of an orthogonal matrix and an upper triangular matrix. It is named after Kenkichi Iwasawa, the Japanese mathematician who developed this method.

In mathematical functional analysis a partial isometry is a linear map between Hilbert spaces such that it is an isometry on the orthogonal complement of its kernel.

<span class="mw-page-title-main">Representation theory of the Lorentz group</span> Representation of the symmetry group of spacetime in special relativity

The Lorentz group is a Lie group of symmetries of the spacetime of special relativity. This group can be realized as a collection of matrices, linear transformations, or unitary operators on some Hilbert space; it has a variety of representations. This group is significant because special relativity together with quantum mechanics are the two physical theories that are most thoroughly established, and the conjunction of these two theories is the study of the infinite-dimensional unitary representations of the Lorentz group. These have both historical importance in mainstream physics, as well as connections to more speculative present-day theories.

In mathematical physics, the gamma matrices, also called the Dirac matrices, are a set of conventional matrices with specific anticommutation relations that ensure they generate a matrix representation of the Clifford algebra It is also possible to define higher-dimensional gamma matrices. When interpreted as the matrices of the action of a set of orthogonal basis vectors for contravariant vectors in Minkowski space, the column vectors on which the matrices act become a space of spinors, on which the Clifford algebra of spacetime acts. This in turn makes it possible to represent infinitesimal spatial rotations and Lorentz boosts. Spinors facilitate spacetime computations in general, and in particular are fundamental to the Dirac equation for relativistic spin particles. Gamma matrices were introduced by Paul Dirac in 1928.

In mathematics, the square root of a matrix extends the notion of square root from numbers to matrices. A matrix B is said to be a square root of A if the matrix product BB is equal to A.

In statistics, principal component regression (PCR) is a regression analysis technique that is based on principal component analysis (PCA). PCR is a form of reduced rank regression. More specifically, PCR is used for estimating the unknown regression coefficients in a standard linear regression model.

Given a Hilbert space with a tensor product structure a product numerical range is defined as a numerical range with respect to the subset of product vectors. In some situations, especially in the context of quantum mechanics product numerical range is known as local numerical range

<span class="mw-page-title-main">Symmetry in quantum mechanics</span> Properties underlying modern physics

Symmetries in quantum mechanics describe features of spacetime and particles which are unchanged under some transformation, in the context of quantum mechanics, relativistic quantum mechanics and quantum field theory, and with applications in the mathematical formulation of the standard model and condensed matter physics. In general, symmetry in physics, invariance, and conservation laws, are fundamentally important constraints for formulating physical theories and models. In practice, they are powerful methods for solving problems and predicting what can happen. While conservation laws do not always give the answer to the problem directly, they form the correct constraints and the first steps to solving a multitude of problems. In application, understanding symmetries can also provide insights on the eigenstates that can be expected. For example, the existence of degenerate states can be inferred by the presence of non commuting symmetry operators or that the non degenerate states are also eigenvectors of symmetry operators.

In machine learning, the kernel embedding of distributions comprises a class of nonparametric methods in which a probability distribution is represented as an element of a reproducing kernel Hilbert space (RKHS). A generalization of the individual data-point feature mapping done in classical kernel methods, the embedding of distributions into infinite-dimensional feature spaces can preserve all of the statistical features of arbitrary distributions, while allowing one to compare and manipulate distributions using Hilbert space operations such as inner products, distances, projections, linear transformations, and spectral analysis. This learning framework is very general and can be applied to distributions over any space on which a sensible kernel function may be defined. For example, various kernels have been proposed for learning from data which are: vectors in , discrete classes/categories, strings, graphs/networks, images, time series, manifolds, dynamical systems, and other structured objects. The theory behind kernel embeddings of distributions has been primarily developed by Alex Smola, Le Song , Arthur Gretton, and Bernhard Schölkopf. A review of recent works on kernel embedding of distributions can be found in.

References

  1. Hall 2015 Section 2.5
  2. Hall 2015 Theorem 2.17
  3. Hall 2015 Section 13.3
  4. Higham, Nicholas J.; Schreiber, Robert S. (1990). "Fast polar decomposition of an arbitrary matrix". SIAM J. Sci. Stat. Comput. 11 (4). Philadelphia, PA, USA: Society for Industrial and Applied Mathematics: 648–655. CiteSeerX   10.1.1.111.9239 . doi:10.1137/0911038. ISSN   0196-5204. S2CID   14268409.
  5. Hall 2015 Lemma 2.18
  6. Note how this implies, by the positivity of , that the eigenvalues are all real and strictly positive.
  7. Sobczyk, G.(1995) "Hyperbolic Number Plane", College Mathematics Journal 26:268–80
  8. Higham, Nicholas J. (1986). "Computing the polar decomposition with applications". SIAM J. Sci. Stat. Comput. 7 (4). Philadelphia, PA, USA: Society for Industrial and Applied Mathematics: 1160–1174. CiteSeerX   10.1.1.137.7354 . doi:10.1137/0907079. ISSN   0196-5204.
  9. Byers, Ralph; Hongguo Xu (2008). "A New Scaling for Newton's Iteration for the Polar Decomposition and its Backward Stability". SIAM J. Matrix Anal. Appl. 30 (2). Philadelphia, PA, USA: Society for Industrial and Applied Mathematics: 822–843. CiteSeerX   10.1.1.378.6737 . doi:10.1137/070699895. ISSN   0895-4798.