Weyr canonical form

Last updated January 31, 2025

The image shows an example of a general Weyr matrix consisting of two blocks each of which is a basic Weyr matrix. The basic Weyr matrix in the top-left corner has the structure (4,2,1) and the other one has the structure (2,2,1,1). WeyrMatrixExample.jpg — The image shows an example of a general Weyr matrix consisting of two blocks each of which is a basic Weyr matrix. The basic Weyr matrix in the top-left corner has the structure (4,2,1) and the other one has the structure (2,2,1,1).

In mathematics, in linear algebra, a Weyr canonical form (or, Weyr form or Weyr matrix) is a square matrix which (in some sense) induces "nice" properties with matrices it commutes with. It also has a particularly simple structure and the conditions for possessing a Weyr form are fairly weak, making it a suitable tool for studying classes of commuting matrices. A square matrix is said to be in the Weyr canonical form if the matrix has the structure defining the Weyr canonical form. The Weyr form was discovered by the Czech mathematician Eduard Weyr in 1885.^[1]^[2]^[3] The Weyr form did not become popular among mathematicians and it was overshadowed by the closely related, but distinct, canonical form known by the name Jordan canonical form.^[3] The Weyr form has been rediscovered several times since Weyr’s original discovery in 1885.^[4] This form has been variously called as modified Jordan form,reordered Jordan form,second Jordan form, and H-form.^[4] The current terminology is credited to Shapiro who introduced it in a paper published in the American Mathematical Monthly in 1999.^[4]^[5]

Definitions
Basic Weyr matrix
Definition
Example
General Weyr matrix
Definition 2
Example 2
Relation between Weyr and Jordan forms
The Weyr form is canonical
Computation of the Weyr canonical form
Reduction to the nilpotent case
Reduction of a nilpotent matrix to the Weyr form
Applications of the Weyr form
References

Recently several applications have been found for the Weyr matrix. Of particular interest is an application of the Weyr matrix in the study of phylogenetic invariants in biomathematics.

Definitions

Basic Weyr matrix

Definition

A basic Weyr matrix with eigenvalue $\lambda$ is an $n\times n$ matrix $W$ of the following form: There is an integer partition

n_{1}+n_{2}+\cdots +n_{r}=n

of

n

with

n_{1}\geq n_{2}\geq \cdots \geq n_{r}\geq 1

such that, when $W$ is viewed as an $r\times r$ block matrix $(W_{ij})$ , where the $(i,j)$ block $W_{ij}$ is an $n_{i}\times n_{j}$ matrix, the following three features are present:

The main diagonal blocks $W_{ii}$ are the $n_{i}\times n_{i}$ scalar matrices $\lambda I$ for $i=1,\ldots ,r$ .
The first superdiagonal blocks $W_{i,i+1}$ are full column rank $n_{i}\times n_{i+1}$ matrices in reduced row-echelon form (that is, an identity matrix followed by zero rows) for $i=1,\ldots ,r-1$ .
All other blocks of W are zero (that is, $W_{ij}=0$ when $j\neq i,i+1$ ).

In this case, we say that $W$ has Weyr structure $(n_{1},n_{2},\ldots ,n_{r})$ .

Example

The following is an example of a basic Weyr matrix.

$W=$ $={\begin{bmatrix}W_{11}&W_{12}&&\\&W_{22}&W_{23}&\\&&W_{33}&W_{34}\\&&&W_{44}\\\end{bmatrix}}$

In this matrix, $n=9$ and $n_{1}=4,n_{2}=2,n_{3}=2,n_{4}=1$ . So $W$ has the Weyr structure $(4,2,2,1)$ . Also,

$W_{11}={\begin{bmatrix}\lambda &0&0&0\\0&\lambda &0&0\\0&0&\lambda &0\\0&0&0&\lambda \\\end{bmatrix}}=\lambda I_{4},\quad W_{22}={\begin{bmatrix}\lambda &0\\0&\lambda \\\end{bmatrix}}=\lambda I_{2},\quad W_{33}={\begin{bmatrix}\lambda &0\\0&\lambda \\\end{bmatrix}}=\lambda I_{2},\quad W_{44}={\begin{bmatrix}\lambda \\\end{bmatrix}}=\lambda I_{1}$

and

$W_{12}={\begin{bmatrix}1&0\\0&1\\0&0\\0&0\\\end{bmatrix}},\quad W_{23}={\begin{bmatrix}1&0\\0&1\\\end{bmatrix}},\quad W_{34}={\begin{bmatrix}1\\0\\\end{bmatrix}}.$

General Weyr matrix

Definition

Let $W$ be a square matrix and let $\lambda _{1},\ldots ,\lambda _{k}$ be the distinct eigenvalues of $W$ . We say that $W$ is in Weyr form (or is a Weyr matrix) if $W$ has the following form:

$W={\begin{bmatrix}W_{1}&&&\\&W_{2}&&\\&&\ddots &\\&&&W_{k}\\\end{bmatrix}}$

where $W_{i}$ is a basic Weyr matrix with eigenvalue $\lambda _{i}$ for $i=1,\ldots ,k$ .

Example

The following image shows an example of a general Weyr matrix consisting of three basic Weyr matrix blocks. The basic Weyr matrix in the top-left corner has the structure (4,2,1) with eigenvalue 4, the middle block has structure (2,2,1,1) with eigenvalue -3 and the one in the lower-right corner has the structure (3, 2) with eigenvalue 0.

Relation between Weyr and Jordan forms

The Weyr canonical form $W=P^{-1}JP$ is related to the Jordan form $J$ by a simple permutation $P$ for each Weyr basic block as follows: The first index of each Weyr subblock forms the largest Jordan chain. After crossing out these rows and columns, the first index of each new subblock forms the second largest Jordan chain, and so forth.^[6]

The Weyr form is canonical

That the Weyr form is a canonical form of a matrix is a consequence of the following result:^[3]Each square matrix $A$ over an algebraically closed field is similar to a Weyr matrix $W$ which is unique up to permutation of its basic blocks. The matrix $W$ is called the Weyr (canonical) form of $A$ .

Computation of the Weyr canonical form

Reduction to the nilpotent case

Let $A$ be a square matrix of order $n$ over an algebraically closed field and let the distinct eigenvalues of $A$ be $\lambda _{1},\lambda _{2},\ldots ,\lambda _{k}$ . The Jordan–Chevalley decomposition theorem states that $A$ is similar to a block diagonal matrix of the form

$A={\begin{bmatrix}\lambda _{1}I+N_{1}&&&\\&\lambda _{2}I+N_{2}&&\\&&\ddots &\\&&&\lambda _{k}I+N_{k}\\\end{bmatrix}}={\begin{bmatrix}\lambda _{1}I&&&\\&\lambda _{2}I&&\\&&\ddots &\\&&&\lambda _{k}I\\\end{bmatrix}}+{\begin{bmatrix}N_{1}&&&\\&N_{2}&&\\&&\ddots &\\&&&N_{k}\\\end{bmatrix}}=D+N$

where $D$ is a diagonal matrix, $N$ is a nilpotent matrix, and $[D,N]=0$ , justifying the reduction of $N$ into subblocks $N_{i}$ . So the problem of reducing $A$ to the Weyr form reduces to the problem of reducing the nilpotent matrices $N_{i}$ to the Weyr form. This leads to the generalized eigenspace decomposition theorem.

Reduction of a nilpotent matrix to the Weyr form

Given a nilpotent square matrix $A$ of order $n$ over an algebraically closed field $F$ , the following algorithm produces an invertible matrix $C$ and a Weyr matrix $W$ such that $W=C^{-1}AC$ .

Step 1

Let $A_{1}=A$

Step 2

Compute a basis for the null space of $A_{1}$ .
Extend the basis for the null space of $A_{1}$ to a basis for the $n$ -dimensional vector space $F^{n}$ .
Form the matrix $P_{1}$ consisting of these basis vectors.
Compute $P_{1}^{-1}A_{1}P_{1}={\begin{bmatrix}0&B_{2}\\0&A_{2}\end{bmatrix}}$ . $A_{2}$ is a square matrix of size $n$ − nullity $(A_{1})$ .

Step 3

If $A_{2}$ is nonzero, repeat Step 2 on $A_{2}$ .

Compute a basis for the null space of $A_{2}$ .
Extend the basis for the null space of $A_{2}$ to a basis for the vector space having dimension $n$ − nullity $(A_{1})$ .
Form the matrix $P_{2}$ consisting of these basis vectors.
Compute $P_{2}^{-1}A_{2}P_{2}={\begin{bmatrix}0&B_{3}\\0&A_{3}\end{bmatrix}}$ . $A_{2}$ is a square matrix of size $n$ − nullity $(A_{1})$ − nullity $(A_{2})$ .

Step 4

Continue the processes of Steps 1 and 2 to obtain increasingly smaller square matrices $A_{1},A_{2},A_{3},\ldots$ and associated invertible matrices $P_{1},P_{2},P_{3},\ldots$ until the first zero matrix $A_{r}$ is obtained.

Step 5

The Weyr structure of $A$ is $(n_{1},n_{2},\ldots ,n_{r})$ where $n_{i}$ = nullity $(A_{i})$ .

Step 6

Compute the matrix $P=P_{1}{\begin{bmatrix}I&0\\0&P_{2}\end{bmatrix}}{\begin{bmatrix}I&0\\0&P_{3}\end{bmatrix}}\cdots {\begin{bmatrix}I&0\\0&P_{r}\end{bmatrix}}$ (here the $I$ 's are appropriately sized identity matrices).
Compute $X=P^{-1}AP$ . $X$ is a matrix of the following form:

X={\begin{bmatrix}0&X_{12}&X_{13}&\cdots &X_{1,r-1}&X_{1r}\\&0&X_{23}&\cdots &X_{2,r-1}&X_{2r}\\&&&\ddots &\\&&&\cdots &0&X_{r-1,r}\\&&&&&0\end{bmatrix}}

.

Step 7

Use elementary row operations to find an invertible matrix $Y_{r-1}$ of appropriate size such that the product $Y_{r-1}X_{r,r-1}$ is a matrix of the form $I_{r,r-1}={\begin{bmatrix}I\\O\end{bmatrix}}$ .

Step 8

Set $Q_{1}=$ diag $(I,I,\ldots ,Y_{r-1}^{-1},I)$ and compute $Q_{1}^{-1}XQ_{1}$ . In this matrix, the $(r,r-1)$ -block is $I_{r,r-1}$ .

Step 9

Find a matrix $R_{1}$ formed as a product of elementary matrices such that $R_{1}^{-1}Q_{1}^{-1}XQ_{1}R_{1}$ is a matrix in which all the blocks above the block $I_{r,r-1}$ contain only $0$ 's.

Step 10

Repeat Steps 8 and 9 on column $r-1$ converting $(r-1,r-2)$ -block to $I_{r-1,r-2}$ via conjugation by some invertible matrix $Q_{2}$ . Use this block to clear out the blocks above, via conjugation by a product $R_{2}$ of elementary matrices.

Step 11

Repeat these processes on $r-2,r-3,\ldots ,3,2$ columns, using conjugations by $Q_{3},R_{3},\ldots ,Q_{r-2},R_{r-2},Q_{r-1}$ . The resulting matrix $W$ is now in Weyr form.

Step 12

Let $C=P_{1}{\text{diag}}(I,P_{2})\cdots {\text{diag}}(I,P_{r-1})Q_{1}R_{1}Q_{2}\cdots R_{r-2}Q_{r-1}$ . Then $W=C^{-1}AC$ .

Applications of the Weyr form

Some well-known applications of the Weyr form are listed below:^[3]

The Weyr form can be used to simplify the proof of Gerstenhaber’s Theorem which asserts that the subalgebra generated by two commuting $n\times n$ matrices has dimension at most $n$ .
A set of finite matrices is said to be approximately simultaneously diagonalizable if they can be perturbed to simultaneously diagonalizable matrices. The Weyr form is used to prove approximate simultaneous diagonalizability of various classes of matrices. The approximate simultaneous diagonalizability property has applications in the study of phylogenetic invariants in biomathematics.
The Weyr form can be used to simplify the proofs of the irreducibility of the variety of all k-tuples of commuting complex matrices.

Related Research Articles

Ray transfer matrix analysis is a mathematical form for performing ray tracing calculations in sufficiently simple problems which can be solved considering only paraxial rays. Each optical element is described by a 2 × 2ray transfer matrix which operates on a vector describing an incoming light ray to calculate the outgoing ray. Multiplication of the successive matrices thus yields a concise ray transfer matrix describing the entire optical system. The same mathematics is also used in accelerator physics to track particles through the magnet installations of a particle accelerator, see electron optics.

In linear algebra, the Cayley–Hamilton theorem states that every square matrix over a commutative ring satisfies its own characteristic equation.

In mathematics, particularly in linear algebra, a skew-symmetricmatrix is a square matrix whose transpose equals its negative. That is, it satisfies the condition

In linear algebra, a square matrix $is called diagonalizable or non-defective if it is similar to a diagonal matrix. That is, if there exists an invertible matrix and a diagonal matrix such that . This is equivalent to . This property exists for any linear map: for a finite-dimensional vector space, a linear map is called diagonalizable if there exists an ordered basis of consisting of eigenvectors of . These definitions are equivalent: if has a matrix representation as above, then the column vectors of form a basis consisting of eigenvectors of, and the diagonal entries of are the corresponding eigenvalues of; with respect to this eigenvector basis, is represented by .$

In linear algebra, a Jordan normal form, also known as a Jordan canonical form, is an upper triangular matrix of a particular form called a Jordan matrix representing a linear operator on a finite-dimensional vector space with respect to some basis. Such a matrix has each non-zero off-diagonal entry equal to 1, immediately above the main diagonal, and with identical diagonal entries to the left and below them.

In mathematics, the spectral radius of a square matrix is the maximum of the absolute values of its eigenvalues. More generally, the spectral radius of a bounded linear operator is the supremum of the absolute values of the elements of its spectrum. The spectral radius is often denoted by $ρ(\cdot)$ .

In linear algebra, a Householder transformation is a linear transformation that describes a reflection about a plane or hyperplane containing the origin. The Householder transformation was used in a 1958 paper by Alston Scott Householder.

In numerical analysis, one of the most important problems is designing efficient and stable algorithms for finding the eigenvalues of a matrix. These eigenvalue algorithms may also find eigenvectors.

In linear algebra, the Frobenius companion matrix of the monic polynomial $is the square matrix defined as$

In linear algebra and functional analysis, the min-max theorem, or variational theorem, or Courant–Fischer–Weyl min-max principle, is a result that gives a variational characterization of eigenvalues of compact Hermitian operators on Hilbert spaces. It can be viewed as the starting point of many results of similar nature.

In linear algebra, a nilpotent matrix is a square matrix N such that

In linear algebra, an eigenvector or characteristic vector is a vector that has its direction unchanged by a given linear transformation. More precisely, an eigenvector, $, of a linear transformation,, is scaled by a constant factor,, when the linear transformation is applied to it: . The corresponding eigenvalue, characteristic value, or characteristic root is the multiplying factor .$

In mathematics, a logarithm of a matrix is another matrix such that the matrix exponential of the latter matrix equals the original matrix. It is thus a generalization of the scalar logarithm and in some sense an inverse function of the matrix exponential. Not all matrices have a logarithm and those matrices that do have a logarithm may have more than one logarithm. The study of logarithms of matrices leads to Lie theory since when a matrix has a logarithm then it is in an element of a Lie group and the logarithm is the corresponding element of the vector space of the Lie algebra.

In the mathematical discipline of matrix theory, a Jordan matrix, named after Camille Jordan, is a block diagonal matrix over a ring $R$ , where each block along the diagonal, called a Jordan block, has the following form:

In linear algebra, Weyl's inequality is a theorem about the changes to eigenvalues of an Hermitian matrix that is perturbed. It can be used to estimate the eigenvalues of a perturbed Hermitian matrix.

In linear algebra, a defective matrix is a square matrix that does not have a complete basis of eigenvectors, and is therefore not diagonalizable. In particular, an $matrix is defective if and only if it does not have linearly independent eigenvectors. A complete basis is formed by augmenting the eigenvectors with generalized eigenvectors, which are necessary for solving defective systems of ordinary differential equations and other problems.$

In linear algebra, eigendecomposition is the factorization of a matrix into a canonical form, whereby the matrix is represented in terms of its eigenvalues and eigenvectors. Only diagonalizable matrices can be factorized in this way. When the matrix being factorized is a normal or real symmetric matrix, the decomposition is called "spectral decomposition", derived from the spectral theorem.

In mathematics, the quadratic eigenvalue problem (QEP), is to find scalar eigenvalues $, left eigenvectors and right eigenvectors such that$

In the mathematical field of linear algebra, an arrowhead matrix is a square matrix containing zeros in all entries except for the first row, first column, and main diagonal, these entries can be any number. In other words, the matrix has the form

In linear algebra, two matrices $and are said to commute if, or equivalently if their commutator is zero. Matrices that commute with matrix are called the commutant of matrix .$

References

↑ Eduard Weyr (1885). "Répartition des matrices en espèces et formation de toutes les espèces" (PDF). Comptes Rendus de l'Académie des Sciences de Paris. 100: 966–969. Retrieved 10 December 2013.
↑ Eduard Weyr (1890). "Zur Theorie der bilinearen Formen". Monatshefte für Mathematik und Physik. 1: 163–236.
1 2 3 4 Kevin C. Meara; John Clark; Charles I. Vinsonhaler (2011). Advanced Topics in Linear Algebra: Weaving Matrix Problems through the Weyr Form. Oxford University Press.
1 2 3 Kevin C. Meara; John Clark; Charles I. Vinsonhaler (2011). Advanced Topics in Linear Algebra: Weaving Matrix Problems through the Weyr Form. Oxford University Press. pp. 44, 81–82.
↑ Shapiro, H. (1999). "The Weyr characteristic" (PDF). The American Mathematical Monthly. 106 (10): 919–929. doi:10.2307/2589746. JSTOR 2589746. S2CID 56072601.
↑ Sergeichuk, "Canonical matrices for linear matrix problems", Arxiv:0709.2485 [math.RT], 2007

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Eduard Weyr (1885). "Répartition des matrices en espèces et formation de toutes les espèces" (PDF). Comptes Rendus de l'Académie des Sciences de Paris. 100: 966–969. Retrieved 10 December 2013.

[2] Eduard Weyr (1890). "Zur Theorie der bilinearen Formen". Monatshefte für Mathematik und Physik. 1: 163–236.

[Weyr-3] 1 2 3 4 Kevin C. Meara; John Clark; Charles I. Vinsonhaler (2011). Advanced Topics in Linear Algebra: Weaving Matrix Problems through the Weyr Form. Oxford University Press.

[Weyr44-4] 1 2 3 Kevin C. Meara; John Clark; Charles I. Vinsonhaler (2011). Advanced Topics in Linear Algebra: Weaving Matrix Problems through the Weyr Form. Oxford University Press. pp. 44, 81–82.

[5] Shapiro, H. (1999). "The Weyr characteristic" (PDF). The American Mathematical Monthly. 106 (10): 919–929. doi:10.2307/2589746. JSTOR 2589746. S2CID 56072601.

[sergeichuk-6] Sergeichuk, "Canonical matrices for linear matrix problems", Arxiv:0709.2485 [math.RT], 2007

[1]

[2]

[3]

[4]

[5]

[6]

Weyr canonical form

Contents

Definitions

Basic Weyr matrix

Definition

Example

General Weyr matrix

Definition

Example

Relation between Weyr and Jordan forms

The Weyr form is canonical

Computation of the Weyr canonical form

Reduction to the nilpotent case

Reduction of a nilpotent matrix to the Weyr form

Applications of the Weyr form

Related Research Articles

References