Square matrix

Last updated January 28, 2025

In mathematics, a square matrix is a matrix with the same number of rows and columns. An n-by-n matrix is known as a square matrix of order $n$ . Any two square matrices of the same order can be added and multiplied.

Square matrices are often used to represent simple linear transformations, such as shearing or rotation. For example, if $R$ is a square matrix representing a rotation (rotation matrix) and $\mathbf {v}$ is a column vector describing the position of a point in space, the product $R\mathbf {v}$ yields another column vector describing the position of that point after that rotation. If $\mathbf {v}$ is a row vector, the same transformation can be obtained using $\mathbf {v} R^{\mathsf {T}}$ , where $R^{\mathsf {T}}$ is the transpose of $R$ .

Main diagonal

The entries $a_{ii}$ ( $i = 1, ..., n$ ) form the main diagonal of a square matrix. They lie on the imaginary line which runs from the top left corner to the bottom right corner of the matrix. For instance, the main diagonal of the 4×4 matrix above contains the elements $a 11 = 9$ , $a 22 = 11$ , $a 33 = 4$ , $a 44 = 10$ .

The diagonal of a square matrix from the top right to the bottom left corner is called antidiagonal or counterdiagonal.

Special kinds

Name	Example with n = 3
Diagonal matrix	${\begin{bmatrix}a_{11}&0&0\\0&a_{22}&0\\0&0&a_{33}\end{bmatrix}}$
Lower triangular matrix	${\begin{bmatrix}a_{11}&0&0\\a_{21}&a_{22}&0\\a_{31}&a_{32}&a_{33}\end{bmatrix}}$
Upper triangular matrix	${\begin{bmatrix}a_{11}&a_{12}&a_{13}\\0&a_{22}&a_{23}\\0&0&a_{33}\end{bmatrix}}$

Diagonal or triangular matrix

If all entries outside the main diagonal are zero, $A$ is called a diagonal matrix. If all entries below (resp. above) the main diagonal are zero, $A$ is called an upper (resp. lower) triangular matrix.

Identity matrix

The identity matrix $I_{n}$ of size $n$ is the $n\times n$ matrix in which all the elements on the main diagonal are equal to 1 and all other elements are equal to 0, e.g. $I_{1}={\begin{bmatrix}1\end{bmatrix}},\ I_{2}={\begin{bmatrix}1&0\\0&1\end{bmatrix}},\ \ldots ,\ I_{n}={\begin{bmatrix}1&0&\cdots &0\\0&1&\cdots &0\\\vdots &\vdots &\ddots &\vdots \\0&0&\cdots &1\end{bmatrix}}.$ It is a square matrix of order $n$ , and also a special kind of diagonal matrix. The term identity matrix refers to the property of matrix multiplication that $I_{m}A=AI_{n}=A$ for any $m\times n$ matrix $A$ .

Invertible matrix and its inverse

A square matrix $A$ is called invertible or non-singular if there exists a matrix $B$ such that^[1]^[2] $AB=BA=I_{n}.$ If $B$ exists, it is unique and is called the inverse matrix of $A$ , denoted $A^{-1}$ .

Symmetric or skew-symmetric matrix

A square matrix $A$ that is equal to its transpose, i.e., $A^{\mathsf {T}}=A$ , is a symmetric matrix. If instead $A^{\mathsf {T}}=-A$ , then $A$ is called a skew-symmetric matrix.

For a complex square matrix $A$ , often the appropriate analogue of the transpose is the conjugate transpose $A^{*}$ , defined as the transpose of the complex conjugate of $A$ . A complex square matrix $A$ satisfying $A^{*}=A$ is called a Hermitian matrix. If instead $A^{*}=-A$ , then $A$ is called a skew-Hermitian matrix.

By the spectral theorem, real symmetric (or complex Hermitian) matrices have an orthogonal (or unitary) eigenbasis; i.e., every vector is expressible as a linear combination of eigenvectors. In both cases, all eigenvalues are real.^[3]

Definite matrix

Positive definite	Indefinite
${\begin{bmatrix}1/4&0\\0&1\\\end{bmatrix}}$	${\begin{bmatrix}1/4&0\\0&-1/4\end{bmatrix}}$
$Q (x, y) = 1/4 x 2 + y 2$	$Q (x, y) = 1/4 x 2 - 1/4 y 2$
Points such that $Q (x, y) = 1$ (Ellipse).	Points such that $Q (x, y) = 1$ (Hyperbola).

A symmetric $n \times n$ -matrix is called positive-definite (respectively negative-definite; indefinite), if for all nonzero vectors $x\in \mathbb {R} ^{n}$ the associated quadratic form given by $Q(\mathbf {x} )=\mathbf {x} ^{\mathsf {T}}A\mathbf {x}$ takes only positive values (respectively only negative values; both some negative and some positive values).^[4] If the quadratic form takes only non-negative (respectively only non-positive) values, the symmetric matrix is called positive-semidefinite (respectively negative-semidefinite); hence the matrix is indefinite precisely when it is neither positive-semidefinite nor negative-semidefinite.

A symmetric matrix is positive-definite if and only if all its eigenvalues are positive.^[5] The table at the right shows two possibilities for 2×2 matrices.

Allowing as input two different vectors instead yields the bilinear form associated to $A$ :^[6] $B_{A}(\mathbf {x} ,\mathbf {y} )=\mathbf {x} ^{\mathsf {T}}A\mathbf {y} .$

Orthogonal matrix

An orthogonal matrix is a square matrix with real entries whose columns and rows are orthogonal unit vectors (i.e., orthonormal vectors). Equivalently, a matrix A is orthogonal if its transpose is equal to its inverse: $A^{\textsf {T}}=A^{-1},$ which entails $A^{\textsf {T}}A=AA^{\textsf {T}}=I,$ where I is the identity matrix.

An orthogonal matrix $A$ is necessarily invertible (with inverse $A -1 = A T$ ), unitary ( $A -1 = A *$ ), and normal ( $A * A = AA *$ ). The determinant of any orthogonal matrix is either +1 or −1. The special orthogonal group $\operatorname {SO} (n)$ consists of the $n \times n$ orthogonal matrices with determinant +1.

The complex analogue of an orthogonal matrix is a unitary matrix.

Normal matrix

A real or complex square matrix $A$ is called normal if $A^{*}A=AA^{*}$ . If a real square matrix is symmetric, skew-symmetric, or orthogonal, then it is normal. If a complex square matrix is Hermitian, skew-Hermitian, or unitary, then it is normal. Normal matrices are of interest mainly because they include the types of matrices just listed and form the broadest class of matrices for which the spectral theorem holds.^[7]

Operations

Trace

The trace, tr(A) of a square matrix A is the sum of its diagonal entries. While matrix multiplication is not commutative, the trace of the product of two matrices is independent of the order of the factors: $\operatorname {tr} (AB)=\operatorname {tr} (BA).$ This is immediate from the definition of matrix multiplication: $\operatorname {tr} (AB)=\sum _{i=1}^{m}\sum _{j=1}^{n}A_{ij}B_{ji}=\operatorname {tr} (BA).$ Also, the trace of a matrix is equal to that of its transpose, i.e., $\operatorname {tr} (A)=\operatorname {tr} (A^{\mathrm {T} }).$

Determinant

The determinant $\det(A)$ or $|A|$ of a square matrix $A$ is a number encoding certain properties of the matrix. A matrix is invertible if and only if its determinant is nonzero. Its absolute value equals the area (in $\mathbb {R} ^{2}$ ) or volume (in $\mathbb {R} ^{3}$ ) of the image of the unit square (or cube), while its sign corresponds to the orientation of the corresponding linear map: the determinant is positive if and only if the orientation is preserved.

The determinant of 2×2 matrices is given by $\det {\begin{bmatrix}a&b\\c&d\end{bmatrix}}=ad-bc.$ The determinant of 3×3 matrices involves 6 terms (rule of Sarrus). The more lengthy Leibniz formula generalizes these two formulae to all dimensions.^[8]

The determinant of a product of square matrices equals the product of their determinants:^[9] $\det(AB)=\det(A)\cdot \det(B)$ Adding a multiple of any row to another row, or a multiple of any column to another column, does not change the determinant. Interchanging two rows or two columns affects the determinant by multiplying it by −1.^[10] Using these operations, any matrix can be transformed to a lower (or upper) triangular matrix, and for such matrices the determinant equals the product of the entries on the main diagonal; this provides a method to calculate the determinant of any matrix. Finally, the Laplace expansion expresses the determinant in terms of minors, i.e., determinants of smaller matrices.^[11] This expansion can be used for a recursive definition of determinants (taking as starting case the determinant of a 1×1 matrix, which is its unique entry, or even the determinant of a 0×0 matrix, which is 1), that can be seen to be equivalent to the Leibniz formula. Determinants can be used to solve linear systems using Cramer's rule, where the division of the determinants of two related square matrices equates to the value of each of the system's variables.^[12]

Eigenvalues and eigenvectors

A number $λ$ and a non-zero vector $\mathbf {v}$ satisfying $A\mathbf {v} =\lambda \mathbf {v}$ are called an eigenvalue and an eigenvector of $A$ , respectively.^[13]^[14] The number $λ$ is an eigenvalue of an $n \times n$ -matrix $A$ if and only if $A - λ I n$ is not invertible, which is equivalent to^[15] $\det(A-\lambda I)=0.$ The polynomial $p A$ in an indeterminate $X$ given by evaluation of the determinant $det(XI n - A)$ is called the characteristic polynomial of $A$ . It is a monic polynomial of degree n. Therefore the polynomial equation $p A (λ) = 0$ has at most n different solutions, i.e., eigenvalues of the matrix.^[16] They may be complex even if the entries of $A$ are real. According to the Cayley–Hamilton theorem, $p A (A) = 0$ , that is, the result of substituting the matrix itself into its own characteristic polynomial yields the zero matrix.

Notes

↑ Brown 1991 , Definition I.2.28
↑ Brown 1991 , Definition I.5.13
↑ Horn&Johnson 1985 , Theorem 2.5.6
↑ Horn&Johnson 1985 , Chapter 7
↑ Horn&Johnson 1985 , Theorem 7.2.1
↑ Horn&Johnson 1985 , Example 4.0.6, p. 169
↑ Artin, Algebra, 2nd edition, Pearson, 2018, section 8.6.
↑ Brown 1991 , Definition III.2.1
↑ Brown 1991 , Theorem III.2.12
↑ Brown 1991 , Corollary III.2.16
↑ Mirsky 1990 , Theorem 1.4.1
↑ Brown 1991 , Theorem III.3.18
↑ Eigen means "own" in German and in Dutch.
↑ Brown 1991 , Definition III.4.1
↑ Brown 1991 , Definition III.4.9
↑ Brown 1991 , Corollary III.4.10

Related Research Articles

In mathematics, the determinant is a scalar-valued function of the entries of a square matrix. The determinant of a matrix $A$ is commonly denoted $det(A)$ , $det A$ , or $| A |$ . Its value characterizes some properties of the matrix and the linear map represented, on a given basis, by the matrix. In particular, the determinant is nonzero if and only if the matrix is invertible and the corresponding linear map is an isomorphism.

In mathematical physics and mathematics, the Pauli matrices are a set of three $2 \times 2$ complex matrices that are traceless, Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma, they are occasionally denoted by tau when used in connection with isospin symmetries.

In mathematics, a symmetric matrix $with real entries is positive-definite if the real number is positive for every nonzero real column vector where is the row vector transpose of More generally, a Hermitian matrix is positive-definite if the real number is positive for every nonzero complex column vector where denotes the conjugate transpose of$

In linear algebra, the trace of a square matrix $A$ , denoted $tr(A)$ , is the sum of the elements on its main diagonal, $. It is only defined for a square matrix.$

In linear algebra, an orthogonal matrix, or orthonormal matrix, is a real square matrix whose columns and rows are orthonormal vectors.

In mathematics, a complex square matrix $A$ is normal if it commutes with its conjugate transpose $A *$ :

<span class="mw-page-title-main">Transpose</span> Matrix operation which flips a matrix over its diagonal

In linear algebra, the transpose of a matrix is an operator which flips a matrix over its diagonal; that is, it switches the row and column indices of the matrix $A$ by producing another matrix, often denoted by $A T$ .

In mathematics, the orthogonal group in dimension $n$ , denoted $O(n)$ , is the group of distance-preserving transformations of a Euclidean space of dimension $n$ that preserve a fixed point, where the group operation is given by composing transformations. The orthogonal group is sometimes called the general orthogonal group, by analogy with the general linear group. Equivalently, it is the group of $n \times n$ orthogonal matrices, where the group operation is given by matrix multiplication (an orthogonal matrix is a real matrix whose inverse equals its transpose). The orthogonal group is an algebraic group and a Lie group. It is compact.

In mathematics, particularly in linear algebra, a skew-symmetricmatrix is a square matrix whose transpose equals its negative. That is, it satisfies the condition

In mathematics, a Hermitian matrix is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the $i$ -th row and $j$ -th column is equal to the complex conjugate of the element in the $j$ -th row and $i$ -th column, for all indices $i$ and $j$ :

<span class="mw-page-title-main">Covariance matrix</span> Measure of covariance of components of a random vector

In probability theory and statistics, a covariance matrix is a square matrix giving the covariance between each pair of elements of a given random vector.

In linear algebra, the adjugate or classical adjoint of a square matrix $A$ , $adj(A)$ , is the transpose of its cofactor matrix. It is occasionally known as adjunct matrix, or "adjoint", though that normally refers to a different concept, the adjoint operator which for a matrix is the conjugate transpose.

In linear algebra, an invertible matrix is a square matrix which has an inverse. In other words, if some other matrix is multiplied by the invertible matrix, the result can be multiplied by an inverse to undo the operation. An invertible matrix multiplied by its inverse yields the identity matrix. Invertible matrices are the same size as their inverse.

In mathematics, the conjugate transpose, also known as the Hermitian transpose, of an $complex matrix is an matrix obtained by transposing and applying complex conjugation to each entry. There are several notations, such as or,, or .$

In the mathematical discipline of linear algebra, a matrix decomposition or matrix factorization is a factorization of a matrix into a product of matrices. There are many different matrix decompositions; each finds use among a particular class of problems.

In linear algebra, a square matrix with complex entries is said to be skew-Hermitian or anti-Hermitian if its conjugate transpose is the negative of the original matrix. That is, the matrix $is skew-Hermitian if it satisfies the relation$

In linear algebra, a rotation matrix is a transformation matrix that is used to perform a rotation in Euclidean space. For example, using the convention below, the matrix

In geometry, Euler's rotation theorem states that, in three-dimensional space, any displacement of a rigid body such that a point on the rigid body remains fixed, is equivalent to a single rotation about some axis that runs through the fixed point. It also means that the composition of two rotations is also a rotation. Therefore the set of rotations has a group structure, known as a rotation group.

In linear algebra, eigendecomposition is the factorization of a matrix into a canonical form, whereby the matrix is represented in terms of its eigenvalues and eigenvectors. Only diagonalizable matrices can be factorized in this way. When the matrix being factorized is a normal or real symmetric matrix, the decomposition is called "spectral decomposition", derived from the spectral theorem.

<span class="mw-page-title-main">Matrix (mathematics)</span> Array of numbers

In mathematics, a matrix is a rectangular array or table of numbers, symbols, or expressions, with elements or entries arranged in rows and columns, which is used to represent a mathematical object or property of such an object.

References

Brown, William C. (1991), Matrices and vector spaces , New York, NY: Marcel Dekker, ISBN 978-0-8247-8419-5
Horn, Roger A.; Johnson, Charles R. (1985), Matrix Analysis, Cambridge University Press, ISBN 978-0-521-38632-6
Mirsky, Leonid (1990), An Introduction to Linear Algebra, Courier Dover Publications, ISBN 978-0-486-66434-7

External links

Media related to Square matrices at Wikimedia Commons

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Brown 1991 , Definition I.2.28

[2] Brown 1991 , Definition I.5.13

[3] Horn&Johnson 1985 , Theorem 2.5.6

[4] Horn&Johnson 1985 , Chapter 7

[5] Horn&Johnson 1985 , Theorem 7.2.1

[6] Horn&Johnson 1985 , Example 4.0.6, p. 169

[7] Artin, Algebra, 2nd edition, Pearson, 2018, section 8.6.

[8] Brown 1991 , Definition III.2.1

[9] Brown 1991 , Theorem III.2.12

[10] Brown 1991 , Corollary III.2.16

[11] Mirsky 1990 , Theorem 1.4.1

[12] Brown 1991 , Theorem III.3.18

[13] Eigen means "own" in German and in Dutch.

[14] Brown 1991 , Definition III.4.1

[15] Brown 1991 , Definition III.4.9

[16] Brown 1991 , Corollary III.4.10

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

v t e Linear algebra
Outline Glossary
Basic concepts	Scalar Vector Vector space Scalar multiplication Vector projection Linear span Linear map Linear projection Linear independence Linear combination Multilinear map Basis Change of basis Row and column vectors Row and column spaces Kernel Eigenvalues and eigenvectors Transpose Linear equations
Matrices	Block Decomposition Invertible Minor Multiplication Rank Transformation Cramer's rule Gaussian elimination Productive matrix
Bilinear	Orthogonality Dot product Hadamard product Inner product space Outer product Kronecker product Gram–Schmidt process
Multilinear algebra	Determinant Cross product Triple product Seven-dimensional cross product Geometric algebra Exterior algebra Bivector Multivector Tensor Outermorphism
Vector space constructions	Dual Direct sum Function space Quotient Subspace Tensor product
Numerical	Floating-point Numerical stability Basic Linear Algebra Subprograms Sparse matrix Comparison of linear algebra libraries
Category