Matrix differential equation

Last updated March 27, 2024

A differential equation is a mathematical equation for an unknown function of one or several variables that relates the values of the function itself and its derivatives of various orders. A matrix differential equation contains more than one function stacked into vector form with a matrix relating the functions to their derivatives.

Stability and steady state of the matrix system
Stability of the two-state-variable case
Solution in matrix form
Putzer Algorithm for computing eAt
Deconstructed example of a matrix ordinary differential equation
Solving deconstructed matrix ordinary differential equations
Solved example of a matrix ODE
First step
Second step
Third step
Using matrix exponentiation
See also
References

For example, a first-order matrix ordinary differential equation is

\mathbf {\dot {x}} (t)=\mathbf {A} (t)\mathbf {x} (t)

where $\mathbf {x} (t)$ is an $n\times 1$ vector of functions of an underlying variable $t$ , $\mathbf {\dot {x}} (t)$ is the vector of first derivatives of these functions, and $\mathbf {A} (t)$ is an $n\times n$ matrix of coefficients.

In the case where $\mathbf {A}$ is constant and has n linearly independent eigenvectors, this differential equation has the following general solution,

\mathbf {x} (t)=c_{1}e^{\lambda _{1}t}\mathbf {u} _{1}+c_{2}e^{\lambda _{2}t}\mathbf {u} _{2}+\cdots +c_{n}e^{\lambda _{n}t}\mathbf {u} _{n}~,

where λ₁, λ₂, …, λ_n are the eigenvalues of A; u₁, u₂, …, u_n are the respective eigenvectors of A; and c₁, c₂, …, c_n are constants.

More generally, if $\mathbf {A} (t)$ commutes with its integral $\int _{a}^{t}\mathbf {A} (s)ds$ then the Magnus expansion reduces to leading order, and the general solution to the differential equation is

\mathbf {x} (t)=e^{\int _{a}^{t}\mathbf {A} (s)ds}\mathbf {c} ~,

where $\mathbf {c}$ is an $n\times 1$ constant vector.

By use of the Cayley–Hamilton theorem and Vandermonde-type matrices, this formal matrix exponential solution may be reduced to a simple form.^[1] Below, this solution is displayed in terms of Putzer's algorithm.^[2]

Stability and steady state of the matrix system

The matrix equation

\mathbf {\dot {x}} (t)=\mathbf {Ax} (t)+\mathbf {b}

with n×1 parameter constant vector b is stable if and only if all eigenvalues of the constant matrix A have a negative real part.

The steady state x* to which it converges if stable is found by setting

\mathbf {\dot {x}} ^{*}(t)=\mathbf {0} ~,

thus yielding

\mathbf {x} ^{*}=-\mathbf {A} ^{-1}\mathbf {b} ~,

assuming A is invertible.

Thus, the original equation can be written in the homogeneous form in terms of deviations from the steady state,

\mathbf {\dot {x}} (t)=\mathbf {A} [\mathbf {x} (t)-\mathbf {x} ^{*}]~.

An equivalent way of expressing this is that x* is a particular solution to the inhomogeneous equation, while all solutions are in the form

\mathbf {x} _{h}+\mathbf {x} ^{*}~,

with $\mathbf {x} _{h}$ a solution to the homogeneous equation (b=0).

Stability of the two-state-variable case

In the n = 2 case (with two state variables), the stability conditions that the two eigenvalues of the transition matrix A each have a negative real part are equivalent to the conditions that the trace of A be negative and its determinant be positive.

Solution in matrix form

The formal solution of $\mathbf {\dot {x}} (t)=\mathbf {A} [\mathbf {x} (t)-\mathbf {x} ^{*}]$ has the matrix exponential form

\mathbf {x} (t)=\mathbf {x} ^{*}+e^{\mathbf {A} t}[\mathbf {x} (0)-\mathbf {x} ^{*}]~,

evaluated using any of a multitude of techniques.

Putzer Algorithm for computing $e A t$

Given a matrix A with eigenvalues $\lambda _{1},\lambda _{2},\dots ,\lambda _{n}$ ,

e^{\mathbf {A} t}=\sum _{j=0}^{n-1}r_{j+1}{\left(t\right)}\mathbf {P} _{j}

where

\mathbf {P} _{0}=\mathbf {I}

\mathbf {P} _{j}=\prod _{k=1}^{j}\left(\mathbf {A} -\lambda _{k}\mathbf {I} \right)=\mathbf {P} _{j-1}\left(\mathbf {A} -\lambda _{j}\mathbf {I} \right),\qquad j=1,2,\dots ,n-1

{\dot {r}}_{1}=\lambda _{1}r_{1}

r_{1}{\left(0\right)}=1

{\dot {r}}_{j}=\lambda _{j}r_{j}+r_{j-1},\qquad j=2,3,\dots ,n

r_{j}{\left(0\right)}=0,\qquad j=2,3,\dots ,n

The equations for $r_{i}(t)$ are simple first order inhomogeneous ODEs.

Note the algorithm does not require that the matrix A be diagonalizable and bypasses complexities of the Jordan canonical forms normally utilized.

Deconstructed example of a matrix ordinary differential equation

A first-order homogeneous matrix ordinary differential equation in two functions x(t) and y(t), when taken out of matrix form, has the following form:

{\frac {dx}{dt}}=a_{1}x+b_{1}y,\quad {\frac {dy}{dt}}=a_{2}x+b_{2}y

where $a_{1}$ , $a_{2}$ , $b_{1}$ , and $b_{2}$ may be any arbitrary scalars.

Higher order matrix ODE's may possess a much more complicated form.

Solving deconstructed matrix ordinary differential equations

The process of solving the above equations and finding the required functions of this particular order and form consists of 3 main steps. Brief descriptions of each of these steps are listed below:

Finding the eigenvalues
Finding the eigenvectors
Finding the needed functions

The final, third, step in solving these sorts of ordinary differential equations is usually done by means of plugging in the values calculated in the two previous steps into a specialized general form equation, mentioned later in this article.

Solved example of a matrix ODE

To solve a matrix ODE according to the three steps detailed above, using simple matrices in the process, let us find, say, a function $x$ and a function $y$ both in terms of the single independent variable $t$ , in the following homogeneous linear differential equation of the first order,

{\frac {dx}{dt}}=3x-4y,\quad {\frac {dy}{dt}}=4x-7y~.

To solve this particular ordinary differential equation system, at some point in the solution process, we shall need a set of two initial values (corresponding to the two state variables at the starting point). In this case, let us pick $x (0) = y (0) = 1$ .

First step

The first step, already mentioned above, is finding the eigenvalues of A in

{\begin{bmatrix}x'\\y'\end{bmatrix}}={\begin{bmatrix}3&-4\\4&-7\end{bmatrix}}{\begin{bmatrix}x\\y\end{bmatrix}}~.

The derivative notation x′ etc. seen in one of the vectors above is known as Lagrange's notation (first introduced by Joseph Louis Lagrange. It is equivalent to the derivative notation dx/dt used in the previous equation, known as Leibniz's notation, honoring the name of Gottfried Leibniz.)

Once the coefficients of the two variables have been written in the matrix form A displayed above, one may evaluate the eigenvalues. To that end, one finds the determinant of the matrix that is formed when an identity matrix, $I_{n}$ , multiplied by some constant $λ$ , is subtracted from the above coefficient matrix to yield the characteristic polynomial of it,

\det \left({\begin{bmatrix}3&-4\\4&-7\end{bmatrix}}-\lambda {\begin{bmatrix}1&0\\0&1\end{bmatrix}}\right)~,

and solve for its zeroes.

Applying further simplification and basic rules of matrix addition yields

\det {\begin{bmatrix}3-\lambda &-4\\4&-7-\lambda \end{bmatrix}}~.

Applying the rules of finding the determinant of a single 2×2 matrix, yields the following elementary quadratic equation,

\det {\begin{bmatrix}3-\lambda &-4\\4&-7-\lambda \end{bmatrix}}=0

-21-3\lambda +7\lambda +\lambda ^{2}+16=0\,\!

which may be reduced further to get a simpler version of the above,

\lambda ^{2}+4\lambda -5=0~.

Now finding the two roots, $\lambda _{1}$ and $\lambda _{2}$ of the given quadratic equation by applying the factorization method yields

\lambda ^{2}+5\lambda -\lambda -5=0

\lambda (\lambda +5)-1(\lambda +5)=0

(\lambda -1)(\lambda +5)=0

\lambda =1,-5~.

The values $\lambda _{1}=1$ and $\lambda _{2}=-5$ , calculated above are the required eigenvalues of A. In some cases, say other matrix ODE's, the eigenvalues may be complex, in which case the following step of the solving process, as well as the final form and the solution, may dramatically change.

Second step

As mentioned above, this step involves finding the eigenvectors of A from the information originally provided.

For each of the eigenvalues calculated, we have an individual eigenvector. For the first eigenvalue, which is $\lambda _{1}=1$ , we have

{\begin{bmatrix}3&-4\\4&-7\end{bmatrix}}{\begin{bmatrix}\alpha \\\beta \end{bmatrix}}=1{\begin{bmatrix}\alpha \\\beta \end{bmatrix}}.

Simplifying the above expression by applying basic matrix multiplication rules yields

3\alpha -4\beta =\alpha

\alpha =2\beta ~.

All of these calculations have been done only to obtain the last expression, which in our case is $α = 2 β$ . Now taking some arbitrary value, presumably, a small insignificant value, which is much easier to work with, for either $α$ or $β$ (in most cases, it does not really matter), we substitute it into $α = 2 β$ . Doing so produces a simple vector, which is the required eigenvector for this particular eigenvalue. In our case, we pick $α = 2$ , which, in turn determines that $β = 1$ and, using the standard vector notation, our vector looks like

\mathbf {\hat {v}} _{1}={\begin{bmatrix}2\\1\end{bmatrix}}.

Performing the same operation using the second eigenvalue we calculated, which is $\lambda =-5$ , we obtain our second eigenvector. The process of working out this vector is not shown, but the final result is

\mathbf {\hat {v}} _{2}={\begin{bmatrix}1\\2\end{bmatrix}}.

Third step

This final step finds the required functions that are 'hidden' behind the derivatives given to us originally. There are two functions, because our differential equations deal with two variables.

The equation which involves all the pieces of information that we have previously found, has the following form:

{\begin{bmatrix}x\\y\end{bmatrix}}=Ae^{\lambda _{1}t}\mathbf {\hat {v}} _{1}+Be^{\lambda _{2}t}\mathbf {\hat {v}} _{2}.

Substituting the values of eigenvalues and eigenvectors yields

{\begin{bmatrix}x\\y\end{bmatrix}}=Ae^{t}{\begin{bmatrix}2\\1\end{bmatrix}}+Be^{-5t}{\begin{bmatrix}1\\2\end{bmatrix}}.

Applying further simplification,

{\begin{bmatrix}x\\y\end{bmatrix}}={\begin{bmatrix}2&1\\1&2\end{bmatrix}}{\begin{bmatrix}Ae^{t}\\Be^{-5t}\end{bmatrix}}.

Simplifying further and writing the equations for functions $x$ and $y$ separately,

x=2Ae^{t}+Be^{-5t}

y=Ae^{t}+2Be^{-5t}.

The above equations are, in fact, the general functions sought, but they are in their general form (with unspecified values of $A$ and $B$ ), whilst we want to actually find their exact forms and solutions. So now we consider the problem’s given initial conditions (the problem including given initial conditions is the so-called initial value problem). Suppose we are given $x(0)=y(0)=1$ , which plays the role of starting point for our ordinary differential equation; application of these conditions specifies the constants, $A$ and $B$ . As we see from the $x(0)=y(0)=1$ conditions, when $t = 0$ , the left sides of the above equations equal 1. Thus we may construct the following system of linear equations,

1=2A+B

1=A+2B~.

Solving these equations, we find that both constants $A$ and $B$ equal 1/3. Therefore substituting these values into the general form of these two functions specifies their exact forms,

x={\tfrac {2}{3}}e^{t}+{\tfrac {1}{3}}e^{-5t}

y={\tfrac {1}{3}}e^{t}+{\tfrac {2}{3}}e^{-5t}~,

the two functions sought.

Using matrix exponentiation

The above problem could have been solved with a direct application of the matrix exponential. That is, we can say that

${\begin{bmatrix}x(t)\\y(t)\end{bmatrix}}=\exp \left({\begin{bmatrix}3&-4\\4&-7\end{bmatrix}}t\right){\begin{bmatrix}x_{0}(t)\\y_{0}(t)\end{bmatrix}}$

Given that (which can be computed using any suitable tool, such as MATLAB's expm tool, or by performing matrix diagonalisation and leveraging the property that the matrix exponential of a diagonal matrix is the same as element-wise exponentiation of its elements)

$\exp \left({\begin{bmatrix}3&-4\\4&-7\end{bmatrix}}t\right)={\begin{bmatrix}4e^{t}/3-e^{-5t}/3&2e^{-5t}/3-2e^{t}/3\\2e^{t}/3-2e^{-5t}/3&4e^{-5t}/3-e^{t}/3\end{bmatrix}}$

the final result is

${\begin{bmatrix}x(t)\\y(t)\end{bmatrix}}={\begin{bmatrix}4e^{t}/3-e^{-5t}/3&2e^{-5t}/3-2e^{t}/3\\2e^{t}/3-2e^{-5t}/3&4e^{-5t}/3-e^{t}/3\end{bmatrix}}{\begin{bmatrix}1\\1\end{bmatrix}}$

${\begin{bmatrix}x(t)\\y(t)\end{bmatrix}}={\begin{bmatrix}e^{-5t}/3+2e^{t}/3\\e^{t}/3+2e^{-5t}/3\end{bmatrix}}$

This is the same as the eigenvector approach shown before.

Related Research Articles

Ray transfer matrix analysis is a mathematical form for performing ray tracing calculations in sufficiently simple problems which can be solved considering only paraxial rays. Each optical element is described by a 2×2 ray transfer matrix which operates on a vector describing an incoming light ray to calculate the outgoing ray. Multiplication of the successive matrices thus yields a concise ray transfer matrix describing the entire optical system. The same mathematics is also used in accelerator physics to track particles through the magnet installations of a particle accelerator, see electron optics.

In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagonal matrix is $, while an example of a 3\times3 diagonal matrix is . An identity matrix of any size, or any multiple of it is a diagonal matrix called scalar matrix, for example, . In geometry, a diagonal matrix may be used as a scaling matrix, since matrix multiplication with it results in changing scale (size) and possibly also shape; only a scalar matrix results in uniform change in scale.$

<span class="mw-page-title-main">Eigenfunction</span> Mathematical function of a linear operator

In mathematics, an eigenfunction of a linear operator D defined on some function space is any non-zero function $in that space that, when acted upon by D, is only multiplied by some scaling factor called an eigenvalue. As an equation, this condition can be written as$

In linear algebra, a square matrix $is called diagonalizable or non-defective if it is similar to a diagonal matrix. That is, if there exists an invertible matrix and a diagonal matrix such that . This is equivalent to . This property exists for any linear map: for a finite-dimensional vector space, a linear map is called diagonalizable if there exists an ordered basis of consisting of eigenvectors of . These definitions are equivalent: if has a matrix representation as above, then the column vectors of form a basis consisting of eigenvectors of, and the diagonal entries of are the corresponding eigenvalues of; with respect to this eigenvector basis, is represented by .$

In mathematics, the Hessian matrix, Hessian or Hesse matrix is a square matrix of second-order partial derivatives of a scalar-valued function, or scalar field. It describes the local curvature of a function of many variables. The Hessian matrix was developed in the 19th century by the German mathematician Ludwig Otto Hesse and later named after him. Hesse originally used the term "functional determinants". The Hessian is sometimes denoted by H or, ambiguously, by ∇².

In numerical analysis, one of the most important problems is designing efficient and stable algorithms for finding the eigenvalues of a matrix. These eigenvalue algorithms may also find eigenvectors.

In linear algebra, the Frobenius companion matrix of the monic polynomial

In mathematics, the matrix exponential is a matrix function on square matrices analogous to the ordinary exponential function. It is used to solve systems of linear differential equations. In the theory of Lie groups, the matrix exponential gives the exponential map between a matrix Lie algebra and the corresponding Lie group.

In linear algebra, a generalized eigenvector of an $matrix is a vector which satisfies certain criteria which are more relaxed than those for an (ordinary) eigenvector.$

In applied mathematics, in particular the context of nonlinear system analysis, a phase plane is a visual display of certain characteristics of certain kinds of differential equations; a coordinate plane with axes being the values of the two state variables, say (x, y), or (q, p) etc. (any pair of variables). It is a two-dimensional case of the general n-dimensional phase space.

The Rayleigh–Ritz method is a direct numerical method of approximating eigenvalues, originated in the context of solving physical boundary value problems and named after Lord Rayleigh and Walther Ritz.

In linear algebra, it is often important to know which vectors have their directions unchanged by a given linear transformation. An eigenvector or characteristic vector is such a vector. Thus an eigenvector $of a linear transformation is scaled by a constant factor when the linear transformation is applied to it: . The corresponding eigenvalue, characteristic value, or characteristic root is the multiplying factor .$

Linear dynamical systems are dynamical systems whose evolution functions are linear. While dynamical systems, in general, do not have closed-form solutions, linear dynamical systems can be solved exactly, and they have a rich set of mathematical properties. Linear systems can also be used to understand the qualitative behavior of general dynamical systems, by calculating the equilibrium points of the system and approximating it as a linear system around each such point.

A multi-compartment model is a type of mathematical model used for describing the way materials or energies are transmitted among the compartments of a system. Sometimes, the physical system that we try to model in equations is too complex, so it is much easier to discretize the problem and reduce the number of parameters. Each compartment is assumed to be a homogeneous entity within which the entities being modeled are equivalent. A multi-compartment model is classified as a lumped parameters model. Similar to more general mathematical models, multi-compartment models can treat variables as continuous, such as a differential equation, or as discrete, such as a Markov chain. Depending on the system being modeled, they can be treated as stochastic or deterministic.

In mathematics, an eigenvalue perturbation problem is that of finding the eigenvectors and eigenvalues of a system $that is perturbed from one with known eigenvectors and eigenvalues . This is useful for studying how sensitive the original system's eigenvectors and eigenvalues are to changes in the system. This type of analysis was popularized by Lord Rayleigh, in his investigation of harmonic vibrations of a string perturbed by small inhomogeneities.$

In geometry and linear algebra, a principal axis is a certain line in a Euclidean space associated with an ellipsoid or hyperboloid, generalizing the major and minor axes of an ellipse or hyperbola. The principal axis theorem states that the principal axes are perpendicular, and gives a constructive procedure for finding them.

In linear algebra, eigendecomposition is the factorization of a matrix into a canonical form, whereby the matrix is represented in terms of its eigenvalues and eigenvectors. Only diagonalizable matrices can be factorized in this way. When the matrix being factorized is a normal or real symmetric matrix, the decomposition is called "spectral decomposition", derived from the spectral theorem.

In mathematics, the quadratic eigenvalue problem (QEP), is to find scalar eigenvalues $, left eigenvectors and right eigenvectors such that$

A matrix difference equation is a difference equation in which the value of a vector of variables at one point in time is related to its own value at one or more previous points in time, using matrices. The order of the equation is the maximum time gap between any two indicated values of the variable vector. For example,

In mathematics, a system of differential equations is a finite set of differential equations. Such a system can be either linear or non-linear. Also, such a system can be either a system of ordinary differential equations or a system of partial differential equations.

References

↑ Moya-Cessa, H.; Soto-Eguibar, F. (2011). Differential Equations: An Operational Approach. New Jersey: Rinton Press. ISBN 978-1-58949-060-4.
↑ Putzer, E. J. (1966). "Avoiding the Jordan Canonical Form in the Discussion of Linear Systems with Constant Coefficients". The American Mathematical Monthly. 73 (1): 2–7. doi:10.1080/00029890.1966.11970714. JSTOR 2313914.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Moya-Cessa, H.; Soto-Eguibar, F. (2011). Differential Equations: An Operational Approach. New Jersey: Rinton Press. ISBN 978-1-58949-060-4.

[2] Putzer, E. J. (1966). "Avoiding the Jordan Canonical Form in the Discussion of Linear Systems with Constant Coefficients". The American Mathematical Monthly. 73 (1): 2–7. doi:10.1080/00029890.1966.11970714. JSTOR 2313914.

[1]

[2]