Variation of parameters

Last updated

In mathematics, variation of parameters, also known as variation of constants, is a general method to solve inhomogeneous linear ordinary differential equations.

Contents

For first-order inhomogeneous linear differential equations it is usually possible to find solutions via integrating factors or undetermined coefficients with considerably less effort, although those methods leverage heuristics that involve guessing and do not work for all inhomogeneous linear differential equations.

Variation of parameters extends to linear partial differential equations as well, specifically to inhomogeneous problems for linear evolution equations like the heat equation, wave equation, and vibrating plate equation. In this setting, the method is more often known as Duhamel's principle, named after Jean-Marie Duhamel (1797–1872) who first applied the method to solve the inhomogeneous heat equation. Sometimes variation of parameters itself is called Duhamel's principle and vice versa.

History

The method of variation of parameters was first sketched by the Swiss mathematician Leonhard Euler (1707–1783), and later completed by the Italian-French mathematician Joseph-Louis Lagrange (1736–1813). [1]

A forerunner of the method of variation of a celestial body's orbital elements appeared in Euler's work in 1748, while he was studying the mutual perturbations of Jupiter and Saturn. [2] In his 1749 study of the motions of the earth, Euler obtained differential equations for the orbital elements. [3] In 1753, he applied the method to his study of the motions of the moon. [4]

Lagrange first used the method in 1766. [5] Between 1778 and 1783, he further developed the method in two series of memoirs: one on variations in the motions of the planets [6] and another on determining the orbit of a comet from three observations. [7] During 1808–1810, Lagrange gave the method of variation of parameters its final form in a third series of papers. [8]

Description of method

Given an ordinary non-homogeneous linear differential equation of order n

 

 

 

 

(i)

Let be a basis of the vector space of solutions of the corresponding homogeneous equation

 

 

 

 

(ii)

Then a particular solution to the non-homogeneous equation is given by

 

 

 

 

(iii)

where the are differentiable functions which are assumed to satisfy the conditions

 

 

 

 

(iv)

Starting with ( iii ), repeated differentiation combined with repeated use of ( iv ) gives

 

 

 

 

(v)

One last differentiation gives

 

 

 

 

(vi)

By substituting ( iii ) into ( i ) and applying ( v ) and ( vi ) it follows that

 

 

 

 

(vii)

The linear system ( iv and vii ) of n equations can then be solved using Cramer's rule yielding

where is the Wronskian determinant of the basis and is the Wronskian determinant of the basis with the i-th column replaced by

The particular solution to the non-homogeneous equation can then be written as

Intuitive explanation

Consider the equation of the forced dispersionless spring, in suitable units:

Here x is the displacement of the spring from the equilibrium x = 0, and F(t) is an external applied force that depends on time. When the external force is zero, this is the homogeneous equation (whose solutions are linear combinations of sines and cosines, corresponding to the spring oscillating with constant total energy).

We can construct the solution physically, as follows. Between times and , the momentum corresponding to the solution has a net change (see: Impulse (physics)). A solution to the inhomogeneous equation, at the present time t > 0, is obtained by linearly superposing the solutions obtained in this manner, for s going between 0 and t.

The homogeneous initial-value problem, representing a small impulse being added to the solution at time , is

The unique solution to this problem is easily seen to be . The linear superposition of all of these solutions is given by the integral:

To verify that this satisfies the required equation:

as required (see: Leibniz integral rule).

The general method of variation of parameters allows for solving an inhomogeneous linear equation

by means of considering the second-order linear differential operator L to be the net force, thus the total impulse imparted to a solution between time s and s+ds is F(s)ds. Denote by the solution of the homogeneous initial value problem

Then a particular solution of the inhomogeneous equation is

the result of linearly superposing the infinitesimal homogeneous solutions. There are generalizations to higher order linear differential operators.

In practice, variation of parameters usually involves the fundamental solution of the homogeneous problem, the infinitesimal solutions then being given in terms of explicit linear combinations of linearly independent fundamental solutions. In the case of the forced dispersionless spring, the kernel is the associated decomposition into fundamental solutions.

Examples

First-order equation

The complementary solution to our original (inhomogeneous) equation is the general solution of the corresponding homogeneous equation (written below):

This homogeneous differential equation can be solved by different methods, for example separation of variables:

The complementary solution to our original equation is therefore:

Now we return to solving the non-homogeneous equation:

Using the method variation of parameters, the particular solution is formed by multiplying the complementary solution by an unknown function C(x):

By substituting the particular solution into the non-homogeneous equation, we can find C(x):

We only need a single particular solution, so we arbitrarily select for simplicity. Therefore the particular solution is:

The final solution of the differential equation is:

This recreates the method of integrating factors.

Specific second-order equation

Let us solve

We want to find the general solution to the differential equation, that is, we want to find solutions to the homogeneous differential equation

The characteristic equation is:

Since is a repeated root, we have to introduce a factor of x for one solution to ensure linear independence: and . The Wronskian of these two functions is

Because the Wronskian is non-zero, the two functions are linearly independent, so this is in fact the general solution for the homogeneous differential equation (and not a mere subset of it).

We seek functions A(x) and B(x) so A(x)u1 + B(x)u2 is a particular solution of the non-homogeneous equation. We need only calculate the integrals

Recall that for this example

That is,

where and are constants of integration.

General second-order equation

We have a differential equation of the form

and we define the linear operator

where D represents the differential operator. We therefore have to solve the equation for , where and are known.

We must solve first the corresponding homogeneous equation:

by the technique of our choice. Once we've obtained two linearly independent solutions to this homogeneous differential equation (because this ODE is second-order) — call them u1 and u2 — we can proceed with variation of parameters.

Now, we seek the general solution to the differential equation which we assume to be of the form

Here, and are unknown and and are the solutions to the homogeneous equation. (Observe that if and are constants, then .) Since the above is only one equation and we have two unknown functions, it is reasonable to impose a second condition. We choose the following:

Now,

Differentiating again (omitting intermediary steps)

Now we can write the action of L upon uG as

Since u1 and u2 are solutions, then

We have the system of equations

Expanding,

So the above system determines precisely the conditions

We seek A(x) and B(x) from these conditions, so, given

we can solve for (A′(x), B′(x))T, so

where W denotes the Wronskian of u1 and u2. (We know that W is nonzero, from the assumption that u1 and u2 are linearly independent.) So,

While homogeneous equations are relatively easy to solve, this method allows the calculation of the coefficients of the general solution of the inhomogeneous equation, and thus the complete general solution of the inhomogeneous equation can be determined.

Note that and are each determined only up to an arbitrary additive constant (the constant of integration). Adding a constant to or does not change the value of because the extra term is just a linear combination of u1 and u2, which is a solution of by definition.

Notes

  1. See:
  2. Euler, L. (1748) "Recherches sur la question des inégalités du mouvement de Saturne et de Jupiter, sujet proposé pour le prix de l'année 1748, par l’Académie Royale des Sciences de Paris" [Investigations on the question of the differences in the movement of Saturn and Jupiter; this subject proposed for the prize of 1748 by the Royal Academy of Sciences (Paris)] (Paris, France: G. Martin, J.B. Coignard, & H.L. Guerin, 1749).
  3. Euler, L. (1749) "Recherches sur la précession des équinoxes, et sur la nutation de l’axe de la terre," Histoire [or Mémoires ] de l'Académie Royale des Sciences et Belles-lettres (Berlin), pages 289–325 [published in 1751].
  4. Euler, L. (1753) Theoria motus lunae: exhibens omnes ejus inaequalitates ... [The theory of the motion of the moon: demonstrating all of its inequalities ... ] (Saint Petersburg, Russia: Academia Imperialis Scientiarum Petropolitanae [Imperial Academy of Science (St. Petersburg)], 1753).
  5. Lagrange, J.-L. (1766) “Solution de différens problèmes du calcul integral,” Mélanges de philosophie et de mathématique de la Société royale de Turin, vol. 3, pages 179–380.
  6. See:
  7. See:
  8. See:
    • Lagrange, J.-L. (1808) “Sur la théorie des variations des éléments des planètes et en particulier des variations des grands axes de leurs orbites,” Mémoires de la première Classe de l’Institut de France. Reprinted in: Joseph-Louis Lagrange with Joseph-Alfred Serret, ed., Oeuvres de Lagrange (Paris, France: Gauthier-Villars, 1873), vol. 6, pages 713–768.
    • Lagrange, J.-L. (1809) “Sur la théorie générale de la variation des constantes arbitraires dans tous les problèmes de la méchanique,” Mémoires de la première Classe de l’Institut de France. Reprinted in: Joseph-Louis Lagrange with Joseph-Alfred Serret, ed., Oeuvres de Lagrange (Paris, France: Gauthier-Villars, 1873), vol. 6, pages 771–805.
    • Lagrange, J.-L. (1810) “Second mémoire sur la théorie générale de la variation des constantes arbitraires dans tous les problèmes de la méchanique, ... ,” Mémoires de la première Classe de l’Institut de France. Reprinted in: Joseph-Louis Lagrange with Joseph-Alfred Serret, ed., Oeuvres de Lagrange (Paris, France: Gauthier-Villars, 1873), vol. 6, pages 809–816.

Related Research Articles

<span class="mw-page-title-main">Wave equation</span> Differential equation important in physics

The (two-way) wave equation is a second-order linear partial differential equation for the description of waves or standing wave fields – as they occur in classical physics – such as mechanical waves or electromagnetic waves. It arises in fields like acoustics, electromagnetism, and fluid dynamics. Single mechanical or electromagnetic waves propagating in a pre-defined direction can also be described with the first-order one-way wave equation, which is much easier to solve and also valid for inhomogeneous media.

The calculus of variations is a field of mathematical analysis that uses variations, which are small changes in functions and functionals, to find maxima and minima of functionals: mappings from a set of functions to the real numbers. Functionals are often expressed as definite integrals involving functions and their derivatives. Functions that maximize or minimize functionals may be found using the Euler–Lagrange equation of the calculus of variations.

In mathematics, the Laplace operator or Laplacian is a differential operator given by the divergence of the gradient of a scalar function on Euclidean space. It is usually denoted by the symbols , (where is the nabla operator), or . In a Cartesian coordinate system, the Laplacian is given by the sum of second partial derivatives of the function with respect to each independent variable. In other coordinate systems, such as cylindrical and spherical coordinates, the Laplacian also has a useful form. Informally, the Laplacian Δf (p) of a function f at a point p measures by how much the average value of f over small spheres or balls centered at p deviates from f (p).

In calculus, integration by substitution, also known as u-substitution, reverse chain rule or change of variables, is a method for evaluating integrals and antiderivatives. It is the counterpart to the chain rule for differentiation, and can loosely be thought of as using the chain rule "backwards."

<span class="mw-page-title-main">Green's function</span> Impulse response of an inhomogeneous linear differential operator

In mathematics, a Green's function is the impulse response of an inhomogeneous linear differential operator defined on a domain with specified initial conditions or boundary conditions.

In mathematics, a linear differential equation is a differential equation that is defined by a linear polynomial in the unknown function and its derivatives, that is an equation of the form

<span class="mw-page-title-main">Separation of variables</span> Technique for solving differential equations

In mathematics, separation of variables is any of several methods for solving ordinary and partial differential equations, in which algebra allows one to rewrite an equation so that each of two variables occurs on a different side of the equation.

In mathematics, integral equations are equations in which an unknown function appears under an integral sign. In mathematical notation, integral equations may thus be expressed as being of the form:

In mathematics and its applications, a Sturm–Liouville problem is a second-order linear ordinary differential equation of the form:

In mathematics, the matrix exponential is a matrix function on square matrices analogous to the ordinary exponential function. It is used to solve systems of linear differential equations. In the theory of Lie groups, the matrix exponential gives the exponential map between a matrix Lie algebra and the corresponding Lie group.

In mathematics, the method of characteristics is a technique for solving partial differential equations. Typically, it applies to first-order equations, although more generally the method of characteristics is valid for any hyperbolic partial differential equation. The method is to reduce a partial differential equation to a family of ordinary differential equations along which the solution can be integrated from some initial data given on a suitable hypersurface.

In mathematics, the kernel of a linear map, also known as the null space or nullspace, is the linear subspace of the domain of the map which is mapped to the zero vector. That is, given a linear map L : VW between two vector spaces V and W, the kernel of L is the vector space of all elements v of V such that L(v) = 0, where 0 denotes the zero vector in W, or more symbolically:

<span class="mw-page-title-main">Differential equation</span> Type of functional equation (mathematics)

In mathematics, a differential equation is an equation that relates one or more unknown functions and their derivatives. In applications, the functions generally represent physical quantities, the derivatives represent their rates of change, and the differential equation defines a relationship between the two. Such relations are common; therefore, differential equations play a prominent role in many disciplines including engineering, physics, economics, and biology.

In mathematics, an integrating factor is a function that is chosen to facilitate the solving of a given equation involving differentials. It is commonly used to solve ordinary differential equations, but is also used within multivariable calculus when multiplying through by an integrating factor allows an inexact differential to be made into an exact differential. This is especially useful in thermodynamics where temperature becomes the integrating factor that makes entropy an exact differential.

In mathematics, a change of variables is a basic technique used to simplify problems in which the original variables are replaced with functions of other variables. The intent is that when expressed in new variables, the problem may become simpler, or equivalent to a better understood problem.

A differential equation can be homogeneous in either of two respects.

In mathematics, an integro-differential equation is an equation that involves both integrals and derivatives of a function.

A differential equation is a mathematical equation for an unknown function of one or several variables that relates the values of the function itself and its derivatives of various orders. A matrix differential equation contains more than one function stacked into vector form with a matrix relating the functions to their derivatives.

In the study of ordinary differential equations and their associated boundary value problems, Lagrange's identity, named after Joseph Louis Lagrange, gives the boundary terms arising from integration by parts of a self-adjoint linear differential operator. Lagrange's identity is fundamental in Sturm–Liouville theory. In more than one independent variable, Lagrange's identity is generalized by Green's second identity.

In mathematics, the exponential response formula (ERF), also known as exponential response and complex replacement, is a method used to find a particular solution of a non-homogeneous linear ordinary differential equation of any order. The exponential response formula is applicable to non-homogeneous linear ordinary differential equations with constant coefficients if the function is polynomial, sinusoidal, exponential or the combination of the three. The general solution of a non-homogeneous linear ordinary differential equation is a superposition of the general solution of the associated homogeneous ODE and a particular solution to the non-homogeneous ODE. Alternative methods for solving ordinary differential equations of higher order are method of undetermined coefficients and method of variation of parameters.

References

See also