Method of characteristics

Last updated September 23, 2023

In mathematics, the method of characteristics is a technique for solving partial differential equations. Typically, it applies to first-order equations, although more generally the method of characteristics is valid for any hyperbolic partial differential equation. The method is to reduce a partial differential equation to a family of ordinary differential equations along which the solution can be integrated from some initial data given on a suitable hypersurface.

Characteristics of first-order partial differential equation

For a first-order PDE (partial differential equation), the method of characteristics discovers curves (called characteristic curves or just characteristics) along which the PDE becomes an ordinary differential equation (ODE).^[1] Once the ODE is found, it can be solved along the characteristic curves and transformed into a solution for the original PDE.

For the sake of simplicity, we confine our attention to the case of a function of two independent variables x and y for the moment. Consider a quasilinear PDE of the form

a(x,y,z){\frac {\partial z}{\partial x}}+b(x,y,z){\frac {\partial z}{\partial y}}=c(x,y,z).

(1)

Suppose that a solution z is known, and consider the surface graph z = z(x,y) in R³. A normal vector to this surface is given by

\left({\frac {\partial z}{\partial x}}(x,y),{\frac {\partial z}{\partial y}}(x,y),-1\right).\,

As a result,^[2] equation ( 1 ) is equivalent to the geometrical statement that the vector field

(a(x,y,z),b(x,y,z),c(x,y,z))\,

is tangent to the surface z = z(x,y) at every point, for the dot product of this vector field with the above normal vector is zero. In other words, the graph of the solution must be a union of integral curves of this vector field. These integral curves are called the characteristic curves of the original partial differential equation and are given by the Lagrange–Charpit equations^[3]

{\begin{array}{rcl}{\frac {dx}{dt}}&=&a(x,y,z),\\{\frac {dy}{dt}}&=&b(x,y,z),\\{\frac {dz}{dt}}&=&c(x,y,z).\end{array}}

A parametrization invariant form of the Lagrange–Charpit equations^[3] is:

{\frac {dx}{a(x,y,z)}}={\frac {dy}{b(x,y,z)}}={\frac {dz}{c(x,y,z)}}.

Linear and quasilinear cases

Consider now a PDE of the form

\sum _{i=1}^{n}a_{i}(x_{1},\dots ,x_{n},u){\frac {\partial u}{\partial x_{i}}}=c(x_{1},\dots ,x_{n},u).

For this PDE to be linear, the coefficients a_i may be functions of the spatial variables only, and independent of u. For it to be quasilinear,^[4]a_i may also depend on the value of the function, but not on any derivatives. The distinction between these two cases is inessential for the discussion here.

For a linear or quasilinear PDE, the characteristic curves are given parametrically by

(x_{1},\dots ,x_{n},u)=(x_{1}(s),\dots ,x_{n}(s),u(s))

u(\mathbf {X} (s))=U(s)

such that the following system of ODEs is satisfied

{\frac {dx_{i}}{ds}}=a_{i}(x_{1},\dots ,x_{n},u)

(2)

{\frac {du}{ds}}=c(x_{1},\dots ,x_{n},u).

(3)

Equations ( 2 ) and ( 3 ) give the characteristics of the PDE.

Proof for quasilinear Case

In the quasilinear case, the use of the method of characteristics is justified by Grönwall's inequality. The above equation may be written as

\mathbf {a} (\mathbf {x} ,u)\cdot \nabla u(\mathbf {x} )=c(\mathbf {x} ,u)

We must distinguish between the solutions to the ODE and the solutions to the PDE, which we do not know are equal a priori. Letting capital letters be the solutions to the ODE we find

\mathbf {X} '(s)=\mathbf {a} (\mathbf {X} (s),U(s))

U'(s)=c(\mathbf {X} (s),U(s))

Examining $\Delta (s)=|u(\mathbf {X} (s))-U(s)|^{2}$ , we find, upon differentiating that

\Delta '(s)=2{\big (}u(\mathbf {X} (s))-U(s){\big )}{\Big (}\mathbf {X} '(s)\cdot \nabla u(\mathbf {X} (s))-U'(s){\Big )}

which is the same as

\Delta '(s)=2{\big (}u(\mathbf {X} (s))-U(s){\big )}{\Big (}\mathbf {a} (\mathbf {X} (s),U(s))\cdot \nabla u(\mathbf {X} (s))-c(\mathbf {X} (s),U(s)){\Big )}

We cannot conclude the above is 0 as we would like, since the PDE only guarantees us that this relationship is satisfied for $u(\mathbf {x} )$ , $\mathbf {a} (\mathbf {x} ,u)\cdot \nabla u(\mathbf {x} )=c(\mathbf {x} ,u)$ , and we do not yet know that $U(s)=u(\mathbf {X} (s))$ .

However, we can see that

\Delta '(s)=2{\big (}u(\mathbf {X} (s))-U(s){\big )}{\Big (}\mathbf {a} (\mathbf {X} (s),U(s))\cdot \nabla u(\mathbf {X} (s))-c(\mathbf {X} (s),U(s))-{\big (}\mathbf {a} (\mathbf {X} (s),u(\mathbf {X} (s)))\cdot \nabla u(\mathbf {X} (s))-c(\mathbf {X} (s),u(\mathbf {X} (s))){\big )}{\Big )}

since by the PDE, the last term is 0. This equals

\Delta '(s)=2{\big (}u(\mathbf {X} (s))-U(s){\big )}{\Big (}{\big (}\mathbf {a} (\mathbf {X} (s),U(s))-\mathbf {a} (\mathbf {X} (s),u(\mathbf {X} (s))){\big )}\cdot \nabla u(\mathbf {X} (s))-{\big (}c(\mathbf {X} (s),U(s))-c(\mathbf {X} (s),u(\mathbf {X} (s))){\big )}{\Big )}

By the triangle inequality, we have

|\Delta '(s)|\leq 2{\big |}u(\mathbf {X} (s))-U(s){\big |}{\Big (}{\big \|}\mathbf {a} (\mathbf {X} (s),U(s))-\mathbf {a} (\mathbf {X} (s),u(\mathbf {X} (s))){\big \|}\ \|\nabla u(\mathbf {X} (s))\|+{\big |}c(\mathbf {X} (s),U(s))-c(\mathbf {X} (s),u(\mathbf {X} (s))){\big |}{\Big )}

Assuming $\mathbf {a} ,c$ are at least $C^{1}$ , we can bound this for small times. Choose a neighborhood $\Omega$ around $\mathbf {X} (0),U(0)$ small enough such that $\mathbf {a} ,c$ are locally Lipschitz. By continuity, $(\mathbf {X} (s),U(s))$ will remain in $\Omega$ for small enough $s$ . Since $U(0)=u(\mathbf {X} (0))$ , we also have that $(\mathbf {X} (s),u(\mathbf {X} (s)))$ will be in $\Omega$ for small enough $s$ by continuity. So, $(\mathbf {X} (s),U(s))\in \Omega$ and $(\mathbf {X} (s),u(\mathbf {X} (s)))\in \Omega$ for $s\in [0,s_{0}]$ . Additionally, $\|\nabla u(\mathbf {X} (s))\|\leq M$ for some $M\in \mathbb {R}$ for $s\in [0,s_{0}]$ by compactness. From this, we find the above is bounded as

|\Delta '(s)|\leq C|u(\mathbf {X} (s))-U(s)|^{2}=C|\Delta (s)|

for some $C\in \mathbb {R}$ . It is a straightforward application of Grönwall's Inequality to show that since $\Delta (0)=0$ we have $\Delta (s)=0$ for as long as this inequality holds. We have some interval $[0,\epsilon )$ such that $u(X(s))=U(s)$ in this interval. Choose the largest $\epsilon$ such that this is true. Then, by continuity, $U(\epsilon )=u(\mathbf {X} (\epsilon ))$ . Provided the ODE still has a solution in some interval after $\epsilon$ , we can repeat the argument above to find that $u(X(s))=U(s)$ in a larger interval. Thus, so long as the ODE has a solution, we have $u(X(s))=U(s)$ .

Fully nonlinear case

Consider the partial differential equation

F(x_{1},\dots ,x_{n},u,p_{1},\dots ,p_{n})=0

(4)

where the variables p_i are shorthand for the partial derivatives

p_{i}={\frac {\partial u}{\partial x_{i}}}.

Let (x_i(s),u(s),p_i(s)) be a curve in R²ⁿ⁺¹. Suppose that u is any solution, and that

u(s)=u(x_{1}(s),\dots ,x_{n}(s)).

Along a solution, differentiating ( 4 ) with respect to s gives

\sum _{i}(F_{x_{i}}+F_{u}p_{i}){\dot {x}}_{i}+\sum _{i}F_{p_{i}}{\dot {p}}_{i}=0

{\dot {u}}-\sum _{i}p_{i}{\dot {x}}_{i}=0

\sum _{i}({\dot {x}}_{i}dp_{i}-{\dot {p}}_{i}dx_{i})=0.

The second equation follows from applying the chain rule to a solution u, and the third follows by taking an exterior derivative of the relation $du-\sum _{i}p_{i}\,dx_{i}=0$ . Manipulating these equations gives

{\dot {x}}_{i}=\lambda F_{p_{i}},\quad {\dot {p}}_{i}=-\lambda (F_{x_{i}}+F_{u}p_{i}),\quad {\dot {u}}=\lambda \sum _{i}p_{i}F_{p_{i}}

where λ is a constant. Writing these equations more symmetrically, one obtains the Lagrange–Charpit equations for the characteristic

{\frac {{\dot {x}}_{i}}{F_{p_{i}}}}=-{\frac {{\dot {p}}_{i}}{F_{x_{i}}+F_{u}p_{i}}}={\frac {\dot {u}}{\sum p_{i}F_{p_{i}}}}.

Geometrically, the method of characteristics in the fully nonlinear case can be interpreted as requiring that the Monge cone of the differential equation should everywhere be tangent to the graph of the solution. The second order partial differential equation is solved with Charpit method .

Example

As an example, consider the advection equation (this example assumes familiarity with PDE notation, and solutions to basic ODEs).

a{\frac {\partial u}{\partial x}}+{\frac {\partial u}{\partial t}}=0

where $a$ is constant and $u$ is a function of $x$ and $t$ . We want to transform this linear first-order PDE into an ODE along the appropriate curve; i.e. something of the form

{\frac {d}{ds}}u(x(s),t(s))=F(u,x(s),t(s)),

where $(x(s),t(s))$ is a characteristic line. First, we find

{\frac {d}{ds}}u(x(s),t(s))={\frac {\partial u}{\partial x}}{\frac {dx}{ds}}+{\frac {\partial u}{\partial t}}{\frac {dt}{ds}}

by the chain rule. Now, if we set ${\frac {dx}{ds}}=a$ and ${\frac {dt}{ds}}=1$ we get

a{\frac {\partial u}{\partial x}}+{\frac {\partial u}{\partial t}}

which is the left hand side of the PDE we started with. Thus

{\frac {d}{ds}}u=a{\frac {\partial u}{\partial x}}+{\frac {\partial u}{\partial t}}=0.

So, along the characteristic line $(x(s),t(s))$ , the original PDE becomes the ODE $u_{s}=F(u,x(s),t(s))=0$ . That is to say that along the characteristics, the solution is constant. Thus, $u(x_{s},t_{s})=u(x_{0},0)$ where $(x_{s},t_{s})\,$ and $(x_{0},0)$ lie on the same characteristic. Therefore, to determine the general solution, it is enough to find the characteristics by solving the characteristic system of ODEs:

${\frac {dt}{ds}}=1$ , letting $t(0)=0$ we know $t=s$ ,
${\frac {dx}{ds}}=a$ , letting $x(0)=x_{0}$ we know $x=as+x_{0}=at+x_{0}$ ,
${\frac {du}{ds}}=0$ , letting $u(0)=f(x_{0})$ we know $u(x(t),t)=f(x_{0})=f(x-at)$ .

In this case, the characteristic lines are straight lines with slope $a$ , and the value of $u$ remains constant along any characteristic line.

Characteristics of linear differential operators

Let X be a differentiable manifold and P a linear differential operator

P:C^{\infty }(X)\to C^{\infty }(X)

of order k. In a local coordinate system xⁱ,

P=\sum _{|\alpha |\leq k}P^{\alpha }(x){\frac {\partial }{\partial x^{\alpha }}}

in which α denotes a multi-index. The principal symbol of P, denoted σ_P, is the function on the cotangent bundle T^∗X defined in these local coordinates by

\sigma _{P}(x,\xi )=\sum _{|\alpha |=k}P^{\alpha }(x)\xi _{\alpha }

where the ξ_i are the fiber coordinates on the cotangent bundle induced by the coordinate differentials dxⁱ. Although this is defined using a particular coordinate system, the transformation law relating the ξ_i and the xⁱ ensures that σ_P is a well-defined function on the cotangent bundle.

The function σ_P is homogeneous of degree k in the ξ variable. The zeros of σ_P, away from the zero section of T^∗X, are the characteristics of P. A hypersurface of X defined by the equation F(x) = c is called a characteristic hypersurface at x if

\sigma _{P}(x,dF(x))=0.

Invariantly, a characteristic hypersurface is a hypersurface whose conormal bundle is in the characteristic set of P.

Qualitative analysis of characteristics

Characteristics are also a powerful tool for gaining qualitative insight into a PDE.

One can use the crossings of the characteristics to find shock waves for potential flow in a compressible fluid. Intuitively, we can think of each characteristic line implying a solution to $u$ along itself. Thus, when two characteristics cross, the function becomes multi-valued resulting in a non-physical solution. Physically, this contradiction is removed by the formation of a shock wave, a tangential discontinuity or a weak discontinuity and can result in non-potential flow, violating the initial assumptions.^[5]

Characteristics may fail to cover part of the domain of the PDE. This is called a rarefaction, and indicates the solution typically exists only in a weak, i.e. integral equation, sense.

The direction of the characteristic lines indicates the flow of values through the solution, as the example above demonstrates. This kind of knowledge is useful when solving PDEs numerically as it can indicate which finite difference scheme is best for the problem.

Notes

↑ Zachmanoglou, E. C.; Thoe, Dale W. (1976), "Linear Partial Differential Equations : Characteristics, Classification, and Canonical Forms", Introduction to Partial Differential Equations with Applications, Baltimore: Williams & Wilkins, pp. 112–152, ISBN 0-486-65251-3
↑ John, Fritz (1991), Partial differential equations (4th ed.), Springer, ISBN 978-0-387-90609-6
1 2 Delgado, Manuel (1997), "The Lagrange-Charpit Method", SIAM Review, 39 (2): 298–304, Bibcode:1997SIAMR..39..298D, doi:10.1137/S0036144595293534, JSTOR 2133111
↑ "Partial Differential Equations (PDEs)—Wolfram Language Documentation".
↑ Debnath, Lokenath (2005), "Conservation Laws and Shock Waves", Nonlinear Partial Differential Equations for Scientists and Engineers (2nd ed.), Boston: Birkhäuser, pp. 251–276, ISBN 0-8176-4323-0

Related Research Articles

<span class="mw-page-title-main">Gradient</span> Multivariate derivative (mathematics)

In vector calculus, the gradient of a scalar-valued differentiable function $of several variables is the vector field whose value at a point is the "direction and rate of fastest increase". If the gradient of a function is non-zero at a point, the direction of the gradient is the direction in which the function increases most quickly from, and the magnitude of the gradient is the rate of increase in that direction, the greatest absolute directional derivative. Further, a point where the gradient is the zero vector is known as a stationary point. The gradient thus plays a fundamental role in optimization theory, where it is used to maximize a function by gradient ascent. In coordinate-free terms, the gradient of a function may be defined by:$

In physics, the Lorentz force is the combination of electric and magnetic force on a point charge due to electromagnetic fields. A particle of charge $q$ moving with a velocity $v$ in an electric field $E$ and a magnetic field $B$ experiences a force of

<span class="mw-page-title-main">Wave equation</span> Differential equation important in physics

The (two-way) wave equation is a second-order linear partial differential equation for the description of waves or standing wave fields – as they occur in classical physics – such as mechanical waves or electromagnetic waves. It arises in fields like acoustics, electromagnetism, and fluid dynamics. Single mechanical or electromagnetic waves propagating in a pre-defined direction can also be described with the first-order one-way wave equation, which is much easier to solve and also valid for inhomogeneous media.

In mathematics and physics, Laplace's equation is a second-order partial differential equation named after Pierre-Simon Laplace, who first studied its properties. This is often written as

<span class="mw-page-title-main">Navier–Stokes equations</span> Equations describing the motion of viscous fluid substances

The Navier–Stokes equations are partial differential equations which describe the motion of viscous fluid substances, named after French engineer and physicist Claude-Louis Navier and Irish physicist and mathematician George Gabriel Stokes. They were developed over several decades of progressively building the theories, from 1822 (Navier) to 1842-1850 (Stokes).

<span class="mw-page-title-main">Partial differential equation</span> Type of differential equation

In mathematics, a partial differential equation (PDE) is an equation which computes a function between various partial derivatives of a multivariable function.

<span class="mw-page-title-main">Heat equation</span> Partial differential equation describing the evolution of temperature in a region

In mathematics and physics, the heat equation is a certain partial differential equation. Solutions of the heat equation are sometimes known as caloric functions. The theory of the heat equation was first developed by Joseph Fourier in 1822 for the purpose of modeling how a quantity such as heat diffuses through a given region.

<span class="mw-page-title-main">Fokker–Planck equation</span> Partial differential equation

In statistical mechanics and information theory, the Fokker–Planck equation is a partial differential equation that describes the time evolution of the probability density function of the velocity of a particle under the influence of drag forces and random forces, as in Brownian motion. The equation can be generalized to other observables as well. The Fokker-Planck equation has multiple applications in information theory, graph theory, data science, finance, economics etc.

In mathematics, the Laplace operator or Laplacian is a differential operator given by the divergence of the gradient of a scalar function on Euclidean space. It is usually denoted by the symbols $, (where is the nabla operator), or . In a Cartesian coordinate system, the Laplacian is given by the sum of second partial derivatives of the function with respect to each independent variable. In other coordinate systems, such as cylindrical and spherical coordinates, the Laplacian also has a useful form. Informally, the Laplacian Δ f (p) of a function f at a point p measures by how much the average value of f over small spheres or balls centered at p deviates from f (p) .$

In fluid dynamics, the Euler equations are a set of quasilinear partial differential equations governing adiabatic and inviscid flow. They are named after Leonhard Euler. In particular, they correspond to the Navier–Stokes equations with zero viscosity and zero thermal conductivity.

<span class="mw-page-title-main">Envelope (mathematics)</span> Family of curves in geometry

In geometry, an envelope of a planar family of curves is a curve that is tangent to each member of the family at some point, and these points of tangency together form the whole envelope. Classically, a point on the envelope can be thought of as the intersection of two "infinitesimally adjacent" curves, meaning the limit of intersections of nearby curves. This idea can be generalized to an envelope of surfaces in space, and so on to higher dimensions.

In physics, the Hamilton–Jacobi equation, named after William Rowan Hamilton and Carl Gustav Jacob Jacobi, is an alternative formulation of classical mechanics, equivalent to other formulations such as Newton's laws of motion, Lagrangian mechanics and Hamiltonian mechanics.

In mathematics, the Helmholtz equation is the eigenvalue problem for the Laplace operator. It corresponds to the linear partial differential equation

An eikonal equation is a non-linear first-order partial differential equation that is encountered in problems of wave propagation.

In mathematics, a first-order partial differential equation is a partial differential equation that involves only first derivatives of the unknown function of n variables. The equation takes the form

The Navier–Stokes existence and smoothness problem concerns the mathematical properties of solutions to the Navier–Stokes equations, a system of partial differential equations that describe the motion of a fluid in space. Solutions to the Navier–Stokes equations are used in many practical applications. However, theoretical understanding of the solutions to these equations is incomplete. In particular, solutions of the Navier–Stokes equations often include turbulence, which remains one of the greatest unsolved problems in physics, despite its immense importance in science and engineering.

There are various mathematical descriptions of the electromagnetic field that are used in the study of electromagnetism, one of the four fundamental interactions of nature. In this article, several approaches are discussed, although the equations are in terms of electric and magnetic fields, potentials, and charges with currents, generally speaking.

In differential calculus, there is no single uniform notation for differentiation. Instead, various notations for the derivative of a function or variable have been proposed by various mathematicians. The usefulness of each notation varies with the context, and it is sometimes advantageous to use more than one notation in a given context. The most common notations for differentiation are listed below.

The Cauchy momentum equation is a vector partial differential equation put forth by Cauchy that describes the non-relativistic momentum transport in any continuum.

In the finite element method for the numerical solution of elliptic partial differential equations, the stiffness matrix is a matrix that represents the system of linear equations that must be solved in order to ascertain an approximate solution to the differential equation.

References

Courant, Richard; Hilbert, David (1962), Methods of Mathematical Physics, Volume II, Wiley-Interscience
Evans, Lawrence C. (1998), Partial Differential Equations, Providence: American Mathematical Society, ISBN 0-8218-0772-2
Polyanin, A. D.; Zaitsev, V. F.; Moussiaux, A. (2002), Handbook of First Order Partial Differential Equations, London: Taylor & Francis, ISBN 0-415-27267-X
Polyanin, A. D. (2002), Handbook of Linear Partial Differential Equations for Engineers and Scientists, Boca Raton: Chapman & Hall/CRC Press, ISBN 1-58488-299-9
Sarra, Scott (2003), "The Method of Characteristics with applications to Conservation Laws", Journal of Online Mathematics and Its Applications.
Streeter, VL; Wylie, EB (1998), Fluid mechanics (International 9th Revised ed.), McGraw-Hill Higher Education

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Zachmanoglou, E. C.; Thoe, Dale W. (1976), "Linear Partial Differential Equations : Characteristics, Classification, and Canonical Forms", Introduction to Partial Differential Equations with Applications, Baltimore: Williams & Wilkins, pp. 112–152, ISBN 0-486-65251-3

[John1991-2] John, Fritz (1991), Partial differential equations (4th ed.), Springer, ISBN 978-0-387-90609-6

[:0-3] 1 2 Delgado, Manuel (1997), "The Lagrange-Charpit Method", SIAM Review, 39 (2): 298–304, Bibcode:1997SIAMR..39..298D, doi:10.1137/S0036144595293534, JSTOR 2133111

[quasilinear-4] "Partial Differential Equations (PDEs)—Wolfram Language Documentation".

[5] Debnath, Lokenath (2005), "Conservation Laws and Shock Waves", Nonlinear Partial Differential Equations for Scientists and Engineers (2nd ed.), Boston: Birkhäuser, pp. 251–276, ISBN 0-8176-4323-0

[1]

[2]

[3]

[4]

[5]