In mathematics, the method of steepest descent or saddle-point method is an extension of Laplace's method for approximating an integral, where one deforms a contour integral in the complex plane to pass near a stationary point (saddle point), in roughly the direction of steepest descent or stationary phase. The saddle-point approximation is used with integrals in the complex plane, whereas Laplace’s method is used with real integrals.
The integral to be estimated is often of the form
where C is a contour, and λ is large. One version of the method of steepest descent deforms the contour of integration C into a new path integration C′ so that the following conditions hold:
The method of steepest descent was first published by Debye (1909), who used it to estimate Bessel functions and pointed out that it occurred in the unpublished note by Riemann (1863) about hypergeometric functions. The contour of steepest descent has a minimax property, see Fedoryuk (2001). Siegel (1932) described some other unpublished notes of Riemann, where he used this method to derive the Riemann–Siegel formula.
The method of steepest descent is a method to approximate a complex integral of the form
for large , where and are analytic functions of . Because the integrand is analytic, the contour can be deformed into a new contour without changing the integral. In particular, one seeks a new contour on which the imaginary part, denoted , of is constant ( denotes the real part). Then
and the remaining integral can be approximated with other methods like Laplace's method. [1]
The method is called the method of steepest descent because for analytic , constant phase contours are equivalent to steepest descent contours.
If is an analytic function of , it satisfies the Cauchy–Riemann equations
Then
so contours of constant phase are also contours of steepest descent.
Let f, S : Cn → C and C ⊂ Cn. If
where denotes the real part, and there exists a positive real number λ0 such that
then the following estimate holds: [2]
Proof of the simple estimate:
Let x be a complex n-dimensional vector, and
denote the Hessian matrix for a function S(x). If
is a vector function, then its Jacobian matrix is defined as
A non-degenerate saddle point, z0 ∈ Cn, of a holomorphic function S(z) is a critical point of the function (i.e., ∇S(z0) = 0) where the function's Hessian matrix has a non-vanishing determinant (i.e., ).
The following is the main tool for constructing the asymptotics of integrals in the case of a non-degenerate saddle point:
The Morse lemma for real-valued functions generalizes as follows [3] for holomorphic functions: near a non-degenerate saddle point z0 of a holomorphic function S(z), there exist coordinates in terms of which S(z) − S(z0) is exactly quadratic. To make this precise, let S be a holomorphic function with domain W ⊂ Cn, and let z0 in W be a non-degenerate saddle point of S, that is, ∇S(z0) = 0 and . Then there exist neighborhoods U ⊂ W of z0 and V ⊂ Cn of w = 0, and a bijective holomorphic function φ : V → U with φ(0) = z0 such that
Here, the μj are the eigenvalues of the matrix .
The following proof is a straightforward generalization of the proof of the real Morse Lemma, which can be found in. [4] We begin by demonstrating
From the identity
we conclude that
and
Without loss of generality, we translate the origin to z0, such that z0 = 0 and S(0) = 0. Using the Auxiliary Statement, we have
Since the origin is a saddle point,
we can also apply the Auxiliary Statement to the functions gi(z) and obtain
| (1) |
Recall that an arbitrary matrix A can be represented as a sum of symmetric A(s) and anti-symmetric A(a) matrices,
The contraction of any symmetric matrix B with an arbitrary matrix A is
| (2) |
i.e., the anti-symmetric component of A does not contribute because
Thus, hij(z) in equation (1) can be assumed to be symmetric with respect to the interchange of the indices i and j. Note that
hence, det(hij(0)) ≠ 0 because the origin is a non-degenerate saddle point.
Let us show by induction that there are local coordinates u = (u1, ... un), z = ψ(u), 0 = ψ(0), such that
| (3) |
First, assume that there exist local coordinates y = (y1, ... yn), z = φ(y), 0 = φ(0), such that
| (4) |
where Hij is symmetric due to equation (2). By a linear change of the variables (yr, ... yn), we can assure that Hrr(0) ≠ 0. From the chain rule, we have
Therefore:
whence,
The matrix (Hij(0)) can be recast in the Jordan normal form: (Hij(0)) = LJL−1, were L gives the desired non-singular linear transformation and the diagonal of J contains non-zero eigenvalues of (Hij(0)). If Hij(0) ≠ 0 then, due to continuity of Hij(y), it must be also non-vanishing in some neighborhood of the origin. Having introduced , we write
Motivated by the last expression, we introduce new coordinates z = η(x), 0 = η(0),
The change of the variables y ↔ x is locally invertible since the corresponding Jacobian is non-zero,
Therefore,
| (5) |
Comparing equations (4) and (5), we conclude that equation (3) is verified. Denoting the eigenvalues of by μj, equation (3) can be rewritten as
| (6) |
Therefore,
| (7) |
From equation (6), it follows that . The Jordan normal form of reads , where Jz is an upper diagonal matrix containing the eigenvalues and det P ≠ 0; hence, . We obtain from equation (7)
If , then interchanging two variables assures that .
Assume
Then, the following asymptotic holds
| (8) |
where μj are eigenvalues of the Hessian and are defined with arguments
| (9) |
This statement is a special case of more general results presented in Fedoryuk (1987). [5]
First, we deform the contour Ix into a new contour passing through the saddle point x0 and sharing the boundary with Ix. This deformation does not change the value of the integral I(λ). We employ the Complex Morse Lemma to change the variables of integration. According to the lemma, the function φ(w) maps a neighborhood x0 ∈ U ⊂ Ωx onto a neighborhood Ωw containing the origin. The integral I(λ) can be split into two: I(λ) = I0(λ) + I1(λ), where I0(λ) is the integral over , while I1(λ) is over (i.e., the remaining part of the contour I′x). Since the latter region does not contain the saddle point x0, the value of I1(λ) is exponentially smaller than I0(λ) as λ → ∞; [6] thus, I1(λ) is ignored. Introducing the contour Iw such that , we have
| (10) |
Recalling that x0 = φ(0) as well as , we expand the pre-exponential function into a Taylor series and keep just the leading zero-order term
| (11) |
Here, we have substituted the integration region Iw by Rn because both contain the origin, which is a saddle point, hence they are equal up to an exponentially small term. [7] The integrals in the r.h.s. of equation (11) can be expressed as
| (12) |
From this representation, we conclude that condition (9) must be satisfied in order for the r.h.s. and l.h.s. of equation (12) to coincide. According to assumption 2, is a negatively defined quadratic form (viz., ) implying the existence of the integral , which is readily calculated
Equation (8) can also be written as
| (13) |
where the branch of
is selected as follows
Consider important special cases:
If the function S(x) has multiple isolated non-degenerate saddle points, i.e.,
where
is an open cover of Ωx, then the calculation of the integral asymptotic is reduced to the case of a single saddle point by employing the partition of unity. The partition of unity allows us to construct a set of continuous functions ρk(x) : Ωx → [0, 1], 1 ≤ k ≤ K, such that
Whence,
Therefore as λ → ∞ we have:
where equation (13) was utilized at the last stage, and the pre-exponential function f (x) at least must be continuous.
When ∇S(z0) = 0 and , the point z0 ∈ Cn is called a degenerate saddle point of a function S(z).
Calculating the asymptotic of
when λ → ∞, f (x) is continuous, and S(z) has a degenerate saddle point, is a very rich problem, whose solution heavily relies on the catastrophe theory. Here, the catastrophe theory replaces the Morse lemma, valid only in the non-degenerate case, to transform the function S(z) into one of the multitude of canonical representations. For further details see, e.g., Poston & Stewart (1978) and Fedoryuk (1987).
Integrals with degenerate saddle points naturally appear in many applications including optical caustics and the multidimensional WKB approximation in quantum mechanics.
The other cases such as, e.g., f (x) and/or S(x) are discontinuous or when an extremum of S(x) lies at the integration region's boundary, require special care (see, e.g., Fedoryuk (1987) and Wong (1989)).
An extension of the steepest descent method is the so-called nonlinear stationary phase/steepest descent method. Here, instead of integrals, one needs to evaluate asymptotically solutions of Riemann–Hilbert factorization problems.
Given a contour C in the complex sphere, a function f defined on that contour and a special point, say infinity, one seeks a function M holomorphic away from the contour C, with prescribed jump across C, and with a given normalization at infinity. If f and hence M are matrices rather than scalars this is a problem that in general does not admit an explicit solution.
An asymptotic evaluation is then possible along the lines of the linear stationary phase/steepest descent method. The idea is to reduce asymptotically the solution of the given Riemann–Hilbert problem to that of a simpler, explicitly solvable, Riemann–Hilbert problem. Cauchy's theorem is used to justify deformations of the jump contour.
The nonlinear stationary phase was introduced by Deift and Zhou in 1993, based on earlier work of the Russian mathematician Alexander Its. A (properly speaking) nonlinear steepest descent method was introduced by Kamvissis, K. McLaughlin and P. Miller in 2003, based on previous work of Lax, Levermore, Deift, Venakides and Zhou. As in the linear case, steepest descent contours solve a min-max problem. In the nonlinear case they turn out to be "S-curves" (defined in a different context back in the 80s by Stahl, Gonchar and Rakhmanov).
The nonlinear stationary phase/steepest descent method has applications to the theory of soliton equations and integrable models, random matrices and combinatorics.
Another extension is the Method of Chester–Friedman–Ursell for coalescing saddle points and uniform asymptotic extensions.
An ellipsoid is a surface that can be obtained from a sphere by deforming it by means of directional scalings, or more generally, of an affine transformation.
In continuum mechanics, the infinitesimal strain theory is a mathematical approach to the description of the deformation of a solid body in which the displacements of the material particles are assumed to be much smaller than any relevant dimension of the body; so that its geometry and the constitutive properties of the material at each point of space can be assumed to be unchanged by the deformation.
In the mathematical field of differential geometry, a metric tensor is an additional structure on a manifold M that allows defining distances and angles, just as the inner product on a Euclidean space allows defining distances and angles there. More precisely, a metric tensor at a point p of M is a bilinear form defined on the tangent space at p, and a metric field on M consists of a metric tensor at each point p of M that varies smoothly with p.
In vector calculus, Green's theorem relates a line integral around a simple closed curve C to a double integral over the plane region D bounded by C. It is the two-dimensional special case of Stokes' theorem.
Linear elasticity is a mathematical model of how solid objects deform and become internally stressed due to prescribed loading conditions. It is a simplification of the more general nonlinear theory of elasticity and a branch of continuum mechanics.
In mathematics, the Hodge star operator or Hodge star is a linear map defined on the exterior algebra of a finite-dimensional oriented vector space endowed with a nondegenerate symmetric bilinear form. Applying the operator to an element of the algebra produces the Hodge dual of the element. This map was introduced by W. V. D. Hodge.
The Schwinger–Dyson equations (SDEs) or Dyson–Schwinger equations, named after Julian Schwinger and Freeman Dyson, are general relations between correlation functions in quantum field theories (QFTs). They are also referred to as the Euler–Lagrange equations of quantum field theories, since they are the equations of motion corresponding to the Green's function. They form a set of infinitely many functional differential equations, all coupled to each other, sometimes referred to as the infinite tower of SDEs.
In mathematics, a volume element provides a means for integrating a function with respect to volume in various coordinate systems such as spherical coordinates and cylindrical coordinates. Thus a volume element is an expression of the form
In continuum mechanics, the finite strain theory—also called large strain theory, or large deformation theory—deals with deformations in which strains and/or rotations are large enough to invalidate assumptions inherent in infinitesimal strain theory. In this case, the undeformed and deformed configurations of the continuum are significantly different, requiring a clear distinction between them. This is commonly the case with elastomers, plastically deforming materials and other fluids and biological soft tissue.
In mathematics, the Fubini–Study metric is a Kähler metric on a complex projective space CPn endowed with a Hermitian form. This metric was originally described in 1904 and 1905 by Guido Fubini and Eduard Study.
The Maxwell stress tensor is a symmetric second-order tensor used in classical electromagnetism to represent the interaction between electromagnetic forces and mechanical momentum. In simple situations, such as a point charge moving freely in a homogeneous magnetic field, it is easy to calculate the forces on the charge from the Lorentz force law. When the situation becomes more complicated, this ordinary procedure can become impractically difficult, with equations spanning multiple lines. It is therefore convenient to collect many of these terms in the Maxwell stress tensor, and to use tensor arithmetic to find the answer to the problem at hand.
There are various mathematical descriptions of the electromagnetic field that are used in the study of electromagnetism, one of the four fundamental interactions of nature. In this article, several approaches are discussed, although the equations are in terms of electric and magnetic fields, potentials, and charges with currents, generally speaking.
The derivation of the Navier–Stokes equations as well as its application and formulation for different families of fluids, is an important exercise in fluid dynamics with applications in mechanical engineering, physics, chemistry, heat transfer, and electrical engineering. A proof explaining the properties and bounds of the equations, such as Navier–Stokes existence and smoothness, is one of the important unsolved problems in mathematics.
In mathematics, the spectral theory of ordinary differential equations is the part of spectral theory concerned with the determination of the spectrum and eigenfunction expansion associated with a linear ordinary differential equation. In his dissertation, Hermann Weyl generalized the classical Sturm–Liouville theory on a finite closed interval to second order differential operators with singularities at the endpoints of the interval, possibly semi-infinite or infinite. Unlike the classical case, the spectrum may no longer consist of just a countable set of eigenvalues, but may also contain a continuous part. In this case the eigenfunction expansion involves an integral over the continuous part with respect to a spectral measure, given by the Titchmarsh–Kodaira formula. The theory was put in its final simplified form for singular differential equations of even degree by Kodaira and others, using von Neumann's spectral theorem. It has had important applications in quantum mechanics, operator theory and harmonic analysis on semisimple Lie groups.
The derivatives of scalars, vectors, and second-order tensors with respect to second-order tensors are of considerable use in continuum mechanics. These derivatives are used in the theories of nonlinear elasticity and plasticity, particularly in the design of algorithms for numerical simulations.
Common integrals in quantum field theory are all variations and generalizations of Gaussian integrals to the complex plane and to multiple dimensions. Other integrals can be approximated by versions of the Gaussian integral. Fourier integrals are also considered.
Curvilinear coordinates can be formulated in tensor calculus, with important applications in physics and engineering, particularly for describing transportation of physical quantities and deformation of matter in fluid mechanics and continuum mechanics.
In continuum mechanics, objective stress rates are time derivatives of stress that do not depend on the frame of reference. Many constitutive equations are designed in the form of a relation between a stress-rate and a strain-rate. The mechanical response of a material should not depend on the frame of reference. In other words, material constitutive equations should be frame-indifferent (objective). If the stress and strain measures are material quantities then objectivity is automatically satisfied. However, if the quantities are spatial, then the objectivity of the stress-rate is not guaranteed even if the strain-rate is objective.
In machine learning, the kernel embedding of distributions comprises a class of nonparametric methods in which a probability distribution is represented as an element of a reproducing kernel Hilbert space (RKHS). A generalization of the individual data-point feature mapping done in classical kernel methods, the embedding of distributions into infinite-dimensional feature spaces can preserve all of the statistical features of arbitrary distributions, while allowing one to compare and manipulate distributions using Hilbert space operations such as inner products, distances, projections, linear transformations, and spectral analysis. This learning framework is very general and can be applied to distributions over any space on which a sensible kernel function may be defined. For example, various kernels have been proposed for learning from data which are: vectors in , discrete classes/categories, strings, graphs/networks, images, time series, manifolds, dynamical systems, and other structured objects. The theory behind kernel embeddings of distributions has been primarily developed by Alex Smola, Le Song , Arthur Gretton, and Bernhard Schölkopf. A review of recent works on kernel embedding of distributions can be found in.
In mathematics, calculus on Euclidean space is a generalization of calculus of functions in one or several variables to calculus of functions on Euclidean space as well as a finite-dimensional real vector space. This calculus is also known as advanced calculus, especially in the United States. It is similar to multivariable calculus but is somewhat more sophisticated in that it uses linear algebra more extensively and covers some concepts from differential geometry such as differential forms and Stokes' formula in terms of differential forms. This extensive use of linear algebra also allows a natural generalization of multivariable calculus to calculus on Banach spaces or topological vector spaces.