Differential of a function

Last updated January 19, 2023

In calculus, the differential represents the principal part of the change in a function $y=f(x)$ with respect to changes in the independent variable. The differential $dy$ is defined by

The precise meaning of the variables $dy$ and $dx$ depends on the context of the application and the required level of mathematical rigor. The domain of these variables may take on a particular geometrical significance if the differential is regarded as a particular differential form, or analytical significance if the differential is regarded as a linear approximation to the increment of a function. Traditionally, the variables $dx$ and $dy$ are considered to be very small (infinitesimal), and this interpretation is made rigorous in non-standard analysis.

History and usage

The differential was first introduced via an intuitive or heuristic definition by Isaac Newton and furthered by Gottfried Leibniz, who thought of the differential $dy$ as an infinitely small (or infinitesimal) change in the value $y$ of the function, corresponding to an infinitely small change $dx$ in the function's argument $x$ . For that reason, the instantaneous rate of change of $y$ with respect to $x$ , which is the value of the derivative of the function, is denoted by the fraction

{\frac {dy}{dx}}

in what is called the Leibniz notation for derivatives. The quotient $dy/dx$ is not infinitely small; rather it is a real number.

The use of infinitesimals in this form was widely criticized, for instance by the famous pamphlet The Analyst by Bishop Berkeley. Augustin-Louis Cauchy (1823) defined the differential without appeal to the atomism of Leibniz's infinitesimals.^[1]^[2] Instead, Cauchy, following d'Alembert, inverted the logical order of Leibniz and his successors: the derivative itself became the fundamental object, defined as a limit of difference quotients, and the differentials were then defined in terms of it. That is, one was free to define the differential $dy$ by an expression

dy=f'(x)\,dx

in which $dy$ and $dx$ are simply new variables taking finite real values,^[3] not fixed infinitesimals as they had been for Leibniz.^[4]

According to Boyer (1959 , p. 12), Cauchy's approach was a significant logical improvement over the infinitesimal approach of Leibniz because, instead of invoking the metaphysical notion of infinitesimals, the quantities $dy$ and $dx$ could now be manipulated in exactly the same manner as any other real quantities in a meaningful way. Cauchy's overall conceptual approach to differentials remains the standard one in modern analytical treatments,^[5] although the final word on rigor, a fully modern notion of the limit, was ultimately due to Karl Weierstrass.^[6]

In physical treatments, such as those applied to the theory of thermodynamics, the infinitesimal view still prevails. Courant & John (1999 , p. 184) reconcile the physical use of infinitesimal differentials with the mathematical impossibility of them as follows. The differentials represent finite non-zero values that are smaller than the degree of accuracy required for the particular purpose for which they are intended. Thus "physical infinitesimals" need not appeal to a corresponding mathematical infinitesimal in order to have a precise sense.

Following twentieth-century developments in mathematical analysis and differential geometry, it became clear that the notion of the differential of a function could be extended in a variety of ways. In real analysis, it is more desirable to deal directly with the differential as the principal part of the increment of a function. This leads directly to the notion that the differential of a function at a point is a linear functional of an increment $\Delta x$ . This approach allows the differential (as a linear map) to be developed for a variety of more sophisticated spaces, ultimately giving rise to such notions as the Fréchet or Gateaux derivative. Likewise, in differential geometry, the differential of a function at a point is a linear function of a tangent vector (an "infinitely small displacement"), which exhibits it as a kind of one-form: the exterior derivative of the function. In non-standard calculus, differentials are regarded as infinitesimals, which can themselves be put on a rigorous footing (see differential (infinitesimal)).

Definition

The differential is defined in modern treatments of differential calculus as follows.^[7] The differential of a function $f(x)$ of a single real variable $x$ is the function $df$ of two independent real variables $x$ and $\Delta x$ given by

df(x,\Delta x){\stackrel {\mathrm {def} }{=}}f'(x)\,\Delta x.

One or both of the arguments may be suppressed, i.e., one may see $df(x)$ or simply $df$ . If $y=f(x)$ , the differential may also be written as $dy$ . Since $dx(x,\Delta x)=\Delta x$ , it is conventional to write $dx=\Delta x$ so that the following equality holds:

df(x)=f'(x)\,dx

This notion of differential is broadly applicable when a linear approximation to a function is sought, in which the value of the increment $\Delta x$ is small enough. More precisely, if $f$ is a differentiable function at $x$ , then the difference in $y$ -values

\Delta y{\stackrel {\rm {def}}{=}}f(x+\Delta x)-f(x)

satisfies

\Delta y=f'(x)\,\Delta x+\varepsilon =df(x)+\varepsilon \,

where the error $\varepsilon$ in the approximation satisfies $\varepsilon /\Delta x\rightarrow 0$ as $\Delta x\rightarrow 0$ . In other words, one has the approximate identity

\Delta y\approx dy

in which the error can be made as small as desired relative to $\Delta x$ by constraining $\Delta x$ to be sufficiently small; that is to say,

{\frac {\Delta y-dy}{\Delta x}}\to 0

as $\Delta x\rightarrow 0$ . For this reason, the differential of a function is known as the principal (linear) part in the increment of a function: the differential is a linear function of the increment $\Delta x$ , and although the error $\varepsilon$ may be nonlinear, it tends to zero rapidly as $\Delta x$ tends to zero.

Differentials in several variables


Operator / Function	$f(x)$	$f(x,y,u(x,y),v(x,y))$
Differential	1: $df\,{\overset {\underset {\mathrm {def} }{}}{=}}\,f'_{x}\,dx$	2: $d_{x}f\,{\overset {\underset {\mathrm {def} }{}}{=}}\,f'_{x}\,dx$ 3: $df\,{\overset {\underset {\mathrm {def} }{}}{=}}\,f'_{x}dx+f'_{y}dy+f'_{u}du+f'_{v}dv$
Partial derivative	$f'_{x}\,{\overset {\underset {\mathrm {(1)} }{}}{=}}\,{\frac {df}{dx}}$	$f'_{x}\,{\overset {\underset {\mathrm {(2)} }{}}{=}}\,{\frac {d_{x}f}{dx}}={\frac {\partial f}{\partial x}}$
Total derivative	${\frac {df}{dx}}\,{\overset {\underset {\mathrm {(1)} }{}}{=}}\,f'_{x}$	${\frac {df}{dx}}\,{\overset {\underset {\mathrm {(3)} }{}}{=}}\,f'_{x}+f'_{u}{\frac {du}{dx}}+f'_{v}{\frac {dv}{dx}};(f'_{y}{\frac {dy}{dx}}=0)$

Following Goursat (1904 , I, §15), for functions of more than one independent variable,

y=f(x_{1},\dots ,x_{n}),

the partial differential of y with respect to any one of the variables x₁ is the principal part of the change in y resulting from a change dx₁ in that one variable. The partial differential is therefore

{\frac {\partial y}{\partial x_{1}}}dx_{1}

involving the partial derivative of y with respect to x₁. The sum of the partial differentials with respect to all of the independent variables is the total differential

dy={\frac {\partial y}{\partial x_{1}}}dx_{1}+\cdots +{\frac {\partial y}{\partial x_{n}}}dx_{n},

which is the principal part of the change in y resulting from changes in the independent variables x_i.

More precisely, in the context of multivariable calculus, following Courant (1937b), if f is a differentiable function, then by the definition of differentiability, the increment

{\begin{aligned}\Delta y&{}{\stackrel {\mathrm {def} }{=}}f(x_{1}+\Delta x_{1},\dots ,x_{n}+\Delta x_{n})-f(x_{1},\dots ,x_{n})\\&{}={\frac {\partial y}{\partial x_{1}}}\Delta x_{1}+\cdots +{\frac {\partial y}{\partial x_{n}}}\Delta x_{n}+\varepsilon _{1}\Delta x_{1}+\cdots +\varepsilon _{n}\Delta x_{n}\end{aligned}}

where the error terms ε_i tend to zero as the increments Δx_i jointly tend to zero. The total differential is then rigorously defined as

dy={\frac {\partial y}{\partial x_{1}}}\Delta x_{1}+\cdots +{\frac {\partial y}{\partial x_{n}}}\Delta x_{n}.

Since, with this definition,

dx_{i}(\Delta x_{1},\dots ,\Delta x_{n})=\Delta x_{i},

one has

dy={\frac {\partial y}{\partial x_{1}}}\,dx_{1}+\cdots +{\frac {\partial y}{\partial x_{n}}}\,dx_{n}.

As in the case of one variable, the approximate identity holds

dy\approx \Delta y

in which the total error can be made as small as desired relative to ${\sqrt {\Delta x_{1}^{2}+\cdots +\Delta x_{n}^{2}}}$ by confining attention to sufficiently small increments.

Application of the total differential to error estimation

In measurement, the total differential is used in estimating the error $\Delta f$ of a function $f$ based on the errors $\Delta x,\Delta y,\ldots$ of the parameters $x,y,\ldots$ . Assuming that the interval is short enough for the change to be approximately linear:

\Delta f(x)=f'(x)\Delta x

and that all variables are independent, then for all variables,

\Delta f=f_{x}\Delta x+f_{y}\Delta y+\cdots

This is because the derivative $f_{x}$ with respect to the particular parameter $x$ gives the sensitivity of the function $f$ to a change in $x$ , in particular the error $\Delta x$ . As they are assumed to be independent, the analysis describes the worst-case scenario. The absolute values of the component errors are used, because after simple computation, the derivative may have a negative sign. From this principle the error rules of summation, multiplication etc. are derived, e.g.:

Let

f(a,b)=ab

;

\Delta f=f_{a}\Delta a+f_{b}\Delta b

; evaluating the derivatives

Δf = bΔa + aΔb; dividing by f, which is a × b

Δf/f = Δa/a + Δb/b

That is to say, in multiplication, the total relative error is the sum of the relative errors of the parameters.

To illustrate how this depends on the function considered, consider the case where the function is $f(a,b)=a\ln b$ instead. Then, it can be computed that the error estimate is

Δf/f = Δa/a + Δb/(b ln b)

with an extra 'ln b' factor not found in the case of a simple product. This additional factor tends to make the error smaller, as ln b is not as large as a bare b.

Higher-order differentials

Higher-order differentials of a function y = f(x) of a single variable x can be defined via:^[8]

d^{2}y=d(dy)=d(f'(x)dx)=(df'(x))dx=f''(x)\,(dx)^{2},

and, in general,

d^{n}y=f^{(n)}(x)\,(dx)^{n}.

Informally, this motivates Leibniz's notation for higher-order derivatives

f^{(n)}(x)={\frac {d^{n}f}{dx^{n}}}.

When the independent variable x itself is permitted to depend on other variables, then the expression becomes more complicated, as it must include also higher order differentials in x itself. Thus, for instance,

{\begin{aligned}d^{2}y&=f''(x)\,(dx)^{2}+f'(x)d^{2}x\\d^{3}y&=f'''(x)\,(dx)^{3}+3f''(x)dx\,d^{2}x+f'(x)d^{3}x\end{aligned}}

and so forth.

Similar considerations apply to defining higher order differentials of functions of several variables. For example, if f is a function of two variables x and y, then

d^{n}f=\sum _{k=0}^{n}{\binom {n}{k}}{\frac {\partial ^{n}f}{\partial x^{k}\partial y^{n-k}}}(dx)^{k}(dy)^{n-k},

where ${\textstyle {\binom {n}{k}}}$ is a binomial coefficient. In more variables, an analogous expression holds, but with an appropriate multinomial expansion rather than binomial expansion.^[9]

Higher order differentials in several variables also become more complicated when the independent variables are themselves allowed to depend on other variables. For instance, for a function f of x and y which are allowed to depend on auxiliary variables, one has

d^{2}f=\left({\frac {\partial ^{2}f}{\partial x^{2}}}(dx)^{2}+2{\frac {\partial ^{2}f}{\partial x\partial y}}dx\,dy+{\frac {\partial ^{2}f}{\partial y^{2}}}(dy)^{2}\right)+{\frac {\partial f}{\partial x}}d^{2}x+{\frac {\partial f}{\partial y}}d^{2}y.

Because of this notational infelicity, the use of higher order differentials was roundly criticized by Hadamard 1935, who concluded:

Enfin, que signifie ou que représente l'égalité

d^{2}z=r\,dx^{2}+2s\,dx\,dy+t\,dy^{2}\,?

A mon avis, rien du tout.

That is: Finally, what is meant, or represented, by the equality [...]? In my opinion, nothing at all. In spite of this skepticism, higher order differentials did emerge as an important tool in analysis.^[10]

In these contexts, the nth order differential of the function f applied to an increment Δx is defined by

d^{n}f(x,\Delta x)=\left.{\frac {d^{n}}{dt^{n}}}f(x+t\Delta x)\right|_{t=0}

or an equivalent expression, such as

\lim _{t\to 0}{\frac {\Delta _{t\Delta x}^{n}f}{t^{n}}}

where $\Delta _{t\Delta x}^{n}f$ is an nth forward difference with increment tΔx.

This definition makes sense as well if f is a function of several variables (for simplicity taken here as a vector argument). Then the nth differential defined in this way is a homogeneous function of degree n in the vector increment Δx. Furthermore, the Taylor series of f at the point x is given by

f(x+\Delta x)\sim f(x)+df(x,\Delta x)+{\frac {1}{2}}d^{2}f(x,\Delta x)+\cdots +{\frac {1}{n!}}d^{n}f(x,\Delta x)+\cdots

The higher order Gateaux derivative generalizes these considerations to infinite dimensional spaces.

Properties

A number of properties of the differential follow in a straightforward manner from the corresponding properties of the derivative, partial derivative, and total derivative. These include:^[11]

Linearity: For constants a and b and differentiable functions f and g,

d(af+bg)=a\,df+b\,dg.

Product rule: For two differentiable functions f and g,

d(fg)=f\,dg+g\,df.

An operation d with these two properties is known in abstract algebra as a derivation. They imply the Power rule

d(f^{n})=nf^{n-1}df

In addition, various forms of the chain rule hold, in increasing level of generality:^[12]

If y = f(u) is a differentiable function of the variable u and u = g(x) is a differentiable function of x, then

dy=f'(u)\,du=f'(g(x))g'(x)\,dx.

If y = f(x₁, ..., x_n) and all of the variables x₁, ..., x_n depend on another variable t, then by the chain rule for partial derivatives, one has

{\begin{aligned}dy&={\frac {dy}{dt}}dt\\&={\frac {\partial y}{\partial x_{1}}}dx_{1}+\cdots +{\frac {\partial y}{\partial x_{n}}}dx_{n}\\&={\frac {\partial y}{\partial x_{1}}}{\frac {dx_{1}}{dt}}\,dt+\cdots +{\frac {\partial y}{\partial x_{n}}}{\frac {dx_{n}}{dt}}\,dt.\end{aligned}}

Heuristically, the chain rule for several variables can itself be understood by dividing through both sides of this equation by the infinitely small quantity dt.

More general analogous expressions hold, in which the intermediate variables x_i depend on more than one variable.

General formulation

A consistent notion of differential can be developed for a function f : Rⁿ → R^m between two Euclidean spaces. Let x,Δx ∈ Rⁿ be a pair of Euclidean vectors. The increment in the function f is

\Delta f=f(\mathbf {x} +\Delta \mathbf {x} )-f(\mathbf {x} ).

If there exists an m × n matrix A such that

\Delta f=A\Delta \mathbf {x} +\|\Delta \mathbf {x} \|{\boldsymbol {\varepsilon }}

in which the vector ε → 0 as Δx → 0, then f is by definition differentiable at the point x. The matrix A is sometimes known as the Jacobian matrix, and the linear transformation that associates to the increment Δx ∈ Rⁿ the vector AΔx ∈ R^m is, in this general setting, known as the differential df(x) of f at the point x. This is precisely the Fréchet derivative, and the same construction can be made to work for a function between any Banach spaces.

Another fruitful point of view is to define the differential directly as a kind of directional derivative:

df(\mathbf {x} ,\mathbf {h} )=\lim _{t\to 0}{\frac {f(\mathbf {x} +t\mathbf {h} )-f(\mathbf {x} )}{t}}=\left.{\frac {d}{dt}}f(\mathbf {x} +t\mathbf {h} )\right|_{t=0},

which is the approach already taken for defining higher order differentials (and is most nearly the definition set forth by Cauchy). If t represents time and x position, then h represents a velocity instead of a displacement as we have heretofore regarded it. This yields yet another refinement of the notion of differential: that it should be a linear function of a kinematic velocity. The set of all velocities through a given point of space is known as the tangent space, and so df gives a linear function on the tangent space: a differential form. With this interpretation, the differential of f is known as the exterior derivative, and has broad application in differential geometry because the notion of velocities and the tangent space makes sense on any differentiable manifold. If, in addition, the output value of f also represents a position (in a Euclidean space), then a dimensional analysis confirms that the output value of df must be a velocity. If one treats the differential in this manner, then it is known as the pushforward since it "pushes" velocities from a source space into velocities in a target space.

Other approaches

Although the notion of having an infinitesimal increment dx is not well-defined in modern mathematical analysis, a variety of techniques exist for defining the infinitesimal differential so that the differential of a function can be handled in a manner that does not clash with the Leibniz notation. These include:

Defining the differential as a kind of differential form, specifically the exterior derivative of a function. The infinitesimal increments are then identified with vectors in the tangent space at a point. This approach is popular in differential geometry and related fields, because it readily generalizes to mappings between differentiable manifolds.
Differentials as nilpotent elements of commutative rings. This approach is popular in algebraic geometry.^[13]
Differentials in smooth models of set theory. This approach is known as synthetic differential geometry or smooth infinitesimal analysis and is closely related to the algebraic geometric approach, except that ideas from topos theory are used to hide the mechanisms by which nilpotent infinitesimals are introduced.^[14]
Differentials as infinitesimals in hyperreal number systems, which are extensions of the real numbers which contain invertible infinitesimals and infinitely large numbers. This is the approach of nonstandard analysis pioneered by Abraham Robinson.^[15]

Examples and applications

Differentials may be effectively used in numerical analysis to study the propagation of experimental errors in a calculation, and thus the overall numerical stability of a problem ( Courant 1937a ). Suppose that the variable x represents the outcome of an experiment and y is the result of a numerical computation applied to x. The question is to what extent errors in the measurement of x influence the outcome of the computation of y. If the x is known to within Δx of its true value, then Taylor's theorem gives the following estimate on the error Δy in the computation of y:

\Delta y=f'(x)\Delta x+{\frac {(\Delta x)^{2}}{2}}f''(\xi )

where ξ = x + θΔx for some 0 < θ < 1. If Δx is small, then the second order term is negligible, so that Δy is, for practical purposes, well-approximated by dy = f'(x)Δx.

The differential is often useful to rewrite a differential equation

{\frac {dy}{dx}}=g(x)

in the form

dy=g(x)\,dx,

in particular when one wants to separate the variables.

Notes

↑ For a detailed historical account of the differential, see Boyer 1959, especially page 275 for Cauchy's contribution on the subject. An abbreviated account appears in Kline 1972 , Chapter 40.
↑ Cauchy explicitly denied the possibility of actual infinitesimal and infinite quantities ( Boyer 1959 , pp. 273–275), and took the radically different point of view that "a variable quantity becomes infinitely small when its numerical value decreases indefinitely in such a way as to converge to zero" (Cauchy 1823 , p. 12; translation from Boyer 1959 , p. 273).
↑ Boyer 1959 , p. 275
↑ Boyer 1959 , p. 12: "The differentials as thus defined are only new variables, and not fixed infinitesimals..."
↑ Courant 1937a , II, §9: "Here we remark merely in passing that it is possible to use this approximate representation of the increment $\Delta y$ by the linear expression $hf(x)$ to construct a logically satisfactory definition of a "differential", as was done by Cauchy in particular."
↑ Boyer 1959 , p. 284
↑ See, for instance, the influential treatises of Courant 1937a, Kline 1977, Goursat 1904, and Hardy 1908. Tertiary sources for this definition include also Tolstov 2001 and Itô 1993 , §106.
↑ Cauchy 1823. See also, for instance, Goursat 1904 , I, §14.
↑ Goursat 1904 , I, §14
↑ In particular to infinite dimensional holomorphy ( Hille & Phillips 1974 ) and numerical analysis via the calculus of finite differences.
↑ Goursat 1904 , I, §17
↑ Goursat 1904 , I, §§14,16
↑ Eisenbud & Harris 1998.
↑ See Kock 2006 and Moerdijk & Reyes 1991.
↑ See Robinson 1996 and Keisler 1986.

Related Research Articles

In calculus, the chain rule is a formula that expresses the derivative of the composition of two differentiable functions $f$ and $g$ in terms of the derivatives of $f$ and $g$ . More precisely, if $is the function such that for every x, then the chain rule is, in Lagrange's notation,$

<span class="mw-page-title-main">Cauchy–Riemann equations</span> Conditions required of holomorphic (complex differentiable) functions

In the field of complex analysis in mathematics, the Cauchy–Riemann equations, named after Augustin Cauchy and Bernhard Riemann, consist of a system of two partial differential equations which, together with certain continuity and differentiability criteria, form a necessary and sufficient condition for a complex function to be holomorphic. This system of equations first appeared in the work of Jean le Rond d'Alembert. Later, Leonhard Euler connected this system to the analytic functions. Cauchy then used these equations to construct his theory of functions. Riemann's dissertation on the theory of functions appeared in 1851.

In mathematics, the derivative of a function of a real variable measures the sensitivity to change of the function value with respect to a change in its argument. Derivatives are a fundamental tool of calculus. For example, the derivative of the position of a moving object with respect to time is the object's velocity: this measures how quickly the position of the object changes when time advances.

In mathematics and physics, Laplace's equation is a second-order partial differential equation named after Pierre-Simon Laplace, who first studied its properties. This is often written as

In mathematics, the Dirac delta distribution, also known as the unit impulse, is a generalized function or distribution over the real numbers, whose value is zero everywhere except at zero, and whose integral over the entire real line is equal to one.

In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample in the sample space can be interpreted as providing a relative likelihood that the value of the random variable would be equal to that sample. Probability density is the probability per unit length, in other words, while the absolute likelihood for a continuous random variable to take on any particular value is 0, the value of the PDF at two different samples can be used to infer, in any particular draw of the random variable, how much more likely it is that the random variable would be close to one sample compared to the other sample.

In mathematics, differential calculus is a subfield of calculus that studies the rates at which quantities change. It is one of the two traditional divisions of calculus, the other being integral calculus—the study of the area beneath a curve.

In vector calculus, Green's theorem relates a line integral around a simple closed curve $C$ to a double integral over the plane region $D$ bounded by $C$ . It is the two-dimensional special case of Stokes' theorem.

In calculus, the product rule is a formula used to find the derivatives of products of two or more functions. For two functions, it may be stated in Lagrange's notation as

In the calculus of variations, a field of mathematical analysis, the functional derivative relates a change in a functional to a change in a function on which the functional depends.

In mathematics, the Hodge star operator or Hodge star is a linear map defined on the exterior algebra of a finite-dimensional oriented vector space endowed with a nondegenerate symmetric bilinear form. Applying the operator to an element of the algebra produces the Hodge dual of the element. This map was introduced by W. V. D. Hodge.

In calculus, Leibniz's notation, named in honor of the 17th-century German philosopher and mathematician Gottfried Wilhelm Leibniz, uses the symbols $dx$ and $dy$ to represent infinitely small increments of $x$ and $y$ , respectively, just as $Δ x$ and $Δ y$ represent finite increments of $x$ and $y$ , respectively.

In mathematics, the symmetry of second derivatives refers to the possibility of interchanging the order of taking partial derivatives of a function

In mathematics, the directional derivative of a multivariable differentiable (scalar) function along a given vector v at a given point x intuitively represents the instantaneous rate of change of the function, moving through x with a velocity specified by v.

In mathematics, differential refers to several related notions derived from the early days of calculus, put on a rigorous footing, such as infinitesimal differences and the derivatives of functions.

In mathematics, the total derivative of a function $f$ at a point is the best linear approximation near this point of the function with respect to its arguments. Unlike partial derivatives, the total derivative approximates the function with respect to all of its arguments, not just a single one. In many situations, this is the same as considering all partial derivatives simultaneously. The term "total derivative" is primarily used when $f$ is a function of several variables, because when $f$ is a function of a single variable, the total derivative is the same as the ordinary derivative of the function.

In calculus, the Leibniz integral rule for differentiation under the integral sign, named after Gottfried Leibniz, states that for an integral of the form

In calculus, the second derivative, or the second order derivative, of a function $f$ is the derivative of the derivative of $f$ . Roughly speaking, the second derivative measures how the rate of change of a quantity is itself changing; for example, the second derivative of the position of an object with respect to time is the instantaneous acceleration of the object, or the rate at which the velocity of the object is changing with respect to time. In Leibniz notation:

The triple product rule, known variously as the cyclic chain rule, cyclic relation, cyclical rule or Euler's chain rule, is a formula which relates partial derivatives of three interdependent variables. The rule finds application in thermodynamics, where frequently three variables can be related by a function of the form f(x, y, z) = 0, so each variable is given as an implicit function of the other two variables. For example, an equation of state for a fluid relates temperature, pressure, and volume in this manner. The triple product rule for such interrelated variables x, y, and z comes from using a reciprocity relation on the result of the implicit function theorem, and is given by

In differential calculus, there is no single uniform notation for differentiation. Instead, various notations for the derivative of a function or variable have been proposed by various mathematicians. The usefulness of each notation varies with the context, and it is sometimes advantageous to use more than one notation in a given context. The most common notations for differentiation are listed below.

References

Boyer, Carl B. (1959), The history of the calculus and its conceptual development, New York: Dover Publications, MR 0124178 .
Cauchy, Augustin-Louis (1823), Résumé des Leçons données à l'Ecole royale polytechnique sur les applications du calcul infinitésimal, archived from the original on 2007-07-08, retrieved 2009-08-19.
Courant, Richard (1937a), Differential and integral calculus. Vol. I, Wiley Classics Library, New York: John Wiley & Sons (published 1988), ISBN 978-0-471-60842-4, MR 1009558 .
Courant, Richard (1937b), Differential and integral calculus. Vol. II, Wiley Classics Library, New York: John Wiley & Sons (published 1988), ISBN 978-0-471-60840-0, MR 1009559 .
Courant, Richard; John, Fritz (1999), Introduction to Calculus and Analysis Volume 1, Classics in Mathematics, Berlin, New York: Springer-Verlag, ISBN 3-540-65058-X, MR 1746554
Eisenbud, David; Harris, Joe (1998), The Geometry of Schemes, Springer-Verlag, ISBN 0-387-98637-5 .
Fréchet, Maurice (1925), "La notion de différentielle dans l'analyse générale", Annales Scientifiques de l'École Normale Supérieure, Série 3, 42: 293–323, doi:10.24033/asens.766, ISSN 0012-9593, MR 1509268 .
Goursat, Édouard (1904), A course in mathematical analysis: Vol 1: Derivatives and differentials, definite integrals, expansion in series, applications to geometry, E. R. Hedrick, New York: Dover Publications (published 1959), MR 0106155 .
Hadamard, Jacques (1935), "La notion de différentiel dans l'enseignement", Mathematical Gazette, XIX (236): 341–342, doi:10.2307/3606323, JSTOR 3606323 .
Hardy, Godfrey Harold (1908), A Course of Pure Mathematics, Cambridge University Press, ISBN 978-0-521-09227-2 .
Hille, Einar; Phillips, Ralph S. (1974), Functional analysis and semi-groups, Providence, R.I.: American Mathematical Society, MR 0423094 .
Itô, Kiyosi (1993), Encyclopedic Dictionary of Mathematics (2nd ed.), MIT Press, ISBN 978-0-262-59020-4 .
Kline, Morris (1977), "Chapter 13: Differentials and the law of the mean", Calculus: An intuitive and physical approach, John Wiley and Sons.
Kline, Morris (1972), Mathematical thought from ancient to modern times (3rd ed.), Oxford University Press (published 1990), ISBN 978-0-19-506136-9
Keisler, H. Jerome (1986), Elementary Calculus: An Infinitesimal Approach (2nd ed.).
Kock, Anders (2006), Synthetic Differential Geometry (PDF) (2nd ed.), Cambridge University Press.
Moerdijk, I.; Reyes, G.E. (1991), Models for Smooth Infinitesimal Analysis, Springer-Verlag.
Robinson, Abraham (1996), Non-standard analysis, Princeton University Press, ISBN 978-0-691-04490-3 .
Tolstov, G.P. (2001) [1994], "Differential", Encyclopedia of Mathematics , EMS Press .

External links

Differential Of A Function at Wolfram Demonstrations Project

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] For a detailed historical account of the differential, see Boyer 1959, especially page 275 for Cauchy's contribution on the subject. An abbreviated account appears in Kline 1972 , Chapter 40.

[2] Cauchy explicitly denied the possibility of actual infinitesimal and infinite quantities ( Boyer 1959 , pp. 273–275), and took the radically different point of view that "a variable quantity becomes infinitely small when its numerical value decreases indefinitely in such a way as to converge to zero" (Cauchy 1823 , p. 12; translation from Boyer 1959 , p. 273).

[3] Boyer 1959 , p. 275

[4] Boyer 1959 , p. 12: "The differentials as thus defined are only new variables, and not fixed infinitesimals..."

[5] Courant 1937a , II, §9: "Here we remark merely in passing that it is possible to use this approximate representation of the increment $\Delta y$ by the linear expression $hf(x)$ to construct a logically satisfactory definition of a "differential", as was done by Cauchy in particular."

[6] Boyer 1959 , p. 284

[7] See, for instance, the influential treatises of Courant 1937a, Kline 1977, Goursat 1904, and Hardy 1908. Tertiary sources for this definition include also Tolstov 2001 and Itô 1993 , §106.

[8] Cauchy 1823. See also, for instance, Goursat 1904 , I, §14.

[9] Goursat 1904 , I, §14

[10] In particular to infinite dimensional holomorphy ( Hille & Phillips 1974 ) and numerical analysis via the calculus of finite differences.

[11] Goursat 1904 , I, §17

[12] Goursat 1904 , I, §§14,16

[13] Eisenbud & Harris 1998.

[14] See Kock 2006 and Moerdijk & Reyes 1991.

[nonstd-15] See Robinson 1996 and Keisler 1986.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]