Characterizations of the exponential function

Last updated

In mathematics, the exponential function can be characterized in many ways. This article presents some common characterizations, discusses why each makes sense, and proves that they are all equivalent.

Contents

The exponential function occurs naturally in many branches of mathematics. Walter Rudin called it "the most important function in mathematics". [1] It is therefore useful to have multiple ways to define (or characterize) it. Each of the characterizations below may be more or less useful depending on context. The "product limit" characterization of the exponential function was discovered by Leonhard Euler. [2]

Characterizations

The six most common definitions of the exponential function for real values are as follows.

  1. Product limit. Define by the limit:
  2. Power series. Define ex as the value of the infinite series (Here n! denotes the factorial of n. One proof that e is irrational uses a special case of this formula.)
  3. Inverse of logarithm integral. Define to be the unique number y > 0 such that That is, is the inverse of the natural logarithm function , which is defined by this integral.
  4. Differential equation. Define to be the unique solution to the differential equation with initial value: where denotes the derivative of y.
  5. Functional equation. The exponential function is the unique function f with the multiplicative property for all and . The condition can be replaced with together with any of the following regularity conditions:
    For the uniqueness, one must impose some regularity condition, since other functions satisfying can be constructed using a basis for the real numbers over the rationals, as described by Hewitt and Stromberg.
  6. Elementary definition by powers. Define the exponential function with base to be the continuous function whose value on integers is given by repeated multiplication or division of , and whose value on rational numbers is given by . Then define to be the exponential function whose base is the unique positive real number satisfying:

Larger domains

One way of defining the exponential function over the complex numbers is to first define it for the domain of real numbers using one of the above characterizations, and then extend it as an analytic function, which is characterized by its values on any infinite domain set.

Also, characterisations (1), (2), and (4) for apply directly for a complex number. Definition (3) presents a problem because there are non-equivalent paths along which one could integrate; but the equation of (3) should hold for any such path modulo . As for definition (5), the additive property together with the complex derivative are sufficient to guarantee . However, the initial value condition together with the other regularity conditions are not sufficient. For example, for real x and y, the functionsatisfies the three listed regularity conditions in (5) but is not equal to . A sufficient condition is that and that is a conformal map at some point; or else the two initial values and together with the other regularity conditions.

One may also define the exponential on other domains, such as matrices and other algebras. Definitions (1), (2), and (4) all make sense for arbitrary Banach algebras.

Proof that each characterization makes sense

Some of these definitions require justification to demonstrate that they are well-defined. For example, when the value of the function is defined as the result of a limiting process (i.e. an infinite sequence or series), it must be demonstrated that such a limit always exists.

Characterization 1

The error of the product limit expression is described by: where the polynomial's degree (in x) in the term with denominator nk is 2k.

Characterization 2

Since it follows from the ratio test that converges for all x.

Characterization 3

Since the integrand is an integrable function of t, the integral expression is well-defined. It must be shown that the function from to defined by is a bijection. Since 1/t is positive for positive t, this function is strictly increasing, hence injective. If the two integrals hold, then it is surjective as well. Indeed, these integrals do hold; they follow from the integral test and the divergence of the harmonic series.

Characterization 6

The definition depends on the unique positive real number satisfying: This limit can be shown to exist for any , and it defines a continuous increasing function with and , so the Intermediate value theorem guarantees the existence of such a value .

Equivalence of the characterizations

The following arguments demonstrate the equivalence of the above characterizations for the exponential function.

Characterization 1 ⇔ characterization 2

The following argument is adapted from Rudin, theorem 3.31, p. 63–65.

Let be a fixed non-negative real number. Define

By the binomial theorem, (using x  0 to obtain the final inequality) so that: One must use lim sup because it is not known if tn converges.

For the other inequality, by the above expression for tn, if 2  mn, we have:

Fix m, and let n approach infinity. Then (again, one must use lim inf because it is not known if tn converges). Now, take the above inequality, let m approach infinity, and put it together with the other inequality to obtain: so that

This equivalence can be extended to the negative real numbers by noting and taking the limit as n goes to infinity.

Characterization 1 ⇔ characterization 3

Here, the natural logarithm function is defined in terms of a definite integral as above. By the first part of fundamental theorem of calculus,

Besides,

Now, let x be any fixed real number, and let

Ln(y) = x, which implies that y = ex, where ex is in the sense of definition 3. We have

Here, the continuity of ln(y) is used, which follows from the continuity of 1/t:

Here, the result lnan = nlna has been used. This result can be established for n a natural number by induction, or using integration by substitution. (The extension to real powers must wait until ln and exp have been established as inverses of each other, so that ab can be defined for real b as eb lna.)

Characterization 1 ⇔ characterization 4

Let denote the solution to the initial value problem . Applying the simplest form of Euler's method with increment and sample points gives the recursive formula:

This recursion is immediately solved to give the approximate value , and since Euler's Method is known to converge to the exact solution, we have:

Characterization 2 ⇔ characterization 4

Let n be a non-negative integer. In the sense of definition 4 and by induction, .

Therefore

Using Taylor series, This shows that definition 4 implies definition 2.

In the sense of definition 2,

Besides, This shows that definition 2 implies definition 4.

Characterization 2 ⇒ characterization 5

In the sense of definition 2, the equation follows from the term-by-term manipulation of power series justified by uniform convergence, and the resulting equality of coefficients is just the Binomial theorem. Furthermore: [3]

Characterization 3 ⇔ characterization 4

Characterisation 3 first defines the natural logarithm:then as the inverse function with . Then by the Chain rule:

i.e. . Finally, , so . That is, is the unique solution of the initial value problem , of characterization 4. Conversely, assume has and , and define as its inverse function with and . Then:

i.e. . By the Fundamental theorem of calculus,

Characterization 5 ⇒ characterization 4

The conditions f'(0) = 1 and f(x + y) = f(x) f(y) imply both conditions in characterization 4. Indeed, one gets the initial condition f(0) = 1 by dividing both sides of the equation by f(0), and the condition that f′(x) = f(x) follows from the condition that f′(0) = 1 and the definition of the derivative as follows:

Characterization 5 ⇒ characterization 4

Assum characterization 5, the multiplicative property together with the initial condition imply that:

Characterization 5 ⇔ characterization 6

By inductively applying the multiplication rule, we get: and thus for . Then the condition means that , so by definition.

Also, any of the regularity conditions of definition 5 imply that is continuous at all real (see below). The converse is similar.

Characterization 5 ⇒ characterization 6

Let be a Lebesgue-integrable non-zero function satisfying the mulitiplicative property with . Following Hewitt and Stromberg, exercise 18.46, we will prove that Lebesgue-integrability implies continuity. This is sufficient to imply according to characterization 6, arguing as above.

First, a few elementary properties:

  1. If is nonzero anywhere (say at ), then it is non-zero everywhere. Proof: implies .
  2. . Proof: and is non-zero.
  3. . Proof: .
  4. If is continuous anywhere (say at ), then it is continuous everywhere. Proof: as by continuity at .

The second and third properties mean that it is sufficient to prove for positive x.

Since is a Lebesgue-integrable function, then we may define . It then follows that

Since is nonzero, some y can be chosen such that and solve for in the above expression. Therefore:

The final expression must go to zero as since and is continuous. It follows that is continuous.

Related Research Articles

<span class="mw-page-title-main">Exponential function</span> Mathematical function, denoted exp(x) or e^x

The exponential function is a mathematical function denoted by or . Unless otherwise specified, the term generally refers to the positive-valued function of a real variable, although it can be extended to the complex numbers or generalized to other mathematical objects like matrices or Lie algebras. The exponential function originated from the operation of taking powers of a number, but various modern definitions allow it to be rigorously extended to all real arguments , including irrational numbers. Its ubiquity in pure and applied mathematics led mathematician Walter Rudin to consider the exponential function to be "the most important function in mathematics".

<span class="mw-page-title-main">Natural logarithm</span> Logarithm to the base of the mathematical constant e

The natural logarithm of a number is its logarithm to the base of the mathematical constant e, which is an irrational and transcendental number approximately equal to 2.718281828459. The natural logarithm of x is generally written as ln x, logex, or sometimes, if the base e is implicit, simply log x. Parentheses are sometimes added for clarity, giving ln(x), loge(x), or log(x). This is done particularly when the argument to the logarithm is not a single symbol, so as to prevent ambiguity.

<span class="mw-page-title-main">Dirac delta function</span> Generalized function whose value is zero everywhere except at zero

In mathematical analysis, the Dirac delta function, also known as the unit impulse, is a generalized function on the real numbers, whose value is zero everywhere except at zero, and whose integral over the entire real line is equal to one. Thus it can be represented heuristically as

A finite difference is a mathematical expression of the form f (x + b) − f (x + a). If a finite difference is divided by ba, one gets a difference quotient. The approximation of derivatives by finite differences plays a central role in finite difference methods for the numerical solution of differential equations, especially boundary value problems.

<span class="mw-page-title-main">Exponential distribution</span> Probability distribution

In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuously and independently at a constant average rate; the distance parameter could be any meaningful mono-dimensional measure of the process, such as time between production errors, or length along a roll of fabric in the weaving manufacturing process. It is a particular case of the gamma distribution. It is the continuous analogue of the geometric distribution, and it has the key property of being memoryless. In addition to being used for the analysis of Poisson point processes it is found in various other contexts.

<span class="mw-page-title-main">Euler's constant</span> Constant value used in mathematics

Euler's constant is a mathematical constant, usually denoted by the lowercase Greek letter gamma, defined as the limiting difference between the harmonic series and the natural logarithm, denoted here by log:

In mathematics, a Gaussian function, often simply referred to as a Gaussian, is a function of the base form and with parametric extension for arbitrary real constants a, b and non-zero c. It is named after the mathematician Carl Friedrich Gauss. The graph of a Gaussian is a characteristic symmetric "bell curve" shape. The parameter a is the height of the curve's peak, b is the position of the center of the peak, and c controls the width of the "bell".

<span class="mw-page-title-main">Product rule</span> Formula for the derivative of a product

In calculus, the product rule is a formula used to find the derivatives of products of two or more functions. For two functions, it may be stated in Lagrange's notation as or in Leibniz's notation as

<span class="mw-page-title-main">Digamma function</span> Mathematical function

In mathematics, the digamma function is defined as the logarithmic derivative of the gamma function:

<span class="mw-page-title-main">Tetration</span> Arithmetic operation

In mathematics, tetration is an operation based on iterated, or repeated, exponentiation. There is no standard notation for tetration, though Knuth's up arrow notation and the left-exponent xb are common.

<span class="mw-page-title-main">Propagator</span> Function in quantum field theory showing probability amplitudes of moving particles

In quantum mechanics and quantum field theory, the propagator is a function that specifies the probability amplitude for a particle to travel from one place to another in a given period of time, or to travel with a certain energy and momentum. In Feynman diagrams, which serve to calculate the rate of collisions in quantum field theory, virtual particles contribute their propagator to the rate of the scattering event described by the respective diagram. Propagators may also be viewed as the inverse of the wave operator appropriate to the particle, and are, therefore, often called (causal) Green's functions.

In mathematical statistics, the Kullback–Leibler (KL) divergence, denoted , is a type of statistical distance: a measure of how one reference probability distribution P is different from a second probability distribution Q. Mathematically, it is defined as

<span class="mw-page-title-main">Rectangular function</span> Function whose graph is 0, then 1, then 0 again, in an almost-everywhere continuous way

The rectangular function is defined as

In mathematics, the K-function, typically denoted K(z), is a generalization of the hyperfactorial to complex numbers, similar to the generalization of the factorial to the gamma function.

In mathematics, the explicit formulae for L-functions are relations between sums over the complex number zeroes of an L-function and sums over prime powers, introduced by Riemann (1859) for the Riemann zeta function. Such explicit formulae have been applied also to questions on bounding the discriminant of an algebraic number field, and the conductor of a number field.

In applied mathematics, polyharmonic splines are used for function approximation and data interpolation. They are very useful for interpolating and fitting scattered data in many dimensions. Special cases include thin plate splines and natural cubic splines in one dimension.

A product integral is any product-based counterpart of the usual sum-based integral of calculus. The product integral was developed by the mathematician Vito Volterra in 1887 to solve systems of linear differential equations.

In discrete calculus the indefinite sum operator, denoted by or , is the linear operator, inverse of the forward difference operator . It relates to the forward difference operator as the indefinite integral relates to the derivative. Thus

In probability theory and statistics, the generalized multivariate log-gamma (G-MVLG) distribution is a multivariate distribution introduced by Demirhan and Hamurkaroglu in 2011. The G-MVLG is a flexible distribution. Skewness and kurtosis are well controlled by the parameters of the distribution. This enables one to control dispersion of the distribution. Because of this property, the distribution is effectively used as a joint prior distribution in Bayesian analysis, especially when the likelihood is not from the location-scale family of distributions such as normal distribution.

References

  1. Walter Rudin (1987). Real and complex analysis (3rd ed.). New York: McGraw-Hill. p. 1. ISBN   978-0-07-054234-1.
  2. Eli Maor. e: the Story of a Number. p. 156.
  3. "Herman Yeung - Calculus - First Principle find d/Dx(e^x) 基本原理求 d/Dx(e^x)". YouTube .