Ekeland's variational principle

Last updated

In mathematical analysis, Ekeland's variational principle, discovered by Ivar Ekeland, [1] [2] [3] is a theorem that asserts that there exist nearly optimal solutions to some optimization problems.

Contents

Ekeland's principle can be used when the lower level set of a minimization problems is not compact, so that the Bolzano–Weierstrass theorem cannot be applied. The principle relies on the completeness of the metric space. [4]

The principle has been shown to be equivalent to completeness of metric spaces. [5] In proof theory, it is equivalent to Π1
1
CA0 over RCA0
, i.e. relatively strong.

It also leads to a quick proof of the Caristi fixed point theorem. [4] [6]

History

Ekeland was associated with the Paris Dauphine University when he proposed this theorem. [1]

Ekeland's variational principle

Preliminary definitions

A function valued in the extended real numbers is said to be bounded below if and it is called proper if it has a non-empty effective domain , which by definition is the set

and it is never equal to In other words, a map is proper if is valued in and not identically The map is proper and bounded below if and only if or equivalently, if and only if

A function is lower semicontinuous at a given if for every real there exists a neighborhood of such that for all A function is called lower semicontinuous if it is lower semicontinuous at every point of which happens if and only if is an open set for every or equivalently, if and only if all of its lower level sets are closed.

Statement of the theorem

Ekeland's variational principle [7]   Let be a complete metric space and let be a proper lower semicontinuous function that is bounded below (so ). Pick such that (or equivalently, ) and fix any real There exists some such that

and for every other than (that is, ),

Proof

Define a function by

which is lower semicontinuous because it is the sum of the lower semicontinuous function and the continuous function Given denote the functions with one coordinate fixed at by

and define the set

which is not empty since An element satisfies the conclusion of this theorem if and only if It remains to find such an element.

It may be verified that for every

  1. is closed (because is lower semicontinuous);
  2. if then
  3. if then in particular,
  4. if then

Let which is a real number because was assumed to be bounded below. Pick such that Having defined and let

and pick such that For any guarantees that and which in turn implies and thus also

So if then and which guarantee

It follows that for all positive integers

which proves that is a Cauchy sequence. Because is a complete metric space, there exists some such that converges to For any since is a closed set that contain the sequence it must also contain this sequence's limit, which is thus and in particular,

The theorem will follow once it is shown that So let and it remains to show Because for all it follows as above that which implies that converges to Because also converges to and limits in metric spaces are unique, Q.E.D.

For example, if and are as in the theorem's statement and if happens to be a global minimum point of then the vector from the theorem's conclusion is

Corollaries

Corollary [8]   Let be a complete metric space, and let be a lower semicontinuous functional on that is bounded below and not identically equal to Fix and a point such that

Then, for every there exists a point such that

and, for all

The principle could be thought of as follows: For any point which nearly realizes the infimum, there exists another point , which is at least as good as , it is close to and the perturbed function, , has unique minimum at . A good compromise is to take in the preceding result. [8]

See also

Related Research Articles

In mathematics, the infimum of a subset of a partially ordered set is the greatest element in that is less than or equal to each element of if such an element exists. In other words, it is the greatest element of that is lower or equal to the lowest element of . Consequently, the term greatest lower bound is also commonly used. The supremum of a subset of a partially ordered set is the least element in that is greater than or equal to each element of if such an element exists. In other words, it is the least element of that is greater than or equal to the greatest element of . Consequently, the supremum is also referred to as the least upper bound.

<span class="mw-page-title-main">Limit inferior and limit superior</span> Bounds of a sequence

In mathematics, the limit inferior and limit superior of a sequence can be thought of as limiting bounds on the sequence. They can be thought of in a similar fashion for a function. For a set, they are the infimum and supremum of the set's limit points, respectively. In general, when there are multiple objects around which a sequence, function, or set accumulates, the inferior and superior limits extract the smallest and largest of them; the type of object and the measure of size is context-dependent, but the notion of extreme limits is invariant. Limit inferior is also called infimum limit, limit infimum, liminf, inferior limit, lower limit, or inner limit; limit superior is also known as supremum limit, limit supremum, limsup, superior limit, upper limit, or outer limit.

In mathematics, the Lp spaces are function spaces defined using a natural generalization of the p-norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue, although according to the Bourbaki group they were first introduced by Frigyes Riesz.

In probability theory, there exist several different notions of convergence of sequences of random variables. The different notions of convergence capture different properties about the sequence, with some notions of convergence being stronger than others. For example, convergence in distribution tells us about the limit distribution of a sequence of random variables. This is a weaker notion than convergence in probability, which tells us about the value a random variable will take, rather than just the distribution.

<span class="mw-page-title-main">Semi-continuity</span> Property of functions which is weaker than continuity

In mathematical analysis, semicontinuity is a property of extended real-valued functions that is weaker than continuity. An extended real-valued function is uppersemicontinuous at a point if, roughly speaking, the function values for arguments near are not much higher than

In mathematics, an infinite series of numbers is said to converge absolutely if the sum of the absolute values of the summands is finite. More precisely, a real or complex series is said to converge absolutely if for some real number Similarly, an improper integral of a function, is said to converge absolutely if the integral of the absolute value of the integrand is finite—that is, if

In mathematics, the uniform boundedness principle or Banach–Steinhaus theorem is one of the fundamental results in functional analysis. Together with the Hahn–Banach theorem and the open mapping theorem, it is considered one of the cornerstones of the field. In its basic form, it asserts that for a family of continuous linear operators whose domain is a Banach space, pointwise boundedness is equivalent to uniform boundedness in operator norm.

<span class="mw-page-title-main">Limit of a sequence</span> Value to which tends an infinite sequence

In mathematics, the limit of a sequence is the value that the terms of a sequence "tend to", and is often denoted using the symbol. If such a limit exists, the sequence is called convergent. A sequence that does not converge is said to be divergent. The limit of a sequence is said to be the fundamental notion on which the whole of mathematical analysis ultimately rests.

In mathematics, Fatou's lemma establishes an inequality relating the Lebesgue integral of the limit inferior of a sequence of functions to the limit inferior of integrals of these functions. The lemma is named after Pierre Fatou.

Vapnik–Chervonenkis theory was developed during 1960–1990 by Vladimir Vapnik and Alexey Chervonenkis. The theory is a form of computational learning theory, which attempts to explain the learning process from a statistical point of view.

In mathematics, the Cauchy principal value, named after Augustin Louis Cauchy, is a method for assigning values to certain improper integrals which would otherwise be undefined. In this method, a singularity on an integral interval is avoided by limiting the integral interval to the non singular domain.

<span class="mw-page-title-main">Integral test for convergence</span> Test for infinite series of monotonous terms for convergence

In mathematics, the integral test for convergence is a method used to test infinite series of monotonous terms for convergence. It was developed by Colin Maclaurin and Augustin-Louis Cauchy and is sometimes known as the Maclaurin–Cauchy test.

In functional analysis and related branches of mathematics, the Banach–Alaoglu theorem states that the closed unit ball of the dual space of a normed vector space is compact in the weak* topology. A common proof identifies the unit ball with the weak-* topology as a closed subset of a product of compact sets with the product topology. As a consequence of Tychonoff's theorem, this product, and hence the unit ball within, is compact.

In mathematics, the Riemann–Lebesgue lemma, named after Bernhard Riemann and Henri Lebesgue, states that the Fourier transform or Laplace transform of an L1 function vanishes at infinity. It is of importance in harmonic analysis and asymptotic analysis.

In linear algebra, a sublinear function, also called a quasi-seminorm or a Banach functional, on a vector space is a real-valued function with only some of the properties of a seminorm. Unlike seminorms, a sublinear function does not have to be nonnegative-valued and also does not have to be absolutely homogeneous. Seminorms are themselves abstractions of the more well known notion of norms, where a seminorm has all the defining properties of a norm except that it is not required to map non-zero vectors to non-zero values.

In the mathematical theory of probability, a Doob martingale is a stochastic process that approximates a given random variable and has the martingale property with respect to the given filtration. It may be thought of as the evolving sequence of best approximations to the random variable based on information accumulated up to a certain time.

In probability theory and theoretical computer science, McDiarmid's inequality is a concentration inequality which bounds the deviation between the sampled value and the expected value of certain functions when they are evaluated on independent random variables. McDiarmid's inequality applies to functions that satisfy a bounded differences property, meaning that replacing a single argument to the function while leaving all other arguments unchanged cannot cause too large of a change in the value of the function.

In mathematics, Schilder's theorem is a generalization of the Laplace method from integrals on to functional Wiener integration. The theorem is used in the large deviations theory of stochastic processes. Roughly speaking, out of Schilder's theorem one gets an estimate for the probability that a (scaled-down) sample path of Brownian motion will stray far from the mean path. This statement is made precise using rate functions. Schilder's theorem is generalized by the Freidlin–Wentzell theorem for Itō diffusions.

This article is supplemental for “Convergence of random variables” and provides proofs for selected results.

In functional analysis and related areas of mathematics, a complete topological vector space is a topological vector space (TVS) with the property that whenever points get progressively closer to each other, then there exists some point towards which they all get closer. The notion of "points that get progressively closer" is made rigorous by Cauchy nets or Cauchy filters, which are generalizations of Cauchy sequences, while "point towards which they all get closer" means that this Cauchy net or filter converges to The notion of completeness for TVSs uses the theory of uniform spaces as a framework to generalize the notion of completeness for metric spaces. But unlike metric-completeness, TVS-completeness does not depend on any metric and is defined for all TVSs, including those that are not metrizable or Hausdorff.

References

    1. 1 2 Ekeland, Ivar (1974). "On the variational principle". J. Math. Anal. Appl. 47 (2): 324–353. doi: 10.1016/0022-247X(74)90025-0 . ISSN   0022-247X.
    2. Ekeland, Ivar (1979). "Nonconvex minimization problems". Bulletin of the American Mathematical Society. New Series. 1 (3): 443–474. doi: 10.1090/S0273-0979-1979-14595-6 . MR   0526967.
    3. Ekeland, Ivar; Temam, Roger (1999). Convex analysis and variational problems. Classics in applied mathematics. Vol. 28 (Corrected reprinting of the (1976) North-Holland ed.). Philadelphia, PA: Society for Industrial and Applied Mathematics (SIAM). pp. 357–373. ISBN   0-89871-450-8. MR   1727362.
    4. 1 2 Kirk, William A.; Goebel, Kazimierz (1990). Topics in Metric Fixed Point Theory. Cambridge University Press. ISBN   0-521-38289-0.
    5. Sullivan, Francis (October 1981). "A characterization of complete metric spaces". Proceedings of the American Mathematical Society. 83 (2): 345–346. doi: 10.1090/S0002-9939-1981-0624927-9 . MR   0624927.
    6. Ok, Efe (2007). "D: Continuity I". Real Analysis with Economic Applications (PDF). Princeton University Press. p. 664. ISBN   978-0-691-11768-3 . Retrieved January 31, 2009.
    7. Zalinescu 2002, p. 29.
    8. 1 2 Zalinescu 2002, p. 30.

    Bibliography