Young's inequality for products

Last updated
The area of the rectangle a,b can't be larger than sum of the areas under the functions
f
{\displaystyle f}
(red) and
f
-
1
{\displaystyle f^{-1}}
(yellow) Young.png
The area of the rectangle a,b can't be larger than sum of the areas under the functions (red) and (yellow)

In mathematics, Young's inequality for products is a mathematical inequality about the product of two numbers. [1] The inequality is named after William Henry Young and should not be confused with Young's convolution inequality.

Contents

Young's inequality for products can be used to prove Hölder's inequality. It is also widely used to estimate the norm of nonlinear terms in PDE theory, since it allows one to estimate a product of two terms by a sum of the same terms raised to a power and scaled.

Standard version for conjugate Hölder exponents

The standard form of the inequality is the following:

Theorem  If and are nonnegative real numbers and if and are real numbers such that then

Equality holds if and only if

It can be used to prove Hölder's inequality.

Proof [2]

Since A graph on the -plane is thus also a graph From sketching a visual representation of the integrals of the area between this curve and the axes, and the area in the rectangle bounded by the lines and the fact that is always increasing for increasing and vice versa, we can see that upper bounds the area of the rectangle below the curve (with equality when ) and upper bounds the area of the rectangle above the curve (with equality when ). Thus, with equality when (or equivalently, ). Young's inequality follows from evaluating the integrals. (See below for a generalization.)

This form of Young's inequality can also be proved via Jensen's inequality.

Proof [3]

The claim is certainly true if or so henceforth assume that and Put and Because the logarithm function is concave,

with the equality holding if and only if Young's inequality follows by exponentiating.

Young's inequality may equivalently be written as

Where this is just the concavity of the logarithm function. Equality holds if and only if or This also follows from the weighted AM-GM inequality.

Generalizations

Theorem [4]   Suppose and If and are such that then

Using and replacing with and with results in the inequality:

which is useful for proving Hölder's inequality.

Proof [4]

Define a real-valued function on the positive real numbers by

for every and then calculate its minimum.

Theorem  If with then

Equality holds if and only if all the s with non-zero s are equal.

Elementary case

An elementary case of Young's inequality is the inequality with exponent

which also gives rise to the so-called Young's inequality with (valid for every ), sometimes called the Peter–Paul inequality. [5] This name refers to the fact that tighter control of the second term is achieved at the cost of losing some control of the first term – one must "rob Peter to pay Paul"

Proof: Young's inequality with exponent is the special case However, it has a more elementary proof.

Start by observing that the square of every real number is zero or positive. Therefore, for every pair of real numbers and we can write:

Work out the square of the right hand side:

Add to both sides:

Divide both sides by 2 and we have Young's inequality with exponent

Young's inequality with follows by substituting and as below into Young's inequality with exponent

Matricial generalization

T. Ando proved a generalization of Young's inequality for complex matrices ordered by Loewner ordering. [6] It states that for any pair of complex matrices of order there exists a unitary matrix such that

where denotes the conjugate transpose of the matrix and

Standard version for increasing functions

For the standard version [7] [8] of the inequality, let denote a real-valued, continuous and strictly increasing function on with and Let denote the inverse function of Then, for all and

with equality if and only if

With and this reduces to standard version for conjugate Hölder exponents.

For details and generalizations we refer to the paper of Mitroi & Niculescu. [9]

Generalization using Fenchel–Legendre transforms

By denoting the convex conjugate of a real function by we obtain

This follows immediately from the definition of the convex conjugate. For a convex function this also follows from the Legendre transformation.

More generally, if is defined on a real vector space and its convex conjugate is denoted by (and is defined on the dual space ), then

where is the dual pairing.

Examples

The convex conjugate of is with such that and thus Young's inequality for conjugate Hölder exponents mentioned above is a special case.

The Legendre transform of is , hence for all non-negative and This estimate is useful in large deviations theory under exponential moment conditions, because appears in the definition of relative entropy, which is the rate function in Sanov's theorem.

See also

Notes

  1. Young, W. H. (1912), "On classes of summable functions and their Fourier series", Proceedings of the Royal Society A , 87 (594): 225–229, Bibcode:1912RSPSA..87..225Y, doi: 10.1098/rspa.1912.0076 , JFM   43.1114.12, JSTOR   93236
  2. Pearse, Erin. "Math 209D - Real Analysis Summer Preparatory Seminar Lecture Notes" (PDF). Retrieved 17 September 2022.
  3. Bahouri, Chemin & Danchin 2011.
  4. 1 2 Jarchow 1981, pp. 47–55.
  5. Tisdell, Chris (2013), The Peter Paul Inequality, YouTube video on Dr Chris Tisdell's YouTube channel,
  6. T. Ando (1995). "Matrix Young Inequalities". In Huijsmans, C. B.; Kaashoek, M. A.; Luxemburg, W. A. J.; et al. (eds.). Operator Theory in Function Spaces and Banach Lattices. Springer. pp. 33–38. ISBN   978-3-0348-9076-2.
  7. Hardy, G. H.; Littlewood, J. E.; Pólya, G. (1952) [1934], Inequalities, Cambridge Mathematical Library (2nd ed.), Cambridge: Cambridge University Press, ISBN   0-521-05206-8, MR   0046395, Zbl   0047.05302 , Chapter 4.8
  8. Henstock, Ralph (1988), Lectures on the Theory of Integration , Series in Real Analysis Volume I, Singapore, New Jersey: World Scientific, ISBN   9971-5-0450-2, MR   0963249, Zbl   0668.28001 , Theorem 2.9
  9. Mitroi, F. C., & Niculescu, C. P. (2011). An extension of Young's inequality. In Abstract and Applied Analysis (Vol. 2011). Hindawi.

Related Research Articles

<span class="mw-page-title-main">Quadrilateral</span> Polygon with four sides and four corners

In geometry a quadrilateral is a four-sided polygon, having four edges (sides) and four corners (vertices). The word is derived from the Latin words quadri, a variant of four, and latus, meaning "side". It is also called a tetragon, derived from Greek "tetra" meaning "four" and "gon" meaning "corner" or "angle", in analogy to other polygons. Since "gon" means "angle", it is analogously called a quadrangle, or 4-angle. A quadrilateral with vertices , , and is sometimes denoted as .

In mathematics, the Lp spaces are function spaces defined using a natural generalization of the p-norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue, although according to the Bourbaki group they were first introduced by Frigyes Riesz.

In mathematical analysis, Hölder's inequality, named after Otto Hölder, is a fundamental inequality between integrals and an indispensable tool for the study of Lp spaces.

In mathematical analysis, the Minkowski inequality establishes that the Lp spaces are normed vector spaces. Let be a measure space, let and let and be elements of Then is in and we have the triangle inequality

<span class="mw-page-title-main">Convex function</span> Real function with secant line between points above the graph itself

In mathematics, a real-valued function is called convex if the line segment between any two distinct points on the graph of the function lies above the graph between the two points. Equivalently, a function is convex if its epigraph is a convex set. In simple terms, a convex function graph is shaped like a cup , while a concave function's graph is shaped like a cap .

<span class="mw-page-title-main">Squeeze theorem</span> Method for finding limits in calculus

In calculus, the squeeze theorem is a theorem regarding the limit of a function that is trapped between two other functions.

<span class="mw-page-title-main">Legendre transformation</span> Mathematical transformation

In mathematics, the Legendre transformation, first introduced by Adrien-Marie Legendre in 1787 when studying the minimal surface problem, is an involutive transformation on real-valued functions that are convex on a real variable. Specifically, if a real-valued multivariable function is convex on one of its independent real variables, then the Legendre transform with respect to this variable is applicable to the function.

In probability theory, a Chernoff bound is an exponentially decreasing upper bound on the tail of a random variable based on its moment generating function. The minimum of all such exponential bounds forms the Chernoff or Chernoff-Cramér bound, which may decay faster than exponential. It is especially useful for sums of independent random variables, such as sums of Bernoulli random variables.

In mathematics and mathematical optimization, the convex conjugate of a function is a generalization of the Legendre transformation which applies to non-convex functions. It is also known as Legendre–Fenchel transformation, Fenchel transformation, or Fenchel conjugate. It allows in particular for a far reaching generalization of Lagrangian duality.

<span class="mw-page-title-main">Continuous uniform distribution</span> Uniform distribution on an interval

In probability theory and statistics, the continuous uniform distributions or rectangular distributions are a family of symmetric probability distributions. Such a distribution describes an experiment where there is an arbitrary outcome that lies between certain bounds. The bounds are defined by the parameters, and which are the minimum and maximum values. The interval can either be closed or open. Therefore, the distribution is often abbreviated where stands for uniform distribution. The difference between the bounds defines the interval length; all intervals of the same length on the distribution's support are equally probable. It is the maximum entropy probability distribution for a random variable under no constraint other than that it is contained in the distribution's support.

In mathematics, a Kloosterman sum is a particular kind of exponential sum. They are named for the Dutch mathematician Hendrik Kloosterman, who introduced them in 1926 when he adapted the Hardy–Littlewood circle method to tackle a problem involving positive definite diagonal quadratic forms in four as opposed to five or more variables, which he had dealt with in his dissertation in 1924.

<span class="mw-page-title-main">Beta prime distribution</span> Probability distribution

In probability theory and statistics, the beta prime distribution is an absolutely continuous probability distribution. If has a beta distribution, then the odds has a beta prime distribution.

<span class="mw-page-title-main">Meijer G-function</span> Generalization of the hypergeometric function

In mathematics, the G-function was introduced by Cornelis Simon Meijer as a very general function intended to include most of the known special functions as particular cases. This was not the only attempt of its kind: the generalized hypergeometric function and the MacRobert E-function had the same aim, but Meijer's G-function was able to include those as particular cases as well. The first definition was made by Meijer using a series; nowadays the accepted and more general definition is via a line integral in the complex plane, introduced in its full generality by Arthur Erdélyi in 1953.

<span class="mw-page-title-main">Dvoretzky–Kiefer–Wolfowitz inequality</span> Statistical inequality

In the theory of probability and statistics, the Dvoretzky–Kiefer–Wolfowitz–Massart inequality provides a bound on the worst case distance of an empirically determined distribution function from its associated population distribution function. It is named after Aryeh Dvoretzky, Jack Kiefer, and Jacob Wolfowitz, who in 1956 proved the inequality

In mathematics, the modulus of convexity and the characteristic of convexity are measures of "how convex" the unit ball in a Banach space is. In some sense, the modulus of convexity has the same relationship to the ε-δ definition of uniform convexity as the modulus of continuity does to the ε-δ definition of continuity.

<span class="mw-page-title-main">Anatoly Karatsuba</span> Russian mathematician (1937–2008)

Anatoly Alexeyevich Karatsuba was a Russian mathematician working in the field of analytic number theory, p-adic numbers and Dirichlet series.

In mathematics, singular integral operators of convolution type are the singular integral operators that arise on Rn and Tn through convolution by distributions; equivalently they are the singular integral operators that commute with translations. The classical examples in harmonic analysis are the harmonic conjugation operator on the circle, the Hilbert transform on the circle and the real line, the Beurling transform in the complex plane and the Riesz transforms in Euclidean space. The continuity of these operators on L2 is evident because the Fourier transform converts them into multiplication operators. Continuity on Lp spaces was first established by Marcel Riesz. The classical techniques include the use of Poisson integrals, interpolation theory and the Hardy–Littlewood maximal function. For more general operators, fundamental new techniques, introduced by Alberto Calderón and Antoni Zygmund in 1952, were developed by a number of authors to give general criteria for continuity on Lp spaces. This article explains the theory for the classical operators and sketches the subsequent general theory.

In mathematics, there are many kinds of inequalities involving matrices and linear operators on Hilbert spaces. This article covers some important operator inequalities connected with traces of matrices.

In the field of mathematical analysis, an interpolation inequality is an inequality of the form

In mathematics, Young's convolution inequality is a mathematical inequality about the convolution of two functions, named after William Henry Young.

References