Nesbitt's inequality

Last updated December 23, 2024

In mathematics, Nesbitt's inequality, named after Alfred Nesbitt, states that for positive real numbers a, b and c,

Proof
First proof: AM-HM inequality
Second proof: Rearrangement
Third proof: Sum of Squares
Fourth proof: Cauchy–Schwarz
Fifth proof: AM-GM
Sixth proof: Titu's lemma
Seventh proof: Using homogeneity
Eighth proof: Jensen's inequality
Ninth proof: Reduction to a two-variable inequality
References
External links

{\frac {a}{b+c}}+{\frac {b}{c+a}}+{\frac {c}{a+b}}\geq {\frac {3}{2}},

with equality only when $a=b=c$ (i. e. in an equilateral triangle).

There is no corresponding upper bound as any of the 3 fractions in the inequality can be made arbitrarily large.

It is the three-variable case of the rather more difficult Shapiro inequality, and was published at least 50 years earlier.

Proof

First proof: AM-HM inequality

By the AM-HM inequality on $(b+c),(c+a),(a+b)$ ,

{\frac {(b+c)+(c+a)+(a+b)}{3}}\geq {\frac {3}{\displaystyle {\frac {1}{b+c}}+{\frac {1}{c+a}}+{\frac {1}{a+b}}}}.

Clearing denominators yields

[(b+c)+(c+a)+(a+b)]\left({\frac {1}{b+c}}+{\frac {1}{c+a}}+{\frac {1}{a+b}}\right)\geq 9,

from which we obtain

2{\frac {a+b+c}{b+c}}+2{\frac {a+b+c}{c+a}}+2{\frac {a+b+c}{a+b}}\geq 9

by expanding the product and collecting like denominators. This then simplifies directly to the final result.

Second proof: Rearrangement

Supposing $a\geq b\geq c$ , we have that

{\frac {1}{b+c}}\geq {\frac {1}{c+a}}\geq {\frac {1}{a+b}}.

Define

{\vec {x}}=(a,b,c)\quad

and

\quad {\vec {y}}=\left({\frac {1}{b+c}},{\frac {1}{a+c}},{\frac {1}{a+b}}\right)

.

By the rearrangement inequality, the dot product of the two sequences is maximized when the terms are arranged to be both increasing or both decreasing. The order here is both decreasing. Let ${\vec {y}}_{1}$ and ${\vec {y}}_{2}$ be the vector ${\vec {y}}$ cyclically shifted by one and by two places; then

{\vec {x}}\cdot {\vec {y}}\geq {\vec {x}}\cdot {\vec {y}}_{1}

{\vec {x}}\cdot {\vec {y}}\geq {\vec {x}}\cdot {\vec {y}}_{2}

Addition then yields Nesbitt's inequality.

Third proof: Sum of Squares

The following identity is true for all $a,b,c:$

{\frac {a}{b+c}}+{\frac {b}{a+c}}+{\frac {c}{a+b}}={\frac {3}{2}}+{\frac {1}{2}}\left({\frac {(b-c)^{2}}{(a+b)(a+c)}}+{\frac {(c-a)^{2}}{(b+c)(b+a)}}+{\frac {(a-b)^{2}}{(c+a)(c+b)}}\right).

This clearly proves that the left side is no less than $3/2$ for positive a, b and c.

Note: every rational inequality can be demonstrated by transforming it to the appropriate sum-of-squares identity—see Hilbert's seventeenth problem.

Fourth proof: Cauchy–Schwarz

Invoking the Cauchy–Schwarz inequality on the vectors $\displaystyle \left\langle {\sqrt {b+c}},{\sqrt {c+a}},{\sqrt {a+b}}\right\rangle ,\left\langle {\frac {1}{\sqrt {b+c}}},{\frac {1}{\sqrt {c+a}}},{\frac {1}{\sqrt {a+b}}}\right\rangle$ yields

((b+c)+(a+c)+(a+b))\left({\frac {1}{b+c}}+{\frac {1}{a+c}}+{\frac {1}{a+b}}\right)\geq 9,

which can be transformed into the final result as we did in the AM-HM proof.

Fifth proof: AM-GM

Let $x=b+c,y=c+a,z=a+b$ . We then apply the AM-GM inequality to obtain

{\frac {y+z}{x}}+{\frac {z+x}{y}}+{\frac {x+y}{z}}\geq 6,

because ${\frac {y}{x}}+{\frac {z}{x}}+{\frac {z}{y}}+{\frac {x}{y}}+{\frac {x}{z}}+{\frac {y}{z}}\geq 6{\sqrt[{6}]{{\frac {y}{x}}\cdot {\frac {z}{x}}\cdot {\frac {z}{y}}\cdot {\frac {x}{y}}\cdot {\frac {x}{z}}\cdot {\frac {y}{z}}}}=6.$

Substituting out the $x,y,z$ in favor of $a,b,c$ yields

{\frac {2a+b+c}{b+c}}+{\frac {2b+c+a}{c+a}}+{\frac {2c+a+b}{a+b}}\geq 6

{\frac {2a}{b+c}}+{\frac {2b}{c+a}}+{\frac {2c}{a+b}}+3\geq 6,

which then simplifies to the final result.

Sixth proof: Titu's lemma

Titu's lemma, a direct consequence of the Cauchy–Schwarz inequality, states that for any sequence of $n$ real numbers $(x_{k})$ and any sequence of $n$ positive numbers $(a_{k})$ , $\displaystyle \sum _{k=1}^{n}{\frac {x_{k}^{2}}{a_{k}}}\geq {\frac {(\sum _{k=1}^{n}x_{k})^{2}}{\sum _{k=1}^{n}a_{k}}}.$

We use the lemma on $(x_{k})=(1,1,1)$ and $(a_{k})=(b+c,a+c,a+b)$ . This gives

{\frac {1}{b+c}}+{\frac {1}{c+a}}+{\frac {1}{a+b}}\geq {\frac {3^{2}}{2(a+b+c)}},

which results in

{\frac {a+b+c}{b+c}}+{\frac {a+b+c}{c+a}}+{\frac {a+b+c}{a+b}}\geq {\frac {9}{2}}

i.e.,

{\frac {a}{b+c}}+{\frac {b}{c+a}}+{\frac {c}{a+b}}\geq {\frac {9}{2}}-3={\frac {3}{2}}.

Seventh proof: Using homogeneity

As the left side of the inequality is homogeneous, we may assume $a+b+c=1$ . Now define $x=b+c$ , $y=c+a$ , and $z=a+b$ . The desired inequality turns into ${\frac {1-x}{x}}+{\frac {1-y}{y}}+{\frac {1-z}{z}}\geq {\frac {3}{2}}$ , or, equivalently, ${\frac {1}{x}}+{\frac {1}{y}}+{\frac {1}{z}}\geq {\frac {9}{2}}$ . This is clearly true by Titu's Lemma.

Eighth proof: Jensen's inequality

Let $S=a+b+c$ and consider the function $f(x)={\frac {x}{S-x}}$ . This function can be shown to be convex in $[0,S]$ and, invoking Jensen's inequality, we get

\displaystyle {\frac {{\frac {a}{S-a}}+{\frac {b}{S-b}}+{\frac {c}{S-c}}}{3}}\geq {\frac {S/3}{S-S/3}}.

A straightforward computation then yields

{\frac {a}{b+c}}+{\frac {b}{c+a}}+{\frac {c}{a+b}}\geq {\frac {3}{2}}.

Ninth proof: Reduction to a two-variable inequality

By clearing denominators,

{\frac {a}{b+c}}+{\frac {b}{c+a}}+{\frac {c}{a+b}}\geq {\frac {3}{2}}\iff 2(a^{3}+b^{3}+c^{3})\geq a^{2}b+a^{2}c+b^{2}c+b^{2}a+c^{2}a+c^{2}b.

It therefore suffices to prove that $x^{3}+y^{3}\geq xy^{2}+x^{2}y$ for $(x,y)\in \mathbb {R} _{+}^{2}$ , as summing this three times for $(x,y)=(b,c),\ (c,a),\ (a,b)$ completes the proof.

As $x^{3}+y^{3}\geq xy^{2}+x^{2}y\iff (x-y)(x^{2}-y^{2})\geq 0$ we are done.

Related Research Articles

In mathematics, the arithmetic–geometric mean of two positive real numbers $x$ and $y$ is the mutual limit of a sequence of arithmetic means and a sequence of geometric means. The arithmetic–geometric mean is used in fast algorithms for exponential, trigonometric functions, and other special functions, as well as some mathematical constants, in particular, computing $π$ .

In mathematics, the binomial coefficients are the positive integers that occur as coefficients in the binomial theorem. Commonly, a binomial coefficient is indexed by a pair of integers $n \geq k \geq 0$ and is written $It is the coefficient of the x k term in the polynomial expansion of the binomial power (1 + x) n; this coefficient can be computed by the multiplicative formula$

In mathematical physics and mathematics, the Pauli matrices are a set of three $2 \times 2$ complex matrices that are traceless, Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma, they are occasionally denoted by tau when used in connection with isospin symmetries.

In mathematics, a square root of a number $x$ is a number $y$ such that $; in other words, a number y whose square is x . For example, 4 and -4 are square roots of 16 because .$

In probability theory, Chebyshev's inequality provides an upper bound on the probability of deviation of a random variable from its mean. More specifically, the probability that a random variable deviates from its mean by more than $is at most, where is any positive constant and is the standard deviation.$

In mathematics, a generating function is a representation of an infinite sequence of numbers as the coefficients of a formal power series. Generating functions are often expressed in closed form, by some expression involving operations on the formal series.

In mathematics, the error function, often denoted by $erf$ , is a function $:\mathbb {C} \to \mathbb {C} }$ defined as:

Unit quaternions, known as versors, provide a convenient mathematical notation for representing spatial orientations and rotations of elements in three dimensional space. Specifically, they encode information about an axis-angle rotation about an arbitrary axis. Rotation and orientation quaternions have applications in computer graphics, computer vision, robotics, navigation, molecular dynamics, flight dynamics, orbital mechanics of satellites, and crystallographic texture analysis.

In probability theory, the Azuma–Hoeffding inequality gives a concentration result for the values of martingales that have bounded differences.

In probability theory, the central limit theorem states that, under certain circumstances, the probability distribution of the scaled mean of a random sample converges to a normal distribution as the sample size increases to infinity. Under stronger assumptions, the Berry–Esseen theorem, or Berry–Esseen inequality, gives a more quantitative result, because it also specifies the rate at which this convergence takes place by giving a bound on the maximal error of approximation between the normal distribution and the true distribution of the scaled sample mean. The approximation is measured by the Kolmogorov–Smirnov distance. In the case of independent samples, the convergence rate is $n -1/2$ , where $n$ is the sample size, and the constant is estimated in terms of the third absolute normalized moment.

In mathematics, the inequality of arithmetic and geometric means, or more briefly the AM–GM inequality, states that the arithmetic mean of a list of non-negative real numbers is greater than or equal to the geometric mean of the same list; and further, that the two means are equal if and only if every number in the list is the same.

In mathematics, a norm is a function from a real or complex vector space to the non-negative real numbers that behaves in certain ways like the distance from the origin: it commutes with scaling, obeys a form of the triangle inequality, and is zero only at the origin. In particular, the Euclidean distance in a Euclidean space is defined by a norm on the associated Euclidean vector space, called the Euclidean norm, the 2-norm, or, sometimes, the magnitude or length of the vector. This norm can be defined as the square root of the inner product of a vector with itself.

In mathematics, Muirhead's inequality, named after Robert Franklin Muirhead, also known as the "bunching" method, generalizes the inequality of arithmetic and geometric means.

In the field of mathematics, norms are defined for elements within a vector space. Specifically, when the vector space comprises matrices, such norms are referred to as matrix norms. Matrix norms differ from vector norms in that they must also interact with matrix multiplication.

In statistics and information theory, a maximum entropy probability distribution has entropy that is at least as great as that of all other members of a specified class of probability distributions. According to the principle of maximum entropy, if nothing is known about a distribution except that it belongs to a certain class, then the distribution with the largest entropy should be chosen as the least-informative default. The motivation is twofold: first, maximizing entropy minimizes the amount of prior information built into the distribution; second, many physical systems tend to move towards maximal entropy configurations over time.

Methods of computing square roots are algorithms for approximating the non-negative square root $of a positive real number . Since all square roots of natural numbers, other than of perfect squares, are irrational, square roots can usually only be computed to some finite precision: these methods typically construct a series of increasingly accurate approximations.$

In probability theory, concentration inequalities provide mathematical bounds on the probability of a random variable deviating from some value. The deviation or other function of the random variable can be thought of as a secondary random variable. The simplest example of the concentration of such a secondary random variable is the CDF of the first random variable which concentrates the probability to unity. If an analytic form of the CDF is available this provides a concentration equality that provides the exact probability of concentration. It is precisely when the CDF is difficult to calculate or even the exact form of the first random variable is unknown that the applicable concentration inequalities provide useful insight.

In pure and applied mathematics, quantum mechanics and computer graphics, a tensor operator generalizes the notion of operators which are scalars and vectors. A special class of these are spherical tensor operators which apply the notion of the spherical basis and spherical harmonics. The spherical basis closely relates to the description of angular momentum in quantum mechanics and spherical harmonic functions. The coordinate-free generalization of a tensor operator is known as a representation operator.

In mathematics, a transformation of a sequence's generating function provides a method of converting the generating function for one sequence into a generating function enumerating another. These transformations typically involve integral formulas applied to a sequence generating function or weighted sums over the higher-order derivatives of these functions.

References

Nesbitt, A. M. (1902). "Problem 15114". Educational Times. 55.
Ion Ionescu, Romanian Mathematical Gazette, Volume XXXII (September 15, 1926 - August 15, 1927), page 120
Arthur Lohwater (1982). "Introduction to Inequalities". Online e-book in PDF format.
"Who was Alfred Nesbitt, the eponym of Nesbitt inequality".

External links

See AoPS for more proofs of this inequality.
"Nesbitt's inequality". PlanetMath .
"proof of Nesbitt's inequality". PlanetMath .

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.