Farkas' lemma

Last updated

In mathematics, Farkas' lemma is a solvability theorem for a finite system of linear inequalities. It was originally proven by the Hungarian mathematician Gyula Farkas. [1] Farkas' lemma is the key result underpinning the linear programming duality and has played a central role in the development of mathematical optimization (alternatively, mathematical programming). It is used amongst other things in the proof of the Karush–Kuhn–Tucker theorem in nonlinear programming. [2] Remarkably, in the area of the foundations of quantum theory, the lemma also underlies the complete set of Bell inequalities in the form of necessary and sufficient conditions for the existence of a local hidden-variable theory, given data from any specific set of measurements. [3]

Contents

Generalizations of the Farkas' lemma are about the solvability theorem for convex inequalities, [4] i.e., infinite system of linear inequalities. Farkas' lemma belongs to a class of statements called "theorems of the alternative": a theorem stating that exactly one of two systems has a solution. [5]

Statement of the lemma

There are a number of slightly different (but equivalent) formulations of the lemma in the literature. The one given here is due to Gale, Kuhn and Tucker (1951). [6]

Farkas' lemma   Let and Then exactly one of the following two assertions is true:

  1. There exists an such that and
  2. There exists a such that and

Here, the notation means that all components of the vector are nonnegative.

Example

Let m, n = 2, and The lemma says that exactly one of the following two statements must be true (depending on b1 and b2):

  1. There exist x1 ≥ 0, x2 ≥ 0 such that 6x1 + 4x2 = b1 and 3x1 = b2, or
  2. There exist y1, y2 such that 6y1 + 3y2 ≥ 0, 4y1 ≥ 0, and b1y1 + b2y2 < 0.

Here is a proof of the lemma in this special case:

Geometric interpretation

Consider the closed convex cone spanned by the columns of A; that is,

Observe that is the set of the vectors b for which the first assertion in the statement of Farkas' lemma holds. On the other hand, the vector y in the second assertion is orthogonal to a hyperplane that separates b and The lemma follows from the observation that b belongs to if and only if there is no hyperplane that separates it from

More precisely, let denote the columns of A. In terms of these vectors, Farkas' lemma states that exactly one of the following two statements is true:

  1. There exist non-negative coefficients such that
  2. There exists a vector such that for and

The sums with nonnegative coefficients form the cone spanned by the columns of A. Therefore, the first statement tells that b belongs to

The second statement tells that there exists a vector y such that the angle of y with the vectors ai is at most 90°, while the angle of y with the vector b is more than 90°. The hyperplane normal to this vector has the vectors ai on one side and the vector b on the other side. Hence, this hyperplane separates the cone spanned by from the vector b.

For example, let n, m = 2, a1 = (1, 0)T, and a2 = (1, 1)T. The convex cone spanned by a1 and a2 can be seen as a wedge-shaped slice of the first quadrant in the xy plane. Now, suppose b = (0, 1). Certainly, b is not in the convex cone a1x1 + a2x2. Hence, there must be a separating hyperplane. Let y = (1, −1)T. We can see that a1 · y = 1, a2 · y = 0, and b · y = −1. Hence, the hyperplane with normal y indeed separates the convex cone a1x1 + a2x2 from b.

Logic interpretation

A particularly suggestive and easy-to-remember version is the following: if a set of linear inequalities has no solution, then a contradiction can be produced from it by linear combination with nonnegative coefficients. In formulas: if is unsolvable then has a solution. [7] Note that is a combination of the left-hand sides, a combination of the right-hand side of the inequalities. Since the positive combination produces a zero vector on the left and a −1 on the right, the contradiction is apparent.

Thus, Farkas' lemma can be viewed as a theorem of logical completeness: is a set of "axioms", the linear combinations are the "derivation rules", and the lemma says that, if the set of axioms is inconsistent, then it can be refuted using the derivation rules. [8] :92–94

Implications in complexity theory

Farkas' lemma implies that the decision problem "Given a system of linear equations, does it have a non-negative solution?" is in the intersection of NP and co-NP. This is because, according to the lemma, both a "yes" answer and a "no" answer have a proof that can be verified in polynomial time. The problems in the intersection are also called well-characterized problems. It is a long-standing open question whether is equal to P. In particular, the question of whether a system of linear equations has a non-negative solution was not known to be in P, until it was proved using the ellipsoid method. [9] :25

Variants

The Farkas Lemma has several variants with different sign constraints (the first one is the original version): [8] :92

The latter variant is mentioned for completeness; it is not actually a "Farkas lemma" since it contains only equalities. Its proof is an exercise in linear algebra.

There are also Farkas-like lemmas for integer programs. [9] :12--14 For systems of equations, the lemma is simple:

For system of inequalities, the lemma is much more complicated. It is based on the following two rules of inference:

  1. Given inequalities and coefficients , infer the inequality .
  2. Given an inequality , infer the inequality .

The lemma says that:

The variants are summarized in the table below.

SystemConstraints on xAlternative systemConstraints on y
,
, ,
, ,
,
integral, not integral
can be inferred from

Generalizations

Generalized Farkas' lemma   Let is a closed convex cone in and the dual cone of is If convex cone is closed, then exactly one of the following two statements is true:

  1. There exists an such that and
  2. There exists a such that and

Generalized Farkas' lemma can be interpreted geometrically as follows: either a vector is in a given closed convex cone, or there exists a hyperplane separating the vector from the cone; there are no other possibilities. The closedness condition is necessary, see Separation theorem I in Hyperplane separation theorem. For original Farkas' lemma, is the nonnegative orthant hence the closedness condition holds automatically. Indeed, for polyhedral convex cone, i.e., there exists a such that the closedness condition holds automatically. In convex optimization, various kinds of constraint qualification, e.g. Slater's condition, are responsible for closedness of the underlying convex cone

By setting and in generalized Farkas' lemma, we obtain the following corollary about the solvability for a finite system of linear equalities:

Corollary   Let and Then exactly one of the following two statements is true:

  1. There exists an such that
  2. There exists a such that and

Further implications

Farkas's lemma can be varied to many further theorems of alternative by simple modifications, [5] such as Gordan's theorem: Either has a solution x, or has a nonzero solution y with y ≥ 0.

Common applications of Farkas' lemma include proving the strong duality theorem associated with linear programming and the Karush–Kuhn–Tucker conditions. An extension of Farkas' lemma can be used to analyze the strong duality conditions for and construct the dual of a semidefinite program. It is sufficient to prove the existence of the Karush–Kuhn–Tucker conditions using the Fredholm alternative but for the condition to be necessary, one must apply von Neumann's minimax theorem to show the equations derived by Cauchy are not violated.

See also

Notes

  1. Farkas, Julius (Gyula) (1902), "Theorie der Einfachen Ungleichungen", Journal für die reine und angewandte Mathematik , 1902 (124): 1–27, doi:10.1515/crll.1902.124.1, S2CID   115770124
  2. Takayama, Akira (1985). Mathematical Economics (2nd ed.). New York: Cambridge University Press. p.  48. ISBN   0-521-31498-4.
  3. Garg, Anupam; Mermin, N. D. (1984), "Farkas's Lemma and the Nature of Reality: Statistical Implications of Quantum Correlations", Foundations of Physics, 14: 1–39, doi:10.1007/BF00741645, S2CID   123622613
  4. Dinh, N.; Jeyakumar, V. (2014), "Farkas' lemma three decades of generalizations for mathematical optimization", TOP, 22 (1): 1–22, doi:10.1007/s11750-014-0319-y, S2CID   119858980
  5. 1 2 Border, KC (2013). "Alternative Linear Inequalities" (PDF). Retrieved 2021-11-29.
  6. Gale, David; Kuhn, Harold; Tucker, Albert W. (1951), "Linear Programming and the Theory of Games - Chapter XII" (PDF), in Koopmans (ed.), Activity Analysis of Production and Allocation, Wiley. See Lemma 1 on page 318.
  7. Boyd, Stephen P.; Vandenberghe, Lieven (2004), "Section 5.8.3" (pdf), Convex Optimization, Cambridge University Press, ISBN   978-0-521-83378-3 , retrieved October 15, 2011.
  8. 1 2 Gärtner, Bernd; Matoušek, Jiří (2006). Understanding and Using Linear Programming. Berlin: Springer. ISBN   3-540-30697-8. Pages 81–104.
  9. 1 2 Grötschel, Martin; Lovász, László; Schrijver, Alexander (1993), Geometric algorithms and combinatorial optimization, Algorithms and Combinatorics, vol. 2 (2nd ed.), Springer-Verlag, Berlin, doi:10.1007/978-3-642-78240-4, ISBN   978-3-642-78242-8, MR   1261419

Further reading

Related Research Articles

The Hahn–Banach theorem is a central tool in functional analysis. It allows the extension of bounded linear functionals defined on a vector subspace of some vector space to the whole space, and it also shows that there are "enough" continuous linear functionals defined on every normed vector space to make the study of the dual space "interesting". Another version of the Hahn–Banach theorem is known as the Hahn–Banach separation theorem or the hyperplane separation theorem, and has numerous uses in convex geometry.

The Cauchy–Schwarz inequality is an upper bound on the inner product between two vectors in an inner product space in terms of the product of the vector norms. It is considered one of the most important and widely used inequalities in mathematics.

In mathematics, a symmetric matrix with real entries is positive-definite if the real number is positive for every nonzero real column vector where is the transpose of . More generally, a Hermitian matrix is positive-definite if the real number is positive for every nonzero complex column vector where denotes the conjugate transpose of

In mathematics, a linear form is a linear map from a vector space to its field of scalars.

<span class="mw-page-title-main">Convex function</span> Real function with secant line between points above the graph itself

In mathematics, a real-valued function is called convex if the line segment between any two distinct points on the graph of the function lies above the graph between the two points. Equivalently, a function is convex if its epigraph is a convex set. In simple terms, a convex function graph is shaped like a cup , while a concave function's graph is shaped like a cap .

Convex optimization is a subfield of mathematical optimization that studies the problem of minimizing convex functions over convex sets. Many classes of convex optimization problems admit polynomial-time algorithms, whereas mathematical optimization is in general NP-hard.

In linear algebra, a sublinear function, also called a quasi-seminorm or a Banach functional, on a vector space is a real-valued function with only some of the properties of a seminorm. Unlike seminorms, a sublinear function does not have to be nonnegative-valued and also does not have to be absolutely homogeneous. Seminorms are themselves abstractions of the more well known notion of norms, where a seminorm has all the defining properties of a norm except that it is not required to map non-zero vectors to non-zero values.

In mathematical optimization, the Karush–Kuhn–Tucker (KKT) conditions, also known as the Kuhn–Tucker conditions, are first derivative tests for a solution in nonlinear programming to be optimal, provided that some regularity conditions are satisfied.

In mathematics, majorization is a preorder on vectors of real numbers. For two such vectors, , we say that weakly majorizesfrom below, commonly denoted when

<span class="mw-page-title-main">Convex cone</span> Mathematical set closed under positive linear combinations

In linear algebra, a cone—sometimes called a linear cone for distinguishing it from other sorts of cones—is a subset of a vector space that is closed under positive scalar multiplication; that is, C is a cone if implies for every positive scalar s. A cone need not be convex, or even look like a cone in Euclidean space.

<span class="mw-page-title-main">Hyperplane separation theorem</span> On the existence of hyperplanes separating disjoint convex sets

In geometry, the hyperplane separation theorem is a theorem about disjoint convex sets in n-dimensional Euclidean space. There are several rather similar versions. In one version of the theorem, if both these sets are closed and at least one of them is compact, then there is a hyperplane in between them and even two parallel hyperplanes in between them separated by a gap. In another version, if both disjoint convex sets are open, then there is a hyperplane in between them, but not necessarily any gap. An axis which is orthogonal to a separating hyperplane is a separating axis, because the orthogonal projections of the convex bodies onto the axis are disjoint.

Semidefinite programming (SDP) is a subfield of mathematical programming concerned with the optimization of a linear objective function over the intersection of the cone of positive semidefinite matrices with an affine space, i.e., a spectrahedron.

A second-order cone program (SOCP) is a convex optimization problem of the form

In mathematics, a Riesz space, lattice-ordered vector space or vector lattice is a partially ordered vector space where the order structure is a lattice.

<span class="mw-page-title-main">Ordered vector space</span> Vector space with a partial order

In mathematics, an ordered vector space or partially ordered vector space is a vector space equipped with a partial order that is compatible with the vector space operations.

The M. Riesz extension theorem is a theorem in mathematics, proved by Marcel Riesz during his study of the problem of moments.

Conic optimization is a subfield of convex optimization that studies problems consisting of minimizing a convex function over the intersection of an affine subspace and a convex cone.

The dual of a given linear program (LP) is another LP that is derived from the original LP in the following schematic way:

In mathematics a linear inequality is an inequality which involves a linear function. A linear inequality contains one of the symbols of inequality:

This is a glossary for the terminology in a mathematical field of functional analysis.