Pigeonhole principle

Last updated
Pigeons in holes. Here there are n = 10 pigeons in m = 9 holes. Since 10 is greater than 9, the pigeonhole principle says that at least one hole has more than one pigeon. (The top left hole has 2 pigeons.) TooManyPigeons.jpg
Pigeons in holes. Here there are n = 10 pigeons in m = 9 holes. Since 10 is greater than 9, the pigeonhole principle says that at least one hole has more than one pigeon. (The top left hole has 2 pigeons.)

In mathematics, the pigeonhole principle states that if n items are put into m containers, with n > m, then at least one container must contain more than one item. [1] For example, of three gloves (none of which is ambidextrous/reversible), at least two must be right-handed or at least two must be left-handed, because there are three objects but only two categories of handedness to put them into. This seemingly obvious statement, a type of counting argument, can be used to demonstrate possibly unexpected results. For example, given that the population of London is more than one unit greater than the maximum number of hairs that can be on a human's head, the principle requires that there must be at least two people in London who have the same number of hairs on their heads.

Contents

Although the pigeonhole principle appears as early as 1624 in a book attributed to Jean Leurechon, [2] it is commonly called Dirichlet's box principle or Dirichlet's drawer principle after an 1834 treatment of the principle by Peter Gustav Lejeune Dirichlet under the name Schubfachprinzip ("drawer principle" or "shelf principle"). [3]

The principle has several generalizations and can be stated in various ways. In a more quantified version: for natural numbers k and m, if n = km + 1 objects are distributed among m sets, the pigeonhole principle asserts that at least one of the sets will contain at least k + 1 objects. [4] For arbitrary n and m, this generalizes to , where and denote the floor and ceiling functions, respectively.

Though the principle's most straightforward application is to finite sets (such as pigeons and boxes), it is also used with infinite sets that cannot be put into one-to-one correspondence. To do so requires the formal statement of the pigeonhole principle: "there does not exist an injective function whose codomain is smaller than its domain". Advanced mathematical proofs like Siegel's lemma build upon this more general concept.

Etymology

Pigeon-hole messageboxes at Stanford University Pigeon-hole messagebox 3.jpg
Pigeon-hole messageboxes at Stanford University

Dirichlet published his works in both French and German, using either the German Schubfach or the French tiroir . The strict original meaning of these terms corresponds to the English drawer , that is, an open-topped box that can be slid in and out of the cabinet that contains it. (Dirichlet wrote about distributing pearls among drawers.) These terms morphed to pigeonhole in the sense of a small open space in a desk, cabinet, or wall for keeping letters or papers, metaphorically rooted in structures that house pigeons.

Because furniture with pigeonholes is commonly used for storing or sorting things into many categories (such as letters in a post office or room keys in a hotel), the translation pigeonhole may be a better rendering of Dirichlet's original "drawer". That understanding of the term pigeonhole, referring to some furniture features, is fading—especially among those who do not speak English natively but as a lingua franca in the scientific world—in favor of the more pictorial interpretation, literally involving pigeons and holes. The suggestive (though not misleading) interpretation of "pigeonhole" as "dovecote" has lately found its way back to a German back-translation of the "pigeonhole principle" as the "Taubenschlagprinzip". [5]

Besides the original terms "Schubfachprinzip" in German [6] and "Principe des tiroirs" in French, [7] other literal translations are still in use in Arabic ("مبدأ برج الحمام"), Bulgarian ("принцип на чекмеджетата"), Chinese ("抽屉原理"), Danish ("Skuffeprincippet"), Dutch ("ladenprincipe"), Hungarian ("skatulyaelv"), Italian ("principio dei cassetti"), Japanese ("引き出し論法"), Persian ("اصل لانه کبوتری"), Polish ("zasada szufladkowa"), Portuguese ("Princípio das Gavetas"), Swedish ("Lådprincipen"), Turkish ("çekmece ilkesi"), and Vietnamese ("nguyên lý hộp").

Examples

Sock picking

Suppose a drawer contains a mixture of black socks and blue socks, each of which can be worn on either foot. You pull a number of socks from the drawer without looking. What is the minimum number of pulled socks required to guarantee a pair of the same color? By the pigeonhole principle (m = 2 socks, using one pigeonhole per color), the answer is three (n = 3 items). Either you have three of one color, or you have two of one color and one of the other.

Hand shaking

If n people can shake hands with one another (where n > 1), the pigeonhole principle shows that there is always a pair of people who will shake hands with the same number of people. In this application of the principle, the "hole" to which a person is assigned is the number of hands that person shakes. Since each person shakes hands with some number of people from 0 to n 1, there are n possible holes. On the other hand, either the "0" hole, the "n 1" hole, or both must be empty, for it is impossible (if n > 1) for some person to shake hands with everybody else while some person shakes hands with nobody. This leaves n people to be placed into at most n 1 non-empty holes, so the principle applies.

This hand-shaking example is equivalent to the statement that in any graph with more than one vertex, there is at least one pair of vertices that share the same degree. [8] This can be seen by associating each person with a vertex and each edge with a handshake.

Hair counting

One can demonstrate there must be at least two people in London with the same number of hairs on their heads as follows. [9] [10] Since a typical human head has an average of around 150,000 hairs, it is reasonable to assume (as an upper bound) that no one has more than 1,000,000 hairs on their head (m = 1 million holes). There are more than 1,000,000 people in London (n is bigger than 1 million items). Assigning a pigeonhole to each number of hairs on a person's head, and assigning people to pigeonholes according to the number of hairs on their heads, there must be at least two people assigned to the same pigeonhole by the 1,000,001st assignment (because they have the same number of hairs on their heads; or, n > m). Assuming London has 9.002 million people, [11] it follows that at least ten Londoners have the same number of hairs, as having nine Londoners in each of the 1 million pigeonholes accounts for only 9 million people.

For the average case (m = 150,000) with the constraint: fewest overlaps, there will be at most one person assigned to every pigeonhole and the 150,001st person assigned to the same pigeonhole as someone else. In the absence of this constraint, there may be empty pigeonholes because the "collision" happens before the 150,001st person. The principle just proves the existence of an overlap; it says nothing about the number of overlaps (which falls under the subject of probability distribution).

There is a passing, satirical, allusion in English to this version of the principle in A History of the Athenian Society, prefixed to A Supplement to the Athenian Oracle: Being a Collection of the Remaining Questions and Answers in the Old Athenian Mercuries (printed for Andrew Bell, London, 1710). [12] It seems that the question whether there were any two persons in the World that have an equal number of hairs on their head? had been raised in The Athenian Mercury before 1704. [13] [14]

Perhaps the first written reference to the pigeonhole principle appears in a short sentence from the French Jesuit Jean Leurechon's 1622 work Selectæ Propositiones: [2] "It is necessary that two men have the same number of hairs, écus, or other things, as each other." [15] The full principle was spelled out two years later, with additional examples, in another book that has often been attributed to Leurechon, but might be by one of his students. [2]

The birthday problem

The birthday problem asks, for a set of n randomly chosen people, what is the probability that some pair of them will have the same birthday? The problem itself is mainly concerned with counterintuitive probabilities, but we can also tell by the pigeonhole principle that if there are 367 people in the room, there is at least one pair of people who share the same birthday with 100% probability, as there are only 366 possible birthdays to choose from (including February 29, if present).

Team tournament

Imagine seven people who want to play in a tournament of teams (n = 7 items), with a limitation of only four teams (m = 4 holes) to choose from. The pigeonhole principle tells us that they cannot all play for different teams; there must be at least one team featuring at least two of the seven players:

Subset sum

Any subset of size six from the set S = {1,2,3,...,9} must contain two elements whose sum is 10. The pigeonholes will be labeled by the two element subsets {1,9}, {2,8}, {3,7}, {4,6} and the singleton {5}, five pigeonholes in all. When the six "pigeons" (elements of the size six subset) are placed into these pigeonholes, each pigeon going into the pigeonhole that has it contained in its label, at least one of the pigeonholes labeled with a two-element subset will have two pigeons in it. [16]

Uses and applications

The principle can be used to prove that any lossless compression algorithm, provided it makes some inputs smaller (as "compression" suggests), will also make some other inputs larger. Otherwise, the set of all input sequences up to a given length L could be mapped to the (much) smaller set of all sequences of length less than L without collisions (because the compression is lossless), a possibility that the pigeonhole principle excludes.

The pigeonhole principle gives an upper bound of 2n in the no-three-in-line problem for the number of points that can be placed on an n x n lattice without any three being colinear - in this case, 16 pawns on a regular chessboard Dudeney no 3 pawns in line.svg
The pigeonhole principle gives an upper bound of 2n in the no-three-in-line problem for the number of points that can be placed on an n×n lattice without any three being colinear in this case, 16 pawns on a regular chessboard

A notable problem in mathematical analysis is, for a fixed irrational number a, to show that the set of fractional parts is dense in [0, 1]. One finds that it is not easy to explicitly find integers n, m such that where e > 0 is a small positive number and a is some arbitrary irrational number. But if one takes M such that by the pigeonhole principle there must be such that n1a and n2a are in the same integer subdivision of size (there are only M such subdivisions between consecutive integers). In particular, one can find n1, n2 such that

for some p, q integers and k in {0, 1, ..., M 1}. One can then easily verify that

This implies that where n = n2n1 or n = n1n2. This shows that 0 is a limit point of {[na]}. One can then use this fact to prove the case for p in (0, 1]: find n such that then if the proof is complete. Otherwise

and by setting

one obtains

Variants occur in a number of proofs. In the proof of the pumping lemma for regular languages, a version that mixes finite and infinite sets is used: If infinitely many objects are placed into finitely many boxes, then two objects share a box. [18] In Fisk's solution to the Art gallery problem a sort of converse is used: If n objects are placed into k boxes, then there is a box containing at most objects. [19]

Alternative formulations

The following are alternative formulations of the pigeonhole principle.

  1. If n objects are distributed over m places, and if n > m, then some place receives at least two objects. [1]
  2. (equivalent formulation of 1) If n objects are distributed over n places in such a way that no place receives more than one object, then each place receives exactly one object. [1]
  3. (generalization of 1) If S and T are sets, and the cardinality of S is greater than the cardinality of T, then there is no injective function from S to T.
  4. If n objects are distributed over m places, and if n < m, then some place receives no object.
  5. (equivalent formulation of 4) If n objects are distributed over n places in such a way that no place receives no object, then each place receives exactly one object. [20]
  6. (generalization of 4) If S and T are sets, and the cardinality of S is less than the cardinality of T, then there is no surjective function from S to T.

Strong form

Let q1, q2, ..., qn be positive integers. If

objects are distributed into n boxes, then either the first box contains at least q1 objects, or the second box contains at least q2 objects, ..., or the nth box contains at least qn objects. [21]

The simple form is obtained from this by taking q1 = q2 = ... = qn = 2, which gives n + 1 objects. Taking q1 = q2 = ... = qn = r gives the more quantified version of the principle, namely:

Let n and r be positive integers. If n(r - 1) + 1 objects are distributed into n boxes, then at least one of the boxes contains r or more of the objects. [22]

This can also be stated as, if k discrete objects are to be allocated to n containers, then at least one container must hold at least objects, where is the ceiling function, denoting the smallest integer larger than or equal to x. Similarly, at least one container must hold no more than objects, where is the floor function, denoting the largest integer smaller than or equal to x.

Generalizations of the pigeonhole principle

A probabilistic generalization of the pigeonhole principle states that if n pigeons are randomly put into m pigeonholes with uniform probability 1/m, then at least one pigeonhole will hold more than one pigeon with probability

where (m)n is the falling factorial m(m 1)(m 2)...(mn + 1). For n = 0 and for n = 1 (and m > 0), that probability is zero; in other words, if there is just one pigeon, there cannot be a conflict. For n > m (more pigeons than pigeonholes) it is one, in which case it coincides with the ordinary pigeonhole principle. But even if the number of pigeons does not exceed the number of pigeonholes (nm), due to the random nature of the assignment of pigeons to pigeonholes there is often a substantial chance that clashes will occur. For example, if 2 pigeons are randomly assigned to 4 pigeonholes, there is a 25% chance that at least one pigeonhole will hold more than one pigeon; for 5 pigeons and 10 holes, that probability is 69.76%; and for 10 pigeons and 20 holes it is about 93.45%. If the number of holes stays fixed, there is always a greater probability of a pair when you add more pigeons. This problem is treated at much greater length in the birthday paradox.

A further probabilistic generalization is that when a real-valued random variable X has a finite mean E(X), then the probability is nonzero that X is greater than or equal to E(X), and similarly the probability is nonzero that X is less than or equal to E(X). To see that this implies the standard pigeonhole principle, take any fixed arrangement of n pigeons into m holes and let X be the number of pigeons in a hole chosen uniformly at random. The mean of X is n/m, so if there are more pigeons than holes the mean is greater than one. Therefore, X is sometimes at least 2.

Infinite sets

The pigeonhole principle can be extended to infinite sets by phrasing it in terms of cardinal numbers: if the cardinality of set A is greater than the cardinality of set B, then there is no injection from A to B. However, in this form the principle is tautological, since the meaning of the statement that the cardinality of set A is greater than the cardinality of set B is exactly that there is no injective map from A to B. However, adding at least one element to a finite set is sufficient to ensure that the cardinality increases.

Another way to phrase the pigeonhole principle for finite sets is similar to the principle that finite sets are Dedekind finite: Let A and B be finite sets. If there is a surjection from A to B that is not injective, then no surjection from A to B is injective. In fact no function of any kind from A to B is injective. This is not true for infinite sets: Consider the function on the natural numbers that sends 1 and 2 to 1, 3 and 4 to 2, 5 and 6 to 3, and so on.

There is a similar principle for infinite sets: If uncountably many pigeons are stuffed into countably many pigeonholes, there will exist at least one pigeonhole having uncountably many pigeons stuffed into it.

This principle is not a generalization of the pigeonhole principle for finite sets however: It is in general false for finite sets. In technical terms it says that if A and B are finite sets such that any surjective function from A to B is not injective, then there exists an element b of B such that there exists a bijection between the preimage of b and A. This is a quite different statement, and is absurd for large finite cardinalities.

Quantum mechanics

Yakir Aharonov et al. presented arguments that quantum mechanics may violate the pigeonhole principle, and proposed interferometric experiments to test the pigeonhole principle in quantum mechanics. [23] Later research has called this conclusion into question. [24] [25] In a January 2015 arXiv preprint, researchers Alastair Rae and Ted Forgan at the University of Birmingham performed a theoretical wave function analysis, employing the standard pigeonhole principle, on the flight of electrons at various energies through an interferometer. If the electrons had no interaction strength at all, they would each produce a single, perfectly circular peak. At high interaction strength, each electron produces four distinct peaks, for a total of 12 peaks on the detector; these peaks are the result of the four possible interactions each electron could experience (alone, together with the first other particle only, together with the second other particle only, or all three together). If the interaction strength was fairly low, as would be the case in many real experiments, the deviation from a zero-interaction pattern would be nearly indiscernible, much smaller than the lattice spacing of atoms in solids, such as the detectors used for observing these patterns. This would make it very difficult or impossible to distinguish a weak-but-nonzero interaction strength from no interaction whatsoever, and thus give an illusion of three electrons that did not interact despite all three passing through two paths.

See also

Notes

  1. 1 2 3 Herstein 1964 , p. 90
  2. 1 2 3 Rittaud, Benoît; Heeffer, Albrecht (2014). "The pigeonhole principle, two centuries before Dirichlet". The Mathematical Intelligencer. 36 (2): 27–29. doi:10.1007/s00283-013-9389-1. hdl: 1854/LU-4115264 . MR   3207654. S2CID   44193229.
  3. Jeff Miller, Peter Flor, Gunnar Berg, and Julio González Cabillón. "Pigeonhole principle". In Jeff Miller (ed.) Earliest Known Uses of Some of the Words of Mathematics . Electronic document, retrieved November 11, 2006
  4. Fletcher & Patty 1987 , p. 27
  5. Zimmermann, Karl-Heinz (2006). Diskrete Mathematik. p. 367. ISBN   9783833455292.
  6. Weintraub, Steven H. (17 May 2017). The Induction Book. p. 13. ISBN   9780486811994.
  7. James, R. C. (31 July 1992). Mathematics Dictionary. p. 490. ISBN   9780412990410.
  8. Pandey, Avinash. "D3 Graph Theory - Interactive Graph Theory Tutorials". d3gt.com. Retrieved 2021-01-12.
  9. Rignano, Eugenio (1923). The Psychology of Reasoning. Translated by Holl, Winifred A. K. Paul, Trench, Trubner & Company, Limited. p. 72. ISBN   9780415191326.
  10. To avoid a slightly messier presentation, this example only refers to people who are not bald.
  11. "London's Population / Greater London Authority (GLA)". data.london.gov.uk.
  12. "A Supplement to the Athenian Oracle: Being a Collection of the Remaining Questions and Answers in the Old Athenian Mercuries. ... To which is Prefix'd the History of the Athenian Society, ... By a Member of the Athenian Society". 1710.
  13. "The Athenian Oracle being an entire collection of all the valuable questions and answers". 1704.
  14. "The Athenian Oracle: Being an Entire Collection of All the Valuable Questions and Answers in the Old Athenian Mercuries. ... By a Member of the Athenian Society". 1704.
  15. Leurechon, Jean (1622), Selecæe Propositiones in Tota Sparsim Mathematica Pulcherrimæ, Gasparem Bernardum, p. 2
  16. Grimaldi 1994 , p. 277
  17. Gardner, Martin (October 1976). "Combinatorial problems, some old, some new and all newly attacked by computer". Mathematical Games. Scientific American . Vol. 235, no. 4. pp. 131–137. JSTOR   24950467.
  18. Introduction to Formal Languages and Automata, Peter Linz, pp. 115–116, Jones and Bartlett Learning, 2006
  19. Computational Geometry in C, Cambridge Tracts in Theoretical Computer Science, 2nd Edition, Joseph O'Rourke, page 9.
  20. Brualdi 2010 , p. 70
  21. Brualdi 2010 , p. 74 Theorem 3.2.1
  22. In the lead section this was presented with the substitutions m = n and k = r − 1.
  23. Aharonov, Yakir; Colombo, Fabrizio; Popescu, Sandu; Sabadini, Irene; Struppa, Daniele C.; Tollaksen, Jeff (2016). "Quantum violation of the pigeonhole principle and the nature of quantum correlations". Proceedings of the National Academy of Sciences. 113 (3): 532–535. Bibcode:2016PNAS..113..532A. doi: 10.1073/pnas.1522411112 . PMC   4725468 . PMID   26729862.
  24. "Quantum pigeonholes are not paradoxical after all, say physicists". 8 January 2015.
  25. Rae, Alastair; Forgan, Ted (2014-12-03). "On the implications of the Quantum-Pigeonhole Effect". arXiv: 1412.1333 [quant-ph].

Related Research Articles

<span class="mw-page-title-main">Binomial distribution</span> Probability distribution

In probability theory and statistics, the binomial distribution with parameters n and p is the discrete probability distribution of the number of successes in a sequence of n independent experiments, each asking a yes–no question, and each with its own Boolean-valued outcome: success or failure. A single success/failure experiment is also called a Bernoulli trial or Bernoulli experiment, and a sequence of outcomes is called a Bernoulli process; for a single trial, i.e., n = 1, the binomial distribution is a Bernoulli distribution. The binomial distribution is the basis for the popular binomial test of statistical significance.

<span class="mw-page-title-main">Binary search algorithm</span> Search algorithm finding the position of a target value within a sorted array

In computer science, binary search, also known as half-interval search, logarithmic search, or binary chop, is a search algorithm that finds the position of a target value within a sorted array. Binary search compares the target value to the middle element of the array. If they are not equal, the half in which the target cannot lie is eliminated and the search continues on the remaining half, again taking the middle element to compare to the target value, and repeating this until the target value is found. If the search ends with the remaining half being empty, the target is not in the array.

<span class="mw-page-title-main">Floor and ceiling functions</span> Nearest integers from a number

In mathematics, the floor function (or greatest integer function) is the function that takes as input a real number x, and gives as output the greatest integer less than or equal to x, denoted x or floor(x). Similarly, the ceiling function maps x to the smallest integer greater than or equal to x, denoted x or ceil(x).

<span class="mw-page-title-main">Derangement</span> Permutation of the elements of a set in which no element appears in its original position

In combinatorial mathematics, a derangement is a permutation of the elements of a set in which no element appears in its original position. In other words, a derangement is a permutation that has no fixed points.

<span class="mw-page-title-main">Quantization (signal processing)</span> Process of mapping a continuous set to a countable set

Quantization, in mathematics and digital signal processing, is the process of mapping input values from a large set to output values in a (countable) smaller set, often with a finite number of elements. Rounding and truncation are typical examples of quantization processes. Quantization is involved to some degree in nearly all digital signal processing, as the process of representing a signal in digital form ordinarily involves rounding. Quantization also forms the core of essentially all lossy compression algorithms.

In mathematics a polydivisible number is a number in a given number base with digits abcde... that has the following properties:

  1. Its first digit a is not 0.
  2. The number formed by its first two digits ab is a multiple of 2.
  3. The number formed by its first three digits abc is a multiple of 3.
  4. The number formed by its first four digits abcd is a multiple of 4.
  5. etc.

In number theory, a formula for primes is a formula generating the prime numbers, exactly and without exception. Formulas for calculating primes do exist, however, they are computationally very slow. A number of constraints are known, showing what such a "formula" can and cannot be.

In number theory, the Ankeny–Artin–Chowla congruence is a result published in 1953 by N. C. Ankeny, Emil Artin and S. Chowla. It concerns the class number h of a real quadratic field of discriminant d > 0. If the fundamental unit of the field is

In number theory, the integer square root (isqrt) of a non-negative integer n is the non-negative integer m which is the greatest integer less than or equal to the square root of n,

In mathematics, a pairing function is a process to uniquely encode two natural numbers into a single natural number.

<span class="mw-page-title-main">Discrete uniform distribution</span> Probability distribution on equally likely outcomes

In probability theory and statistics, the discrete uniform distribution is a symmetric probability distribution wherein a finite number of values are equally likely to be observed; every one of n values has equal probability 1/n. Another way of saying "discrete uniform distribution" would be "a known, finite number of outcomes equally likely to happen".

In number theory, Dirichlet's theorem on Diophantine approximation, also called Dirichlet's approximation theorem, states that for any real numbers and , with , there exist integers and such that and

Euclid's theorem is a fundamental statement in number theory that asserts that there are infinitely many prime numbers. It was first proven by Euclid in his work Elements. There are several proofs of the theorem.

The statistics of random permutations, such as the cycle structure of a random permutation are of fundamental importance in the analysis of algorithms, especially of sorting algorithms, which operate on random permutations. Suppose, for example, that we are using quickselect to select a random element of a random permutation. Quickselect will perform a partial sort on the array, as it partitions the array according to the pivot. Hence a permutation will be less disordered after quickselect has been performed. The amount of disorder that remains may be analysed with generating functions. These generating functions depend in a fundamental way on the generating functions of random permutation statistics. Hence it is of vital importance to compute these generating functions.

In mathematics, Abel's summation formula, introduced by Niels Henrik Abel, is intensively used in analytic number theory and the study of special functions to compute series.

<span class="mw-page-title-main">Dedekind number</span> Combinatorial sequence of numbers

In mathematics, the Dedekind numbers are a rapidly growing sequence of integers named after Richard Dedekind, who defined them in 1897. The Dedekind number M(n) is the number of monotone boolean functions of n variables. Equivalently, it is the number of antichains of subsets of an n-element set, the number of elements in a free distributive lattice with n generators, and one more than the number of abstract simplicial complexes on a set with n elements.

<span class="mw-page-title-main">Orchard-planting problem</span> Geometry; how many 3-point lines can n points form

In discrete geometry, the original orchard-planting problem asks for the maximum number of 3-point lines attainable by a configuration of a specific number of points in the plane. There are also investigations into how many k-point lines there can be. Hallard T. Croft and Paul Erdős proved

In stochastic analysis, a rough path is a generalization of the notion of smooth path allowing to construct a robust solution theory for controlled differential equations driven by classically irregular signals, for example a Wiener process. The theory was developed in the 1990s by Terry Lyons. Several accounts of the theory are available.

The purpose of this page is to catalog new, interesting, and useful identities related to number-theoretic divisor sums, i.e., sums of an arithmetic function over the divisors of a natural number , or equivalently the Dirichlet convolution of an arithmetic function with one:

An agreeable subset is a subset of items that is considered, by all people in a certain group, to be at least as good as its complement. Finding a small agreeable subset is a problem in computational social choice.

References

Listen to this article (24 minutes)
Sound-icon.svg
This audio file was created from a revision of this article dated 5 June 2021 (2021-06-05), and does not reflect subsequent edits.