Newton–Pepys problem

Last updated December 21, 2024 • 2 min readFrom Wikipedia, The Free Encyclopedia

The Newton–Pepys problem is a probability problem concerning the probability of throwing sixes from a certain number of dice.^[1]

Which of the following three propositions has the greatest chance of success?
A. Six fair dice are tossed independently and at least one "6" appears.
B. Twelve fair dice are tossed independently and at least two "6"s appear.
C. Eighteen fair dice are tossed independently and at least three "6"s appear.^[3]

Pepys initially thought that outcome C had the highest probability, but Newton correctly concluded that outcome A actually has the highest probability.

Solution

The probabilities of outcomes A, B and C are:^[1]

P(A)=1-\left({\frac {5}{6}}\right)^{6}={\frac {31031}{46656}}\approx 0.6651\,,

P(B)=1-\sum _{x=0}^{1}{\binom {12}{x}}\left({\frac {1}{6}}\right)^{x}\left({\frac {5}{6}}\right)^{12-x}={\frac {1346704211}{2176782336}}\approx 0.6187\,,

P(C)=1-\sum _{x=0}^{2}{\binom {18}{x}}\left({\frac {1}{6}}\right)^{x}\left({\frac {5}{6}}\right)^{18-x}={\frac {15166600495229}{25389989167104}}\approx 0.5973\,.

These results may be obtained by applying the binomial distribution (although Newton obtained them from first principles). In general, if P(n) is the probability of throwing at least n sixes with 6n dice, then:

P(n)=1-\sum _{x=0}^{n-1}{\binom {6n}{x}}\left({\frac {1}{6}}\right)^{x}\left({\frac {5}{6}}\right)^{6n-x}\,.

As n grows, P(n) decreases monotonically towards an asymptotic limit of 1/2.

Example in R

The solution outlined above can be implemented in R as follows:

for (sin1:3){# looking for s = 1, 2 or 3 sixesn=6*s# ... in n = 6, 12 or 18 diceq=pbinom(s-1,n,1/6)# q = Prob( <s sixes in n dice )cat("Probability of at least",s,"six in",n,"fair dice:",1-q,"\n")}

Newton's explanation

Although Newton correctly calculated the odds of each bet, he provided a separate intuitive explanation to Pepys. He imagined that B and C toss their dice in groups of six, and said that A was most favorable because it required a 6 in only one toss, while B and C required a 6 in each of their tosses. This explanation assumes that a group does not produce more than one 6, so it does not actually correspond to the original problem.^[3]

Generalizations

A natural generalization of the problem is to consider n non-necessarily fair dice, with p the probability that each die will select the 6 face when thrown (notice that actually the number of faces of the dice and which face should be selected are irrelevant). If r is the total number of dice selecting the 6 face, then $P(r\geq k;n,p)$ is the probability of having at least k correct selections when throwing exactly n dice. Then the original Newton–Pepys problem can be generalized as follows:

Let $\nu _{1},\nu _{2}$ be natural positive numbers s.t. $\nu _{1}\leq \nu _{2}$ . Is then $P(r\geq \nu _{1}k;\nu _{1}n,p)$ not smaller than $P(r\geq \nu _{2}k;\nu _{2}n,p)$ for all n, p, k?

Notice that, with this notation, the original Newton–Pepys problem reads as: is $P(r\geq 1;6,1/6)\geq P(r\geq 2;12,1/6)\geq P(r\geq 3;18,1/6)$ ?

As noticed in Rubin and Evans (1961), there are no uniform answers to the generalized Newton–Pepys problem since answers depend on k, n and p. There are nonetheless some variations of the previous questions that admit uniform answers:

(from Chaundy and Bullard (1960)):^[4]

If $k_{1},k_{2},n$ are positive natural numbers, and $k_{1}<k_{2}$ , then $P(r\geq k_{1};k_{1}n,{\frac {1}{n}})>P(r\geq k_{2};k_{2}n,{\frac {1}{n}})$ .

If $k,n_{1},n_{2}$ are positive natural numbers, and $n_{1}<n_{2}$ , then $P(r\geq k;kn_{1},{\frac {1}{n_{1}}})>P(r\geq k;kn_{2},{\frac {1}{n_{2}}})$ .

(from Varagnolo, Pillonetto and Schenato (2013)):^[5]

If $\nu _{1},\nu _{2},n,k$ are positive natural numbers, and $\nu _{1}\leq \nu _{2},k\leq n,p\in [0,1]$ then $P(r=\nu _{1}k;\nu _{1}n,p)\geq P(r=\nu _{2}k;\nu _{2}n,p)$ .

Related Research Articles

In mathematics, the binomial coefficients are the positive integers that occur as coefficients in the binomial theorem. Commonly, a binomial coefficient is indexed by a pair of integers $n \geq k \geq 0$ and is written $It is the coefficient of the x k term in the polynomial expansion of the binomial power (1 + x) n; this coefficient can be computed by the multiplicative formula$

In probability theory and statistics, Student's $t$ distribution $is a continuous probability distribution that generalizes the standard normal distribution. Like the latter, it is symmetric around zero and bell-shaped.$

<span class="mw-page-title-main">Hypergeometric distribution</span> Discrete probability distribution

In probability theory and statistics, the hypergeometric distribution is a discrete probability distribution that describes the probability of $successes in draws, without replacement, from a finite population of size that contains exactly objects with that feature, wherein each draw is either a success or a failure. In contrast, the binomial distribution describes the probability of successes in draws with replacement.$

In probability theory and statistics, the gamma distribution is a versatile two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-squared distribution are special cases of the gamma distribution. There are two equivalent parameterizations in common use:

With a shape parameter $α$ and a scale parameter $θ$
With a shape parameter $and a rate parameter ⁠ ⁠$

In mathematics, the Radon–Nikodym theorem is a result in measure theory that expresses the relationship between two measures defined on the same measurable space. A measure is a set function that assigns a consistent magnitude to the measurable subsets of a measurable space. Examples of a measure include area and volume, where the subsets are sets of points; or the probability of an event, which is a subset of possible outcomes within a wider probability space.

In the mathematical field of numerical analysis, a Bernstein polynomial is a polynomial expressed as a linear combination of Bernstein basis polynomials. The idea is named after mathematician Sergei Natanovich Bernstein.

Multi-index notation is a mathematical notation that simplifies formulas used in multivariable calculus, partial differential equations and the theory of distributions, by generalising the concept of an integer index to an ordered tuple of indices.

In probability theory and statistics, the noncentral F-distribution is a continuous probability distribution that is a noncentral generalization of the (ordinary) F-distribution. It describes the distribution of the quotient (X/n₁)/(Y/n₂), where the numerator X has a noncentral chi-squared distribution with n₁ degrees of freedom and the denominator Y has a central chi-squared distribution with n₂ degrees of freedom. It is also required that X and Y are statistically independent of each other.

Stochastic dominance is a partial order between random variables. It is a form of stochastic ordering. The concept arises in decision theory and decision analysis in situations where one gamble can be ranked as superior to another gamble for a broad class of decision-makers. It is based on shared preferences regarding sets of possible outcomes and their associated probabilities. Only limited knowledge of preferences is required for determining dominance. Risk aversion is a factor only in second order stochastic dominance.

In mathematics and economics, transportation theory or transport theory is a name given to the study of optimal transportation and allocation of resources. The problem was formalized by the French mathematician Gaspard Monge in 1781.

In mathematics, the Wasserstein distance or Kantorovich–Rubinstein metric is a distance function defined between probability distributions on a given metric space $. It is named after Leonid Vaseršteĭn.$

In mathematics, the theory of optimal stopping or early stopping is concerned with the problem of choosing a time to take a particular action, in order to maximise an expected reward or minimise an expected cost. Optimal stopping problems can be found in areas of statistics, economics, and mathematical finance. A key example of an optimal stopping problem is the secretary problem. Optimal stopping problems can often be written in the form of a Bellman equation, and are therefore often solved using dynamic programming.

In statistics, the multivariate t-distribution is a multivariate probability distribution. It is a generalization to random vectors of the Student's t-distribution, which is a distribution applicable to univariate random variables. While the case of a random matrix could be treated within this structure, the matrix t-distribution is distinct and makes particular use of the matrix structure.

In mathematical analysis, a Young measure is a parameterized measure that is associated with certain subsequences of a given bounded sequence of measurable functions. They are a quantification of the oscillation effect of the sequence in the limit. Young measures have applications in the calculus of variations, especially models from material science, and the study of nonlinear partial differential equations, as well as in various optimization. They are named after Laurence Chisholm Young who invented them, already in 1937 in one dimension (curves) and later in higher dimensions in 1942.

An $- superprocess,, within mathematics probability theory is a stochastic process on that is usually constructed as a special limit of near-critical branching diffusions.$

In probability theory and statistics, the Conway–Maxwell–Poisson distribution is a discrete probability distribution named after Richard W. Conway, William L. Maxwell, and Siméon Denis Poisson that generalizes the Poisson distribution by adding a parameter to model overdispersion and underdispersion. It is a member of the exponential family, has the Poisson distribution and geometric distribution as special cases and the Bernoulli distribution as a limiting case.

In the statistical theory of estimation, the German tank problem consists of estimating the maximum of a discrete uniform distribution from sampling without replacement. In simple terms, suppose there exists an unknown number of items which are sequentially numbered from 1 to N. A random sample of these items is taken and their sequence numbers observed; the problem is to estimate N from these observed numbers.

In the mathematical theory of random matrices, the Marchenko–Pastur distribution, or Marchenko–Pastur law, describes the asymptotic behavior of singular values of large rectangular random matrices. The theorem is named after soviet mathematicians Volodymyr Marchenko and Leonid Pastur who proved this result in 1967.

In statistics, the generalized Marcum Q-function of order $is defined as$

In probability theory and statistics, the Conway–Maxwell–binomial (CMB) distribution is a three parameter discrete probability distribution that generalises the binomial distribution in an analogous manner to the way that the Conway–Maxwell–Poisson distribution generalises the Poisson distribution. The CMB distribution can be used to model both positive and negative association among the Bernoulli summands,.

References

1 2 Weisstein, Eric W. "Newton-Pepys Problem". MathWorld .
↑ Chaundy, T.W., Bullard, J.E., 1960. "John Smith’s Problem." The Mathematical Gazette 44, 253-260.
1 2 Stigler, Stephen M (2006). "Isaac Newton as a Probabilist". Statistical Science. 21 (3): 400. arXiv: math/0701089 . doi:10.1214/088342306000000312. S2CID 17471221.
↑ Chaundy, T.W., Bullard, J.E., 1960. "John Smith’s Problem." The Mathematical Gazette 44, 253-260.
↑ Varagnolo, Damiano; Schenato, Luca; Pillonetto, Gianluigi (2013). "A variation of the Newton–Pepys problem and its connections to size-estimation problems". Statistics & Probability Letters. 83 (5): 1472–1478. doi:10.1016/j.spl.2013.02.008.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[wolfram-1] 1 2 Weisstein, Eric W. "Newton-Pepys Problem". MathWorld .

[2] Chaundy, T.W., Bullard, J.E., 1960. "John Smith’s Problem." The Mathematical Gazette 44, 253-260.

[Stigler-3] 1 2 Stigler, Stephen M (2006). "Isaac Newton as a Probabilist". Statistical Science. 21 (3): 400. arXiv: math/0701089 . doi:10.1214/088342306000000312. S2CID 17471221.

[4] Chaundy, T.W., Bullard, J.E., 1960. "John Smith’s Problem." The Mathematical Gazette 44, 253-260.

[5] Varagnolo, Damiano; Schenato, Luca; Pillonetto, Gianluigi (2013). "A variation of the Newton–Pepys problem and its connections to size-estimation problems". Statistics & Probability Letters. 83 (5): 1472–1478. doi:10.1016/j.spl.2013.02.008.

[1]

[2]

[3]

[4]

[5]

v t e Sir Isaac Newton
Publications	Fluxions (1671) De Motu (1684) Principia (1687) Opticks (1704) Queries (1704) Arithmetica (1707) De Analysi (1711)
Other writings	Quaestiones (1661–1665) "standing on the shoulders of giants" (1675) Notes on the Jewish Temple (c. 1680) "General Scholium" (1713; "hypotheses non fingo" ) Ancient Kingdoms Amended (1728) Corruptions of Scripture (1754)
Contributions	Calculus fluxion Impact depth Inertia Newton disc Newton polygon Newton–Okounkov body Newton's reflector Newtonian telescope Newton scale Newton's metal Spectrum Structural coloration
Newtonianism	Bucket argument Newton's inequalities Newton's law of cooling Newton's law of universal gravitation post-Newtonian expansion parameterized gravitational constant Newton–Cartan theory Schrödinger–Newton equation Newton's laws of motion Kepler's laws Newtonian dynamics Newton's method in optimization Apollonius's problem truncated Newton method Gauss–Newton algorithm Newton's rings Newton's theorem about ovals Newton–Pepys problem Newtonian potential Newtonian fluid Classical mechanics Corpuscular theory of light Leibniz–Newton calculus controversy Newton's notation Rotating spheres Newton's cannonball Newton–Cotes formulas Newton's method generalized Gauss–Newton method Newton fractal Newton's identities Newton polynomial Newton's theorem of revolving orbits Newton–Euler equations Newton number kissing number problem Newton's quotient Parallelogram of force Newton–Puiseux theorem Absolute space and time Luminiferous aether Newtonian series table
Personal life	Woolsthorpe Manor (birthplace) Cranbury Park (home) Early life Later life Apple tree Religious views Occult studies Scientific Revolution Copernican Revolution
Relations	Catherine Barton (niece) John Conduitt (nephew-in-law) Isaac Barrow (professor) William Clarke (mentor) Benjamin Pulleyn (tutor) Roger Cotes (student) William Whiston (student) John Keill (disciple) William Stukeley (friend) William Jones (friend) Abraham de Moivre (friend)
Depictions	Newton by Blake (monotype) Newton by Paolozzi (sculpture) Isaac Newton Gargoyle Astronomers Monument
Namesake	Newton (unit) Newton's cradle Isaac Newton Institute Isaac Newton Medal Isaac Newton Telescope Isaac Newton Group of Telescopes XMM-Newton Sir Isaac Newton Sixth Form Statal Institute of Higher Education Isaac Newton Newton International Fellowship
Categories	Isaac Newton