Buffon's needle problem

Last updated
The a needle lies across a line, while the b needle does not. Buffon needle.svg
The a needle lies across a line, while the b needle does not.

In probability theory, Buffon's needle problem is a question first posed in the 18th century by Georges-Louis Leclerc, Comte de Buffon: [1]

Contents

Suppose we have a floor made of parallel strips of wood, each the same width, and we drop a needle onto the floor. What is the probability that the needle will lie across a line between two strips?

Buffon's needle was the earliest problem in geometric probability to be solved; [2] it can be solved using integral geometry. The solution for the sought probability p, in the case where the needle length l is not greater than the width t of the strips, is

This can be used to design a Monte Carlo method for approximating the number π, although that was not the original motivation for de Buffon's question. [3]

Solution

The problem in more mathematical terms is: Given a needle of length l dropped on a plane ruled with parallel lines t units apart, what is the probability that the needle will lie across a line upon landing?

Let x be the distance from the center of the needle to the closest parallel line, and let θ be the acute angle between the needle and one of the parallel lines.

The uniform probability density function (PDF) of x between 0 and t/2 is

Here, x = 0 represents a needle that is centered directly on a line, and x = t/2 represents a needle that is perfectly centered between two lines. The uniform PDF assumes the needle is equally likely to fall anywhere in this range, but could not fall outside of it.

The uniform probability density function of θ between 0 and π/2 is

Here, θ = 0 represents a needle that is parallel to the marked lines, and θ = π/2 radians represents a needle that is perpendicular to the marked lines. Any angle within this range is assumed an equally likely outcome.

The two random variables, x and θ, are independent, [4] so the joint probability density function is the product

The needle crosses a line if

Now there are two cases.

Case 1: Short needle (lt)

Integrating the joint probability density function gives the probability that the needle will cross a line:

Case 2: Long needle (l > t)

Suppose l > t. In this case, integrating the joint probability density function, we obtain:

where m(θ) is the minimum between l/2 sin θ and t/2.

Thus, performing the above integration, we see that, when l > t, the probability that the needle will cross at least one line is

or

In the second expression, the first term represents the probability of the angle of the needle being such that it will always cross at least one line. The right term represents the probability that the needle falls at an angle where its position matters, and it crosses the line.

Alternatively, notice that whenever θ has a value such that l sin θt, that is, in the range 0 ≤ θ ≤ arcsin t/l, the probability of crossing is the same as in the short needle case. However if l sin θ > t, that is, arcsin t/l < θπ/2 the probability is constant and is equal to 1.

Using elementary calculus

The following solution for the "short needle" case, while equivalent to the one above, has a more visual flavor, and avoids iterated integrals.

We can calculate the probability P as the product of two probabilities: P = P1 · P2, where P1 is the probability that the center of the needle falls close enough to a line for the needle to possibly cross it, and P2 is the probability that the needle actually crosses the line, given that the center is within reach.

Looking at the illustration in the above section, it is apparent that the needle can cross a line if the center of the needle is within l/2 units of either side of the strip. Adding l/2 + l/2 from both sides and dividing by the whole width t, we obtain P1 = l/t.

The red and blue needles are both centered at x. The red one falls within the gray area, contained by an angle of 2th on each side, so it crosses the vertical line; the blue one does not. The proportion of the circle that is gray is what we integrate as the center x goes from 0 to 1 Buffon's needle corrected.PNG
The red and blue needles are both centered at x. The red one falls within the gray area, contained by an angle of 2θ on each side, so it crosses the vertical line; the blue one does not. The proportion of the circle that is gray is what we integrate as the center x goes from 0 to 1

Now, we assume that the center is within reach of the edge of the strip, and calculate P2. To simplify the calculation, we can assume that .

Let x and θ be as in the illustration in this section. Placing a needle's center at x, the needle will cross the vertical axis if it falls within a range of 2θ radians, out of π radians of possible orientations. This represents the gray area to the left of x in the figure. For a fixed x, we can express θ as a function of x: θ(x) = arccos(x). Now we can let x range from 0 to 1, and integrate:

Multiplying both results, we obtain P = P1 · P2 = l/t · 2/π = 2l/ as above.

There is an even more elegant and simple method of calculating the "short needle case". The end of the needle farthest away from any one of the two lines bordering its region must be located within a horizontal (perpendicular to the bordering lines) distance of l cos θ (where θ is the angle between the needle and the horizontal) from this line in order for the needle to cross it. The farthest this end of the needle can move away from this line horizontally in its region is t. The probability that the farthest end of the needle is located no more than a distance l cos θ away from the line (and thus that the needle crosses the line) out of the total distance t it can move in its region for 0 ≤ θπ/2 is given by

Without integrals

The short-needle problem can also be solved without any integration, in a way that explains the formula for p from the geometric fact that a circle of diameter t will cross the distance t strips always (i.e. with probability 1) in exactly two spots. This solution was given by Joseph-Émile Barbier in 1860 [5] and is also referred to as "Buffon's noodle".

Estimating π

An experiment to find p. Matches with the length of 9 squares have been thrown 17 times between rows with the width of 9 squares. 11 of the matches have landed at random across the drawn lines marked by the green points.

2l * n/th =
2 x 9 x 17/9 x 11 [?] 3.1 [?] p. Streicholz-Pi.jpg
An experiment to find π. Matches with the length of 9 squares have been thrown 17 times between rows with the width of 9 squares. 11 of the matches have landed at random across the drawn lines marked by the green points.
2l · n/th = 2 × 9 × 17/9 × 11 ≈ 3.1 ≈ π.
A Python 3 based simulation using Matplotlib to sketch Buffon's needle experiment with the parameters t = 5.0, l = 2.6. Observe the calculated value of p (y-axis) approaching 3.14 as the number of tosses (x-axis) approaches infinity. Buffon needle experiment compressed.gif
A Python 3 based simulation using Matplotlib to sketch Buffon's needle experiment with the parameters t = 5.0, l = 2.6. Observe the calculated value of π (y-axis) approaching 3.14 as the number of tosses (x-axis) approaches infinity.

In the first, simpler case above, the formula obtained for the probability P can be rearranged to

Thus, if we conduct an experiment to estimate P, we will also have an estimate for π.

Suppose we drop n needles and find that h of those needles are crossing lines, so P is approximated by the fraction h/n. This leads to the formula:

In 1901, Italian mathematician Mario Lazzarini performed Buffon's needle experiment. Tossing a needle 3,408 times, he obtained the well-known approximation 355/113 for π, accurate to six decimal places. [6] Lazzarini's "experiment" is an example of confirmation bias, as it was set up to replicate the already well-known approximation of 355/113 (in fact, there is no better rational approximation with fewer than five digits in the numerator and denominator, see also Milü), yielding a more accurate "prediction" of π than would be expected from the number of trials, as follows: [7]

Lazzarini chose needles whose length was 5/6 of the width of the strips of wood. In this case, the probability that the needles will cross the lines is 5/3π. Thus if one were to drop n needles and get x crossings, one would estimate π as

So if Lazzarini was aiming for the result 355/113, he needed n and x such that

or equivalently,

To do this, one should pick n as a multiple of 213, because then 113n/213 is an integer; one then drops n needles, and hopes for exactly x = 113n/213 successes. If one drops 213 needles and happens to get 113 successes, then one can triumphantly report an estimate of π accurate to six decimal places. If not, one can just do 213 more trials and hope for a total of 226 successes; if not, just repeat as necessary. Lazzarini performed 3,408 = 213 × 16 trials, making it seem likely that this is the strategy he used to obtain his "estimate".

The above description of strategy might even be considered charitable to Lazzarini. A statistical analysis of intermediate results he reported for fewer tosses leads to a very low probability of achieving such close agreement to the expected value all through the experiment. This makes it very possible that the "experiment" itself was never physically performed, but based on numbers concocted from imagination to match statistical expectations, but too well, as it turns out. [7]

Dutch science journalist Hans van Maanen argues, however, that Lazzarini's article was never meant to be taken too seriously as it would have been pretty obvious for the readers of the magazine (aimed at school teachers) that the apparatus that Lazzarini said to have built cannot possibly work as described. [8]

Laplace's extension (short needle case)

Now consider the case where the plane contains two sets of parallel lines orthogonal to one another, creating a standard perpendicular grid. We aim to find the probability that the needle intersects at least one line on the grid. Let a and b be the sides of the rectangle that contains the midpoint of the needle whose length is l. Since this is the short needle case, l < a, l < b. Let (x,y) mark the coordinates of the needle's midpoint and let φ mark the angle formed by the needle and the x-axis. Similar to the examples described above, we consider x, y, φ to be independent uniform random variables over the ranges 0 ≤ xa, 0 ≤ yb, π/2φπ/2.

To solve such a problem, we first compute the probability that the needle crosses no lines, and then we take its compliment. We compute this first probability by determining the volume of the domain where the needle crosses no lines and then divide that by the volume of all possibilities, V. We can easily see that V = πab.

Now let V* be the volume of possibilities where the needle does not intersect any line. Developed by J.V. Uspensky, [9]

where F(φ) is the region where the needle does not intersect any line given an angle φ. To determine F(φ), let's first look at the case for the horizontal edges of the bounding rectangle. The total side length is a and the midpoint must not be within l/2 cos φ of either endpoint of the edge. Thus, the total allowable length for no intersection is a − 2(l/2 cos φ) or simply just al cos φ. Equivalently, for the vertical edges with length b, we have b ± l sin φ. The ± accounts for the cases where φ is positive or negative. Taking the positive case and then adding the absolute value signs in the final answer for generality, we get

Now we can compute the following integral:

Thus, the probability that the needle does not intersect any line is

And finally, if we want to calculate the probability, P, that the needle does intersect at least one line, we need to subtract the above result from 1 to compute its compliment, yielding

.

Comparing estimators of π

As mentioned above, Buffon's needle experiment can be used to estimate π. This fact holds for Laplace's extension too since π shows up in that answer as well. The following question then naturally arises and is discussed by E.F. Schuster in 1974. [10] Is Buffon's experiment or Laplace's a better estimator of the value of π? Since in Laplace's extension there are two sets of parallel lines, we compare N drops when there is a grid (Laplace), and 2N drops in Buffon's original experiment.

Let A be the event that the needle intersects a horizontal line (parallel to the x-axis)

and let B be the event that the needle intersects a vertical line (parallel to the y-axis)

For simplicity in the algebraic formulation ahead, let a = b = t = 2l such that the original result in Buffon's problem is P(A) = P(B) = 1/π. Furthermore, let N = 100 drops.

Now let us examine P(AB) for Laplace's result, that is, the probability the needle intersects both a horizontal and a vertical line. We know that

From the above section, P(AB′), or the probability that the needle intersects no lines is

We can solve for P(AB) and P(AB′) using the following method:

Solving for P(AB) and P(AB′) and plugging that into the original definition for P(AB) a few lines above, we get

Although not necessary to the problem, it is now possible to see that P(AB) = P(AB′) = 3/4π. With the values above, we are now able to determine which of these estimators is a better estimator for π. For the Laplace variant, let be the estimator for the probability that there is a line intersection such that

.

We are interested in the variance of such an estimator to understand the usefulness or efficiency of it. To compute the variance of , we first compute Var(xn + yn) where

Solving for each part individually,

We know from the previous section that

yielding

Thus,

Returning to the original problem of this section, the variance of estimator is

Now let us calculate the number of drops, M, needed to achieve the same variance as 100 drops over perpendicular lines. If M < 200 then we can conclude that the setup with only parallel lines is more efficient than the case with perpendicular lines. Conversely if M is equal to or more than 200, than Buffon's experiment is equally or less efficient, respectively. Let be the estimator for Buffon's original experiment. Then,

and

Solving for M,

Thus, it takes 222 drops with only parallel lines to have the same certainty as 100 drops in Laplace's case. This isn't actually surprising because of the observation that Cov(xn,yn) < 0. Because xn and yn are negatively correlated random variables, they act to reduce the total variance in the estimator that is an average of the two of them. This method of variance reduction is known as the antithetic variates method.

See also

Related Research Articles

In physics, the cross section is a measure of the probability that a specific process will take place in a collision of two particles. For example, the Rutherford cross-section is a measure of probability that an alpha particle will be deflected by a given angle during an interaction with an atomic nucleus. Cross section is typically denoted σ (sigma) and is expressed in units of area, more specifically in barns. In a way, it can be thought of as the size of the object that the excitation must hit in order for the process to occur, but more exactly, it is a parameter of a stochastic process.

<span class="mw-page-title-main">Kinetic theory of gases</span> Understanding of gas properties in terms of molecular motion

The kinetic theory of gases is a simple classical model of the thermodynamic behavior of gases. It treats a gas as composed of numerous particles, too small to see with a microscope, which are constantly in random motion. Their collisions with each other and with the walls of their container are used to explain physical properties of the gas—for example the relationship between its temperature, pressure and volume. The particles are now known to be the atoms or molecules of the gas.

<span class="mw-page-title-main">Ellipsoid</span> Quadric surface that looks like a deformed sphere

An ellipsoid is a surface that can be obtained from a sphere by deforming it by means of directional scalings, or more generally, of an affine transformation.

<span class="mw-page-title-main">Tautochrone curve</span> Curve for which the time to roll to the end is equal for all starting points

A tautochrone curve or isochrone curve is the curve for which the time taken by an object sliding without friction in uniform gravity to its lowest point is independent of its starting point on the curve. The curve is a cycloid, and the time is equal to π times the square root of the radius over the acceleration of gravity. The tautochrone curve is related to the brachistochrone curve, which is also a cycloid.

In mechanics and geometry, the 3D rotation group, often denoted SO(3), is the group of all rotations about the origin of three-dimensional Euclidean space under the operation of composition.

<span class="mw-page-title-main">Spherical harmonics</span> Special mathematical functions defined on the surface of a sphere

In mathematics and physical science, spherical harmonics are special functions defined on the surface of a sphere. They are often employed in solving partial differential equations in many scientific fields. A list of the spherical harmonics is available in Table of spherical harmonics.

<span class="mw-page-title-main">Inverse trigonometric functions</span> Inverse functions of sin, cos, tan, etc.

In mathematics, the inverse trigonometric functions are the inverse functions of the trigonometric functions. Specifically, they are the inverses of the sine, cosine, tangent, cotangent, secant, and cosecant functions, and are used to obtain an angle from any of the angle's trigonometric ratios. Inverse trigonometric functions are widely used in engineering, navigation, physics, and geometry.

<span class="mw-page-title-main">Clausen function</span> Transcendental single-variable function

In mathematics, the Clausen function, introduced by Thomas Clausen, is a transcendental, special function of a single variable. It can variously be expressed in the form of a definite integral, a trigonometric series, and various other forms. It is intimately connected with the polylogarithm, inverse tangent integral, polygamma function, Riemann zeta function, Dirichlet eta function, and Dirichlet beta function.

In probability theory, the Borel–Kolmogorov paradox is a paradox relating to conditional probability with respect to an event of probability zero. It is named after Émile Borel and Andrey Kolmogorov.

In probability and statistics, a circular distribution or polar distribution is a probability distribution of a random variable whose values are angles, usually taken to be in the range [0, 2π). A circular distribution is often a continuous probability distribution, and hence has a probability density, but such distributions can also be discrete, in which case they are called circular lattice distributions. Circular distributions can be used even when the variables concerned are not explicitly angles: the main consideration is that there is not usually any real distinction between events occurring at the opposite ends of the range, and the division of the range could notionally be made at any point.

In geometry, the area enclosed by a circle of radius r is πr2. Here the Greek letter π represents the constant ratio of the circumference of any circle to its diameter, approximately equal to 3.14159.

In calculus, the Leibniz integral rule for differentiation under the integral sign states that for an integral of the form

<span class="mw-page-title-main">Multiple integral</span> Generalization of definite integrals to functions of multiple variables

In mathematics (specifically multivariable calculus), a multiple integral is a definite integral of a function of several real variables, for instance, f(x, y) or f(x, y, z). Physical (natural philosophy) interpretation: S any surface, V any volume, etc.. Incl. variable to time, position, etc.

<span class="mw-page-title-main">Sine and cosine</span> Fundamental trigonometric functions

In mathematics, sine and cosine are trigonometric functions of an angle. The sine and cosine of an acute angle are defined in the context of a right triangle: for the specified angle, its sine is the ratio of the length of the side that is opposite that angle to the length of the longest side of the triangle, and the cosine is the ratio of the length of the adjacent leg to that of the hypotenuse. For an angle , the sine and cosine functions are denoted simply as and .

There are several equivalent ways for defining trigonometric functions, and the proofs of the trigonometric identities between them depend on the chosen definition. The oldest and most elementary definitions are based on the geometry of right triangles. The proofs given in this article use these definitions, and thus apply to non-negative angles not greater than a right angle. For greater and negative angles, see Trigonometric functions.

A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.

In geometric probability, the problem of Buffon's noodle is a variation on the well-known problem of Buffon's needle, named after Georges-Louis Leclerc, Comte de Buffon who lived in the 18th century. This approach to the problem was published by Joseph-Émile Barbier in 1860.

In quantum computing, the quantum phase estimation algorithm is a quantum algorithm to estimate the phase corresponding to an eigenvalue of a given unitary operator. Because the eigenvalues of a unitary operator always have unit modulus, they are characterized by their phase, and therefore the algorithm can be equivalently described as retrieving either the phase or the eigenvalue itself. The algorithm was initially introduced by Alexei Kitaev in 1995.

In the theory of Lorentzian manifolds, spherically symmetric spacetimes admit a family of nested round spheres. In such a spacetime, a particularly important kind of coordinate chart is the Schwarzschild chart, a kind of polar spherical coordinate chart on a static and spherically symmetric spacetime, which is adapted to these nested round spheres. The defining characteristic of Schwarzschild chart is that the radial coordinate possesses a natural geometric interpretation in terms of the surface area and Gaussian curvature of each sphere. However, radial distances and angles are not accurately represented.

References

  1. Histoire de l'Acad. Roy. des. Sciences (1733), 43–45; Histoire naturelle, générale et particulière Supplément 4 (1777), p. 46.
  2. Seneta, Eugene; Parshall, Karen Hunger; Jongmans, François (2001). "Nineteenth-Century Developments in Geometric Probability: J. J. Sylvester, M. W. Crofton, J.-É. Barbier, and J. Bertrand". Archive for History of Exact Sciences. 55 (6): 501–524. doi:10.1007/s004070100038. ISSN   0003-9519. JSTOR   41134124. S2CID   124429237.
  3. Behrends, Ehrhard. "Buffon: Hat er Stöckchen geworfen oder hat er nicht?" (PDF). Retrieved 14 March 2015.
  4. The problem formulation here avoids having to work with Regular conditional probability densities.
  5. Aigner, Martin; Ziegler, Günter M. (2013). Proofs from THE BOOK (2nd ed.). Springer Science & Business Media. pp. 189–192.
  6. Lazzarini, M. (1901). "Un'applicazione del calcolo della probabilità alla ricerca sperimentale di un valore approssimato di π" [An Application of Probability Theory to Experimental Research of an Approximation of π]. Periodico di Matematica per l'Insegnamento Secondario (in Italian). 4: 140–143.
  7. 1 2 Lee Badger, 'Lazzarini's Lucky Approximation of π', Mathematics Magazine 67, 1994, 83–91.
  8. Hans van Maanen, 'Het stokje van Lazzarini' (Lazzarini's stick), "Skepter" 31.3, 2018.
  9. J.V. Uspensky, 'Introduction to Mathematical Probability', 1937, 255.
  10. E. F. Schuster, Buffon's Needle Experiment, The American Mathematical Monthly, 1974, 29-29.

Bibliography