Bernoulli trial

Last updated January 09, 2025

In the theory of probability and statistics, a Bernoulli trial (or binomial trial) is a random experiment with exactly two possible outcomes, "success" and "failure", in which the probability of success is the same every time the experiment is conducted.^[1] It is named after Jacob Bernoulli, a 17th-century Swiss mathematician, who analyzed them in his Ars Conjectandi (1713).^[2]

Success and failure are in this context labels for the two outcomes, and should not be construed literally or as value judgments. More generally, given any probability space, for any event (set of outcomes), one can define a Bernoulli trial according to whether the event occurred or not (event or complementary event). Examples of Bernoulli trials include:

Flipping a coin. In this context, obverse ("heads") conventionally denotes success and reverse ("tails") denotes failure. A fair coin has the probability of success 0.5 by definition. In this case, there are exactly two possible outcomes.
Rolling a die, where a six is "success" and everything else a "failure". In this case, there are six possible outcomes, and the event is a six; the complementary event "not a six" corresponds to the other five possible outcomes.
In conducting a political opinion poll, choosing a voter at random to ascertain whether that voter will vote "yes" in an upcoming referendum.

Preliminary

Suppose there exists an experiment consiting of indepently repeated trials, each of which has only two possible outcomes; called experimental Bernoulli trials. The collection of $n$ experimental realizations of success (1) and failure (0) will be defined by a Bernoulli random variable: $bX_{r}|==>{x:bX_{r}==f(bX_{r}=x)::[x=1,x=0;;(p,p-1)]}$
| $p=total_{1}/n$

Let $p$ be the probability of success in a Bernoulli trial, and $q$ be the probability of failure. Then the probability of success and the probability of failure sum to one, since these are complementary events: "success" and "failure" are mutually exclusive and exhaustive. Thus, one has the following relations:

p=1-q,\quad \quad q=1-p,\quad \quad p+q=1.

Alternatively, these can be stated in terms of odds: given probability $p$ of success and $q$ of failure, the odds for are $p:q$ and the odds against are $q:p.$ These can also be expressed as numbers, by dividing, yielding the odds for, $o_{f}$ , and the odds against, $o_{a}$ :

{\begin{aligned}o_{f}&=p/q=p/(1-p)=(1-q)/q\\o_{a}&=q/p=(1-p)/p=q/(1-q).\end{aligned}}

These are multiplicative inverses, so they multiply to 1, with the following relations:

o_{f}=1/o_{a},\quad o_{a}=1/o_{f},\quad o_{f}\cdot o_{a}=1.

In the case that a Bernoulli trial is representing an event from finitely many equally likely outcomes, where $S$ of the outcomes are success and $F$ of the outcomes are failure, the odds for are $S:F$ and the odds against are $F:S.$ This yields the following formulas for probability and odds:

{\begin{aligned}p&=S/(S+F)\\q&=F/(S+F)\\o_{f}&=S/F\\o_{a}&=F/S.\end{aligned}}

Here the odds are computed by dividing the number of outcomes, not the probabilities, but the proportion is the same, since these ratios only differ by multiplying both terms by the same constant factor.

Random variables describing Bernoulli trials are often encoded using the convention that 1 = "success", 0 = "failure".

Closely related to a Bernoulli trial is a binomial experiment, which consists of a fixed number $n$ of statistically independent Bernoulli trials, each with a probability of success $p$ , and counts the number of successes. A random variable corresponding to a binomial experiment is denoted by $B(n,p)$ , and is said to have a binomial distribution . The probability of exactly $k$ successes in the experiment $B(n,p)$ is given by:

P(k)={n \choose k}p^{k}q^{n-k}

where ${n \choose k}$ is a binomial coefficient.

Bernoulli trials may also lead to negative binomial distributions (which count the number of successes in a series of repeated Bernoulli trials until a specified number of failures are seen), as well as various other distributions.

When multiple Bernoulli trials are performed, each with its own probability of success, these are sometimes referred to as Poisson trials.^[3]

Examples

Tossing coins

Consider the simple experiment where a fair coin is tossed four times. Find the probability that exactly two of the tosses result in heads.

Solution

For this experiment, let a heads be defined as a success and a tails as a failure. Because the coin is assumed to be fair, the probability of success is $p={\tfrac {1}{2}}$ . Thus, the probability of failure, $q$ , is given by

q=1-p=1-{\tfrac {1}{2}}={\tfrac {1}{2}}

.

Using the equation above, the probability of exactly two tosses out of four total tosses resulting in a heads is given by:

{\begin{aligned}P(2)&={4 \choose 2}p^{2}q^{4-2}\\&=6\times \left({\tfrac {1}{2}}\right)^{2}\times \left({\tfrac {1}{2}}\right)^{2}\\&={\dfrac {3}{8}}.\end{aligned}}

Rolling dice

What is probability that when three independent fair six-sided dice are rolled, exactly two yield sixes?

Solution

On one die, the probability of rolling a six, $p={\tfrac {1}{6}}$ . Thus, the probability of not rolling a six, $q=1-p={\tfrac {5}{6}}$ .

As above, the probability of exactly two sixes out of three,

{\begin{aligned}P(2)&={3 \choose 2}p^{2}q^{3-2}\\&=3\times \left({\tfrac {1}{6}}\right)^{2}\times \left({\tfrac {5}{6}}\right)^{1}\\&={\dfrac {5}{72}}\approx 0.069.\end{aligned}}

Related Research Articles

In probability theory and statistics, the binomial distribution with parameters $n$ and $p$ is the discrete probability distribution of the number of successes in a sequence of $n$ independent experiments, each asking a yes–no question, and each with its own Boolean-valued outcome: success or failure. A single success/failure experiment is also called a Bernoulli trial or Bernoulli experiment, and a sequence of outcomes is called a Bernoulli process; for a single trial, i.e., $n = 1$ , the binomial distribution is a Bernoulli distribution. The binomial distribution is the basis for the binomial test of statistical significance.

In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of possible outcomes for an experiment. It is a mathematical description of a random phenomenon in terms of its sample space and the probabilities of events.

<span class="mw-page-title-main">Negative binomial distribution</span> Probability distribution

In probability theory and statistics, the negative binomial distribution is a discrete probability distribution that models the number of failures in a sequence of independent and identically distributed Bernoulli trials before a specified/constant/fixed number of successes $occur. For example, we can define rolling a 6 on some dice as a success, and rolling any other number as a failure, and ask how many failure rolls will occur before we see the third success. In such a case, the probability distribution of the number of failures that appear will be a negative binomial distribution.$

In probability theory and statistics, the geometric distribution is either one of two discrete probability distributions:

In probability and statistics, a Bernoulli process is a finite or infinite sequence of binary random variables, so it is a discrete-time stochastic process that takes only two values, canonically 0 and 1. The component Bernoulli variablesX_i are identically distributed and independent. Prosaically, a Bernoulli process is a repeated coin flipping, possibly with an unfair coin. Every variable X_i in the sequence is associated with a Bernoulli trial or experiment. They all have the same Bernoulli distribution. Much of what can be said about the Bernoulli process can also be generalized to more than two outcomes ; this generalization is known as the Bernoulli scheme.

In probability and statistics, a probability mass function is a function that gives the probability that a discrete random variable is exactly equal to some value. Sometimes it is also known as the discrete probability density function. The probability mass function is often the primary means of defining a discrete probability distribution, and such functions exist for either scalar or multivariate random variables whose domain is discrete.

In probability theory, odds provide a measure of the probability of a particular outcome. Odds are commonly used in gambling and statistics. For example for an event that is 40% probable, one could say that the odds are "2 in 5","2 to 3 in favor", or "3 to 2 against".

<span class="mw-page-title-main">Hypergeometric distribution</span> Discrete probability distribution

In probability theory and statistics, the hypergeometric distribution is a discrete probability distribution that describes the probability of $successes in draws, without replacement, from a finite population of size that contains exactly objects with that feature, wherein each draw is either a success or a failure. In contrast, the binomial distribution describes the probability of successes in draws with replacement.$

In probability theory and statistics, the Bernoulli distribution, named after Swiss mathematician Jacob Bernoulli, is the discrete probability distribution of a random variable which takes the value 1 with probability $and the value 0 with probability . Less formally, it can be thought of as a model for the set of possible outcomes of any single experiment that asks a yes-no question. Such questions lead to outcomes that are Boolean-valued: a single bit whose value is success/yes/true/one with probability p and failure/no/false/zero with probability q . It can be used to represent a coin toss where 1 and 0 would represent "heads" and "tails", respectively, and p would be the probability of the coin landing on heads. In particular, unfair coins would have$

In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] or in terms of two positive parameters, denoted by alpha (α) and beta (β), that appear as exponents of the variable and its complement to 1, respectively, and control the shape of the distribution.

In information theory, the information content, self-information, surprisal, or Shannon information is a basic quantity derived from the probability of a particular event occurring from a random variable. It can be thought of as an alternative way of expressing probability, much like odds or log-odds, but which has particular mathematical advantages in the setting of information theory.

In probability theory, the multinomial distribution is a generalization of the binomial distribution. For example, it models the probability of counts for each side of a k-sided die rolled n times. For n independent trials each of which leads to a success for exactly one of k categories, with each category having a given fixed success probability, the multinomial distribution gives the probability of any particular combination of numbers of successes for the various categories.

Lottery mathematics is used to calculate probabilities of winning or losing a lottery game. It is based primarily on combinatorics, particularly the twelvefold way and combinations without replacement. It can also be used to analyze coincidences that happen in lottery drawings, such as repeated numbers appearing across different draws.

In statistics, a binomial proportion confidence interval is a confidence interval for the probability of success calculated from the outcome of a series of success–failure experiments. In other words, a binomial proportion confidence interval is an interval estimate of a success probability $when only the number of experiments and the number of successes are known.$

In statistics, binomial regression is a regression analysis technique in which the response has a binomial distribution: it is the number of successes in a series of ⁠ $⁠$ independent Bernoulli trials, where each trial has probability of success ⁠ $⁠$ . In binomial regression, the probability of a success is related to explanatory variables: the corresponding concept in ordinary regression is to relate the mean value of the unobserved response to explanatory variables.

In probability theory and statistics, the beta-binomial distribution is a family of discrete probability distributions on a finite support of non-negative integers arising when the probability of success in each of a fixed or known number of Bernoulli trials is either unknown or random. The beta-binomial distribution is the binomial distribution in which the probability of success at each of n trials is not fixed but randomly drawn from a beta distribution. It is frequently used in Bayesian statistics, empirical Bayes methods and classical statistics to capture overdispersion in binomial type distributed data.

In probability theory, the de Moivre–Laplace theorem, which is a special case of the central limit theorem, states that the normal distribution may be used as an approximation to the binomial distribution under certain conditions. In particular, the theorem shows that the probability mass function of the random number of "successes" observed in a series of $independent Bernoulli trials, each having probability of success, converges to the probability density function of the normal distribution with expectation and standard deviation, as grows large, assuming is not or .$

In probability theory and statistics, the Poisson distribution is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time if these events occur with a known constant mean rate and independently of the time since the last event. It can also be used for the number of events in other types of intervals than time, and in dimension greater than 1.

In probability theory and statistics, the Poisson binomial distribution is the discrete probability distribution of a sum of independent Bernoulli trials that are not necessarily identically distributed. The concept is named after Siméon Denis Poisson.

In probability theory, a beta negative binomial distribution is the probability distribution of a discrete random variable $equal to the number of failures needed to get successes in a sequence of independent Bernoulli trials. The probability of success on each trial stays constant within any given experiment but varies across different experiments following a beta distribution. Thus the distribution is a compound probability distribution.$

References

↑ Papoulis, A. (1984). "Bernoulli Trials". Probability, Random Variables, and Stochastic Processes (2nd ed.). New York: McGraw-Hill. pp. 57–63.
↑ James Victor Uspensky: Introduction to Mathematical Probability, McGraw-Hill, New York 1937, page 45
↑ Rajeev Motwani and P. Raghavan. Randomized Algorithms. Cambridge University Press, New York (NY), 1995, p.67-68

External links

"Bernoulli trials", Encyclopedia of Mathematics , EMS Press, 2001 [1994]
"Simulation of n Bernoulli trials". math.uah.edu. Retrieved 2014-01-21.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Papoulis, A. (1984). "Bernoulli Trials". Probability, Random Variables, and Stochastic Processes (2nd ed.). New York: McGraw-Hill. pp. 57–63.

[2] James Victor Uspensky: Introduction to Mathematical Probability, McGraw-Hill, New York 1937, page 45

[3] Rajeev Motwani and P. Raghavan. Randomized Algorithms. Cambridge University Press, New York (NY), 1995, p.67-68

[1]

[2]

[3]

Bernoulli trial

Contents

Preliminary

Examples

Tossing coins

Solution

Rolling dice

Solution

See also

Related Research Articles

References

External links