Margin of error

Last updated October 05, 2024

The margin of error is a statistic expressing the amount of random sampling error in the results of a survey. The larger the margin of error, the less confidence one should have that a poll result would reflect the result of a census of the entire population. The margin of error will be positive whenever a population is incompletely sampled and the outcome measure has positive variance, which is to say, whenever the measure varies.

Concept

Consider a simple yes/no poll $P$ as a sample of $n$ respondents drawn from a population $N{\text{, }}(n\ll N)$ reporting the percentage $p$ of yes responses. We would like to know how close $p$ is to the true result of a survey of the entire population $N$ , without having to conduct one. If, hypothetically, we were to conduct a poll $P$ over subsequent samples of $n$ respondents (newly drawn from $N$ ), we would expect those subsequent results $p_{1},p_{2},\ldots$ to be normally distributed about ${\overline {p}}$ , the true but unknown percentage of the population. The margin of error describes the distance within which a specified percentage of these results is expected to vary from ${\overline {p}}$ .

Going by the Central limit theorem, the margin of error helps to explain how the distribution of sample means (or percentage of yes, in this case) will approximate a normal distribution as sample size increases. If this applies, it would speak about the sampling being unbiased, but not about the inherent distribution of the data.^[1]

According to the 68-95-99.7 rule, we would expect that 95% of the results $p_{1},p_{2},\ldots$ will fall within about two standard deviations ( $\pm 2\sigma _{P}$ ) either side of the true mean ${\overline {p}}$ . This interval is called the confidence interval, and the radius (half the interval) is called the margin of error, corresponding to a 95% confidence level.

Generally, at a confidence level $\gamma$ , a sample sized $n$ of a population having expected standard deviation $\sigma$ has a margin of error

MOE_{\gamma }=z_{\gamma }\times {\sqrt {\frac {\sigma ^{2}}{n}}}

where $z_{\gamma }$ denotes the quantile (also, commonly, a z-score ), and ${\sqrt {\frac {\sigma ^{2}}{n}}}$ is the standard error.

Standard deviation and standard error

We would expect the average of normally distributed values $p_{1},p_{2},\ldots$ to have a standard deviation which somehow varies with $n$ . The smaller $n$ , the wider the margin. This is called the standard error $\sigma _{\overline {p}}$ .

For the single result from our survey, we assume that $p={\overline {p}}$ , and that all subsequent results $p_{1},p_{2},\ldots$ together would have a variance $\sigma _{P}^{2}=P(1-P)$ .

{\text{Standard error}}=\sigma _{\overline {p}}\approx {\sqrt {\frac {\sigma _{P}^{2}}{n}}}\approx {\sqrt {\frac {p(1-p)}{n}}}

Note that $p(1-p)$ corresponds to the variance of a Bernoulli distribution.

Maximum margin of error at different confidence levels

For a confidence level $\gamma$ , there is a corresponding confidence interval about the mean $\mu \pm z_{\gamma }\sigma$ , that is, the interval $[\mu -z_{\gamma }\sigma ,\mu +z_{\gamma }\sigma ]$ within which values of $P$ should fall with probability $\gamma$ . Precise values of $z_{\gamma }$ are given by the quantile function of the normal distribution (which the 68–95–99.7 rule approximates).

Note that $z_{\gamma }$ is undefined for $|\gamma |\geq 1$ , that is, $z_{1.00}$ is undefined, as is $z_{1.10}$ .

$\gamma$	$z_{\gamma }$	$\gamma$	$z_{\gamma }$
0.84	0.994457883210	0.9995	3.290526731492
0.95	1.644853626951	0.99995	3.890591886413
0.975	1.959963984540	0.999995	4.417173413469
0.99	2.326347874041	0.9999995	4.891638475699
0.995	2.575829303549	0.99999995	5.326723886384
0.9975	2.807033768344	0.999999995	5.730728868236
0.9985	2.967737925342	0.9999999995	6.109410204869

Since $\max \sigma _{P}^{2}=\max P(1-P)=0.25$ at $p=0.5$ , we can arbitrarily set $p={\overline {p}}=0.5$ , calculate $\sigma _{P}$ , $\sigma _{\overline {p}}$ , and $z_{\gamma }\sigma _{\overline {p}}$ to obtain the maximum margin of error for $P$ at a given confidence level $\gamma$ and sample size $n$ , even before having actual results. With $p=0.5,n=1013$

MOE_{95}(0.5)=z_{0.95}\sigma _{\overline {p}}\approx z_{0.95}{\sqrt {\frac {\sigma _{P}^{2}}{n}}}=1.96{\sqrt {\frac {.25}{n}}}=0.98/{\sqrt {n}}=\pm 3.1\%

MOE_{99}(0.5)=z_{0.99}\sigma _{\overline {p}}\approx z_{0.99}{\sqrt {\frac {\sigma _{P}^{2}}{n}}}=2.58{\sqrt {\frac {.25}{n}}}=1.29/{\sqrt {n}}=\pm 4.1\%

Also, usefully, for any reported $MOE_{95}$

MOE_{99}={\frac {z_{0.99}}{z_{0.95}}}MOE_{95}\approx 1.3\times MOE_{95}

Specific margins of error

If a poll has multiple percentage results (for example, a poll measuring a single multiple-choice preference), the result closest to 50% will have the highest margin of error. Typically, it is this number that is reported as the margin of error for the entire poll. Imagine poll $P$ reports $p_{a},p_{b},p_{c}$ as $71\%,27\%,2\%,n=1013$

MOE_{95}(P_{a})=z_{0.95}\sigma _{\overline {p_{a}}}\approx 1.96{\sqrt {\frac {p_{a}(1-p_{a})}{n}}}=0.89/{\sqrt {n}}=\pm 2.8\%

(as in the figure above)

MOE_{95}(P_{b})=z_{0.95}\sigma _{\overline {p_{b}}}\approx 1.96{\sqrt {\frac {p_{b}(1-p_{b})}{n}}}=0.87/{\sqrt {n}}=\pm 2.7\%

MOE_{95}(P_{c})=z_{0.95}\sigma _{\overline {p_{c}}}\approx 1.96{\sqrt {\frac {p_{c}(1-p_{c})}{n}}}=0.27/{\sqrt {n}}=\pm 0.8\%

As a given percentage approaches the extremes of 0% or 100%, its margin of error approaches ±0%.

Comparing percentages

Imagine multiple-choice poll $P$ reports $p_{a},p_{b},p_{c}$ as $46\%,42\%,12\%,n=1013$ . As described above, the margin of error reported for the poll would typically be $MOE_{95}(P_{a})$ , as $p_{a}$ is closest to 50%. The popular notion of statistical tie or statistical dead heat, however, concerns itself not with the accuracy of the individual results, but with that of the ranking of the results. Which is in first?

If, hypothetically, we were to conduct a poll $P$ over subsequent samples of $n$ respondents (newly drawn from $N$ ), and report the result $p_{w}=p_{a}-p_{b}$ , we could use the standard error of difference to understand how $p_{w_{1}},p_{w_{2}},p_{w_{3}},\ldots$ is expected to fall about ${\overline {p_{w}}}$ . For this, we need to apply the sum of variances to obtain a new variance, $\sigma _{P_{w}}^{2}$ ,

\sigma _{P_{w}}^{2}=\sigma _{P_{a}-P_{b}}^{2}=\sigma _{P_{a}}^{2}+\sigma _{P_{b}}^{2}-2\sigma _{P_{a},P_{b}}=p_{a}(1-p_{a})+p_{b}(1-p_{b})+2p_{a}p_{b}

where $\sigma _{P_{a},P_{b}}=-P_{a}P_{b}$ is the covariance of $P_{a}$ and $P_{b}$ .

Thus (after simplifying),

{\text{Standard error of difference}}=\sigma _{\overline {w}}\approx {\sqrt {\frac {\sigma _{P_{w}}^{2}}{n}}}={\sqrt {\frac {p_{a}+p_{b}-(p_{a}-p_{b})^{2}}{n}}}=0.029,P_{w}=P_{a}-P_{b}

MOE_{95}(P_{a})=z_{0.95}\sigma _{\overline {p_{a}}}\approx \pm {3.1\%}

MOE_{95}(P_{w})=z_{0.95}\sigma _{\overline {w}}\approx \pm {5.8\%}

Note that this assumes that $P_{c}$ is close to constant, that is, respondents choosing either A or B would almost never choose C (making $P_{a}$ and $P_{b}$ close to perfectly negatively correlated). With three or more choices in closer contention, choosing a correct formula for $\sigma _{P_{w}}^{2}$ becomes more complicated.

Effect of finite population size

The formulae above for the margin of error assume that there is an infinitely large population and thus do not depend on the size of population $N$ , but only on the sample size $n$ . According to sampling theory, this assumption is reasonable when the sampling fraction is small. The margin of error for a particular sampling method is essentially the same regardless of whether the population of interest is the size of a school, city, state, or country, as long as the sampling fraction is small.

In cases where the sampling fraction is larger (in practice, greater than 5%), analysts might adjust the margin of error using a finite population correction to account for the added precision gained by sampling a much larger percentage of the population. FPC can be calculated using the formula^[2]

\operatorname {FPC} ={\sqrt {\frac {N-n}{N-1}}}

...and so, if poll $P$ were conducted over 24% of, say, an electorate of 300,000 voters,

MOE_{95}(0.5)=z_{0.95}\sigma _{\overline {p}}\approx {\frac {0.98}{\sqrt {72,000}}}=\pm 0.4\%

MOE_{95_{FPC}}(0.5)=z_{0.95}\sigma _{\overline {p}}{\sqrt {\frac {N-n}{N-1}}}\approx {\frac {0.98}{\sqrt {72,000}}}{\sqrt {\frac {300,000-72,000}{300,000-1}}}=\pm 0.3\%

Intuitively, for appropriately large $N$ ,

\lim _{n\to 0}{\sqrt {\frac {N-n}{N-1}}}\approx 1

\lim _{n\to N}{\sqrt {\frac {N-n}{N-1}}}=0

In the former case, $n$ is so small as to require no correction. In the latter case, the poll effectively becomes a census and sampling error becomes moot.

Related Research Articles

In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is $The parameter is the mean or expectation of the distribution, while the parameter is the variance. The standard deviation of the distribution is (sigma). A random variable with a Gaussian distribution is said to be normally distributed, and is called a normal deviate .$

In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its mean. A low standard deviation indicates that the values tend to be close to the mean of the set, while a high standard deviation indicates that the values are spread out over a wider range. The standard deviation is commonly used in the determination of what constitutes an outlier and what does not.

In probability theory and statistics, Student's $t$ distribution $is a continuous probability distribution that generalizes the standard normal distribution. Like the latter, it is symmetric around zero and bell-shaped.$

In probability theory and statistics, the chi-squared distribution with $degrees of freedom is the distribution of a sum of the squares of independent standard normal random variables.$

In statistics, the standard score is the number of standard deviations by which the value of a raw score is above or below the mean value of what is being observed or measured. Raw scores above the mean have positive standard scores, while those below the mean have negative standard scores.

In probability theory and statistics, the Rayleigh distribution is a continuous probability distribution for nonnegative-valued random variables. Up to rescaling, it coincides with the chi distribution with two degrees of freedom. The distribution is named after Lord Rayleigh.

In quantum field theory, the Dirac spinor is the spinor that describes all known fundamental particles that are fermions, with the possible exception of neutrinos. It appears in the plane-wave solution to the Dirac equation, and is a certain combination of two Weyl spinors, specifically, a bispinor that transforms "spinorially" under the action of the Lorentz group.

One half is the irreducible fraction resulting from dividing one (1) by two (2), or the fraction resulting from dividing any number by its double.

In statistical inference, specifically predictive inference, a prediction interval is an estimate of an interval in which a future observation will fall, with a certain probability, given what has already been observed. Prediction intervals are often used in regression analysis.

The Voigt profile is a probability distribution given by a convolution of a Cauchy-Lorentz distribution and a Gaussian distribution. It is often used in analyzing data from spectroscopy or diffraction.

Directional statistics is the subdiscipline of statistics that deals with directions, axes or rotations in Rⁿ. More generally, directional statistics deals with observations on compact Riemannian manifolds including the Stiefel manifold.

Sample size determination or estimation is the act of choosing the number of observations or replicates to include in a statistical sample. The sample size is an important feature of any empirical study in which the goal is to make inferences about a population from a sample. In practice, the sample size used in a study is usually determined based on the cost, time, or convenience of collecting the data, and the need for it to offer sufficient statistical power. In complex studies, different sample sizes may be allocated, such as in stratified surveys or experimental designs with multiple treatment groups. In a census, data is sought for an entire population, hence the intended sample size is equal to the population. In experimental design, where a study may be divided into different treatment groups, there may be different sample sizes for each group.

In probability theory, the Rice distribution or Rician distribution is the probability distribution of the magnitude of a circularly-symmetric bivariate normal random variable, possibly with non-zero mean (noncentral). It was named after Stephen O. Rice (1907–1986).

In probability theory and statistics, the chi distribution is a continuous probability distribution over the non-negative real line. It is the distribution of the positive square root of a sum of squared independent Gaussian random variables. Equivalently, it is the distribution of the Euclidean distance between a multivariate Gaussian random variable and the origin. The chi distribution describes the positive square roots of a variable obeying a chi-squared distribution.

A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.

In statistics, the 68–95–99.7 rule, also known as the empirical rule, and sometimes abbreviated 3sr, is a shorthand used to remember the percentage of values that lie within an interval estimate in a normal distribution: approximately 68%, 95%, and 99.7% of the values lie within one, two, and three standard deviations of the mean, respectively.

In statistics and in particular statistical theory, unbiased estimation of a standard deviation is the calculation from a statistical sample of an estimated value of the standard deviation of a population of values, in such a way that the expected value of the calculation equals the true value. Except in some important situations, outlined later, the task has little relevance to applications of statistics since its need is avoided by standard procedures, such as the use of significance tests and confidence intervals, or by using Bayesian analysis.

The term generalized logistic distribution is used as the name for several different families of probability distributions. For example, Johnson et al. list four forms, which are listed below.

<span class="mw-page-title-main">Wrapped normal distribution</span>

In probability theory and directional statistics, a wrapped normal distribution is a wrapped probability distribution that results from the "wrapping" of the normal distribution around the unit circle. It finds application in the theory of Brownian motion and is a solution to the heat equation for periodic boundary conditions. It is closely approximated by the von Mises distribution, which, due to its mathematical simplicity and tractability, is the most commonly used distribution in directional statistics.

In statistics and probability theory, the nonparametric skew is a statistic occasionally used with random variables that take real values. It is a measure of the skewness of a random variable's distribution—that is, the distribution's tendency to "lean" to one side or the other of the mean. Its calculation does not require any knowledge of the form of the underlying distribution—hence the name nonparametric. It has some desirable properties: it is zero for any symmetric distribution; it is unaffected by a scale shift; and it reveals either left- or right-skewness equally well. In some statistical samples it has been shown to be less powerful than the usual measures of skewness in detecting departures of the population from normality.

References

↑ Siegfried, Tom (2014-07-03). "Scientists' grasp of confidence intervals doesn't inspire confidence | Science News". Science News. Retrieved 2024-08-06.
↑ Isserlis, L. (1918). "On the value of a mean as calculated from a sample". Journal of the Royal Statistical Society. 81 (1). Blackwell Publishing: 75–81. doi:10.2307/2340569. JSTOR 2340569. (Equation 1)

Sources

Sudman, Seymour and Bradburn, Norman (1982). Asking Questions: A Practical Guide to Questionnaire Design. San Francisco: Jossey Bass. ISBN 0-87589-546-8
Wonnacott, T.H.; R.J. Wonnacott (1990). Introductory Statistics (5th ed.). Wiley. ISBN 0-471-61518-8.

External links

"Errors, theory of", Encyclopedia of Mathematics , EMS Press, 2001 [1994]
Weisstein, Eric W. "Margin of Error". MathWorld .

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Siegfried, Tom (2014-07-03). "Scientists' grasp of confidence intervals doesn't inspire confidence | Science News". Science News. Retrieved 2024-08-06.

[2] Isserlis, L. (1918). "On the value of a mean as calculated from a sample". Journal of the Royal Statistical Society. 81 (1). Blackwell Publishing: 75–81. doi:10.2307/2340569. JSTOR 2340569. (Equation 1)

[1]

[2]