Pareto principle

Last updated
The Pareto principle may apply to fundraising, i.e. 20% of the donors contributing towards 80% of the total. Pareto principle.png
The Pareto principle may apply to fundraising, i.e. 20% of the donors contributing towards 80% of the total.

The Pareto principle (also known as the 80/20 rule, the law of the vital few and the principle of factor sparsity [1] [2] ) states that for many outcomes, roughly 80% of consequences come from 20% of causes (the "vital few"). [1]

Contents

In 1941, management consultant Joseph M. Juran developed the concept in the context of quality control and improvement after reading the works of Italian sociologist and economist Vilfredo Pareto, who wrote in 1906 about the 80/20 connection while teaching at the University of Lausanne. [3] In his first work, Cours d'économie politique, Pareto showed that approximately 80% of the land in the Kingdom of Italy was owned by 20% of the population. The Pareto principle is only tangentially related to the Pareto efficiency.

Mathematically, the 80/20 rule is roughly described by a power law distribution (also known as a Pareto distribution) for a particular set of parameters. Many natural phenomena are distributed according to power law statistics. [4] It is an adage of business management that "80% of sales come from 20% of clients." [5]

History

In 1941, Joseph M. Juran, a Romanian-born American engineer, came across the work of Italian polymath Vilfredo Pareto. Pareto noted that approximately 80% of Italy's land was owned by 20% of the population. [6] [4] Juran applied the observation that 80% of an issue is caused by 20% of the causes to quality issues. Later during his career, Juran preferred to describe this as "the vital few and the useful many" to highlight that the contribution of the remaining 80% should not be discarded entirely. [7]

Mathematical explanation

The demonstration of the Pareto principle is explained by a large proportion of process variation being associated with a small proportion of process variables. [2] This is a special case of the wider phenomenon of Pareto distributions. If the Pareto index α, which is one of the parameters characterizing a Pareto distribution, is chosen as α = log45 ≈ 1.16, then one has 80% of effects coming from 20% of causes. [8]

The term 80/20 is only a shorthand for the general principle at work. In individual cases, the distribution could be nearer to 90/10 or 70/30. There is also no need for the two numbers to add up to the number 100, as they are measures of different things. The Pareto principle is an illustration of a "power law" relationship, which also occurs in phenomena such as bush fires and earthquakes. [9] Because it is self-similar over a wide range of magnitudes, it produces outcomes completely different from Normal or Gaussian distribution phenomena. This fact explains the frequent breakdowns of sophisticated financial instruments, which are modeled on the assumption that a Gaussian relationship is appropriate to something like stock price movements. [10]

Gini coefficient and Hoover index

Using the "A:B" notation (for example, 0.8:0.2) and with A + B = 1, inequality measures like the Gini index (G) and the Hoover index (H) can be computed. In this case both are the same:

Analysis

A Pareto analysis in a diagram showing which cause should be addressed first Pareto analysis.svg
A Pareto analysis in a diagram showing which cause should be addressed first

Pareto analysis is a formal technique useful where many possible courses of action are competing for attention. In essence, the problem-solver estimates the benefit delivered by each action, then selects a number of the most effective actions that deliver a total benefit reasonably close to the maximal possible one.

Pareto analysis is a creative way of looking at causes of problems because it helps stimulate thinking and organize thoughts. However, it can be limited by its exclusion of possibly important problems which may be small initially, but will grow with time. It should be combined with other analytical tools such as failure mode and effects analysis and fault tree analysis for example.[ citation needed ]

This technique helps to identify the top portion of causes that need to be addressed to resolve the majority of problems. Once the predominant causes are identified, then tools like the Ishikawa diagram or Fish-bone Analysis can be used to identify the root causes of the problems. While it is common to refer to pareto as "80/20" rule, under the assumption that, in all situations, 20% of causes determine 80% of problems, this ratio is merely a convenient rule of thumb and is not, nor should it be considered, an immutable law of nature.

The application of the Pareto analysis in risk management allows management to focus on those risks that have the most impact on the project. [11]

Steps to identify the important causes using 80/20 rule: [12]

  1. Form a frequency of occurrences as a percentage
  2. Arrange the rows in decreasing order of importance of the causes (i.e., the most important cause first)
  3. Add a cumulative percentage column to the table, then plot the information
  4. Plot (#1) a curve with causes on x- and cumulative percentage on y-axis
  5. Plot (#2) a bar graph with causes on x- and percent frequency on y-axis
  6. Draw a horizontal dotted line at 80% from the y-axis to intersect the curve. Then draw a vertical dotted line from the point of intersection to the x-axis. The vertical dotted line separates the important causes (on the left) and trivial causes (on the right)
  7. Explicitly review the chart to ensure that causes for at least 80% of the problems are captured

Applications

Economics

Pareto's observation was in connection with population and wealth. Pareto noticed that approximately 80% of Italy's land was owned by 20% of the population. [6] He then carried out surveys on a variety of other countries and found to his surprise that a similar distribution applied.[ citation needed ]

A chart that demonstrated the effect appeared in the 1992 United Nations Development Program Report, which showed that the richest 20% of the world's population receives 82.7% of the world's income. [13] However, among nations, the Gini index shows that wealth distributions vary substantially around this norm. [14]

Distribution of world GDP, 1989 [15]
Quintile of populationIncome
Richest 20%82.70%
Second 20%11.75%
Third 20%2.30%
Fourth 20%1.85%
Poorest 20%1.40%

The principle also holds within the tails of the distribution. The physicist Victor Yakovenko of the University of Maryland, College Park and AC Silva analyzed income data from the US Internal Revenue Service from 1983 to 2001 and found that the income distribution of the richest 1–3% of the population also follows Pareto's principle. [16]

In Talent: How to Identify Entrepreneurs, economist Tyler Cowen and entrepreneur Daniel Gross suggest that the Pareto Principle can be applied to the role of the 20% most talented individuals in generating the majority of economic growth. [17] According to the New York Times in 1988, many video rental shops reported that 80% of revenue came from 20% of videotapes (although rarely rented classics such as Gone with the Wind must be stocked to appear to have a good selection). [18]

Computing

In computer science the Pareto principle can be applied to optimization efforts. [19] For example, Microsoft noted that by fixing the top 20% of the most-reported bugs, 80% of the related errors and crashes in a given system would be eliminated. [20] Lowell Arthur expressed that "20% of the code has 80% of the errors. Find them, fix them!" [21] It was also discovered that, in general, 80% of a piece of software can be written in 20% of the total allocated time. Conversely, the hardest 20% of the code takes 80% of the time. This factor is usually a part of COCOMO estimating for software coding.[ citation needed ]

Occupational health and safety

Occupational health and safety professionals use the Pareto principle to underline the importance of hazard prioritization. Assuming 20% of the hazards account for 80% of the injuries, and by categorizing hazards, safety professionals can target those 20% of the hazards that cause 80% of the injuries or accidents. Alternatively, if hazards are addressed in random order, a safety professional is more likely to fix one of the 80% of hazards that account only for some fraction of the remaining 20% of injuries. [22]

Aside from ensuring efficient accident prevention practices, the Pareto principle also ensures hazards are addressed in an economical order, because the technique ensures the utilized resources are best used to prevent the most accidents. [23]

Engineering and quality control

The Pareto principle is the basis for the Pareto chart, one of the key tools used in total quality control and Six Sigma techniques. The Pareto principle serves as a baseline for ABC-analysis and XYZ-analysis, widely used in logistics and procurement for the purpose of optimizing stock of goods, as well as costs of keeping and replenishing that stock. [24] In engineering control theory, such as for electromechanical energy converters, the 80/20 principle applies to optimization efforts. [19]

The remarkable success of statistically based searches for root causes is based upon a combination of an empirical principle and mathematical logic. The empirical principle is usually known as the Pareto principle. [25] With regard to variation causality, this principle states that there is a non-random distribution of the slopes of the numerous (theoretically infinite) terms in the general equation.

All of the terms are independent of each other by definition. Interdependent factors appear as multiplication terms. The Pareto principle states that the effect of the dominant term is very much greater than the second-largest effect term, which in turn is very much greater than the third, and so on. [26] There is no explanation for this phenomenon; that is why we refer to it as an empirical principle.

The mathematical logic is known as the square-root-of-the-sum-of-the-squares axiom. This states that the variation caused by the steepest slope must be squared, and then the result added to the square of the variation caused by the second-steepest slope, and so on. The total observed variation is then the square root of the total sum of the variation caused by individual slopes squared. This derives from the probability density function for multiple variables or the multivariate distribution (we are treating each term as an independent variable).

The combination of the Pareto principle and the square-root-of-the-sum-of-the-squares axiom means that the strongest term in the general equation totally dominates the observed variation of effect. Thus, the strongest term will dominate the data collected for hypothesis testing.

In the systems science discipline, Joshua M. Epstein and Robert Axtell created an agent-based simulation model called Sugarscape, from a decentralized modeling approach, based on individual behavior rules defined for each agent in the economy. Wealth distribution and Pareto's 80/20 principle emerged in their results, which suggests the principle is a collective consequence of these individual rules. [27]

Health and social outcomes

In 2009, the Agency for Healthcare Research and Quality said 20% of patients incurred 80% of healthcare expenses due to chronic conditions. [28] A 2021 analysis showed unequal distribution of healthcare costs, with older patients and those with poorer health incurring more costs. [29] The 80/20 rule has been proposed as a rule of thumb for the infection distribution in superspreading events. [30] [31] However, the degree of infectiousness has been found to be distributed continuously in the population. [31] In epidemics with super-spreading, the majority of individuals infect relatively few secondary contacts.

See also

Related Research Articles

In economics the Pareto index, named after the Italian economist and sociologist Vilfredo Pareto, is a measure of the breadth of income or wealth distribution. It is one of the parameters specifying a Pareto distribution and embodies the Pareto principle. As applied to income, the Pareto principle is sometimes stated in popular expositions by saying q=20% of the population has p=80% of the income. In fact, Pareto's data on British income taxes in his Cours d'économie politique indicates that about 20% of the population had about 80% of the income.. For example, if the population is 100 and the total wealth is $100xm, then together q=20 people have pxm=$80xm. Hence, each of these people has x=pxm/q=$4xm.

<span class="mw-page-title-main">Power law</span> Functional relationship between two quantities

In statistics, a power law is a functional relationship between two quantities, where a relative change in one quantity results in a relative change in the other quantity proportional to the change raised to a constant exponent: one quantity varies as a power of another. The change is independent of the initial size of those quantities.

<span class="mw-page-title-main">Vilfredo Pareto</span> Italian polymath (1848–1923)

Vilfredo Federico Damaso Pareto was an Italian polymath, whose areas of interest included sociology, civil engineering, economics, political science, and philosophy. He made several important contributions to economics, particularly in the study of income distribution and in the analysis of individuals' choices. He was also responsible for popularising the use of the term elite in social analysis.

<span class="mw-page-title-main">Zipf's law</span> Probability distribution

Zipf's law is an empirical law stating that when a list of measured values is sorted in decreasing order, the value of the nth entry is often approximately inversely proportional to n.

<span class="mw-page-title-main">Pareto efficiency</span> Weakly optimal allocation of resources

In welfare economics, a Pareto improvement formalizes the idea of an outcome being "better in every possible way". A change is called a Pareto improvement if it leaves everyone in a society better-off. A situation is called Pareto efficient or Pareto optimal if all possible Pareto improvements have already been made; in other words, there are no longer any ways left to make one person better-off, without making some other person worse-off.

<span class="mw-page-title-main">Pareto distribution</span> Probability distribution

The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto, is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actuarial, and many other types of observable phenomena; the principle originally applied to describing the distribution of wealth in a society, fitting the trend that a large portion of wealth is held by a small fraction of the population. The Pareto principle or "80-20 rule" stating that 80% of outcomes are due to 20% of causes was named in honour of Pareto, but the concepts are distinct, and only Pareto distributions with shape value of log45 ≈ 1.16 precisely reflect it. Empirical observation has shown that this 80-20 distribution fits a wide range of cases, including natural phenomena and human activities.

Pareto may refer to:

Social statistics is the use of statistical measurement systems to study human behavior in a social environment. This can be accomplished through polling a group of people, evaluating a subset of data obtained about a group of people, or by observation and statistical analysis of a set of data that relates to people and their behaviors.

In science and engineering, root cause analysis (RCA) is a method of problem solving used for identifying the root causes of faults or problems. It is widely used in IT operations, manufacturing, telecommunications, industrial process control, accident analysis (e.g., in aviation, rail transport, or nuclear plants), medical diagnosis, the healthcare industry (e.g., for epidemiology), etc. Root cause analysis is a form of inductive inference (first create a theory, or root, based on empirical evidence, or causes) and deductive inference (test the theory, i.e., the underlying causal mechanisms, with empirical data).

80 (eighty) is the natural number following 79 and preceding 81.

Statistical process control (SPC) or statistical quality control (SQC) is the application of statistical methods to monitor and control the quality of a production process. This helps to ensure that the process operates efficiently, producing more specification-conforming products with less waste scrap. SPC can be applied to any process where the "conforming product" output can be measured. Key tools used in SPC include run charts, control charts, a focus on continuous improvement, and the design of experiments. An example of a process where SPC is applied is manufacturing lines.

<span class="mw-page-title-main">Pareto chart</span> Type of chart

A Pareto chart is a type of chart that contains both bars and a line graph, where individual values are represented in descending order by bars, and the cumulative total is represented by the line. The chart is named for the Pareto principle, which, in turn, derives its name from Vilfredo Pareto, a noted Italian economist.

In economics, distribution is the way total output, income, or wealth is distributed among individuals or among the factors of production. In general theory and in for example the U.S. National Income and Product Accounts, each unit of output corresponds to a unit of income. One use of national accounts is for classifying factor incomes and measuring their respective shares, as in national Income. But, where focus is on income of persons or households, adjustments to the national accounts or other data sources are frequently used. Here, interest is often on the fraction of income going to the top x percent of households, the next x percent, and so forth, and on the factors that might affect them.

<span class="mw-page-title-main">Rank–size distribution</span>

Rank–size distribution is the distribution of size by rank, in decreasing order of size. For example, if a data set consists of items of sizes 5, 100, 5, and 8, the rank-size distribution is 100, 8, 5, 5. This is also known as the rank–frequency distribution, when the source data are from a frequency distribution. These are particularly of interest when the data vary significantly in scales, such as city size or word frequency. These distributions frequently follow a power law distribution, or less well-known ones such as a stretched exponential function or parabolic fractal distribution, at least approximately for certain ranges of ranks; see below.

<span class="mw-page-title-main">Joseph M. Juran</span> Romanian-American engineer and management consultant

Joseph Moses Juran was a Romanian-born American engineer, management consultant and author. He was an advocate for quality and quality management and wrote several books on the topics. He was the brother of Academy Award winner Nathan Juran.

An empirical statistical law or a law of statistics represents a type of behaviour that has been found across a number of datasets and, indeed, across a range of types of data sets. Many of these observances have been formulated and proved as statistical or probabilistic theorems and the term "law" has been carried over to these theorems. There are other statistical and probabilistic theorems that also have "law" as a part of their names that have not obviously derived from empirical observations. However, both types of "law" may be considered instances of a scientific law in the field of statistics. What distinguishes an empirical statistical law from a formal statistical theorem is the way these patterns simply appear in natural distributions, without a prior theoretical reasoning about the data.

<span class="mw-page-title-main">Dorian Shainin</span> American engineer, author, and professor (1914–2000)

Dorian Shainin was an American quality consultant, aeronautics engineer, author, and college professor most notable for his contributions in the fields of industrial problem solving, product reliability, and quality engineering, particularly the creation and development of the "Red X" concept.

Kinetic exchange models are multi-agent dynamic models inspired by the statistical physics of energy distribution, which try to explain the robust and universal features of income/wealth distributions.

Attention inequality is the inequality of distribution of attention across users on social networks, people in general, and for scientific papers. Yun Family Foundation introduced "Attention Inequality Coefficient" as a measure of inequality in attention and arguments it by the close interconnection with wealth inequality.

References

  1. 1 2 Bunkley, Nick (March 3, 2008). "Joseph Juran, 103, Pioneer in Quality Control, Dies". The New York Times . Archived from the original on September 6, 2017. Retrieved 25 January 2018.
  2. 1 2 Box, George E.P.; Meyer, R. Daniel (1986). "An Analysis for Unreplicated Fractional Factorials". Technometrics. 28 (1): 11–18. doi:10.1080/00401706.1986.10488093.
  3. Pareto, Vilfredo (1896–1897). Cours d'Économie Politique (in two volumes). F. Rouge (Lausanne) & F. Pichon (Paris). Volume 1 Volume 2
  4. 1 2 Newman, MEJ (2005). "Power laws, Pareto Distributions, and Zipf's law" (PDF). Contemporary Physics. 46 (5): 323–351. arXiv: cond-mat/0412004 . Bibcode:2005ConPh..46..323N. doi:10.1080/00107510500052444. S2CID   202719165 . Retrieved 10 April 2011.
  5. Marshall, Perry (2013-10-09). "The 80/20 Rule of Sales: How to Find Your Best Customers". Entrepreneur. Retrieved 2018-01-05.
  6. 1 2 Pareto, Vilfredo; Page, Alfred N. (1971), Translation of Manuale di economia politica ("Manual of political economy"), A.M. Kelley, ISBN   978-0-678-00881-2
  7. "Pareto Principle (80/20 Rule) & Pareto Analysis Guide". Juran. 2019-03-12. Retrieved 2021-02-27.
  8. Dunford (2014), "The Pareto Principle" (PDF), The Plymouth Student Scientist
  9. Bak, Per (1999), How Nature Works: the science of self-organized criticality, Springer, p. 89, ISBN   0-387-94791-4
  10. Taleb, Nassim (2007), The Black Swan, pp. 229–252, 274–285
  11. David Litten, Project Risk and Risk Management, Retrieved May 16, 2010
  12. "Pareto Analysis". Archived from the original on 8 February 2012. Retrieved 12 January 2012.
  13. United Nations Development Program (1992), 1992 Human Development Report, New York: Oxford University Press
  14. Hillebrand, Evan (June 2009). "Poverty, Growth, and Inequality over the Next 50 Years" (PDF). FAO, United Nations – Economic and Social Development Department. Archived from the original (PDF) on 2017-10-20.
  15. Human Development Report 1992, Chapter 3 , retrieved 2007-07-08
  16. Yakovenko, Victor M.; Silva, A. Christian (2005), Chatterjee, Arnab; Yarlagadda, Sudhakar; Chakrabarti, Bikas K. (eds.), "Two-class Structure of Income Distribution in the USA: Exponential Bulk and Power-law Tail", Econophysics of Wealth Distributions: Econophys-Kolkata I, New Economic Windows, Springer Milan, pp. 15–23, doi:10.1007/88-470-0389-x_2, ISBN   978-88-470-0389-7
  17. Paris Aéroport, Paris Vous Aime Magazine, No 13, avril-may-juin 2023, p. 71
  18. Kleinfield, N. R. (1988-05-01). "A Tight Squeeze at Video Stores" . The New York Times. ISSN   0362-4331. Archived from the original on 2015-05-25.
  19. 1 2 Gen, M.; Cheng, R. (2002), Genetic Algorithms and Engineering Optimization, New York: Wiley
  20. Rooney, Paula (October 3, 2002), Microsoft's CEO: 80–20 Rule Applies To Bugs, Not Just Features, ChannelWeb
  21. Pressman, Roger S. (2010). Software Engineering: A Practitioner's Approach (7th ed.). Boston, Mass: McGraw-Hill, 2010. ISBN   978-0-07-337597-7.
  22. Woodcock, Kathryn (2010). Safety Evaluation Techniques. Toronto, ON: Ryerson University. p. 86.
  23. "Introduction to Risk-based Decision-Making" (PDF). USCG Safety Program. United States Coast Guard. Retrieved 14 January 2012.
  24. Rushton, Oxley & Croucher (2000), pp. 107–108.
  25. Juran, Joseph M., Frank M. Gryna, and Richard S. Bingham. Quality control handbook. Vol. 3. New York: McGraw-Hill, 1974.
  26. Shainin, Richard D. “Strategies for Technical Problem Solving.” 1992, Quality Engineering, 5:3, 433-448
  27. Epstein, Joshua; Axtell, Robert (1996), Growing Artificial Societies: Social Science from the Bottom-Up, MIT Press, p. 208, ISBN   0-262-55025-3
  28. Weinberg, Myrl (July 27, 2009). "Myrl Weinberg: In health-care reform, the 20-80 solution". The Providence Journal. Archived from the original on 2009-08-02.
  29. Sawyer, Bradley; Claxton, Gary. "How do health expenditures vary across the population?". Peterson-Kaiser Health System Tracker. Peterson Center on Healthcare and the Kaiser Family Foundation. Retrieved 13 March 2019.
  30. Galvani, Alison P.; May, Robert M. (2005). "Epidemiology: Dimensions of superspreading". Nature. 438 (7066): 293–295. Bibcode:2005Natur.438..293G. doi:10.1038/438293a. PMC   7095140 . PMID   16292292.
  31. 1 2 Lloyd-Smith, JO; Schreiber, SJ; Kopp, PE; Getz, WM (2005). "Superspreading and the effect of individual variation on disease emergence". Nature. 438 (7066): 355–359. Bibcode:2005Natur.438..355L. doi:10.1038/nature04153. PMC   7094981 . PMID   16292310.

Further reading