Hoover index

Last updated

The Hoover index, also known as the Robin Hood index or the Schutz index, is a measure of income inequality. It is equal to the percentage of the total population's income that would have to be redistributed to make all the incomes equal.

Contents

i.e. The Hoover is the total amount (as a percentage of the national-income) by which people have less than their equal income-share.

The Hoover Index can be calculated by the following subtraction: The percentage of the people getting less than their equal-share (i.e. less than the national mean income), minus their percentage of the national income.

It can be graphically represented as the longest vertical distance between the Lorenz curve (which graphs cumulative income vs cumulative population (income-ordered population-percentile)and the 45 degree line representing perfect equality).

It would be informative to express the Hoover in terms of its average cost to individuals who get less than their equal-share:

If the Hoover is divided by the percentage of the population whose income is less than their equal-share (i.e. the mean income), that gives the average cost of that Hoover-value, per person whose income is less than their equal-share. ...that cost being expressed in terms of the national mean income.

If, instead, the Hoover is divided by the percentage of the total national income received by the people getting less than their equal-share (i.e. less than the mean income), then that gives the percentage by which those people, as a group, would get more than they currently do, if income were equal.

...in other words the cost, to them, of that Hoover value, expressed in terms of their actual current income.

That latter cost can also be gotten by dividing, instead of subtracting, the two numbers that were subtracted to get the Hoover.

...i.e. dividing the percentage of the population whose income is less than the mean by their percentage of the national income.

...and then subtracting 1.

The Hoover index is typically used in applications related to socio-economic class (SES) and health. It is conceptually one of the simplest inequality indices used in econometrics.

A more frequently encountered inequality measure is the Gini coefficient which is based onthe summation, over all income-ordered population-percentiles, of the cumulative income up to each percentile. That sum is divided by the maximum value that it could have (its value with complete equality), to express it as a percentage of its maximum-possible value. The result is subtracted from one, to get a measure of inequality.

A report from the National Library of Medicine, of the National Institute of Health, described a statistical study that compared how the Robin Hood and the Gini are correlated with mortality:

Results: The Robin Hood index was positively correlated with total mortality adjusted for age (r = 0.54; P < 0.05). This association remained after adjustment for poverty (P < 0.007), where each percentage increase in the index was associated with' an increase in the total mortality of 21.68 deaths per 100,000. Effects of the index were also found for infant mortality (P = 0.013); coronary heart disease (P = 0.004); malignant neoplasms (P = 0.023); and homicide (P < 0.001). Strong associations were also found between the index and causes of death amenable to medical intervention. The Gini coefficient showed very little correlation with any of the causes of death. [1]

The Gini, like the Theil (below), is an impartial measure of inequality over the entire population. That can be of interest and use, but the Robin Hood differs, as a not-impartial examination of the total amount by which members of the population get less than their equal-share.

Computation

Let be the income of the -th person and be the mean income. Then the Hoover index is:

This value can also be computed using quantiles. For the formula, a notation [2] is used, where the amount of quantiles only appears as upper border of summations. Thus, inequities can be computed for quantiles with different widths . For example, could be the income in the quantile #i and could be the amount (absolute or relative) of earners in the quantile #i. then would be the sum of incomes of all quantiles and would be the sum of the income earners in all quantiles.

Computation of the Robin Hood index :

For comparison, [3] here also the computation of the symmetrized Theil index is given:

Both formulas can be used in spreadsheet computations.

See also

Notes

  1. Kennedy, B. P.; Kawachi, I.; Prothrow-Stith, D. (1996). "Income distribution and mortality: Cross sectional ecological study of the Robin Hood index in the United States". BMJ (Clinical Research Ed.). 312 (7037): 1004–1007. doi:10.1136/bmj.312.7037.1004. PMC   2350807 . PMID   8616345.
  2. The notation using E and A follows the notation of a small calculation published by Lionnel Maugis: Inequality Measures in Mathematical Programming for the Air Traffic Flow Management Problem with En-Route Capacities (für IFORS 96), 1996 [ full citation needed ]
  3. For an explanation of the comparison with Henri Theil's index see: Theil index

Further reading

Related Research Articles

Gini coefficient Measure of inequality in the income or wealth distribution

In economics, the Gini coefficient, also the Gini index and the Gini ratio, is a measure of statistical dispersion intended to represent the income inequality or the wealth inequality within a nation or a social group. The Gini coefficient was developed by the statistician and sociologist Corrado Gini.

Geometric mean N-th root of the product of n numbers

In mathematics, the geometric mean is a mean or average, which indicates the central tendency or typical value of a set of numbers by using the product of their values. The geometric mean is defined as the nth root of the product of n numbers, i.e., for a set of numbers x1, x2, ..., xn, the geometric mean is defined as

Lorenz curve Graphical representation of the distribution of income or of wealth

In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing inequality of the wealth distribution.

Median Middle quantile of a data set or probability distribution

In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic feature of the median in describing data compared to the mean is that it is not skewed by a small proportion of extremely large or small values, and therefore provides a better representation of a "typical" value. Median income, for example, may be a better way to suggest what a "typical" income is, because income distribution can be very skewed. The median is of central importance in robust statistics, as it is the most resistant statistic, having a breakdown point of 50%: so long as no more than half the data are contaminated, the median is not an arbitrarily large or small result.

There are several kinds of mean in mathematics, especially in statistics.

Standard deviation Measure of the amount of variation or dispersion of a set of values

In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean of the set, while a high standard deviation indicates that the values are spread out over a wider range.

Skewness measure of the asymmetry of random variables

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined.

In welfare economics, a social welfare function is a function that ranks social states as less desirable, more desirable, or indifferent for every possible pair of social states. Inputs of the function include any variables considered to affect the economic welfare of a society. In using welfare measures of persons in the society as inputs, the social welfare function is individualistic in form. One use of a social welfare function is to represent prospective patterns of collective choice as to alternative social states. The social welfare function provides the government with a simple guideline for achieving the optimal distribution of income.

Income inequality metrics or income distribution metrics are used by social scientists to measure the distribution of income and economic inequality among the participants in a particular economy, such as that of a specific country or of the world in general. While different theories may try to explain how income inequality comes about, income inequality metrics simply provide a system of measurement used to determine the dispersion of incomes. The concept of inequality is distinct from poverty and fairness.

The Theil index is a statistic primarily used to measure economic inequality and other economic phenomena, though it has also been used to measure racial segregation.

The Atkinson index is a measure of income inequality developed by British economist Anthony Barnes Atkinson. The measure is useful in determining which end of the distribution contributed most to the observed inequality.

The mean absolute difference (univariate) is a measure of statistical dispersion equal to the average absolute difference of two independent values drawn from a probability distribution. A related statistic is the relative mean absolute difference, which is the mean absolute difference divided by the arithmetic mean, and equal to twice the Gini coefficient. The mean absolute difference is also known as the absolute mean difference and the Gini mean difference (GMD). The mean absolute difference is sometimes denoted by Δ or as MD.

Measuring poverty

Poverty can be and is measured in different ways by governments, international organisations, policy makers and practitioners. Increasingly, poverty is understood as multidimensional, comprising social, natural and economic factors situated within wider socio-political processes. The capabilities approach also argues that capturing the perceptions of poor people is fundamental in understanding and measuring poverty.

Growth elasticity of poverty (GEP) is the percentage reduction in poverty rates associated with a percentage change in mean income.

Glenn Firebaugh is an American sociologist and leading international authority on social science research methods. Currently he is the Roy C. Buck Distinguished Professor of Sociology (Emeritus) at the Pennsylvania State University. He has also held regular or visiting faculty appointments at Harvard University, Vanderbilt University, Oxford University, and the University of Michigan. Firebaugh is best known for his contributions to statistical methods and for his research on global inequality. In 2018 he received the Paul F. Lazarsfeld Award from the American Sociological Association for "a career of distinguished contributions to the field of sociological methodology." His publications are highly cited by other social scientists.

The Lorenz asymmetry coefficient (LAC) is a summary statistic of the Lorenz curve that measures the degree of asymmetry of the curve. The Lorenz curve is used to describe the inequality in the distribution of a quantity. The most common summary statistic for the Lorenz curve is the Gini coefficient, which is an overall measure of inequality within the population. The Lorenz asymmetry coefficient can be a useful supplement to the Gini coefficient. The Lorenz asymmetry coefficient is defined as

Generalized entropy index Measure of income inequality

The generalized entropy index has been proposed as a measure of income inequality in a population. It is derived from information theory as a measure of redundancy in data. In information theory a measure of redundancy can be interpreted as non-randomness or data compression; thus this interpretation also applies to this index. In additional interpretation of the index is as biodiversity as entropy has also been proposed as a measure of diversity.

Poverty gap index measure of the intensity of poverty

The poverty gap index is a measure of the intensity of poverty. It is defined as the average poverty gap in the population as a proportion of the poverty line.

In statistics and econometrics, the mean log deviation (MLD) is a measure of income inequality. The MLD is zero when everyone has the same income, and takes larger positive values as incomes become more unequal, especially at the high end.

The Kakwani index is a measure of the progressivity of a social intervention, and is used by social scientists, statisticians, and economists. It is named after the economist who first proposed and used it, Nanak Chand Kakwani.