Theil index

Last updated August 19, 2024 • 7 min readFrom Wikipedia, The Free Encyclopedia

The Theil index is a statistic primarily used to measure economic inequality ^[1] and other economic phenomena, though it has also been used to measure racial segregation.^[2]^[3] The Theil index T_T is the same as redundancy in information theory which is the maximum possible entropy of the data minus the observed entropy. It is a special case of the generalized entropy index. It can be viewed as a measure of redundancy, lack of diversity, isolation, segregation, inequality, non-randomness, and compressibility. It was proposed by a Dutch econometrician Henri Theil (1924–2000) at the Erasmus University Rotterdam.^[3]

Henri Theil himself said (1967): "The (Theil) index can be interpreted as the expected information content of the indirect message which transforms the population shares as prior probabilities into the income shares as posterior probabilities."^[4] Amartya Sen noted, "But the fact remains that the Theil index is an arbitrary formula, and the average of the logarithms of the reciprocals of income shares weighted by income is not a measure that is exactly overflowing with intuitive sense."^[4]

Formula

For a population of N "agents" each with characteristic x, the situation may be represented by the list x_i (i = 1,...,N) where x_i is the characteristic of agent i. For example, if the characteristic is income, then x_i is the income of agent i.

The Theil T index is defined as^[5]

T_{T}=T_{\alpha =1}={\frac {1}{N}}\sum _{i=1}^{N}{\frac {x_{i}}{\mu }}\ln \left({\frac {x_{i}}{\mu }}\right)

and the Theil L index is defined as^[5]

T_{L}=T_{\alpha =0}={\frac {1}{N}}\sum _{i=1}^{N}\ln \left({\frac {\mu }{x_{i}}}\right)

where $\mu$ is the mean income:

\mu ={\frac {1}{N}}\sum _{i=1}^{N}x_{i}

Theil-L is an income-distribution's dis-entropy per person, measured with respect to maximum entropy (...which is achieved with complete equality).

(In an alternative interpretation of it, Theil-L is the natural-logarithm of the geometric-mean of the ratio: (mean income)/(income i), over all the incomes. The related Atkinson(1) is just 1 minus the geometric-mean of (income i)/(mean income), over the income distribution.)

Because a transfer between a larger income & a smaller one will change the smaller income's ratio more than it changes the larger income's ratio, the transfer-principle is satisfied by this index.

Equivalently, if the situation is characterized by a discrete distribution function f_k (k = 0,...,W) where f_k is the fraction of the population with income k and W = Nμ is the total income, then $\sum _{k=0}^{W}f_{k}=1$ and the Theil index is:

T_{T}=\sum _{k=0}^{W}\,f_{k}\,{\frac {k}{\mu }}\ln \left({\frac {k}{\mu }}\right)

where $\mu$ is again the mean income:

\mu =\sum _{k=0}^{W}kf_{k}

Note that in this case income k is an integer and k=1 represents the smallest increment of income possible (e.g., cents).

if the situation is characterized by a continuous distribution function f(k) (supported from 0 to infinity) where f(k) dk is the fraction of the population with income k to k + dk, then the Theil index is:

T_{T}=\int _{0}^{\infty }f(k){\frac {k}{\mu }}\ln \left({\frac {k}{\mu }}\right)dk

where the mean is:

\mu =\int _{0}^{\infty }kf(k)\,dk

Theil indices for some common continuous probability distributions are given in the table below:

Income distribution function	PDF(x) (x ≥ 0)	Theil coefficient (nats)
Dirac delta function	$\delta (x-x_{0}),\,x_{0}>0$	0
Uniform distribution	${\begin{cases}{\frac {1}{b-a}}&a\leq x\leq b\\0&{\text{otherwise}}\end{cases}}$	$\ln \left({\frac {2a}{(a+b){\sqrt {e}}}}\right)+{\frac {b^{2}}{b^{2}-a^{2}}}\ln(b/a)$
Exponential distribution	$\lambda e^{-x\lambda },\,\,x>0$	$1-$ $\gamma$
Log-normal distribution	${\frac {1}{\sigma {\sqrt {2\pi }}}}e^{(-(\ln(x)-\mu )^{2})/\sigma ^{2}}$	${\frac {\sigma ^{2}}{2}}$
Pareto distribution	${\begin{cases}{\frac {\alpha k^{\alpha }}{x^{\alpha +1}}}&x\geq k\\0&x<k\end{cases}}$	$\ln(1\!-\!1/\alpha )+{\frac {1}{\alpha -1}}$ (α>1)
Chi-squared distribution	${\frac {2^{-k/2}e^{-x/2}x^{k/2-1}}{\Gamma (k/2)}}$	$\ln(2/k)+$ $\psi ^{(0)}$ $(1\!+\!k/2)$
Gamma distribution ^[6]	${\frac {e^{-x/\theta }x^{k-1}\theta ^{-k}}{\Gamma (k)}}$	$\psi ^{(0)}$ $(1+k)-\ln(k)$
Weibull distribution	${\frac {k}{\lambda }}\left({\frac {x}{\lambda }}\right)^{k-1}e^{-(x/\lambda )^{k}}$	${\frac {1}{k}}$ $\psi ^{(0)}$ $(1+1/k)-\ln \left(\Gamma (1+1/k)\right)$

If everyone has the same income, then T_T equals 0. If one person has all the income, then T_T gives the result $\ln N$ , which is maximum inequality. Dividing T_T by $\ln N$ can normalize the equation to range from 0 to 1, but then the independence axiom is violated: $T[x\cup x]\neq T[x]$ and does not qualify as a measure of inequality.

The Theil index measures an entropic "distance" the population is away from the egalitarian state of everyone having the same income. The numerical result is in terms of negative entropy so that a higher number indicates more order that is further away from the complete equality. Formulating the index to represent negative entropy instead of entropy allows it to be a measure of inequality rather than equality.

Relation to Atkinson Index

The Theil index can be transformed into an Atkinson index, which has a range between 0 and 1 (0% and 100%), where 0 indicates perfect equality and 1 (100%) indicates maximum inequality. (See Generalized entropy index for the transformation.)

Derivation from entropy

The Theil index is derived from Shannon's measure of information entropy $S$ , where entropy is a measure of randomness in a given set of information. In information theory, physics, and the Theil index, the general form of entropy is

S=k\sum _{i=1}^{N}\left(p_{i}\log _{a}\left({\frac {1}{p_{i}}}\right)\right)=-k\sum _{i=1}^{N}\left(p_{i}\log _{a}\left({p_{i}}\right)\right)

where

$i$ is an individual item from the set (such as an individual member from a population, or an individual byte from a computer file).
$p_{i}$ is the probability of finding $i$ from a random sample from the set.
$k$ is a constant.^{[note 1]}
$\log _{a}\left({x}\right)$ is a logarithm with a base equal to $a$ .^{[note 2]}

When looking at the distribution of income in a population, $p_{i}$ is equal to the ratio of a particular individual's income to the total income of the entire population. This gives the observed entropy $S_{\text{Theil}}$ of a population to be:

S_{\text{Theil}}=\sum _{i=1}^{N}\left({\frac {x_{i}}{N{\bar {x}}}}\ln \left({\frac {N{\bar {x}}}{x_{i}}}\right)\right)

where

$x_{i}$ is the income of a particular individual.
$\left(N{\bar {x}}\right)$ is the total income of the entire population, with

$N$ being the number of individuals in the population.
${\bar {x}}$ ("x bar") being the average income of the population.

$\ln \left(x\right)$ is the natural logarithm of $x$ : $\left(\log _{e}\left(x\right)\right)$ .

The Theil index $T_{T}$ measures how far the observed entropy ( $S_{\text{Theil}}$ , which represents how randomly income is distributed) is from the highest possible entropy ( $S_{\text{max}}=\ln \left({N}\right)$ ,^{[note 3]} which represents income being maximally distributed amongst individuals in the population– a distribution analogous to the [most likely] outcome of an infinite number of random coin tosses: an equal distribution of heads and tails). Therefore, the Theil index is the difference between the theoretical maximum entropy (which would be reached if the incomes of every individual were equal) minus the observed entropy:

T_{T}=S_{\text{max}}-S_{\text{Theil}}=\ln \left({N}\right)-S_{\text{Theil}}

When $x$ is in units of population/species, $S_{\text{Theil}}$ is a measure of biodiversity and is called the Shannon index. If the Theil index is used with x=population/species, it is a measure of inequality of population among a set of species, or "bio-isolation" as opposed to "wealth isolation".

The Theil index measures what is called redundancy in information theory.^[5] It is the left over "information space" that was not utilized to convey information, which reduces the effectiveness of the price signal.^{[ original research? ]} The Theil index is a measure of the redundancy of income (or other measure of wealth) in some individuals. Redundancy in some individuals implies scarcity in others. A high Theil index indicates the total income is not distributed evenly among individuals in the same way an uncompressed text file does not have a similar number of byte locations assigned to the available unique byte characters.

Notation	Information theory	Theil index T_T
$N$	number of unique characters	number of individuals
$i$	a particular character	a particular individual
$x_{i}$	count of ith character	income of ith individual
$N{\bar {x}}$	total characters in document	total income in population
$T_{T}$	unused information space	unused potential in price mechanism^{[ original research? ]}
	data compression	progressive tax^{[ original research? ]}

Decomposability

According to the World Bank,

"The best-known entropy measures are Theil’s T ( $T_{T}$ ) and Theil’s L ( $T_{L}$ ), both of which allow one to decompose inequality into the part that is due to inequality within areas (e.g. urban, rural) and the part that is due to differences between areas (e.g. the rural-urban income gap). Typically at least three-quarters of inequality in a country is due to within-group inequality, and the remaining quarter to between-group differences."^[7]

If the population is divided into $m$ subgroups and

$s_{i}$ is the income share of group $i$ ,
$N$ is the total population and $N_{i}$ is the population of group $i$ ,
$T_{i}$ is the Theil index for that subgroup,
${\overline {x}}_{i}$ is the average income in group $i$ , and
$\mu$ is the average income of the population,

then Theil's T index is

T_{T}=\sum _{i=1}^{m}s_{i}T_{i}+\sum _{i=1}^{m}s_{i}\ln {\frac {{\overline {x}}_{i}}{\mu }}

for

s_{i}={\frac {N_{i}}{N}}{\frac {{\overline {x}}_{i}}{\mu }}

For example, inequality within the United States is the average inequality within each state, weighted by state income, plus the inequality between states.

Note: This image is not the Theil Index in each area of the United States, but of contributions to the Theil Index for the U.S. by each area. The Theil Index is always positive, although individual contributions to the Theil Index may be negative or positive.

The decomposition of the Theil index which identifies the share attributable to the between-region component becomes a helpful tool for the positive analysis of regional inequality as it suggests the relative importance of spatial dimension of inequality.^[8]

Theil's T versus Theil's L

Both Theil's T and Theil's L are decomposable. The difference between them is based on the part of the outcomes distribution that each is used for. Indexes of inequality in the generalized entropy (GE) family are more sensitive to differences in income shares among the poor or among the rich depending on a parameter that defines the GE index. The smaller the parameter value for GE, the more sensitive it is to differences at the bottom of the distribution.^[9]

GE(0) = Theil's L and is more sensitive to differences at the lower end of the distribution. It is also referred to as the mean log deviation measure.

GE(1) = Theil's T and is more sensitive to differences at the top of the distribution.

The decomposability is a property of the Theil index which the more popular Gini coefficient does not offer. The Gini coefficient is more intuitive to many people since it is based on the Lorenz curve. However, it is not easily decomposable like the Theil.

Applications

In addition to multitude of economic applications, the Theil index has been applied to assess performance of irrigation systems^[10] and distribution of software metrics.^[11]

Notes

↑ When this equation is used in physics, $k$ typically represents the Boltzmann constant. In information theory or statistics, $k$ is typically equal to 1 (such as in the Theil Index).
↑ In information theory, when information is given in binary digits, the binary logarithm is used (with $a$ equal to 2). In physics and also in computation of Theil index, the natural logarithm is used (with $a$ equal to e).
↑ When the income of every individual is equal to the average income, $\sum _{i=1}^{N}\left(\left({\frac {x_{i}}{\bar {x}}}=1\right){\frac {1}{N}}\ln \left({\left({\frac {\bar {x}}{x_{i}}}=1\right)N}\right)\right)$ $=\sum _{i=1}^{N}\left({\frac {1}{N}}\ln \left({N}\right)\right)$ $=\ln \left({N}\right)$

Related Research Articles

In economics, the Gini coefficient, also known as the Gini index or Gini ratio, is a measure of statistical dispersion intended to represent the income inequality, the wealth inequality, or the consumption inequality within a nation or a social group. It was developed by Italian statistician and sociologist Corrado Gini.

In information theory, the entropy of a random variable is the average level of "information", "surprise", or "uncertainty" inherent to the variable's possible outcomes. Given a discrete random variable $, which takes values in the set and is distributed according to, the entropy is where denotes the sum over the variable's possible values. The choice of base for, the logarithm, varies for different applications. Base 2 gives the unit of bits, while base e gives "natural units" nat, and base 10 gives units of "dits", "bans", or "hartleys". An equivalent definition of entropy is the expected value of the self-information of a variable.$

In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing inequality of the wealth distribution.

The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto, is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actuarial, and many other types of observable phenomena; the principle originally applied to describing the distribution of wealth in a society, fitting the trend that a large portion of wealth is held by a small fraction of the population. The Pareto principle or "80-20 rule" stating that 80% of outcomes are due to 20% of causes was named in honour of Pareto, but the concepts are distinct, and only Pareto distributions with shape value of log₄5 ≈ 1.16 precisely reflect it. Empirical observation has shown that this 80-20 distribution fits a wide range of cases, including natural phenomena and human activities.

<span class="mw-page-title-main">Log-normal distribution</span> Probability distribution

In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normally distributed. Thus, if the random variable $X$ is log-normally distributed, then $Y = ln(X)$ has a normal distribution. Equivalently, if $Y$ has a normal distribution, then the exponential function of $Y$ , $X = exp(Y)$ , has a log-normal distribution. A random variable which is log-normally distributed takes only positive real values. It is a convenient and useful model for measurements in exact and engineering sciences, as well as medicine, economics and other topics (e.g., energies, concentrations, lengths, prices of financial instruments, and other metrics).

In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] or in terms of two positive parameters, denoted by alpha (α) and beta (β), that appear as exponents of the variable and its complement to 1, respectively, and control the shape of the distribution.

In information geometry, the Fisher information metric is a particular Riemannian metric which can be defined on a smooth statistical manifold, i.e., a smooth manifold whose points are probability measures defined on a common probability space. It can be used to calculate the informational difference between measurements.

Income inequality metrics or income distribution metrics are used by social scientists to measure the distribution of income and economic inequality among the participants in a particular economy, such as that of a specific country or of the world in general. While different theories may try to explain how income inequality comes about, income inequality metrics simply provide a system of measurement used to determine the dispersion of incomes. The concept of inequality is distinct from poverty and fairness.

In mathematical statistics, the Kullback–Leibler (KL) divergence, denoted $, is a type of statistical distance: a measure of how one probability distribution P is different from a second, reference probability distribution Q . Mathematically, it is defined as$

Variational Bayesian methods are a family of techniques for approximating intractable integrals arising in Bayesian inference and machine learning. They are typically used in complex statistical models consisting of observed variables as well as unknown parameters and latent variables, with various sorts of relationships among the three types of random variables, as might be described by a graphical model. As typical in Bayesian inference, the parameters and latent variables are grouped together as "unobserved variables". Variational Bayesian methods are primarily used for two purposes:

To provide an analytical approximation to the posterior probability of the unobserved variables, in order to do statistical inference over these variables.
To derive a lower bound for the marginal likelihood of the observed data. This is typically used for performing model selection, the general idea being that a higher marginal likelihood for a given model indicates a better fit of the data by that model and hence a greater probability that the model in question was the one that generated the data.

In statistics and information theory, a maximum entropy probability distribution has entropy that is at least as great as that of all other members of a specified class of probability distributions. According to the principle of maximum entropy, if nothing is known about a distribution except that it belongs to a certain class, then the distribution with the largest entropy should be chosen as the least-informative default. The motivation is twofold: first, maximizing entropy minimizes the amount of prior information built into the distribution; second, many physical systems tend to move towards maximal entropy configurations over time.

In probability theory and directional statistics, the von Mises distribution is a continuous probability distribution on the circle. It is a close approximation to the wrapped normal distribution, which is the circular analogue of the normal distribution. A freely diffusing angle $on a circle is a wrapped normally distributed random variable with an unwrapped variance that grows linearly in time. On the other hand, the von Mises distribution is the stationary distribution of a drift and diffusion process on the circle in a harmonic potential, i.e. with a preferred orientation. The von Mises distribution is the maximum entropy distribution for circular data when the real and imaginary parts of the first circular moment are specified. The von Mises distribution is a special case of the von Mises-Fisher distribution on the N -dimensional sphere.$

The Hoover index, also known as the Robin Hood index or the Schutz index, is a measure of income inequality. It is equal to the percentage of the total population's income that would have to be redistributed to make all the incomes equal.

Differential entropy is a concept in information theory that began as an attempt by Claude Shannon to extend the idea of (Shannon) entropy of a random variable, to continuous probability distributions. Unfortunately, Shannon did not derive this formula, and rather just assumed it was the correct continuous analogue of discrete entropy, but it is not. The actual continuous version of discrete entropy is the limiting density of discrete points (LDDP). Differential entropy is commonly encountered in the literature, but it is a limiting case of the LDDP, and one that loses its fundamental association with discrete entropy.

The Atkinson index is a measure of income inequality developed by British economist Anthony Barnes Atkinson. The measure is useful in determining which end of the distribution contributed most to the observed inequality.

<span class="mw-page-title-main">Wrapped normal distribution</span>

In probability theory and directional statistics, a wrapped normal distribution is a wrapped probability distribution that results from the "wrapping" of the normal distribution around the unit circle. It finds application in the theory of Brownian motion and is a solution to the heat equation for periodic boundary conditions. It is closely approximated by the von Mises distribution, which, due to its mathematical simplicity and tractability, is the most commonly used distribution in directional statistics.

<span class="mw-page-title-main">Wrapped Cauchy distribution</span>

In probability theory and directional statistics, a wrapped Cauchy distribution is a wrapped probability distribution that results from the "wrapping" of the Cauchy distribution around the unit circle. The Cauchy distribution is sometimes known as a Lorentzian distribution, and the wrapped Cauchy distribution may sometimes be referred to as a wrapped Lorentzian distribution.

The Lorenz asymmetry coefficient (LAC) is a summary statistic of the Lorenz curve that measures the degree of asymmetry of the curve. The Lorenz curve is used to describe the inequality in the distribution of a quantity. The most common summary statistic for the Lorenz curve is the Gini coefficient, which is an overall measure of inequality within the population. The Lorenz asymmetry coefficient can be a useful supplement to the Gini coefficient. The Lorenz asymmetry coefficient is defined as

The generalized entropy index has been proposed as a measure of income inequality in a population. It is derived from information theory as a measure of redundancy in data. In information theory a measure of redundancy can be interpreted as non-randomness or data compression; thus this interpretation also applies to this index. In addition, interpretation of biodiversity as entropy has also been proposed leading to uses of generalized entropy to quantify biodiversity.

In statistics and econometrics, the mean log deviation (MLD) is a measure of income inequality. The MLD is zero when everyone has the same income, and takes larger positive values as incomes become more unequal, especially at the high end.

References

↑ Introduction to the Theil index from the University of Texas
↑ "Segregation Measures". www.urban.org. Urban Institute. Retrieved 5 February 2018.
1 2 Parker, Lauren (20 July 2015). "Racial and Ethnic Segregation: In the News and On PolicyMap". PolicyMap. Retrieved 5 February 2018.
1 2 Conceicao, Pedro NMI2; Ferreira, Pedro M. (2000). "The Young Person's Guide to the Theil Index: Suggesting Intuitive Interpretations and Exploring Analytical Applications". SSRN Electronic Journal. doi:10.2139/ssrn.228703. ISSN 1556-5068. S2CID 19009769.{{cite journal}}: CS1 maint: numeric names: authors list (link)
1 2 3 http://www.poorcity.richcity.org (Redundancy, Entropy and Inequality Measures)
↑ McDonald, James B; Jensen, Bartell C. (December 1979). "An Analysis of Some Properties of Alternative Measures of Income Inequality Based on the Gamma Distribution Function". Journal of the American Statistical Association. 74 (368): 856–860. doi:10.1080/01621459.1979.10481042.
↑ "6. Inequality Measures". Poverty Manual (PDF). World Bank. 8 August 2005. p. 95. Retrieved 4 February 2018.
↑ Novotny, J. (2007). "On the measurement of regional inequality: Does spatial dimension of income inequality matter?" (PDF). Annals of Regional Science. 41 (3): 563–580. doi:10.1007/s00168-007-0113-y. S2CID 51753883.
↑ "Inequality Measures". www.urban.org. Urban Institute. Retrieved 5 February 2018.
↑ Rajan K. Sampath. Equity Measures for Irrigation Performance Evaluation. Water International, 13(1), 1988.
↑ A. Serebrenik, M. van den Brand. Theil index for aggregation of software metrics values. 26th IEEE International Conference on Software Maintenance. IEEE Computer Society.

External links

Software:
- Free Online Calculator computes the Gini Coefficient, plots the Lorenz curve, and computes many other measures of concentration for any dataset
- Free Calculator: Online and downloadable scripts (Python and Lua) for Atkinson, Gini, and Hoover inequalities
- Users of the R data analysis software can install the "ineq" package which allows for computation of a variety of inequality indices including Gini, Atkinson, Theil.
- A MATLAB Inequality Package Archived 2008-10-04 at the Wayback Machine , including code for computing Gini, Atkinson, Theil indexes and for plotting the Lorenz Curve. Many examples are available.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[7] When this equation is used in physics, $k$ typically represents the Boltzmann constant. In information theory or statistics, $k$ is typically equal to 1 (such as in the Theil Index).

[8] In information theory, when information is given in binary digits, the binary logarithm is used (with $a$ equal to 2). In physics and also in computation of Theil index, the natural logarithm is used (with $a$ equal to e).

[9] When the income of every individual is equal to the average income, $\sum _{i=1}^{N}\left(\left({\frac {x_{i}}{\bar {x}}}=1\right){\frac {1}{N}}\ln \left({\left({\frac {\bar {x}}{x_{i}}}=1\right)N}\right)\right)$ $=\sum _{i=1}^{N}\left({\frac {1}{N}}\ln \left({N}\right)\right)$ $=\ln \left({N}\right)$

[1] Introduction to the Theil index from the University of Texas

[2] "Segregation Measures". www.urban.org. Urban Institute. Retrieved 5 February 2018.

[policymap-3] 1 2 Parker, Lauren (20 July 2015). "Racial and Ethnic Segregation: In the News and On PolicyMap". PolicyMap. Retrieved 5 February 2018.

[:0-4] 1 2 Conceicao, Pedro NMI2; Ferreira, Pedro M. (2000). "The Young Person's Guide to the Theil Index: Suggesting Intuitive Interpretations and Exploring Analytical Applications". SSRN Electronic Journal. doi:10.2139/ssrn.228703. ISSN 1556-5068. S2CID 19009769.{{cite journal}}: CS1 maint: numeric names: authors list (link)

[Formulas-5] 1 2 3 http://www.poorcity.richcity.org (Redundancy, Entropy and Inequality Measures)

[McDonald1974-6] McDonald, James B; Jensen, Bartell C. (December 1979). "An Analysis of Some Properties of Alternative Measures of Income Inequality Based on the Gamma Distribution Function". Journal of the American Statistical Association. 74 (368): 856–860. doi:10.1080/01621459.1979.10481042.

[10] "6. Inequality Measures". Poverty Manual (PDF). World Bank. 8 August 2005. p. 95. Retrieved 4 February 2018.

[Spatial_decomposition-11] Novotny, J. (2007). "On the measurement of regional inequality: Does spatial dimension of income inequality matter?" (PDF). Annals of Regional Science. 41 (3): 563–580. doi:10.1007/s00168-007-0113-y. S2CID 51753883.

[12] "Inequality Measures". www.urban.org. Urban Institute. Retrieved 5 February 2018.

[13] Rajan K. Sampath. Equity Measures for Irrigation Performance Evaluation. Water International, 13(1), 1988.

[14] A. Serebrenik, M. van den Brand. Theil index for aggregation of software metrics values. 26th IEEE International Conference on Software Maintenance. IEEE Computer Society.

[1]

[2]

[3]

[4]

[5]

[6]

[note 1]

[note 2]

[note 3]

[7]

[8]

[9]

[10]

[11]