Index of dissimilarity

Last updated

The index of dissimilarity is a demographic measure of the evenness with which two groups are distributed across component geographic areas that make up a larger area. A group is evenly distributed when each geographic unit has the same percentage of group members as the total population. The index score can also be interpreted as the percentage of one of the two groups included in the calculation that would have to move to different geographic areas in order to produce a distribution that matches that of the larger area. The index of dissimilarity can be used as a measure of segregation. A score of zero (0%) reflects a fully integrated environment; a score of 1 (100%) reflects full segregation. In terms of black–white segregation, a score of .60 means that 60 percent of blacks would have to exchange places with whites in other units to achieve an even geographic distribution. [1] [2]

Contents

Basic formula

The basic formula for the index of dissimilarity is:

where (comparing a black and white population, for example):

ai = the population of group A in the ith area, e.g. census tract
A = the total population in group A in the large geographic entity for which the index of dissimilarity is being calculated.
bi = the population of group B in the ith area
B = the total population in group B in the large geographic entity for which the index of dissimilarity is being calculated.

The index of dissimilarity is applicable to any categorical variable (whether demographic or not) and because of its simple properties is useful for input into multidimensional scaling and clustering programs. It has been used extensively in the study of social mobility to compare distributions of origin (or destination) occupational categories.

Linear algebra perspective

The formula for the Index of Dissimilarity can be made much more compact and meaningful by considering it from the perspective of Linear algebra. Suppose we are studying the distribution of rich and poor people in a city (e.g. London). Suppose our city contains blocks:

Let's create a vector which shows the number of rich people in each block of our city:

Similarly, let's create a vector which shows the number of poor people in each block of our city:

Now, the -norm of a vector is simply the sum of (the magnitude of) each entry in that vector. [3] That is, for a vector , we have the -norm:

If we denote as the total number of rich people in our city, than a compact way to calculate would be to use the -norm:

Similarly, if we denote as the total number of poor people in our city, then:

When we divide a vector by its norm, we get what is called the normalized vector or Unit vector :

Let us normalize the rich vector and the poor vector :

We finally return to the formula for the Index of Dissimilarity (); it is simply equal to one-half the -norm of the difference between the vectors and :

Index of Dissimilarity
(in Linear Algebraic notation)

Numerical example

Consider a city consisting of four blocks of 2 people each. One block consists of 2 rich people. One block consists of 2 poor people. Two blocks consist of 1 rich and 1 poor person. What is the index of dissimilarity for this city?

Our fictional city has 4 blocks: one block containing 2 rich people; another containing 2 poor people; and two blocks containing 1 rich and 1 poor person. 2x2 city.jpg
Our fictional city has 4 blocks: one block containing 2 rich people; another containing 2 poor people; and two blocks containing 1 rich and 1 poor person.

Firstly, let's find the rich vector and poor vector :

Next, let's calculate the total number of rich people and poor people in our city:

Next, let's normalize the rich and poor vectors:

We can now calculate the difference :

Finally, let's find the index of dissimilarity ():

Equivalence between formulae

We can prove that the Linear Algebraic formula for is identical to the basic formula for . Let's start with the Linear Algebraic formula:

Let's replace the normalized vectors and with:

Finally, from the definition of the -norm, we know that we can replace it with the summation:

Thus we prove that the linear algebra formula for the index of dissimilarity is equivalent to the basic formula for it:

Zero segregation

When the Index of Dissimilarity is zero, this means that the community we are studying has zero segregation. For example, if we are studying the segregation of rich and poor people in a city, then if , it means that:

If we set in the linear algebraic formula, we get the necessary condition for having zero segregation:

For example, suppose you have a city with 2 blocks. Each block has 4 rich people and 100 poor people:

Then, the total number of rich people is , and the total number of poor people is . Thus:

Because , thus this city has zero segregation.

As another example, suppose you have a city with 3 blocks:

Then, we have rich people in our city, and poor people. Thus:

Again, because , thus this city also has zero segregation.

See also

Related Research Articles

<span class="mw-page-title-main">Curl (mathematics)</span> Circulation density in a vector field

In vector calculus, the curl is a vector operator that describes the infinitesimal circulation of a vector field in three-dimensional Euclidean space. The curl at a point in the field is represented by a vector whose length and direction denote the magnitude and axis of the maximum circulation. The curl of a field is formally defined as the circulation density at each point of the field.

A centripetal force is a force that makes a body follow a curved path. The direction of the centripetal force is always orthogonal to the motion of the body and towards the fixed point of the instantaneous center of curvature of the path. Isaac Newton described it as "a force by which bodies are drawn or impelled, or in any way tend, towards a point as to a centre". In the theory of Newtonian mechanics, gravity provides the centripetal force causing astronomical orbits.

<span class="mw-page-title-main">Divergence</span> Vector operator that measures the expansion or outgoingness of a vector field

In vector calculus, divergence is a vector operator that operates on a vector field, producing a scalar field giving the quantity of the vector field's source at each point. More technically, the divergence represents the volume density of the outward flux of a vector field from an infinitesimal volume around a given point.

<span class="mw-page-title-main">Gradient</span> Multivariate derivative (mathematics)

In vector calculus, the gradient of a scalar-valued differentiable function f of several variables is the vector field whose value at a point is the "direction and rate of fastest increase". If the gradient of a function is non-zero at a point p, the direction of the gradient is the direction in which the function increases most quickly from p, and the magnitude of the gradient is the rate of increase in that direction, the greatest absolute directional derivative. Further, a point where the gradient is the zero vector is known as a stationary point. The gradient thus plays a fundamental role in optimization theory, where it is used to maximize a function by gradient ascent. In coordinate-free terms, the gradient of a function may be defined by:

In mathematics, a partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant. Partial derivatives are used in vector calculus and differential geometry.

Kinematics is a subfield of physics, developed in classical mechanics, that describes the motion of points, bodies (objects), and systems of bodies without considering the forces that cause them to move. Kinematics, as a field of study, is often referred to as the "geometry of motion" and is occasionally seen as a branch of mathematics. A kinematics problem begins by describing the geometry of the system and declaring the initial conditions of any known values of position, velocity and/or acceleration of points within the system. Then, using arguments from geometry, the position, velocity and acceleration of any unknown parts of the system can be determined. The study of how forces act on bodies falls within kinetics, not kinematics. For further details, see analytical dynamics.

In physics, angular velocity or rotational velocity, also known as angular frequency vector, is a pseudovector representation of how fast the angular position or orientation of an object changes with time. The magnitude of the pseudovector represents the angular speed, the rate at which the object rotates or revolves, and its direction is normal to the instantaneous plane of rotation or angular displacement. The orientation of angular velocity is conventionally specified by the right-hand rule.

<span class="mw-page-title-main">Biot–Savart law</span> Important law of classical magnetism

In physics, specifically electromagnetism, the Biot–Savart law is an equation describing the magnetic field generated by a constant electric current. It relates the magnetic field to the magnitude, direction, length, and proximity of the electric current. The Biot–Savart law is fundamental to magnetostatics, playing a role similar to that of Coulomb's law in electrostatics. When magnetostatics does not apply, the Biot–Savart law should be replaced by Jefimenko's equations. The law is valid in the magnetostatic approximation, and consistent with both Ampère's circuital law and Gauss's law for magnetism. It is named after Jean-Baptiste Biot and Félix Savart, who discovered this relationship in 1820.

<span class="mw-page-title-main">Unit vector</span> Vector of length one

In mathematics, a unit vector in a normed vector space is a vector of length 1. A unit vector is often denoted by a lowercase letter with a circumflex, or "hat", as in .

<span class="mw-page-title-main">Four-vector</span> 4-dimensional vector in relativity

In special relativity, a four-vector is an object with four components, which transform in a specific way under Lorentz transformations. Specifically, a four-vector is an element of a four-dimensional vector space considered as a representation space of the standard representation of the Lorentz group, the representation. It differs from a Euclidean vector in how its magnitude is determined. The transformations that preserve this magnitude are the Lorentz transformations, which include spatial rotations and boosts.

In vector calculus, Green's theorem relates a line integral around a simple closed curve C to a double integral over the plane region D bounded by C. It is the two-dimensional special case of Stokes' theorem.

In physics, an operator is a function over a space of physical states onto another space of physical states. The simplest example of the utility of operators is the study of symmetry. Because of this, they are very useful tools in classical mechanics. Operators are even more important in quantum mechanics, where they form an intrinsic part of the formulation of the theory.

<span class="mw-page-title-main">Bloch's theorem</span> Fundamental theorem in condensed matter physics

In condensed matter physics, Bloch's theorem states that solutions to the Schrödinger equation in a periodic potential take the form of a plane wave modulated by a periodic function. The theorem is named after the physicist Felix Bloch, who discovered the theorem in 1929. Mathematically, they are written

In physics and mathematics, in the area of vector calculus, Helmholtz's theorem, also known as the fundamental theorem of vector calculus, states that any sufficiently smooth, rapidly decaying vector field in three dimensions can be resolved into the sum of an irrotational (curl-free) vector field and a solenoidal (divergence-free) vector field; this is known as the Helmholtz decomposition or Helmholtz representation. It is named after Hermann von Helmholtz.

<span class="mw-page-title-main">Rotating reference frame</span> Concept in classical mechanics

A rotating frame of reference is a special case of a non-inertial reference frame that is rotating relative to an inertial reference frame. An everyday example of a rotating reference frame is the surface of the Earth.

In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model by the principle of least squares: minimizing the sum of the squares of the differences between the observed dependent variable in the input dataset and the output of the (linear) function of the independent variable.

<span class="mw-page-title-main">Larmor formula</span> Gives the total power radiated by an accelerating, nonrelativistic point charge

In electrodynamics, the Larmor formula is used to calculate the total power radiated by a nonrelativistic point charge as it accelerates. It was first derived by J. J. Larmor in 1897, in the context of the wave theory of light.

<span class="mw-page-title-main">Bragg plane</span>

In physics, a Bragg plane is a plane in reciprocal space which bisects a reciprocal lattice vector, , at right angles. The Bragg plane is defined as part of the Von Laue condition for diffraction peaks in x-ray diffraction crystallography.

A vector-valued function, also referred to as a vector function, is a mathematical function of one or more variables whose range is a set of multidimensional vectors or infinite-dimensional vectors. The input of a vector-valued function could be a scalar or a vector ; the dimension of the function's domain has no relation to the dimension of its range.

In geometry, various formalisms exist to express a rotation in three dimensions as a mathematical transformation. In physics, this concept is applied to classical mechanics where rotational kinematics is the science of quantitative description of a purely rotational motion. The orientation of an object at a given instant is described with the same tools, as it is defined as an imaginary rotation from a reference placement in space, rather than an actually observed rotation from a previous placement in space.

References

  1. Bureau, US Census. "Housing Patterns: Appendix B: Measures of Residential Segregation". Census.gov. Retrieved 2022-04-28.
  2. Massey, Douglas S.; Rothwell, Jonathan; Domina, Thurston (2009-10-26). "The Changing Bases of Segregation in the United States". The Annals of the American Academy of Political and Social Science. 626 (1): 74–90. doi:10.1177/0002716209343558. ISSN   0002-7162. PMC   3844132 . PMID   24298193.
  3. Wolfram MathWorld: L1 Norm