Change of variables

Last updated August 17, 2024

In mathematics, a change of variables is a basic technique used to simplify problems in which the original variables are replaced with functions of other variables. The intent is that when expressed in new variables, the problem may become simpler, or equivalent to a better understood problem.

Simple example

Consider the system of equations

xy+x+y=71

x^{2}y+xy^{2}=880

where $x$ and $y$ are positive integers with $x>y$ . (Source: 1991 AIME)

Solving this normally is not very difficult, but it may get a little tedious. However, we can rewrite the second equation as $xy(x+y)=880$ . Making the substitutions $s=x+y$ and $t=xy$ reduces the system to $s+t=71,st=880$ . Solving this gives $(s,t)=(16,55)$ and $(s,t)=(55,16)$ . Back-substituting the first ordered pair gives us $x+y=16,xy=55,x>y$ , which gives the solution $(x,y)=(11,5).$ Back-substituting the second ordered pair gives us $x+y=55,xy=16,x>y$ , which gives no solutions. Hence the solution that solves the system is $(x,y)=(11,5)$ .

Formal introduction

Let $A$ , $B$ be smooth manifolds and let $\Phi :A\rightarrow B$ be a $C^{r}$ -diffeomorphism between them, that is: $\Phi$ is a $r$ times continuously differentiable, bijective map from $A$ to $B$ with $r$ times continuously differentiable inverse from $B$ to $A$ . Here $r$ may be any natural number (or zero), $\infty$ (smooth) or $\omega$ (analytic).

The map $\Phi$ is called a regular coordinate transformation or regular variable substitution, where regular refers to the $C^{r}$ -ness of $\Phi$ . Usually one will write $x=\Phi (y)$ to indicate the replacement of the variable $x$ by the variable $y$ by substituting the value of $\Phi$ in $y$ for every occurrence of $x$ .

Other examples

Coordinate transformation

Some systems can be more easily solved when switching to polar coordinates. Consider for example the equation

U(x,y):=(x^{2}+y^{2}){\sqrt {1-{\frac {x^{2}}{x^{2}+y^{2}}}}}=0.

This may be a potential energy function for some physical problem. If one does not immediately see a solution, one might try the substitution

\displaystyle (x,y)=\Phi (r,\theta )

given by

\displaystyle \Phi (r,\theta )=(r\cos(\theta ),r\sin(\theta )).

Note that if $\theta$ runs outside a $2\pi$ -length interval, for example, $[0,2\pi ]$ , the map $\Phi$ is no longer bijective. Therefore, $\Phi$ should be limited to, for example $(0,\infty ]\times [0,2\pi )$ . Notice how $r=0$ is excluded, for $\Phi$ is not bijective in the origin ( $\theta$ can take any value, the point will be mapped to (0, 0)). Then, replacing all occurrences of the original variables by the new expressions prescribed by $\Phi$ and using the identity $\sin ^{2}x+\cos ^{2}x=1$ , we get

V(r,\theta )=r^{2}{\sqrt {1-{\frac {r^{2}\cos ^{2}\theta }{r^{2}}}}}=r^{2}{\sqrt {1-\cos ^{2}\theta }}=r^{2}\left|\sin \theta \right|.

Now the solutions can be readily found: $\sin(\theta )=0$ , so $\theta =0$ or $\theta =\pi$ . Applying the inverse of $\Phi$ shows that this is equivalent to $y=0$ while $x\not =0$ . Indeed, we see that for $y=0$ the function vanishes, except for the origin.

Note that, had we allowed $r=0$ , the origin would also have been a solution, though it is not a solution to the original problem. Here the bijectivity of $\Phi$ is crucial. The function is always positive (for $x,y\in \mathbb {R}$ ), hence the absolute values.

Differentiation

The chain rule is used to simplify complicated differentiation. For example, consider the problem of calculating the derivative

{\frac {d}{dx}}\sin(x^{2}).

Let $y=\sin u$ with $u=x^{2}.$ Then:

{\begin{aligned}{\frac {d}{dx}}\sin(x^{2})&={\frac {dy}{dx}}\\[6pt]&={\frac {dy}{du}}{\frac {du}{dx}}&&{\text{This part is the chain rule.}}\\[6pt]&=\left({\frac {d}{du}}\sin u\right)\left({\frac {d}{dx}}x^{2}\right)\\[6pt]&=(\cos u)(2x)\\&=\left(\cos(x^{2})\right)(2x)\\&=2x\cos(x^{2})\end{aligned}}

Integration

Difficult integrals may often be evaluated by changing variables; this is enabled by the substitution rule and is analogous to the use of the chain rule above. Difficult integrals may also be solved by simplifying the integral using a change of variables given by the corresponding Jacobian matrix and determinant.^[1] Using the Jacobian determinant and the corresponding change of variable that it gives is the basis of coordinate systems such as polar, cylindrical, and spherical coordinate systems.

Change of variables formula in terms of Lebesgue measure

The following theorem allows us to relate integrals with respect to Lebesgue measure to an equivalent integral with respect to the pullback measure under a parameterization G.^[2] The proof is due to approximations of the Jordan content.

Suppose that $\Omega$ is an open subset of $\mathbb {R} ^{n}$ and $G:\Omega \to \mathbb {R} ^{n}$ is a $C^{1}$ diffeomorphism.
If $f$ is a Lebesgue measurable function on $G(\Omega )$ , then $f\circ G$ is Lebesgue measurable on $\Omega$ . If $f\geq 0$ or $f\in L^{1}(G(\Omega ),m),$ then $\int _{G(\Omega )}f(x)dx=\int _{\Omega }f\circ G(x)|{\text{det}}D_{x}G|dx$ .
If $E\subset \Omega$ and $E$ is Lebesgue measurable, then $G(E)$ is Lebesgue measurable, then $m(G(E))=\int _{E}|{\text{det}}D_{x}G|dx$ .

As a corollary of this theorem, we may compute the Radon–Nikodym derivatives of both the pullback and pushforward measures of $m$ under $T$ .

Pullback measure and transformation formula

The pullback measure in terms of a transformation $T$ is defined as ${\displaystyle T^{*}\mu$ . The change of variables formula for pullback measures is

$\int _{T(\Omega )}gd\mu =\int _{\Omega }g\circ TdT^{*}\mu$ .

Pushforward measure and transformation formula

The pushforward measure in terms of a transformation $T$ , is defined as ${\displaystyle T_{*}\mu$ . The change of variables formula for pushforward measures is

$\int _{\Omega }g\circ Td\mu =\int _{T(\Omega )}gdT_{*}\mu$ .

As a corollary of the change of variables formula for Lebesgue measure, we have that

Radon-Nikodym derivative of the pullback with respect to Lebesgue measure: ${\frac {dT^{*}m}{dm}}(x)=|{\text{det}}D_{x}T|$
Radon-Nikodym derivative of the pushforward with respect to Lebesgue measure: ${\frac {dT_{*}m}{dm}}(x)=|{\text{det}}D_{x}T^{-1}|$

From which we may obtain

The change of variables formula for pullback measure: $\int _{T(\Omega )}gdm=\int _{\Omega }g\circ TdT^{*}m=\int _{\Omega }g\circ T|{\text{det}}D_{x}T|dm(x)$
The change of variables formula for pushforward measure: $\int _{\Omega }gdm=\int _{T(\Omega )}g\circ T^{-1}dT_{*}m=\int _{T(\Omega )}g\circ T^{-1}|{\text{det}}D_{x}T^{-1}|dm(x)$

Differential equations

Variable changes for differentiation and integration are taught in elementary calculus and the steps are rarely carried out in full.

The very broad use of variable changes is apparent when considering differential equations, where the independent variables may be changed using the chain rule or the dependent variables are changed resulting in some differentiation to be carried out. Exotic changes, such as the mingling of dependent and independent variables in point and contact transformations, can be very complicated but allow much freedom.

Very often, a general form for a change is substituted into a problem and parameters picked along the way to best simplify the problem.

Scaling and shifting

Probably the simplest change is the scaling and shifting of variables, that is replacing them with new variables that are "stretched" and "moved" by constant amounts. This is very common in practical applications to get physical parameters out of problems. For an n^th order derivative, the change simply results in

{\frac {d^{n}y}{dx^{n}}}={\frac {y_{\text{scale}}}{x_{\text{scale}}^{n}}}{\frac {d^{n}{\hat {y}}}{d{\hat {x}}^{n}}}

where

x={\hat {x}}x_{\text{scale}}+x_{\text{shift}}

y={\hat {y}}y_{\text{scale}}+y_{\text{shift}}.

This may be shown readily through the chain rule and linearity of differentiation. This change is very common in practical applications to get physical parameters out of problems, for example, the boundary value problem

{\displaystyle \mu {\frac {d^{2}u}{dy^{2}}}={\frac {dp}{dx}}\quad

describes parallel fluid flow between flat solid walls separated by a distance δ; μ is the viscosity and $dp/dx$ the pressure gradient, both constants. By scaling the variables the problem becomes

{\displaystyle {\frac {d^{2}{\hat {u}}}{d{\hat {y}}^{2}}}=1\quad

where

y={\hat {y}}L\qquad {\text{and}}\qquad u={\hat {u}}{\frac {L^{2}}{\mu }}{\frac {dp}{dx}}.

Scaling is useful for many reasons. It simplifies analysis both by reducing the number of parameters and by simply making the problem neater. Proper scaling may normalize variables, that is make them have a sensible unitless range such as 0 to 1. Finally, if a problem mandates numeric solution, the fewer the parameters the fewer the number of computations.

Momentum vs. velocity

Consider a system of equations

{\begin{aligned}m{\dot {v}}&=-{\frac {\partial H}{\partial x}}\\[5pt]m{\dot {x}}&={\frac {\partial H}{\partial v}}\end{aligned}}

for a given function $H(x,v)$ . The mass can be eliminated by the (trivial) substitution $\Phi (p)=1/m\cdot p$ . Clearly this is a bijective map from $\mathbb {R}$ to $\mathbb {R}$ . Under the substitution $v=\Phi (p)$ the system becomes

{\begin{aligned}{\dot {p}}&=-{\frac {\partial H}{\partial x}}\\[5pt]{\dot {x}}&={\frac {\partial H}{\partial p}}\end{aligned}}

Lagrangian mechanics

Given a force field $\varphi (t,x,v)$ , Newton's equations of motion are

m{\ddot {x}}=\varphi (t,x,v).

Lagrange examined how these equations of motion change under an arbitrary substitution of variables $x=\Psi (t,y)$ , $v={\frac {\partial \Psi (t,y)}{\partial t}}+{\frac {\partial \Psi (t,y)}{\partial y}}\cdot w.$

He found that the equations

{\frac {\partial {L}}{\partial y}}={\frac {\mathrm {d} }{\mathrm {d} t}}{\frac {\partial {L}}{\partial {w}}}

are equivalent to Newton's equations for the function $L=T-V$ , where T is the kinetic, and V the potential energy.

In fact, when the substitution is chosen well (exploiting for example symmetries and constraints of the system) these equations are much easier to solve than Newton's equations in Cartesian coordinates.

Related Research Articles

In mathematics, the polar coordinate system is a two-dimensional coordinate system in which each point on a plane is determined by a distance from a reference point and an angle from a reference direction. The reference point is called the pole, and the ray from the pole in the reference direction is the polar axis. The distance from the pole is called the radial coordinate, radial distance or simply radius, and the angle is called the angular coordinate, polar angle, or azimuth. Angles in polar notation are generally expressed in either degrees or radians.

<span class="mw-page-title-main">Navier–Stokes equations</span> Equations describing the motion of viscous fluid substances

The Navier–Stokes equations are partial differential equations which describe the motion of viscous fluid substances. They were named after French engineer and physicist Claude-Louis Navier and the Irish physicist and mathematician George Gabriel Stokes. They were developed over several decades of progressively building the theories, from 1822 (Navier) to 1842–1850 (Stokes).

In physics, equations of motion are equations that describe the behavior of a physical system in terms of its motion as a function of time. More specifically, the equations of motion describe the behavior of a physical system as a set of mathematical functions in terms of dynamic variables. These variables are usually spatial coordinates and time, but may include momentum components. The most general choice are generalized coordinates which can be any convenient variables characteristic of the physical system. The functions are defined in a Euclidean space in classical mechanics, but are replaced by curved spaces in relativity. If the dynamics of a system is known, the equations are the solutions for the differential equations describing the motion of the dynamics.

<span class="mw-page-title-main">Tautochrone curve</span> Curve for which the time to roll to the end is equal for all starting points

A tautochrone curve or isochrone curve is the curve for which the time taken by an object sliding without friction in uniform gravity to its lowest point is independent of its starting point on the curve. The curve is a cycloid, and the time is equal to π times the square root of the radius over the acceleration of gravity. The tautochrone curve is related to the brachistochrone curve, which is also a cycloid.

The primitive equations are a set of nonlinear partial differential equations that are used to approximate global atmospheric flow and are used in most atmospheric models. They consist of three main sets of balance equations:

A continuity equation: Representing the conservation of mass.
Conservation of momentum: Consisting of a form of the Navier–Stokes equations that describe hydrodynamical flow on the surface of a sphere under the assumption that vertical motion is much smaller than horizontal motion (hydrostasis) and that the fluid layer depth is small compared to the radius of the sphere
A thermal energy equation: Relating the overall temperature of the system to heat sources and sinks

In probability theory, the Borel–Kolmogorov paradox is a paradox relating to conditional probability with respect to an event of probability zero. It is named after Émile Borel and Andrey Kolmogorov.

In physics, the Hamilton–Jacobi equation, named after William Rowan Hamilton and Carl Gustav Jacob Jacobi, is an alternative formulation of classical mechanics, equivalent to other formulations such as Newton's laws of motion, Lagrangian mechanics and Hamiltonian mechanics.

In mathematics, a Killing vector field, named after Wilhelm Killing, is a vector field on a Riemannian manifold that preserves the metric. Killing fields are the infinitesimal generators of isometries; that is, flows generated by Killing fields are continuous isometries of the manifold. More simply, the flow generates a symmetry, in the sense that moving each point of an object the same distance in the direction of the Killing vector will not distort distances on the object.

In rotordynamics, the rigid rotor is a mechanical model of rotating systems. An arbitrary rigid rotor is a 3-dimensional rigid object, such as a top. To orient such an object in space requires three angles, known as Euler angles. A special rigid rotor is the linear rotor requiring only two angles to describe, for example of a diatomic molecule. More general molecules are 3-dimensional, such as water, ammonia, or methane.

In physics, spherically symmetric spacetimes are commonly used to obtain analytic and numerical solutions to Einstein's field equations in the presence of radially moving matter or energy. Because spherically symmetric spacetimes are by definition irrotational, they are not realistic models of black holes in nature. However, their metrics are considerably simpler than those of rotating spacetimes, making them much easier to analyze.

In calculus, the Leibniz integral rule for differentiation under the integral sign, named after Gottfried Wilhelm Leibniz, states that for an integral of the form $where and the integrands are functions dependent on the derivative of this integral is expressible as where the partial derivative indicates that inside the integral, only the variation of with is considered in taking the derivative.$

<span class="mw-page-title-main">Hopf bifurcation</span> Critical point where a periodic solution arises

In the mathematical theory of bifurcations, a Hopfbifurcation is a critical point where, as a parameter changes, a system's stability switches and a periodic solution arises. More accurately, it is a local bifurcation in which a fixed point of a dynamical system loses stability, as a pair of complex conjugate eigenvalues—of the linearization around the fixed point—crosses the complex plane imaginary axis as a parameter crosses a threshold value. Under reasonably generic assumptions about the dynamical system, the fixed point becomes a small-amplitude limit cycle as the parameter changes.

A theoretical motivation for general relativity, including the motivation for the geodesic equation and the Einstein field equation, can be obtained from special relativity by examining the dynamics of particles in circular orbits about the Earth. A key advantage in examining circular orbits is that it is possible to know the solution of the Einstein Field Equation a priori. This provides a means to inform and verify the formalism.

In classical mechanics, a Liouville dynamical system is an exactly solvable dynamical system in which the kinetic energy T and potential energy V can be expressed in terms of the s generalized coordinates q as follows:

<span class="mw-page-title-main">Radiative transfer equation and diffusion theory for photon transport in biological tissue</span>

Photon transport in biological tissue can be equivalently modeled numerically with Monte Carlo simulations or analytically by the radiative transfer equation (RTE). However, the RTE is difficult to solve without introducing approximations. A common approximation summarized here is the diffusion approximation. Overall, solutions to the diffusion equation for photon transport are more computationally efficient, but less accurate than Monte Carlo simulations.

In mathematics, vector spherical harmonics (VSH) are an extension of the scalar spherical harmonics for use with vector fields. The components of the VSH are complex-valued functions expressed in the spherical coordinate basis vectors.

In mathematics, the spectral theory of ordinary differential equations is the part of spectral theory concerned with the determination of the spectrum and eigenfunction expansion associated with a linear ordinary differential equation. In his dissertation, Hermann Weyl generalized the classical Sturm–Liouville theory on a finite closed interval to second order differential operators with singularities at the endpoints of the interval, possibly semi-infinite or infinite. Unlike the classical case, the spectrum may no longer consist of just a countable set of eigenvalues, but may also contain a continuous part. In this case the eigenfunction expansion involves an integral over the continuous part with respect to a spectral measure, given by the Titchmarsh–Kodaira formula. The theory was put in its final simplified form for singular differential equations of even degree by Kodaira and others, using von Neumann's spectral theorem. It has had important applications in quantum mechanics, operator theory and harmonic analysis on semisimple Lie groups.

In fluid dynamics, the Oseen equations describe the flow of a viscous and incompressible fluid at small Reynolds numbers, as formulated by Carl Wilhelm Oseen in 1910. Oseen flow is an improved description of these flows, as compared to Stokes flow, with the (partial) inclusion of convective acceleration.

In the theory of Lorentzian manifolds, spherically symmetric spacetimes admit a family of nested round spheres. In such a spacetime, a particularly important kind of coordinate chart is the Schwarzschild chart, a kind of polar spherical coordinate chart on a static and spherically symmetric spacetime, which is adapted to these nested round spheres. The defining characteristic of Schwarzschild chart is that the radial coordinate possesses a natural geometric interpretation in terms of the surface area and Gaussian curvature of each sphere. However, radial distances and angles are not accurately represented.

Calculations in the Newman–Penrose (NP) formalism of general relativity normally begin with the construction of a complex null tetrad $, where is a pair of real null vectors and is a pair of complex null vectors. These tetrad vectors respect the following normalization and metric conditions assuming the spacetime signature$

References

↑ Kaplan, Wilfred (1973). "Change of Variables in Integrals". Advanced Calculus (Second ed.). Reading: Addison-Wesley. pp. 269–275.
↑ Folland, G. B. (1999). Real analysis : modern techniques and their applications (2nd ed.). New York: Wiley. pp. 74–75. ISBN 0-471-31716-0. OCLC 39849337.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Kaplan, Wilfred (1973). "Change of Variables in Integrals". Advanced Calculus (Second ed.). Reading: Addison-Wesley. pp. 269–275.

[2] Folland, G. B. (1999). Real analysis : modern techniques and their applications (2nd ed.). New York: Wiley. pp. 74–75. ISBN 0-471-31716-0. OCLC 39849337.

[1]

[2]