Frobenius solution to the hypergeometric equation

Last updated

In the following we solve the second-order differential equation called the hypergeometric differential equation using Frobenius method, named after Ferdinand Georg Frobenius. This is a method that uses the series solution for a differential equation, where we assume the solution takes the form of a series. This is usually the method we use for complicated ordinary differential equations.

Contents

The solution of the hypergeometric differential equation is very important. For instance, Legendre's differential equation can be shown to be a special case of the hypergeometric differential equation. Hence, by solving the hypergeometric differential equation, one may directly compare its solutions to get the solutions of Legendre's differential equation, after making the necessary substitutions. For more details, please check the hypergeometric differential equation.

We shall prove that this equation has three singularities, namely at x = 0, x = 1 and around x = infinity. However, as these will turn out to be regular singular points, we will be able to assume a solution on the form of a series. Since this is a second-order differential equation, we must have two linearly independent solutions.

The problem however will be that our assumed solutions may or not be independent, or worse, may not even be defined (depending on the value of the parameters of the equation). This is why we shall study the different cases for the parameters and modify our assumed solution accordingly.

The equation

Solve the hypergeometric equation around all singularities:

Solution around x = 0

Let

Then

Hence, x = 0 and x = 1 are singular points. Let's start with x = 0. To see if it is regular, we study the following limits:

Hence, both limits exist and x = 0 is a regular singular point. Therefore, we assume the solution takes the form

with a0 ≠ 0. Hence,

Substituting these into the hypergeometric equation, we get

That is,

In order to simplify this equation, we need all powers to be the same, equal to r + c − 1, the smallest power. Hence, we switch the indices as follows:

Thus, isolating the first term of the sums starting from 0 we get

Now, from the linear independence of all powers of x, that is, of the functions 1, x, x2, etc., the coefficients of xk vanish for all k. Hence, from the first term, we have

which is the indicial equation. Since a0 ≠ 0, we have

Hence,

Also, from the rest of the terms, we have

Hence,

But

Hence, we get the recurrence relation

Let's now simplify this relation by giving ar in terms of a0 instead of ar−1. From the recurrence relation (note: below, expressions of the form (u)r refer to the Pochhammer symbol).

As we can see,

Hence, our assumed solution takes the form

We are now ready to study the solutions corresponding to the different cases for c1  c2 = γ  1 (this reduces to studying the nature of the parameter γ: whether it is an integer or not).

Analysis of the solution in terms of the difference γ 1 of the two roots

γ not an integer

Then y1 = y|c = 0 and y2 = y|c = 1 − γ. Since

we have

Hence, Let A a0 = a and Ba0 = B. Then

γ = 1

Then y1 = y|c = 0. Since γ = 1, we have

Hence,

To calculate this derivative, let

Then

But

Hence,

Differentiating both sides of the equation with respect to c, we get:

Hence,

Now,

Hence,

For c = 0, we get

Hence, y = Cy1 + Dy2. Let Ca0 = C and Da0 = D. Then

γ an integer and γ 1

γ 0

The value of is . To begin with, we shall simplify matters by concentrating a particular value of and generalise the result at a later stage. We shall use the value . The indicial equation has a root at , and we see from the recurrence relation

that when that that denominator has a factor which vanishes when . In this case, a solution can be obtained by putting where is a constant.

With this substitution, the coefficients of vanish when and . The factor of in the denominator of the recurrence relation cancels with that of the numerator when . Hence, our solution takes the form

If we start the summation at rather than we see that

The result (as we have written it) generalises easily. For , with then

Obviously, if , then . The expression for we have just given looks a little inelegant since we have a multiplicative constant apart from the usual arbitrary multiplicative constant . Later, we shall see that we can recast things in such a way that this extra constant never appears

The other root to the indicial equation is , but this gives us (apart from a multiplicative constant) the same result as found using . This means we must take the partial derivative (w.r.t. ) of the usual trial solution in order to find a second independent solution. If we define the linear operator as

then since in our case,

(We insist that .) Taking the partial derivative w.r.t ,

Note that we must evaluate the partial derivative at (and not at the other root ). Otherwise the right hand side is non-zero in the above, and we do not have a solution of . The factor is not cancelled for and . This part of the second independent solution is

Now we can turn our attention to the terms where the factor cancels. First

After this, the recurrence relations give us

So, if we have

We need the partial derivatives

Similarly, we can write

and

It becomes clear that for

Here, is the th partial sum of the harmonic series, and by definition and .

Putting these together, for the case we have a second solution

The two independent solutions for (where is a positive integer) are then

and

The general solution is as usual where and are arbitrary constants. Now, if the reader consults a ``standard solution" for this case, such as given by Abramowitz and Stegun [1] in §15.5.21 (which we shall write down at the end of the next section) it shall be found that the solution we have found looks somewhat different from the standard solution. In our solution for , the first term in the infinite series part of is a term in . The first term in the corresponding infinite series in the standard solution is a term in . The term is missing from the standard solution. Nonetheless, the two solutions are entirely equivalent.

The "Standard" Form of the Solution γ 0

The reason for the apparent discrepancy between the solution given above and the standard solution in Abramowitz and Stegun [1] §15.5.21 is that there are an infinite number of ways in which to represent the two independent solutions of the hypergeometric ODE. In the last section, for instance, we replaced with . Suppose though, we are given some function which is continuous and finite everywhere in an arbitrarily small interval about . Suppose we are also given

and

Then, if instead of replacing with we replace with , we still find we have a valid solution of the hypergeometric equation. Clearly, we have an infinity of possibilities for . There is however a ``natural choice" for . Suppose that is the first non zero term in the first solution with . If we make the reciprocal of , then we won't have a multiplicative constant involved in as we did in the previous section. From another point of view, we get the same result if we ``insist" that is independent of , and find by using the recurrence relations backwards.

For the first solution, the function gives us (apart from multiplicative constant) the same as we would have obtained using . Suppose that using gives rise to two independent solutions and . In the following we shall denote the solutions arrived at given some as and .

The second solution requires us to take the partial derivative w.r.t , and substituting the usual trial solution gives us

The operator is the same linear operator discussed in the previous section. That is to say, the hypergeometric ODE is represented as .

Evaluating the left hand side at will give us a second independent solution. Note that this second solution is in fact a linear combination of and .

Any two independent linear combinations ( and ) of and are independent solutions of .

The general solution can be written as a linear combination of and just as well as linear combinations of and .


We shall review the special case where that was considered in the last section. If we ``insist" , then the recurrence relations yield

and

These three coefficients are all zero at as expected. We have three terms involved in by taking the partiial derivative w.r.t , we denote the sum of the three terms involving these coefficients as where

The reader may confirm that we can tidy this up and make it easy to generalise by putting

Next we can turn to the other coefficients, the recurrence relations yield

Setting gives us

This is (apart from the multiplicative constant) the same as . Now, to find we need partial derivatives

Then

we can re-write this as

The pattern soon becomes clear, and for

Clearly, for ,

The infinite series part of is , where

Now we can write (disregarding the arbitrary constant) for

Some authors prefer to express the finite sums in this last result using the digamma function . In particular, the following results are used

Here, is the Euler-Mascheroni constant. Also

With these results we obtain the form given in Abramamowitz and Stegun §15.5.21, namely

The Standard" Form of the Solution γ > 1

In this section, we shall concentrate on the ``standard solution", and we shall not replace with . We shall put where . For the root of the indicial equation we had

where in which case we are in trouble if . For instance, if , the denominator in the recurrence relations vanishes for . We can use exactly the same methods that we have just used for the standard solution in the last section. We shall not (in the instance where ) replace with as this will not give us the standard form of solution that we are after. Rather, we shall ``insist" that as we did in the standard solution for in the last section. (Recall that this defined the function and that will now be replaced with .) Then we may work out the coefficients of to as functions of using the recurrence relations backwards. There is nothing new to add here, and the reader may use the same methods as used in the last section to find the results of [1] §15.5.18 and §15.5.19, these are

and

Note that the powers of in the finite sum part of are now negative so that this sum diverges as

Solution around x = 1

Let us now study the singular point x = 1. To see if it is regular,

Hence, both limits exist and x = 1 is a regular singular point. Now, instead of assuming a solution on the form

we will try to express the solutions of this case in terms of the solutions for the point x = 0. We proceed as follows: we had the hypergeometric equation

Let z = 1 − x. Then

Hence, the equation takes the form

Since z = 1 − x, the solution of the hypergeometric equation at x = 1 is the same as the solution for this equation at z = 0. But the solution at z = 0 is identical to the solution we obtained for the point x = 0, if we replace each γ by α + β − γ + 1. Hence, to get the solutions, we just make this substitution in the previous results. For x = 0, c1 = 0 and c2 = 1 − γ. Hence, in our case, c1 = 0 while c2 = γ − α − β. Let us now write the solutions. In the following we replaced each z by 1 - x.

Analysis of the solution in terms of the difference γαβ of the two roots

To simplify notation from now on denote γ − α − β by Δ, therefore γ = Δ + α + β.

Δ not an integer

Δ = 0

Δ is a non-zero integer

Δ > 0

Δ < 0

Solution around infinity

Finally, we study the singularity as x → ∞. Since we can't study this directly, we let x = s−1. Then the solution of the equation as x → ∞ is identical to the solution of the modified equation when s = 0. We had

Hence, the equation takes the new form

which reduces to

Let

As we said, we shall only study the solution when s = 0. As we can see, this is a singular point since P2(0) = 0. To see if it is regular,

Hence, both limits exist and s = 0 is a regular singular point. Therefore, we assume the solution takes the form

with a0 ≠ 0. Hence,

Substituting in the modified hypergeometric equation we get

And therefore:

i.e.,

In order to simplify this equation, we need all powers to be the same, equal to r + c, the smallest power. Hence, we switch the indices as follows

Thus, isolating the first term of the sums starting from 0 we get

Now, from the linear independence of all powers of s (i.e., of the functions 1, s, s2, ...), the coefficients of sk vanish for all k. Hence, from the first term we have

which is the indicial equation. Since a0 ≠ 0, we have

Hence, c1 = α and c2 = β.

Also, from the rest of the terms we have

Hence,

But

Hence, we get the recurrence relation

Let's now simplify this relation by giving ar in terms of a0 instead of ar−1. From the recurrence relation,

As we can see,

Hence, our assumed solution takes the form

We are now ready to study the solutions corresponding to the different cases for c1  c2 = α − β.

Analysis of the solution in terms of the difference αβ of the two roots

αβ not an integer

Then y1 = y|c = α and y2 = y|c = β. Since

we have

Hence, y = Ay1 + By2. Let Aa0 = A and Ba0 = B. Then, noting that s = x−1,

αβ = 0

Then y1 = y|c = α. Since α = β, we have

Hence,

To calculate this derivative, let

Then using the method in the case γ = 1 above, we get

Now,

Hence,

Therefore:

Hence, y = Cy1 + Dy2. Let Ca0 = C and Da0 = D. Noting that s = x−1,

αβ an integer and αβ 0

αβ > 0

From the recurrence relation

we see that when c = β (the smaller root), aα−β → ∞. Hence, we must make the substitution a0 = b0(cci), where ci is the root for which our solution is infinite. Hence, we take a0 = b0(c − β) and our assumed solution takes the new form

Then y1 = yb|c = β. As we can see, all terms before

vanish because of the c − β in the numerator.

But starting from this term, the c − β in the numerator vanishes. To see this, note that

Hence, our solution takes the form

Now,

To calculate this derivative, let

Then using the method in the case γ = 1 above we get

Now,

Hence,

Hence,

At c = α we get y2. Hence, y = Ey1 + Fy2. Let Eb0 = E and Fb0 = F. Noting that s = x−1 we get

αβ < 0

From the symmetry of the situation here, we see that

Related Research Articles

<span class="mw-page-title-main">Exponential distribution</span> Probability distribution

In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuously and independently at a constant average rate; the distance parameter could be any meaningful mono-dimensional measure of the process, such as time between production errors, or length along a roll of fabric in the weaving manufacturing process. It is a particular case of the gamma distribution. It is the continuous analogue of the geometric distribution, and it has the key property of being memoryless. In addition to being used for the analysis of Poisson point processes it is found in various other contexts.

<span class="mw-page-title-main">Beta distribution</span> Probability distribution

In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] or in terms of two positive parameters, denoted by alpha (α) and beta (β), that appear as exponents of the variable and its complement to 1, respectively, and control the shape of the distribution.

<span class="mw-page-title-main">Gamma distribution</span> Probability distribution

In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-squared distribution are special cases of the gamma distribution. There are two equivalent parameterizations in common use:

  1. With a shape parameter k and a scale parameter θ
  2. With a shape parameter and an inverse scale parameter , called a rate parameter.
<span class="mw-page-title-main">Beta function</span> Mathematical function

In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral

In mathematics, the Mellin transform is an integral transform that may be regarded as the multiplicative version of the two-sided Laplace transform. This integral transform is closely connected to the theory of Dirichlet series, and is often used in number theory, mathematical statistics, and the theory of asymptotic expansions; it is closely related to the Laplace transform and the Fourier transform, and the theory of the gamma function and allied special functions.

<span class="mw-page-title-main">Dirichlet distribution</span> Probability distribution

In probability and statistics, the Dirichlet distribution (after Peter Gustav Lejeune Dirichlet), often denoted , is a family of continuous multivariate probability distributions parameterized by a vector of positive reals. It is a multivariate generalization of the beta distribution, hence its alternative name of multivariate beta distribution (MBD). Dirichlet distributions are commonly used as prior distributions in Bayesian statistics, and in fact, the Dirichlet distribution is the conjugate prior of the categorical distribution and multinomial distribution.

<span class="mw-page-title-main">Stable distribution</span> Distribution of variables which satisfies a stability property under linear combinations

In probability theory, a distribution is said to be stable if a linear combination of two independent random variables with this distribution has the same distribution, up to location and scale parameters. A random variable is said to be stable if its distribution is stable. The stable distribution family is also sometimes referred to as the Lévy alpha-stable distribution, after Paul Lévy, the first mathematician to have studied it.

In mathematics, we can define norms for the elements of a vector space. When the vector space in question consists of matrices, these are called matrix norms.

<span class="mw-page-title-main">Mittag-Leffler function</span> Mathematical function

In mathematics, the Mittag-Leffler function is a special function, a complex function which depends on two complex parameters and . It may be defined by the following series when the real part of is strictly positive:

<span class="mw-page-title-main">Inverse-gamma distribution</span> Two-parameter family of continuous probability distributions

In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the reciprocal of a variable distributed according to the gamma distribution.

In the theory of special functions in mathematics, the Horn functions are the 34 distinct convergent hypergeometric series of order two, enumerated by Horn (1931). They are listed in. B. C. Carlson revealed a problem with the Horn function classification scheme. The total 34 Horn functions can be further categorised into 14 complete hypergeometric functions and 20 confluent hypergeometric functions. The complete functions, with their domain of convergence, are:

<span class="mw-page-title-main">Beta prime distribution</span> Probability distribution

In probability theory and statistics, the beta prime distribution is an absolutely continuous probability distribution. If has a beta distribution, then the odds has a beta prime distribution.

<span class="mw-page-title-main">Beta-binomial distribution</span> Discrete probability distribution

In probability theory and statistics, the beta-binomial distribution is a family of discrete probability distributions on a finite support of non-negative integers arising when the probability of success in each of a fixed or known number of Bernoulli trials is either unknown or random. The beta-binomial distribution is the binomial distribution in which the probability of success at each of n trials is not fixed but randomly drawn from a beta distribution. It is frequently used in Bayesian statistics, empirical Bayes methods and classical statistics to capture overdispersion in binomial type distributed data.

Expected shortfall (ES) is a risk measure—a concept used in the field of financial risk measurement to evaluate the market risk or credit risk of a portfolio. The "expected shortfall at q% level" is the expected return on the portfolio in the worst of cases. ES is an alternative to value at risk that is more sensitive to the shape of the tail of the loss distribution.

A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.

The term generalized logistic distribution is used as the name for several different families of probability distributions. For example, Johnson et al. list four forms, which are listed below.

The Luttinger–Kohn model is a flavor of the k·p perturbation theory used for calculating the structure of multiple, degenerate electronic bands in bulk and quantum well semiconductors. The method is a generalization of the single band k·p theory.

A product distribution is a probability distribution constructed as the distribution of the product of random variables having two other known distributions. Given two statistically independent random variables X and Y, the distribution of the random variable Z that is formed as the product is a product distribution.

<span class="mw-page-title-main">Jacobi polynomials</span> Polynomial sequence

In mathematics, Jacobi polynomials are a class of classical orthogonal polynomials. They are orthogonal with respect to the weight on the interval . The Gegenbauer polynomials, and thus also the Legendre, Zernike and Chebyshev polynomials, are special cases of the Jacobi polynomials.

References

  1. 1 2 3 Abramowitz and Stegun

Abramowitz, Milton; Stegun, Irene A. (1964). Handbook of Mathematical Functions . New York: Dover. ISBN   978-0-48-661272-0.