Diophantine approximation

Last updated

Best rational approximants for p (green circle), e (blue diamond), ph (pink oblong), ([?]3)/2 (grey hexagon), 1/[?]2 (red octagon) and 1/[?]3 (orange triangle) calculated from their continued fraction expansions, plotted as slopes y/x with errors from their true values (black dashes)
.mw-parser-output .hlist dl,.mw-parser-output .hlist ol,.mw-parser-output .hlist ul{margin:0;padding:0}.mw-parser-output .hlist dd,.mw-parser-output .hlist dt,.mw-parser-output .hlist li{margin:0;display:inline}.mw-parser-output .hlist.inline,.mw-parser-output .hlist.inline dl,.mw-parser-output .hlist.inline ol,.mw-parser-output .hlist.inline ul,.mw-parser-output .hlist dl dl,.mw-parser-output .hlist dl ol,.mw-parser-output .hlist dl ul,.mw-parser-output .hlist ol dl,.mw-parser-output .hlist ol ol,.mw-parser-output .hlist ol ul,.mw-parser-output .hlist ul dl,.mw-parser-output .hlist ul ol,.mw-parser-output .hlist ul ul{display:inline}.mw-parser-output .hlist .mw-empty-li{display:none}.mw-parser-output .hlist dt::after{content:": "}.mw-parser-output .hlist dd::after,.mw-parser-output .hlist li::after{content:" * ";font-weight:bold}.mw-parser-output .hlist dd:last-child::after,.mw-parser-output .hlist dt:last-child::after,.mw-parser-output .hlist li:last-child::after{content:none}.mw-parser-output .hlist dd dd:first-child::before,.mw-parser-output .hlist dd dt:first-child::before,.mw-parser-output .hlist dd li:first-child::before,.mw-parser-output .hlist dt dd:first-child::before,.mw-parser-output .hlist dt dt:first-child::before,.mw-parser-output .hlist dt li:first-child::before,.mw-parser-output .hlist li dd:first-child::before,.mw-parser-output .hlist li dt:first-child::before,.mw-parser-output .hlist li li:first-child::before{content:" (";font-weight:normal}.mw-parser-output .hlist dd dd:last-child::after,.mw-parser-output .hlist dd dt:last-child::after,.mw-parser-output .hlist dd li:last-child::after,.mw-parser-output .hlist dt dd:last-child::after,.mw-parser-output .hlist dt dt:last-child::after,.mw-parser-output .hlist dt li:last-child::after,.mw-parser-output .hlist li dd:last-child::after,.mw-parser-output .hlist li dt:last-child::after,.mw-parser-output .hlist li li:last-child::after{content:")";font-weight:normal}.mw-parser-output .hlist ol{counter-reset:listitem}.mw-parser-output .hlist ol>li{counter-increment:listitem}.mw-parser-output .hlist ol>li::before{content:" "counter(listitem)"\a0 "}.mw-parser-output .hlist dd ol>li:first-child::before,.mw-parser-output .hlist dt ol>li:first-child::before,.mw-parser-output .hlist li ol>li:first-child::before{content:" ("counter(listitem)"\a0 "}
.mw-parser-output .navbar{display:inline;font-size:88%;font-weight:normal}.mw-parser-output .navbar-collapse{float:left;text-align:left}.mw-parser-output .navbar-boxtext{word-spacing:0}.mw-parser-output .navbar ul{display:inline-block;white-space:nowrap;line-height:inherit}.mw-parser-output .navbar-brackets::before{margin-right:-0.125em;content:"[ "}.mw-parser-output .navbar-brackets::after{margin-left:-0.125em;content:" ]"}.mw-parser-output .navbar li{word-spacing:-0.125em}.mw-parser-output .navbar a>span,.mw-parser-output .navbar a>abbr{text-decoration:inherit}.mw-parser-output .navbar-mini abbr{font-variant:small-caps;border-bottom:none;text-decoration:none;cursor:inherit}.mw-parser-output .navbar-ct-full{font-size:114%;margin:0 7em}.mw-parser-output .navbar-ct-mini{font-size:114%;margin:0 4em}html.skin-theme-clientpref-night .mw-parser-output .navbar li a abbr{color:var(--color-base)!important}@media(prefers-color-scheme:dark){html.skin-theme-clientpref-os .mw-parser-output .navbar li a abbr{color:var(--color-base)!important}}@media print{.mw-parser-output .navbar{display:none!important}}
v
t
e Diophantine approximation graph.svg
Best rational approximants for π (green circle), e (blue diamond), ϕ (pink oblong), (3)/2 (grey hexagon), 1/2 (red octagon) and 1/3 (orange triangle) calculated from their continued fraction expansions, plotted as slopes y/x with errors from their true values (black dashes)  

In number theory, the study of Diophantine approximation deals with the approximation of real numbers by rational numbers. It is named after Diophantus of Alexandria.

Contents

The first problem was to know how well a real number can be approximated by rational numbers. For this problem, a rational number p/q is a "good" approximation of a real number α if the absolute value of the difference between p/q and α may not decrease if p/q is replaced by another rational number with a smaller denominator. This problem was solved during the 18th century by means of continued fractions.

Knowing the "best" approximations of a given number, the main problem of the field is to find sharp upper and lower bounds of the above difference, expressed as a function of the denominator. It appears that these bounds depend on the nature of the real numbers to be approximated: the lower bound for the approximation of a rational number by another rational number is larger than the lower bound for algebraic numbers, which is itself larger than the lower bound for all real numbers. Thus a real number that may be better approximated than the bound for algebraic numbers is certainly a transcendental number.

This knowledge enabled Liouville, in 1844, to produce the first explicit transcendental number. Later, the proofs that π and e are transcendental were obtained by a similar method.

Diophantine approximations and transcendental number theory are very close areas that share many theorems and methods. Diophantine approximations also have important applications in the study of Diophantine equations.

The 2022 Fields Medal was awarded to James Maynard for his work on Diophantine approximation.

Best Diophantine approximations of a real number

Given a real number α, there are two ways to define a best Diophantine approximation of α. For the first definition, [1] the rational number p/q is a best Diophantine approximation of α if

for every rational number p'/q' different from p/q such that 0 < q  q.

For the second definition, [2] [3] the above inequality is replaced by

A best approximation for the second definition is also a best approximation for the first one, but the converse is not true in general. [4]

The theory of continued fractions allows us to compute the best approximations of a real number: for the second definition, they are the convergents of its expression as a regular continued fraction. [3] [4] [5] For the first definition, one has to consider also the semiconvergents. [1]

For example, the constant e = 2.718281828459045235... has the (regular) continued fraction representation

Its best approximations for the second definition are

while, for the first definition, they are

Measure of the accuracy of approximations

The obvious measure of the accuracy of a Diophantine approximation of a real number α by a rational number p/q is However, this quantity can always be made arbitrarily small by increasing the absolute values of p and q; thus the accuracy of the approximation is usually estimated by comparing this quantity to some function φ of the denominator q, typically a negative power of it.

For such a comparison, one may want upper bounds or lower bounds of the accuracy. A lower bound is typically described by a theorem like "for every element α of some subset of the real numbers and every rational number p/q, we have ". In some cases, "every rational number" may be replaced by "all rational numbers except a finite number of them", which amounts to multiplying φ by some constant depending on α.

For upper bounds, one has to take into account that not all the "best" Diophantine approximations provided by the convergents may have the desired accuracy. Therefore, the theorems take the form "for every element α of some subset of the real numbers, there are infinitely many rational numbers p/q such that ".

Badly approximable numbers

A badly approximable number is an x for which there is a positive constant c such that for all rational p/q we have

The badly approximable numbers are precisely those with bounded partial quotients. [6]

Equivalently, a number is badly approximable if and only if its Markov constant is finite and its simple continued fraction is bounded.

Lower bounds for Diophantine approximations

Approximation of a rational by other rationals

A rational number may be obviously and perfectly approximated by for every positive integer i.

If we have

because is a positive integer and is thus not lower than 1. Thus the accuracy of the approximation is bad relative to irrational numbers (see next sections).

It may be remarked that the preceding proof uses a variant of the pigeonhole principle: a non-negative integer that is not 0 is not smaller than 1. This apparently trivial remark is used in almost every proof of lower bounds for Diophantine approximations, even the most sophisticated ones.

In summary, a rational number is perfectly approximated by itself, but is badly approximated by any other rational number.

Approximation of algebraic numbers, Liouville's result

In the 1840s, Joseph Liouville obtained the first lower bound for the approximation of algebraic numbers: If x is an irrational algebraic number of degree n over the rational numbers, then there exists a constant c(x) > 0 such that

holds for all integers p and q where q > 0.

This result allowed him to produce the first proven example of a transcendental number, the Liouville constant

which does not satisfy Liouville's theorem, whichever degree n is chosen.

This link between Diophantine approximations and transcendental number theory continues to the present day. Many of the proof techniques are shared between the two areas.

Approximation of algebraic numbers, Thue–Siegel–Roth theorem

Over more than a century, there were many efforts to improve Liouville's theorem: every improvement of the bound enables us to prove that more numbers are transcendental. The main improvements are due to AxelThue  ( 1909 ), Siegel  ( 1921 ), FreemanDyson  ( 1947 ), and KlausRoth  ( 1955 ), leading finally to the Thue–Siegel–Roth theorem: If x is an irrational algebraic number and ε > 0, then there exists a positive real number c(x, ε) such that

holds for every integer p and q such that q > 0.

In some sense, this result is optimal, as the theorem would be false with ε = 0. This is an immediate consequence of the upper bounds described below.

Simultaneous approximations of algebraic numbers

Subsequently, Wolfgang M. Schmidt generalized this to the case of simultaneous approximations, proving that: If x1, ..., xn are algebraic numbers such that 1, x1, ..., xn are linearly independent over the rational numbers and ε is any given positive real number, then there are only finitely many rational n-tuples (p1/q, ..., pn/q) such that

Again, this result is optimal in the sense that one may not remove ε from the exponent.

Effective bounds

All preceding lower bounds are not effective, in the sense that the proofs do not provide any way to compute the constant implied in the statements. This means that one cannot use the results or their proofs to obtain bounds on the size of solutions of related Diophantine equations. However, these techniques and results can often be used to bound the number of solutions of such equations.

Nevertheless, a refinement of Baker's theorem by Feldman provides an effective bound: if x is an algebraic number of degree n over the rational numbers, then there exist effectively computable constants c(x) > 0 and 0 < d(x) < n such that

holds for all rational integers.

However, as for every effective version of Baker's theorem, the constants d and 1/c are so large that this effective result cannot be used in practice.

Upper bounds for Diophantine approximations

General upper bound

The first important result about upper bounds for Diophantine approximations is Dirichlet's approximation theorem, which implies that, for every irrational number α, there are infinitely many fractions such that

This implies immediately that one cannot suppress the ε in the statement of Thue-Siegel-Roth theorem.

Adolf Hurwitz (1891) [7] strengthened this result, proving that for every irrational number α, there are infinitely many fractions such that

Therefore, is an upper bound for the Diophantine approximations of any irrational number. The constant in this result may not be further improved without excluding some irrational numbers (see below).

Émile Borel (1903) [8] showed that, in fact, given any irrational number α, and given three consecutive convergents of α, at least one must satisfy the inequality given in Hurwitz's Theorem.

Equivalent real numbers

Definition: Two real numbers are called equivalent [9] [10] if there are integers with such that:

So equivalence is defined by an integer Möbius transformation on the real numbers, or by a member of the Modular group , the set of invertible 2 × 2 matrices over the integers. Each rational number is equivalent to 0; thus the rational numbers are an equivalence class for this relation.

The equivalence may be read on the regular continued fraction representation, as shown by the following theorem of Serret:

Theorem: Two irrational numbers x and y are equivalent if and only if there exist two positive integers h and k such that the regular continued fraction representations of x and y

satisfy

for every non negative integer i. [11]

Thus, except for a finite initial sequence, equivalent numbers have the same continued fraction representation.

Equivalent numbers are approximable to the same degree, in the sense that they have the same Markov constant.

Lagrange spectrum

As said above, the constant in Borel's theorem may not be improved, as shown by Adolf Hurwitz in 1891. [12] Let be the golden ratio. Then for any real constant c with there are only a finite number of rational numbers p/q such that

Hence an improvement can only be achieved, if the numbers which are equivalent to are excluded. More precisely: [13] [14] For every irrational number , which is not equivalent to , there are infinite many fractions such that

By successive exclusions — next one must exclude the numbers equivalent to — of more and more classes of equivalence, the lower bound can be further enlarged. The values which may be generated in this way are Lagrange numbers, which are part of the Lagrange spectrum. They converge to the number 3 and are related to the Markov numbers. [15] [16]

Khinchin's theorem on metric Diophantine approximation and extensions

Let be a positive real-valued function on positive integers (i.e., a positive sequence) such that is non-increasing. A real number x (not necessarily algebraic) is called -approximable if there exist infinitely many rational numbers p/q such that

Aleksandr Khinchin proved in 1926 that if the series diverges, then almost every real number (in the sense of Lebesgue measure) is -approximable, and if the series converges, then almost every real number is not -approximable. The circle of ideas surrounding this theorem and its relatives is known as metric Diophantine approximation or the metric theory of Diophantine approximation (not to be confused with height "metrics" in Diophantine geometry) or metric number theory.

Duffin & Schaeffer (1941) proved a generalization of Khinchin's result, and posed what is now known as the Duffin–Schaeffer conjecture on the analogue of Khinchin's dichotomy for general, not necessarily decreasing, sequences . Beresnevich & Velani (2006) proved that a Hausdorff measure analogue of the Duffin–Schaeffer conjecture is equivalent to the original Duffin–Schaeffer conjecture, which is a priori weaker. In July 2019, Dimitris Koukoulopoulos and James Maynard announced a proof of the conjecture. [17] [18]

Hausdorff dimension of exceptional sets

An important example of a function to which Khinchin's theorem can be applied is the function , where c > 1 is a real number. For this function, the relevant series converges and so Khinchin's theorem tells us that almost every point is not -approximable. Thus, the set of numbers which are -approximable forms a subset of the real line of Lebesgue measure zero. The Jarník-Besicovitch theorem, due to V. Jarník and A. S. Besicovitch, states that the Hausdorff dimension of this set is equal to . [19] In particular, the set of numbers which are -approximable for some (known as the set of very well approximable numbers) has Hausdorff dimension one, while the set of numbers which are -approximable for all (known as the set of Liouville numbers) has Hausdorff dimension zero.

Another important example is the function , where is a real number. For this function, the relevant series diverges and so Khinchin's theorem tells us that almost every number is -approximable. This is the same as saying that every such number is well approximable, where a number is called well approximable if it is not badly approximable. So an appropriate analogue of the Jarník-Besicovitch theorem should concern the Hausdorff dimension of the set of badly approximable numbers. And indeed, V. Jarník proved that the Hausdorff dimension of this set is equal to one. This result was improved by W. M. Schmidt, who showed that the set of badly approximable numbers is incompressible, meaning that if is a sequence of bi-Lipschitz maps, then the set of numbers x for which are all badly approximable has Hausdorff dimension one. Schmidt also generalized Jarník's theorem to higher dimensions, a significant achievement because Jarník's argument is essentially one-dimensional, depending on the apparatus of continued fractions.

Uniform distribution

Another topic that has seen a thorough development is the theory of uniform distribution mod 1. Take a sequence a1, a2, ... of real numbers and consider their fractional parts. That is, more abstractly, look at the sequence in , which is a circle. For any interval I on the circle we look at the proportion of the sequence's elements that lie in it, up to some integer N, and compare it to the proportion of the circumference occupied by I. Uniform distribution means that in the limit, as N grows, the proportion of hits on the interval tends to the 'expected' value. Hermann Weyl proved a basic result showing that this was equivalent to bounds for exponential sums formed from the sequence. This showed that Diophantine approximation results were closely related to the general problem of cancellation in exponential sums, which occurs throughout analytic number theory in the bounding of error terms.

Related to uniform distribution is the topic of irregularities of distribution, which is of a combinatorial nature.

Algorithms

Grotschel, Lovasz and Schrijver describe algorithms for finding approximately-best diophantine approximations, both for individual real numbers and for set of real numbers. The latter problem is called simultaneous diophantine approximation. [20] :Sec. 5.2

Unsolved problems

There are still simply stated unsolved problems remaining in Diophantine approximation, for example the Littlewood conjecture and the lonely runner conjecture . It is also unknown if there are algebraic numbers with unbounded coefficients in their continued fraction expansion.

Recent developments

In his plenary address at the International Mathematical Congress in Kyoto (1990), Grigory Margulis outlined a broad program rooted in ergodic theory that allows one to prove number-theoretic results using the dynamical and ergodic properties of actions of subgroups of semisimple Lie groups. The work of D. Kleinbock, G. Margulis and their collaborators demonstrated the power of this novel approach to classical problems in Diophantine approximation. Among its notable successes are the proof of the decades-old Oppenheim conjecture by Margulis, with later extensions by Dani and Margulis and Eskin–Margulis–Mozes, and the proof of Baker and Sprindzhuk conjectures in the Diophantine approximations on manifolds by Kleinbock and Margulis. Various generalizations of the above results of Aleksandr Khinchin in metric Diophantine approximation have also been obtained within this framework.

See also

Notes

  1. 1 2 Khinchin 1997 , p. 21
  2. Cassels 1957 , p. 2
  3. 1 2 Lang 1995 , p. 9
  4. 1 2 Khinchin 1997 , p. 24
  5. Cassels 1957 , pp. 5–8
  6. Bugeaud 2012 , p. 245
  7. Hurwitz 1891 , p. 279
  8. Perron 1913 , Chapter 2, Theorem 15
  9. Hurwitz 1891 , p. 284
  10. Hardy & Wright 1979 , Chapter 10.11
  11. See Perron 1929 , Chapter 2, Theorem 23, p. 63
  12. Hardy & Wright 1979 , p. 164
  13. Cassels 1957 , p. 11
  14. Hurwitz 1891
  15. Cassels 1957 , p. 18
  16. See Michel Waldschmidt: Introduction to Diophantine methods irrationality and transcendence Archived 2012-02-09 at the Wayback Machine , pp 24–26.
  17. Koukoulopoulos, D.; Maynard, J. (2019). "On the Duffin–Schaeffer conjecture". arXiv: 1907.04593 [math.NT].
  18. Sloman, Leila (2019). "New Proof Solves 80-Year-Old Irrational Number Problem". Scientific American .
  19. Bernik et al. 2013 , p. 24
  20. Grötschel, Martin; Lovász, László; Schrijver, Alexander (1993), Geometric algorithms and combinatorial optimization, Algorithms and Combinatorics, vol. 2 (2nd ed.), Springer-Verlag, Berlin, doi:10.1007/978-3-642-78240-4, ISBN   978-3-642-78242-8, MR   1261419

Related Research Articles

In mathematics, a transcendental number is a real or complex number that is not algebraic – that is, not the root of a non-zero polynomial with integer coefficients. The best-known transcendental numbers are π and e. The quality of a number being transcendental is called transcendence.

In mathematics, a continued fraction is an expression obtained through an iterative process of representing a number as the sum of its integer part and the reciprocal of another number, then writing this other number as the sum of its integer part and another reciprocal, and so on. In a finite continued fraction, the iteration/recursion is terminated after finitely many steps by using an integer in lieu of another continued fraction. In contrast, an infinite continued fraction is an infinite expression. In either case, all integers in the sequence, other than the first, must be positive. The integers are called the coefficients or terms of the continued fraction.

In number theory, a Liouville number is a real number with the property that, for every positive integer , there exists a pair of integers with such that

In mathematics, a quadratic irrational number is an irrational number that is the solution to some quadratic equation with rational coefficients which is irreducible over the rational numbers. Since fractions in the coefficients of a quadratic equation can be cleared by multiplying both sides by their least common denominator, a quadratic irrational is an irrational root of some quadratic equation with integer coefficients. The quadratic irrational numbers, a subset of the complex numbers, are algebraic numbers of degree 2, and can therefore be expressed as

<span class="mw-page-title-main">Square root of 2</span> Unique positive real number which when multiplied by itself gives 2

The square root of 2 is the positive real number that, when multiplied by itself or squared, equals the number 2. It may be written in mathematics as or . It is an algebraic number, and therefore not a transcendental number. Technically, it should be called the principal square root of 2, to distinguish it from the negative number with the same property.

One half is the irreducible fraction resulting from dividing one (1) by two (2), or the fraction resulting from dividing any number by its double.

In mathematics, Kronecker's theorem is a theorem about diophantine approximation, introduced by Leopold Kronecker.

In mathematics, Roth's theorem or Thue–Siegel–Roth theorem is a fundamental result in diophantine approximation to algebraic numbers. It is of a qualitative type, stating that algebraic numbers cannot have many rational number approximations that are 'very good'. Over half a century, the meaning of very good here was refined by a number of mathematicians, starting with Joseph Liouville in 1844 and continuing with work of Axel Thue, Carl Ludwig Siegel, Freeman Dyson, and Klaus Roth.

Transcendental number theory is a branch of number theory that investigates transcendental numbers, in both qualitative and quantitative ways.

In number theory, Dirichlet's theorem on Diophantine approximation, also called Dirichlet's approximation theorem, states that for any real numbers and , with , there exist integers and such that and

<span class="mw-page-title-main">Hofstadter's butterfly</span> Fractal describing the theorised behaviour of electrons in a magnetic field

In condensed matter physics, Hofstadter's butterfly is a graph of the spectral properties of non-interacting two-dimensional electrons in a perpendicular magnetic field in a lattice. The fractal, self-similar nature of the spectrum was discovered in the 1976 Ph.D. work of Douglas Hofstadter and is one of the early examples of modern scientific data visualization. The name reflects the fact that, as Hofstadter wrote, "the large gaps [in the graph] form a very striking pattern somewhat resembling a butterfly."

In mathematics, an irrationality measure of a real number is a measure of how "closely" it can be approximated by rationals. If a function , defined for positive real numbers, strictly decreasing in both and is given, consider the following inequality:

The square root of 5 is the positive real number that, when multiplied by itself, gives the prime number 5. It is more precisely called the principal square root of 5, to distinguish it from the negative number with the same property. This number appears in the fractional expression for the golden ratio. It can be denoted in surd form as:

In mathematics, the Lagrange numbers are a sequence of numbers that appear in bounds relating to the approximation of irrational numbers by rational numbers. They are linked to Hurwitz's theorem.

In mathematics, specifically the area of Diophantine approximation, the Davenport–Schmidt theorem tells us how well a certain kind of real number can be approximated by another kind. Specifically it tells us that we can get a good approximation to irrational numbers that are not quadratic by using either quadratic irrationals or simply rational numbers. It is named after Harold Davenport and Wolfgang M. Schmidt.

<span class="mw-page-title-main">Rational number</span> Quotient of two integers

In mathematics, a rational number is a number that can be expressed as the quotient or fraction of two integers, a numerator p and a non-zero denominator q. For example, is a rational number, as is every integer. The set of all rational numbers, also referred to as "the rationals", the field of rationals or the field of rational numbers is usually denoted by boldface Q, or blackboard bold

The Koukoulopoulos–Maynard theorem, also known as the Duffin-Schaeffer conjecture, is a theorem in mathematics, specifically, the Diophantine approximation proposed as a conjecture by R. J. Duffin and A. C. Schaeffer in 1941 and proven in 2019 by Dimitris Koukoulopoulos and James Maynard. It states that if is a real-valued function taking on positive values, then for almost all , the inequality

In mathematics, a Brjuno number is a special type of irrational number named for Russian mathematician Alexander Bruno, who introduced them in Brjuno (1971).

A mathematical constant is a number whose value is fixed by an unambiguous definition, often referred to by a special symbol, or by mathematicians' names to facilitate using it across multiple mathematical problems. Constants arise in many areas of mathematics, with constants such as e and π occurring in such diverse contexts as geometry, number theory, statistics, and calculus.

In number theory, specifically in Diophantine approximation theory, the Markov constant of an irrational number is the factor for which Dirichlet's approximation theorem can be improved for .

References