Sieve theory

Last updated May 10, 2024

Sieve theory is a set of general techniques in number theory, designed to count, or more realistically to estimate the size of, sifted sets of integers. The prototypical example of a sifted set is the set of prime numbers up to some prescribed limit X. Correspondingly, the prototypical example of a sieve is the sieve of Eratosthenes, or the more general Legendre sieve. The direct attack on prime numbers using these methods soon reaches apparently insuperable obstacles, in the way of the accumulation of error terms.^{[ citation needed ]} In one of the major strands of number theory in the twentieth century, ways were found of avoiding some of the difficulties of a frontal attack with a naive idea of what sieving should be.^{[ citation needed ]}

One successful approach is to approximate a specific sifted set of numbers (e.g. the set of prime numbers) by another, simpler set (e.g. the set of almost prime numbers), which is typically somewhat larger than the original set, and easier to analyze. More sophisticated sieves also do not work directly with sets per se, but instead count them according to carefully chosen weight functions on these sets (options for giving some elements of these sets more "weight" than others). Furthermore, in some modern applications, sieves are used not to estimate the size of a sifted set, but to produce a function that is large on the set and mostly small outside it, while being easier to analyze than the characteristic function of the set.

The term sieve was first used by the norwegian mathematician Viggo Brun in 1915.^[1] However Brun's work was inspired by the works of the french mathematician Jean Merlin who died in the World War I and only two of his manuscripts survived.^[2]

Basic sieve theory

For information on notation see at the end.

We start with some countable sequence of non-negative numbers ${\mathcal {A}}=(a_{n})$ . In the most basic case this sequence is just the indicator function $a_{n}=1_{A}(n)$ of some set $A=\{s:s\leq x\}$ we want to sieve. However this abstraction allows for more general situations. Next we introduce a general set of prime numbers called the sifting range ${\mathcal {P}}\subseteq \mathbb {P}$ and their product up to $z$ as a function $P(z)=\prod \limits _{p\in {\mathcal {P}},p<z}p$ .

The goal of sieve theory is to estimate the sifting function

S({\mathcal {A}},{\mathcal {P}},z)=\sum \limits _{n\leq x,{\text{gcd}}(n,P(z))=1}a_{n}.

In the case of $a_{n}=1_{A}(n)$ this just counts the cardinality of a subset $A_{\operatorname {sift} }\subseteq A$ of numbers, that are coprime to the prime factors of $P(z)$ .

The inclusion–exclusion principle

For ${\mathcal {P}}$ define

A_{\operatorname {sift} }:=\{a\in A|(a,p_{1}\cdots p_{k})=1\},\quad p_{1},\dots ,p_{k}\in {\mathcal {P}}

and for each prime $p\in {\mathcal {P}}$ denote the set $E_{p}=\{pn:n\in \mathbb {N} \}$ and let $|E_{p}|$ be the cardinality. Let now ${\mathcal {P}}:=\{2,3,5,7,11,13\dots \}$ be some set of primes.

If one wants to calculate the cardinality of $A_{\operatorname {sift} }$ , one can apply the inclusion–exclusion principle. This algorithm works like this: first one removes from the cardinality of $|A|$ the cardinality $|E_{2}|$ and $|E_{3}|$ . Now since one has removed the numbers that are divisble by $2$ and $3$ twice, one has to add the cardinality $|E_{6}|$ . In the next step one removes $|E_{5}|$ and adds $|E_{10}|$ and $|E_{15}|$ again. Additionally one has now to remove $|E_{30}|$ , i.e. the cardinality of all numbers divisible by $2,3$ and $5$ . This leads to the inclusion–exclusion principle

|A_{\operatorname {sift} }|=|A|-|E_{2}|-|E_{3}|+|E_{6}|-|E_{5}|+|E_{10}|+|E_{15}|-|E_{30}|+\cdots

Legendre's identity

We can rewrite the sifting function with Legendre's identity

S({\mathcal {A}},{\mathcal {P}},z)=\sum \limits _{d\mid P(z)}\mu (d)A_{d}(x)

by using the Möbius function and some functions $A_{d}(x)$ induced by the elements of ${\mathcal {P}}$

A_{d}(x)=\sum \limits _{n\leq x,n\equiv 0{\pmod {d}}}a_{n}.

Example

Let $z=7$ and ${\mathcal {P}}=\mathbb {P}$ . The Möbius function is negative for every prime, so we get

{\begin{aligned}S({\mathcal {A}},\mathbb {P} ,7)&=A_{1}(x)-A_{2}(x)-A_{3}(x)-A_{5}(x)+A_{6}(x)+A_{10}(x)+A_{15}(x)-A_{30}(x).\end{aligned}}

Approximation of the congruence sum

One assumes then that $A_{d}(x)$ can be written as

A_{d}(x)=g(d)X+r_{d}(x)

where $g(d)$ is a density, meaning a multiplicative function such that

g(1)=1,\qquad 0\leq g(p)<1\qquad p\in \mathbb {P}

and $X$ is an approximation of $A_{1}(x)$ and $r_{d}(x)$ is some remainder term. The sifting function becomes

S({\mathcal {A}},{\mathcal {P}},z)=X\sum \limits _{d\mid P(z)}\mu (d)g(d)+\sum \limits _{d\mid P(z)}\mu (d)r_{d}(x)

or in short

S({\mathcal {A}},{\mathcal {P}},z)=XG(x,z)+R(x,z).

One tries then to estimate the sifting function by finding upper and lower bounds for $S$ respectively $G$ and $R$ .

The partial sum of the sifting function alternately over- and undercounts, so the remainder term will be huge. Brun's idea to improve this was to replace $\mu (d)$ in the sifting function with a weight sequence $(\lambda _{d})$ consisting of restricted Möbius functions. Choosing two appropriate sequences $(\lambda _{d}^{-})$ and $(\lambda _{d}^{+})$ and denoting the sifting functions with $S^{-}$ and $S^{+}$ , one can get lower and upper bounds for the original sifting functions

S^{-}\leq S\leq S^{+}.

^[3]

Since $g$ is multiplicative, one can also work with the identity

\sum \limits _{d\mid n}\mu (d)g(d)=\prod \limits _{\begin{array}{c}p|n;\;p\in \mathbb {P} \end{array}}(1-g(p)),\quad \forall \;n\in \mathbb {N} .

Notation: a word of caution regarding the notation, in the literature one often identifies the set of sequences ${\mathcal {A}}$ with the set $A$ itself. This means one writes ${\mathcal {A}}=\{s:s\leq x\}$ to define a sequence ${\mathcal {A}}=(a_{n})$ . Also in the literature the sum $A_{d}(x)$ is sometimes notated as the cardinality $|A_{d}(x)|$ of some set $A_{d}(x)$ , while we have defined $A_{d}(x)$ to be already the cardinality of this set. We used $\mathbb {P}$ to denote the set of primes and $(a,b)$ for the greatest common divisor of $a$ and $b$ .

Types of sieving

Modern sieves include the Brun sieve, the Selberg sieve, the Turán sieve, the large sieve, the larger sieve and the Goldston-Pintz-Yıldırım sieve. One of the original purposes of sieve theory was to try to prove conjectures in number theory such as the twin prime conjecture. While the original broad aims of sieve theory still are largely unachieved, there have been some partial successes, especially in combination with other number theoretic tools. Highlights include:

Brun's theorem , which shows that the sum of the reciprocals of the twin primes converges (whereas the sum of the reciprocals of all primes diverges);
Chen's theorem , which shows that there are infinitely many primes p such that p + 2 is either a prime or a semiprime (the product of two primes); a closely related theorem of Chen Jingrun asserts that every sufficiently large even number is the sum of a prime and another number which is either a prime or a semiprime. These can be considered to be near-misses to the twin prime conjecture and the Goldbach conjecture respectively.
The fundamental lemma of sieve theory , which asserts that if one is sifting a set of N numbers, then one can accurately estimate the number of elements left in the sieve after $N^{\varepsilon }$ iterations provided that $\varepsilon$ is sufficiently small (fractions such as 1/10 are quite typical here). This lemma is usually too weak to sieve out primes (which generally require something like $N^{1/2}$ iterations), but can be enough to obtain results regarding almost primes.
The Friedlander–Iwaniec theorem , which asserts that there are infinitely many primes of the form $a^{2}+b^{4}$ .
Zhang's theorem ( Zhang 2014 ), which shows that there are infinitely many pairs of primes within a bounded distance. The Maynard–Tao theorem ( Maynard 2015 ) generalizes Zhang's theorem to arbitrarily long sequences of primes.

Techniques of sieve theory

The techniques of sieve theory can be quite powerful, but they seem to be limited by an obstacle known as the parity problem , which roughly speaking asserts that sieve theory methods have extreme difficulty distinguishing between numbers with an odd number of prime factors and numbers with an even number of prime factors. This parity problem is still not very well understood.

Compared with other methods in number theory, sieve theory is comparatively elementary, in the sense that it does not necessarily require sophisticated concepts from either algebraic number theory or analytic number theory. Nevertheless, the more advanced sieves can still get very intricate and delicate (especially when combined with other deep techniques in number theory), and entire textbooks have been devoted to this single subfield of number theory; a classic reference is ( Halberstam & Richert 1974 ) and a more modern text is ( Iwaniec & Friedlander 2010 ).

The sieve methods discussed in this article are not closely related to the integer factorization sieve methods such as the quadratic sieve and the general number field sieve. Those factorization methods use the idea of the sieve of Eratosthenes to determine efficiently which members of a list of numbers can be completely factored into small primes.

Literature

Cojocaru, Alina Carmen; Murty, M. Ram (2006), An introduction to sieve methods and their applications, London Mathematical Society Student Texts, vol. 66, Cambridge University Press, ISBN 0-521-84816-4, MR 2200366
Motohashi, Yoichi (1983), Lectures on Sieve Methods and Prime Number Theory, Tata Institute of Fundamental Research Lectures on Mathematics and Physics, vol. 72, Berlin: Springer-Verlag, ISBN 3-540-12281-8, MR 0735437
Greaves, George (2001), Sieves in number theory, Ergebnisse der Mathematik und ihrer Grenzgebiete (3), vol. 43, Berlin: Springer-Verlag, doi:10.1007/978-3-662-04658-6, ISBN 3-540-41647-1, MR 1836967
Harman, Glyn (2007). Prime-detecting sieves. London Mathematical Society Monographs. Vol. 33. Princeton, NJ: Princeton University Press. ISBN 978-0-691-12437-7. MR 2331072. Zbl 1220.11118.
Halberstam, Heini; Richert, Hans-Egon (1974). Sieve Methods. London Mathematical Society Monographs. Vol. 4. London-New York: Academic Press. ISBN 0-12-318250-6. MR 0424730.
Iwaniec, Henryk; Friedlander, John (2010), Opera de cribro, American Mathematical Society Colloquium Publications, vol. 57, Providence, RI: American Mathematical Society, ISBN 978-0-8218-4970-5, MR 2647984
Hooley, Christopher (1976), Applications of sieve methods to the theory of numbers, Cambridge Tracts in Mathematics, vol. 70, Cambridge-New York-Melbourne: Cambridge University Press, ISBN 0-521-20915-3, MR 0404173
Maynard, James (2015). "Small gaps between primes". Annals of Mathematics . 181 (1): 383–413. arXiv: 1311.4600 . doi:10.4007/annals.2015.181.1.7. MR 3272929.
Tenenbaum, Gérald (1995), Introduction to Analytic and Probabilistic Number Theory, Cambridge studies in advanced mathematics, vol. 46, Translated from the second French edition (1995) by C. B. Thomas, Cambridge University Press, pp. 56–79, ISBN 0-521-41261-7, MR 1342300
Zhang, Yitang (2014). "Bounded gaps between primes". Annals of Mathematics . 179 (3): 1121–1174. doi: 10.4007/annals.2014.179.3.7 . MR 3171761.

External links

Bredikhin, B.M. (2001) [1994], "Sieve method", Encyclopedia of Mathematics , EMS Press

Related Research Articles

In mathematics, an abelian group, also called a commutative group, is a group in which the result of applying the group operation to two group elements does not depend on the order in which they are written. That is, the group operation is commutative. With addition as an operation, the integers and the real numbers form abelian groups, and the concept of an abelian group may be viewed as a generalization of these examples. Abelian groups are named after early 19th century mathematician Niels Henrik Abel.

In mathematics, especially order theory, a partial order on a set is an arrangement such that, for certain pairs of elements, one precedes the other. The word partial is used to indicate that not every pair of elements needs to be comparable; that is, there may be pairs for which neither element precedes the other. Partial orders thus generalize total orders, in which every pair is comparable.

A random variable is a mathematical formalization of a quantity or object which depends on random events. The term 'random variable' in its mathematical definition refers to neither randomness nor variability but instead is a mathematical function in which

In mathematics, the $L p$ spaces are function spaces defined using a natural generalization of the $p$ -norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue, although according to the Bourbaki group they were first introduced by Frigyes Riesz.

In complex analysis, a branch of mathematics, analytic continuation is a technique to extend the domain of definition of a given analytic function. Analytic continuation often succeeds in defining further values of a function, for example in a new region where the infinite series representation which initially defined the function becomes divergent.

Vapnik–Chervonenkis theory was developed during 1960–1990 by Vladimir Vapnik and Alexey Chervonenkis. The theory is a form of computational learning theory, which attempts to explain the learning process from a statistical point of view.

In abstract algebra, a semiring is an algebraic structure. It is a generalization of a ring, dropping the requirement that each element must have an additive inverse. At the same time, it is a generalization of bounded distributive lattices.

In mathematics, differential algebra is, broadly speaking, the area of mathematics consisting in the study of differential equations and differential operators as algebraic objects in view of deriving properties of differential equations and operators without computing the solutions, similarly as polynomial algebras are used for the study of algebraic varieties, which are solution sets of systems of polynomial equations. Weyl algebras and Lie algebras may be considered as belonging to differential algebra.

In mathematics, the theory of optimal stopping or early stopping is concerned with the problem of choosing a time to take a particular action, in order to maximise an expected reward or minimise an expected cost. Optimal stopping problems can be found in areas of statistics, economics, and mathematical finance. A key example of an optimal stopping problem is the secretary problem. Optimal stopping problems can often be written in the form of a Bellman equation, and are therefore often solved using dynamic programming.

In mathematics, a cardinal function is a function that returns cardinal numbers.

In the field of number theory, the Brun sieve is a technique for estimating the size of "sifted sets" of positive integers which satisfy a set of conditions which are expressed by congruences. It was developed by Viggo Brun in 1915 and later generalized to the fundamental lemma of sieve theory by others.

<span class="mw-page-title-main">Selberg sieve</span>

In number theory, the Selberg sieve is a technique for estimating the size of "sifted sets" of positive integers which satisfy a set of conditions which are expressed by congruences. It was developed by Atle Selberg in the 1940s.

In number theory, the Turán sieve is a technique for estimating the size of "sifted sets" of positive integers which satisfy a set of conditions which are expressed by congruences. It was developed by Pál Turán in 1934.

In number theory, the parity problem refers to a limitation in sieve theory that prevents sieves from giving good estimates in many kinds of prime-counting problems. The problem was identified and named by Atle Selberg in 1949. Beginning around 1996, John Friedlander and Henryk Iwaniec developed some parity-sensitive sieves that make the parity problem less of an obstacle.

In number theory, the fundamental lemma of sieve theory is any of several results that systematize the process of applying sieve methods to particular problems. Halberstam & Richert write:

A curious feature of sieve literature is that while there is frequent use of Brun's method there are only a few attempts to formulate a general Brun theorem ; as a result there are surprisingly many papers which repeat in considerable detail the steps of Brun's argument.

The Jurkat–Richert theorem is a mathematical theorem in sieve theory. It is a key ingredient in proofs of Chen's theorem on Goldbach's conjecture. It was proved in 1965 by Wolfgang B. Jurkat and Hans-Egon Richert.

In mathematics the Function Field Sieve is one of the most efficient algorithms to solve the Discrete Logarithm Problem (DLP) in a finite field. It has heuristic subexponential complexity. Leonard Adleman developed it in 1994 and then elaborated it together with M. D. Huang in 1999. Previous work includes the work of D. Coppersmith about the DLP in fields of characteristic two.

In mathematics, especially measure theory, a set function is a function whose domain is a family of subsets of some given set and that (usually) takes its values in the extended real number line $which consists of the real numbers and$

In mathematics, an algebraic number field is an extension field $of the field of rational numbers such that the field extension has finite degree . Thus is a field that contains and has finite dimension when considered as a vector space over .$

The Goldston-Pintz-Yıldırım sieve is a sieve method and variant of the Selberg sieve with generalized, multidimensional sieve weights. The sieve led to a series of important breakthroughs in analytic number theory.

References

↑ Brun, Viggo (1915). "Über das Goldbachsche Gesetz und die Anzahl der Primzahlpaare". Archiv for Math. Naturvidenskab. 34.
↑ Cojocaru, Alina Carmen; Murty, M. Ram (2005). An Introduction to Sieve Methods and Their Applications. Cambridge University Press. doi:10.1017/CBO9780511615993.
↑ ( Iwaniec & Friedlander 2010 )

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Brun, Viggo (1915). "Über das Goldbachsche Gesetz und die Anzahl der Primzahlpaare". Archiv for Math. Naturvidenskab. 34.

[2] Cojocaru, Alina Carmen; Murty, M. Ram (2005). An Introduction to Sieve Methods and Their Applications. Cambridge University Press. doi:10.1017/CBO9780511615993.

[3] ( Iwaniec & Friedlander 2010 )

[1]

[2]

[3]