Function field sieve

Last updated April 08, 2024

In mathematics the Function Field Sieve is one of the most efficient algorithms to solve the Discrete Logarithm Problem (DLP) in a finite field. It has heuristic subexponential complexity. Leonard Adleman developed it in 1994 ^[1] and then elaborated it together with M. D. Huang in 1999.^[2] Previous work includes the work of D. Coppersmith ^[3] about the DLP in fields of characteristic two.

The discrete logarithm problem in a finite field consists of solving the equation $a^{x}=b$ for $a,b\in \mathbb {F} _{p^{n}}$ , $p$ a prime number and $n$ an integer. The function $f:\mathbb {F} _{p^{n}}\to \mathbb {F} _{p^{n}},a\mapsto a^{x}$ for a fixed $x\in \mathbb {N}$ is a one-way function used in cryptography. Several cryptographic methods are based on the DLP such as the Diffie-Hellman key exchange, the El Gamal cryptosystem and the Digital Signature Algorithm.

Number theoretical background

Function Fields

Let $C(x,y)$ be a polynomial defining an algebraic curve over a finite field $\mathbb {F} _{p}$ . A function field may be viewed as the field of fractions of the affine coordinate ring $\mathbb {F} _{p}[x,y]/(C(x,y))$ , where $(C(x,y))$ denotes the ideal generated by $C(x,y)$ . This is a special case of an algebraic function field. It is defined over the finite field $\mathbb {F} _{p}$ and has transcendence degree one. The transcendent element will be denoted by $x$ .

There exist bijections between valuation rings in function fields and equivalence classes of places, as well as between valuation rings and equivalence classes of valuations.^[4] This correspondence is frequently used in the Function Field Sieve algorithm.

Divisors

A discrete valuation of the function field $K/\mathbb {F} _{p}$ , namely a discrete valuation ring $\mathbb {F} _{p}\subset O\subset K$ , has a unique maximal ideal $P$ called a prime of the function field. The degree of $P$ is $deg(P)=[O/P:\mathbb {F} _{p}]$ and we also define $f_{O}=[O/P:\mathbb {F} _{p}]$ .

A divisor is a $\mathbb {Z}$ -linear combination over all primes, so ${\textstyle d=\sum \alpha _{P}P}$ where $\alpha _{P}\in \mathbb {Z}$ and only finitely many elements of the sum are non-zero. The divisor of an element $x\in K$ is defined as ${\textstyle {\text{div}}(x)=\sum v_{P}(x)P}$ , where $v_{P}$ is the valuation corresponding to the prime $P$ . The degree of a divisor is ${\textstyle \deg(d)=\sum \alpha _{P}\deg(P)}$ .

Method

The Function Field Sieve algorithm consists of a precomputation where the discrete logarithms of irreducible polynomials of small degree are found and a reduction step where they are combined to the logarithm of $b$ .

Functions that decompose into irreducible function of degree smaller than some bound $B$ are called $B$ -smooth. This is analogous to the definition of a smooth number and such functions are useful because their decomposition can be found relatively fast. The set of those functions $S=\{g(x)\in \mathbb {F} _{p}[x]\mid {\text{ irreductible with }}\deg(g)<B\}$ is called the factor base. A pair of functions $(r,s)$ is doubly-smooth if $rm+s$ and $N(ry+s)$ are both smooth, where $N(\cdot ,\cdot )$ is the norm of an element of $K$ over $\mathbb {F} _{p}$ , $m\in \mathbb {F} _{p}[x]$ is some parameter and $ry+s$ is viewed as an element of the function field of $C$ .

The sieving step of the algorithm consists of finding doubly-smooth pairs of functions. In the subsequent step we use them to find linear relations including the logarithms of the functions in the decompositions. By solving a linear system we then calculate the logarithms. In the reduction step we express $\log _{a}(b)$ as a combination of the logarithm we found before and thus solve the DLP.

Precomputation

Parameter selection

The algorithm requires the following parameters: an irreducible function $f$ of degree $n$ , a function $m\in \mathbb {F} _{p}[x]$ and a curve $C(x,y)$ of given degree $d$ such that $C(x,m)\equiv 0{\text{ mod }}f$ . Here $n$ is the power in the order of the base field $\mathbb {F} _{p^{n}}$ . Let $K$ denote the function field defined by $C$ .

This leads to an isomorphism $\mathbb {F} _{p^{n}}\simeq \mathbb {F} _{p}[x]/f$ and a homomorphism

{\displaystyle \phi

Using the isomorphism each element of $\mathbb {F} _{p^{n}}$ can be considered as a polynomial in $\mathbb {F} _{p}[x]/f$ .

One also needs to set a smoothness bound $B$ for the factor base $S$ .

Sieving

In this step doubly-smooth pairs of functions $(r,s)\in \mathbb {F} _{p}[x]\times \mathbb {F} _{p}[x]$ are found.

One considers functions of the form $f=(rm+s)N(ry+s)$ , then divides $f$ by any $g\in S$ as many times as possible. Any $f$ that is reduced to one in this process is $B$ -smooth. To implement this, Gray code can be used to efficiently step through multiples of a given polynomial.

This is completely analogous to the sieving step in other sieving algorithms such as the Number Field Sieve or the index calculus algorithm. Instead of numbers one sieves through functions in $\mathbb {F} _{p}[x]$ but those functions can be factored into irreducible polynomials just as numbers can be factored into primes.

Finding linear relations

This is the most difficult part of the algorithm, involving function fields, places and divisors as defined above. The goal is to use the doubly-smooth pairs of functions to find linear relations involving the discrete logarithms of elements in the factor base.

For each irreducible function in the factor base we find places $v_{1},v_{2},...$ of $K$ that lie over them and surrogate functions $\alpha _{1},\alpha _{2},...$ that correspond to the places. A surrogate function $\alpha _{i}\in K$ corresponding to a place $v_{i}$ satisfies ${\text{div}}(\alpha _{i})=h(v_{i}-f_{v_{i}}u)$ where $h$ is the class number of $K$ and $u$ is any fixed discrete valuation with $f_{u}=1$ . The function defined this way is unique up to a constant in $\mathbb {F} _{p}$ .

By the definition of a divisor ${\textstyle {\text{div}}(ry+s)=\sum a_{i}v_{i}}$ for $a_{i}=v_{i}(ry+s)$ . Using this and the fact that ${\textstyle \sum a_{i}f_{v_{i}}=\deg({\text{div}}(ry+s))=0}$ we get the following expression:

{\text{div}}((ry+s)^{h})=\sum ha_{i}v_{i}=\sum ha_{i}v_{i}-\sum ha_{i}f_{v_{i}}v+hv\sum a_{i}f_{v_{i}}=\sum a_{i}h(v_{i}-f_{v_{i}}v))={\text{div}}(\prod \alpha _{i}^{a_{i}})

where $v$ is any valuation with $f_{v}=1$ . Then, using the fact that the divisor of a surrogate function is unique up to a constant, one gets

(ry+s)^{h}=c\prod \alpha _{i}^{a_{i}}{\text{ for some }}c\in F_{p}^{*}

\implies \phi ((ry+s)^{h})=\phi (c)\prod \phi (\alpha _{i})^{a_{i}}

We now use the fact that $\phi (ry+s)=rm+s$ and the known decomposition of this expression into irreducible polynomials. Let $e_{g}$ be the power of $g\in S$ in this decomposition. Then

\prod _{g\in S}g^{he_{g}}\equiv \phi (c)\prod \phi (\alpha _{i})^{a_{i}}{\text{ mod }}f

Here we can take the discrete logarithm of the equation up to a unit. This is called the restricted discrete logarithm $\log _{*}(x)$ . It is defined by the equation $a^{\log _{*}(x)}=ux$ for some unit $u\in \mathbb {F} _{p}$ .

\sum _{g\in S}e_{g}\log _{*}g\equiv \sum a_{i}h_{1}\log _{*}(\phi (\alpha _{i})){\text{ mod }}(p^{n}-1)/(p-1),

where $h_{1}$ is the inverse of $h$ modulo $(p^{n}-1)/(p-1)$ .

The expressions $h_{1}\log _{*}(\phi (\alpha _{i}))$ and the logarithms $\log _{*}(g)$ are unknown. Once enough equations of this form are found, a linear system can be solved to find $\log _{*}(g)$ for all $g\in S$ . Taking the whole expression $h_{1}log_{*}(\phi (\alpha _{i}))$ as an unknown helps to gain time, since $h$ , $h_{1}$ , $\alpha _{i}$ or $\phi (\alpha _{i})$ don't have to be computed. Eventually for each $g\in S$ the unit corresponding to the restricted discrete logarithm can be calculated which then gives $\log _{a}(g)=\log _{*}(g)-\log _{a}(u)$ .

Reduction step

First $a^{l}b$ mod $f$ are computed for a random $l<n$ . With sufficiently high probability this is ${\sqrt {nB}}$ -smooth, so one can factor it as $a^{l}b=\prod b_{i}$ for $b_{i}\in \mathbb {F} _{p}[x]$ with $\deg(b_{i})<{\sqrt {nB}}$ . Each of these polynomials $b_{i}$ can be reduced to polynomials of smaller degree using a generalization of the Coppersmith method.^[2] We can reduce the degree until we get a product of $B$ -smooth polynomials. Then, taking the logarithm to the base $a$ , we can eventually compute

\log _{a}(b)=\sum _{g_{i}\in S}\log _{a}(g_{i})-l

, which solves the DLP.

Complexity

The Function Field Sieve is thought to run in subexponential time in

\exp \left(\left({\sqrt[{3}]{\frac {32}{9}}}+o(1)\right)(\ln p)^{\frac {1}{3}}(\ln \ln p)^{\frac {2}{3}}\right)=L_{p}\left[{\frac {1}{3}},{\sqrt[{3}]{\frac {32}{9}}}\right]

using the L-notation. There is no rigorous proof of this complexity since it relies on some heuristic assumptions. For example in the sieving step we assume that numbers of the form $(rm+s)N(ry+s)$ behave like random numbers in a given range.

Comparison with other methods

There are two other well known algorithms that solve the discrete logarithm problem in sub-exponential time: the index calculus algorithm and a version of the Number Field Sieve.^[5] In their easiest forms both solve the DLP in a finite field of prime order but they can be expanded to solve the DLP in $\mathbb {F} _{p^{n}}$ as well.

The Number Field Sieve for the DLP in $\mathbb {F} _{p^{n}}$ has a complexity of $L_{p}[1/3,(64/9)^{1/3}+o(1)]$ ^[6] and is therefore slightly slower than the best performance of the Function Field Sieve. However, it is faster than the Function Field Sieve when $n<<(\log(p))^{1/2}$ . It is not surprising that there exist two similar algorithms, one with number fields and the other one with function fields. In fact there is an extensive analogy between these two kinds of global fields.

The index calculus algorithm is much easier to state than the Function Field Sieve and the Number Field Sieve since it does not involve any advanced algebraic structures. It is asymptotically slower with a complexity of $L_{p}[1/2,{\sqrt {2}}]$ . The main reason why the Number Field Sieve and the Function Field Sieve are faster is that these algorithms can run with a smaller smoothness bound $B$ , so most of the computations can be done with smaller numbers.

Related Research Articles

Elliptic-curve cryptography (ECC) is an approach to public-key cryptography based on the algebraic structure of elliptic curves over finite fields. ECC allows smaller keys compared to non-EC cryptography to provide equivalent security.

<span class="mw-page-title-main">Elliptic curve</span> Algebraic curve

In mathematics, an elliptic curve is a smooth, projective, algebraic curve of genus one, on which there is a specified point $O$ . An elliptic curve is defined over a field $K$ and describes points in $K 2$ , the Cartesian product of $K$ with itself. If the field's characteristic is different from 2 and 3, then the curve can be described as a plane algebraic curve which consists of solutions $(x, y)$ for:

In mathematics, a finite field or Galois field is a field that contains a finite number of elements. As with any field, a finite field is a set on which the operations of multiplication, addition, subtraction and division are defined and satisfy certain basic rules. The most common examples of finite fields are given by the integers mod $p$ when $p$ is a prime number.

In mathematics, the logarithm is the inverse function to exponentiation. That means that the logarithm of a number $x$ to the base $b$ is the exponent to which $b$ must be raised to produce $x$ . For example, since $1000 = 10 3$ , the logarithm base 10 of $1000$ is $3$ , or $log 10 (1000) = 3$ . The logarithm of $x$ to base $b$ is denoted as $log b (x)$ , or without parentheses, $log b x$ , or even without the explicit base, $log x$ , when no confusion is possible, or when the base does not matter such as in big O notation.

Shor's algorithm is a quantum algorithm for finding the prime factors of an integer. It was developed in 1994 by the American mathematician Peter Shor. It is one of the few known quantum algorithms with compelling potential applications and strong evidence of superpolynomial speedup compared to best known classical algorithms. On the other hand, factoring numbers of practical significance requires far more qubits than available in the near future. Another concern is that noise in quantum circuits may undermine results, requiring additional qubits for quantum error correction.

In mathematics, for given real numbers a and b, the logarithm log_b a is a number x such that b^x = a. Analogously, in any group G, powers b^k can be defined for all integers k, and the discrete logarithm log_b a is an integer k such that b^k = a. In number theory, the more commonly used term is index: we can write x = ind_ra (mod m) (read "the index of a to the base r modulo m") for r^x ≡ a (mod m) if r is a primitive root of m and gcd(a,m) = 1.

In mathematics, specifically the algebraic theory of fields, a normal basis is a special kind of basis for Galois extensions of finite degree, characterised as forming a single orbit for the Galois group. The normal basis theorem states that any finite Galois extension of fields has a normal basis. In algebraic number theory, the study of the more refined question of the existence of a normal integral basis is part of Galois module theory.

In number theory, an n-smooth (or n-friable) number is an integer whose prime factors are all less than or equal to n. For example, a 7-smooth number is a number whose every prime factor is at most 7, so 49 = 7² and 15750 = 2 × 3² × 5³ × 7 are both 7-smooth, while 11 and 702 = 2 × 3³ × 13 are not 7-smooth. The term seems to have been coined by Leonard Adleman. Smooth numbers are especially important in cryptography, which relies on factorization of integers. The 2-smooth numbers are just the powers of 2, while 5-smooth numbers are known as regular numbers.

In commutative algebra and field theory, the Frobenius endomorphism is a special endomorphism of commutative rings with prime characteristic $p$ , an important class that includes finite fields. The endomorphism maps every element to its $p$ -th power. In certain contexts it is an automorphism, but this is not true in general.

In computational number theory, the index calculus algorithm is a probabilistic algorithm for computing discrete logarithms. Dedicated to the discrete logarithm in $where is a prime, index calculus leads to a family of algorithms adapted to finite fields and to some families of elliptic curves. The algorithm collects relations among the discrete logarithms of small primes, computes them by a linear algebra procedure and finally expresses the desired discrete logarithm with respect to the discrete logarithms of small primes.$

Pollard's rho algorithm for logarithms is an algorithm introduced by John Pollard in 1978 to solve the discrete logarithm problem, analogous to Pollard's rho algorithm to solve the integer factorization problem.

In mathematics, the resultant of two polynomials is a polynomial expression of their coefficients that is equal to zero if and only if the polynomials have a common root, or, equivalently, a common factor. In some older texts, the resultant is also called the eliminant.

In mathematics, particularly computational algebra, Berlekamp's algorithm is a well-known method for factoring polynomials over finite fields. The algorithm consists mainly of matrix reduction and polynomial GCD computations. It was invented by Elwyn Berlekamp in 1967. It was the dominant algorithm for solving the problem until the Cantor–Zassenhaus algorithm of 1981. It is currently implemented in many well-known computer algebra systems.

In computational algebra, the Cantor–Zassenhaus algorithm is a method for factoring polynomials over finite fields.

In number theory, an average order of an arithmetic function is some simpler or better-understood function which takes the same values "on average".

A hyperelliptic curve is a particular kind of algebraic curve. There exist hyperelliptic curves of every genus $. If the genus of a hyperelliptic curve equals 1, we simply call the curve an elliptic curve. Hence we can see hyperelliptic curves as generalizations of elliptic curves. There is a well-known group structure on the set of points lying on an elliptic curve over some field, which we can describe geometrically with chords and tangents. Generalizing this group structure to the hyperelliptic case is not straightforward. We cannot define the same group law on the set of points lying on a hyperelliptic curve, instead a group structure can be defined on the so-called Jacobian of a hyperelliptic curve. The computations differ depending on the number of points at infinity. Imaginary hyperelliptic curves are hyperelliptic curves with exactly 1 point at infinity: real hyperelliptic curves have two points at infinity.$

In transcendental number theory, a mathematical discipline, Baker's theorem gives a lower bound for the absolute value of linear combinations of logarithms of algebraic numbers. The result, proved by Alan Baker, subsumed many earlier results in transcendental number theory and solved a problem posed by Alexander Gelfond nearly fifteen years earlier. Baker used this to prove the transcendence of many numbers, to derive effective bounds for the solutions of some Diophantine equations, and to solve the class number problem of finding all imaginary quadratic fields with class number 1.

In mathematics and computer algebra the factorization of a polynomial consists of decomposing it into a product of irreducible factors. This decomposition is theoretically possible and is unique for polynomials with coefficients in any field, but rather strong restrictions on the field of the coefficients are needed to allow the computation of the factorization by means of an algorithm. In practice, algorithms have been designed only for polynomials with coefficients in a finite field, in the field of rationals or in a finitely generated field extension of one of them.

Network coding has been shown to optimally use bandwidth in a network, maximizing information flow but the scheme is very inherently vulnerable to pollution attacks by malicious nodes in the network. A node injecting garbage can quickly affect many receivers. The pollution of network packets spreads quickly since the output of honest node is corrupted if at least one of the incoming packets is corrupted.

In representation theory of mathematics, the Waldspurger formula relates the special values of two L-functions of two related admissible irreducible representations. Let $k$ be the base field, $f$ be an automorphic form over $k$ , $π$ be the representation associated via the Jacquet–Langlands correspondence with f. Goro Shimura (1976) proved this formula, when $and f is a cusp form; Günter Harder made the same discovery at the same time in an unpublished paper. Marie-France Vignéras (1980) proved this formula, when and f is a newform. Jean-Loup Waldspurger, for whom the formula is named, reproved and generalized the result of Vignéras in 1985 via a totally different method which was widely used thereafter by mathematicians to prove similar formulas.$

References

↑ L. Adleman. "The function field sieve". In: Algorithmic Number Theory (ANTS-I). Lecture Notes in Computer Science. Springer (1994), pp.108-121.
1 2 L. Adleman, M.D. Huang. "Function Field Sieve Method for Discrete Logarithms over Finite Fields". In: Inf. Comput. 151 (May 1999), pp. 5-16. DOI: 10.1006/inco.1998.2761.
↑ D. Coppersmith. (1984), "Fast evaluation of discrete logarithms in fields of characteristic two". In: IEEE Trans. Inform. Theory IT-39 (1984), pp. 587-594.
↑ M. Fried and M. Jarden. In: "Field Arithmetic". vol. 11. (Jan. 2005). Chap. 2.1. DOI: 10.1007/b138352.
↑ D. Gordon. "Discrete Logarithm in GF(P) Using the Number Field Sieve". In: Siam Journal on Discrete Mathematics - SIAMDM 6 (Feb. 1993), pp. 124-138. DOI: 10.1137/0406010.
↑ R. Barbulescu, P. Gaudry, T. Kleinjung. "The Tower Number Field Sieve". In: Advances in Cryptology – Asiacrypt 2015. Vol. 9453. Springer, May 2015. pp. 31-58

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] L. Adleman. "The function field sieve". In: Algorithmic Number Theory (ANTS-I). Lecture Notes in Computer Science. Springer (1994), pp.108-121.

[adleman-2] 1 2 L. Adleman, M.D. Huang. "Function Field Sieve Method for Discrete Logarithms over Finite Fields". In: Inf. Comput. 151 (May 1999), pp. 5-16. DOI: 10.1006/inco.1998.2761.

[3] D. Coppersmith. (1984), "Fast evaluation of discrete logarithms in fields of characteristic two". In: IEEE Trans. Inform. Theory IT-39 (1984), pp. 587-594.

[4] M. Fried and M. Jarden. In: "Field Arithmetic". vol. 11. (Jan. 2005). Chap. 2.1. DOI: 10.1007/b138352.

[5] D. Gordon. "Discrete Logarithm in GF(P) Using the Number Field Sieve". In: Siam Journal on Discrete Mathematics - SIAMDM 6 (Feb. 1993), pp. 124-138. DOI: 10.1137/0406010.

[6] R. Barbulescu, P. Gaudry, T. Kleinjung. "The Tower Number Field Sieve". In: Advances in Cryptology – Asiacrypt 2015. Vol. 9453. Springer, May 2015. pp. 31-58

[1]

[2]

[3]

[4]

[5]

[6]

v t e Number-theoretic algorithms
Primality tests	AKS APR Baillie–PSW Elliptic curve Pocklington Fermat Lucas Lucas–Lehmer Lucas–Lehmer–Riesel Proth's theorem Pépin's Quadratic Frobenius Solovay–Strassen Miller–Rabin
Prime-generating	Sieve of Atkin Sieve of Eratosthenes Sieve of Pritchard Sieve of Sundaram Wheel factorization
Integer factorization	Continued fraction (CFRAC) Dixon's Lenstra elliptic curve (ECM) Euler's Pollard's rho p − 1 p + 1 Quadratic sieve (QS) General number field sieve (GNFS) Special number field sieve (SNFS) Rational sieve Fermat's Shanks's square forms Trial division Shor's
Multiplication	Ancient Egyptian Long Karatsuba Toom–Cook Schönhage–Strassen Fürer's
Euclidean division	Binary Chunking Fourier Goldschmidt Newton-Raphson Long Short SRT
Discrete logarithm	Baby-step giant-step Pollard rho Pollard kangaroo Pohlig–Hellman Index calculus Function field sieve
Greatest common divisor	Binary Euclidean Extended Euclidean Lehmer's
Modular square root	Cipolla Pocklington's Tonelli–Shanks Berlekamp Kunerth
Other algorithms	Chakravala Cornacchia Exponentiation by squaring Integer square root Integer relation (LLL; KZ) Modular exponentiation Montgomery reduction Schoof Trachtenberg system
Italics indicate that algorithm is for numbers of special forms