Pollard's rho algorithm

Last updated January 24, 2024

Pollard's rho algorithm is an algorithm for integer factorization. It was invented by John Pollard in 1975.^[1] It uses only a small amount of space, and its expected running time is proportional to the square root of the smallest prime factor of the composite number being factorized.

Core ideas

The algorithm is used to factorize a number $n=pq$ , where $p$ is a non-trivial factor. A polynomial modulo $n$ , called $g(x)$ (e.g., $g(x)=(x^{2}+1){\bmod {n}}$ ), is used to generate a pseudorandom sequence. It is important to note that $g(x)$ must be a polynomial. A starting value, say 2, is chosen, and the sequence continues as $x_{1}=g(2)$ , $x_{2}=g(g(2))$ , $x_{3}=g(g(g(2)))$ , etc. The sequence is related to another sequence $\{x_{k}{\bmod {p}}\}$ . Since $p$ is not known beforehand, this sequence cannot be explicitly computed in the algorithm. Yet, in it lies the core idea of the algorithm.

Because the number of possible values for these sequences is finite, both the $\{x_{k}\}$ sequence, which is mod $n$ , and $\{x_{k}{\bmod {p}}\}$ sequence will eventually repeat, even though these values are unknown. If the sequences were to behave like random numbers, the birthday paradox implies that the number of $x_{k}$ before a repetition occurs would be expected to be $O({\sqrt {N}})$ , where $N$ is the number of possible values. So the sequence $\{x_{k}{\bmod {p}}\}$ will likely repeat much earlier than the sequence $\{x_{k}\}$ . When one has found a $k_{1},k_{2}$ such that $x_{k_{1}}\neq x_{k_{2}}$ but $x_{k_{1}}\equiv x_{k_{2}}{\bmod {p}}$ , the number $|x_{k_{1}}-x_{k_{2}}|$ is a multiple of $p$ , so $p$ has been found.

Once a sequence has a repeated value, the sequence will cycle, because each value depends only on the one before it. This structure of eventual cycling gives rise to the name "rho algorithm", owing to similarity to the shape of the Greek letter ρ when the values $x_{1}{\bmod {p}}$ , $x_{2}{\bmod {p}}$ , etc. are represented as nodes in a directed graph.

This is detected by Floyd's cycle-finding algorithm: two nodes $i$ and $j$ (i.e., $x_{i}$ and $x_{j}$ ) are kept. In each step, one moves to the next node in the sequence and the other moves forward by two nodes. After that, it is checked whether $\gcd(x_{i}-x_{j},n)\neq 1$ . If it is not 1, then this implies that there is a repetition in the $\{x_{k}{\bmod {p}}\}$ sequence (i.e. $x_{i}{\bmod {p}}=x_{j}{\bmod {p}})$ . This works because if the $x_{i}$ is the same as $x_{j}$ , the difference between $x_{i}$ and $x_{j}$ is necessarily a multiple of $p$ . Although this always happens eventually, the resulting greatest common divisor (GCD) is a divisor of $n$ other than 1. This may be $n$ itself, since the two sequences might repeat at the same time. In this (uncommon) case the algorithm fails, and can be repeated with a different parameter.

Algorithm

The algorithm takes as its inputs $n$ , the integer to be factored; and $g(x)$ , a polynomial in $x$ computed modulo $n$ . In the original algorithm, $g(x)=(x^{2}-1){\bmod {n}}$ , but nowadays it is more common to use $g(x)=(x^{2}+1){\bmod {n}}$ . The output is either a non-trivial factor of $n$ , or failure. It performs the following steps:^[2]

Pseudocode for Pollard's rho algorithm

    x ← 2 // starting value     y ← x     d ← 1      while d = 1:         x ← g(x)         y ← g(g(y))         d ← gcd(|x - y|, n)      if d = n:          return failureelse:         return d

Here $x$ and $y$ corresponds to $x_{i}$ and $x_{j}$ in the previous section. Note that this algorithm may fail to find a nontrivial factor even when $n$ is composite. In that case, the method can be tried again, using a starting value other than 2 or a different $g(x)$ .

Example factorization

Let $n=8051$ and $g(x)=(x^{2}+1){\bmod {8}}051$ .

$i$	$x$	$y$	$gcd(\| x - y \|, 8051)$
1	5	26	1
2	26	7474	1
3	677	871	97
4	7474	1481	1

Now 97 is a non-trivial factor of 8051. Starting values other than $x = y = 2$ may give the cofactor (83) instead of 97. One extra iteration is shown above to make it clear that $y$ moves twice as fast as $x$ . Note that even after a repetition, the GCD can return to 1.

Variants

In 1980, Richard Brent published a faster variant of the rho algorithm. He used the same core ideas as Pollard but a different method of cycle detection, replacing Floyd's cycle-finding algorithm with the related Brent's cycle finding method.^[3]

A further improvement was made by Pollard and Brent. They observed that if $\gcd(a,n)>1$ , then also $\gcd(ab,n)>1$ for any positive integer $b$ . In particular, instead of computing $\gcd(|x-y|,n)$ at every step, it suffices to define $z$ as the product of 100 consecutive $|x-y|$ terms modulo $n$ , and then compute a single $\gcd(z,n)$ . A major speed up results as 100 $gcd$ steps are replaced with 99 multiplications modulo $n$ and a single $gcd$ . Occasionally it may cause the algorithm to fail by introducing a repeated factor, for instance when $n$ is a square. But it then suffices to go back to the previous $gcd$ term, where $\gcd(z,n)=1$ , and use the regular ρ algorithm from there.

Application

The algorithm is very fast for numbers with small factors, but slower in cases where all factors are large. The ρ algorithm's most remarkable success was the 1980 factorization of the Fermat number $F 8$ = 1238926361552897 × 93461639715357977769163558199606896584051237541638188580280321.^[4] The ρ algorithm was a good choice for $F 8$ because the prime factor $p$ = 1238926361552897 is much smaller than the other factor. The factorization took 2 hours on a UNIVAC 1100/42.^[4]

Example: factoring $n$ = 10403 = 101 · 103

The following table shows numbers produced by the algorithm, starting with $x=2$ and using the polynomial $g(x)=(x^{2}+1){\bmod {1}}0403$ . The third and fourth columns of the table contain additional information not known by the algorithm. They are included to show how the algorithm works.

$x$	$y$	$x{\bmod {1}}01$	$y{\bmod {1}}01$	step
2	2	2	2	0
5	2	5	2	1
26	2	26	2	2
677	26	71	26	3
598	26	93	26	4
3903	26	65	26	5
3418	26	85	26	6
156	3418	55	85	7
3531	3418	97	85	8
5168	3418	17	85	9
3724	3418	88	85	10
978	3418	69	85	11
9812	3418	15	85	12
5983	3418	24	85	13
9970	3418	72	85	14
236	9970	34	72	15
3682	9970	46	72	16
2016	9970	97	72	17
7087	9970	17	72	18
10289	9970	88	72	19
2594	9970	69	72	20
8499	9970	15	72	21
4973	9970	24	72	22
2799	9970	72	72	23

The first repetition modulo 101 is 97 which occurs in step 17. The repetition is not detected until step 23, when $x\equiv y{\pmod {101}}$ . This causes $\gcd(x-y,n)=\gcd(2799-9970,n)$ to be $p=101$ , and a factor is found.

Complexity

If the pseudorandom number $x=g(x)$ occurring in the Pollard ρ algorithm were an actual random number, it would follow that success would be achieved half the time, by the birthday paradox in $O({\sqrt {p}})\leq O(n^{1/4})$ iterations. It is believed that the same analysis applies as well to the actual rho algorithm, but this is a heuristic claim, and rigorous analysis of the algorithm remains open.^[5]

Related Research Articles

In mathematics, the Euclidean algorithm, or Euclid's algorithm, is an efficient method for computing the greatest common divisor (GCD) of two integers (numbers), the largest number that divides them both without a remainder. It is named after the ancient Greek mathematician Euclid, who first described it in his Elements . It is an example of an algorithm, a step-by-step procedure for performing a calculation according to well-defined rules, and is one of the oldest algorithms in common use. It can be used to reduce fractions to their simplest form, and is a part of many other number-theoretic and cryptographic calculations.

<span class="mw-page-title-main">Quadratic reciprocity</span> Gives conditions for the solvability of quadratic equations modulo prime numbers

In number theory, the law of quadratic reciprocity is a theorem about modular arithmetic that gives conditions for the solvability of quadratic equations modulo prime numbers. Due to its subtlety, it has many formulations, but the most standard statement is:

RSA (Rivest–Shamir–Adleman) is a public-key cryptosystem, one of the oldest widely used for secure data transmission. The initialism "RSA" comes from the surnames of Ron Rivest, Adi Shamir and Leonard Adleman, who publicly described the algorithm in 1977. An equivalent system was developed secretly in 1973 at Government Communications Headquarters (GCHQ), the British signals intelligence agency, by the English mathematician Clifford Cocks. That system was declassified in 1997.

The Lenstra elliptic-curve factorization or the elliptic-curve factorization method (ECM) is a fast, sub-exponential running time, algorithm for integer factorization, which employs elliptic curves. For general-purpose factoring, ECM is the third-fastest known factoring method. The second-fastest is the multiple polynomial quadratic sieve, and the fastest is the general number field sieve. The Lenstra elliptic-curve factorization is named after Hendrik Lenstra.

The Rabin cryptosystem is a family of public-key encryption schemes based on a trapdoor function whose security, like that of RSA, is related to the difficulty of integer factorization.

Pollard's p − 1 algorithm is a number theoretic integer factorization algorithm, invented by John Pollard in 1974. It is a special-purpose algorithm, meaning that it is only suitable for integers with specific types of factors; it is the simplest example of an algebraic-group factorisation algorithm.

The quadratic sieve algorithm (QS) is an integer factorization algorithm and, in practice, the second-fastest method known. It is still the fastest for integers under 100 decimal digits or so, and is considerably simpler than the number field sieve. It is a general-purpose factorization algorithm, meaning that its running time depends solely on the size of the integer to be factored, and not on special structure or properties. It was invented by Carl Pomerance in 1981 as an improvement to Schroeppel's linear sieve.

In computer science, cycle detection or cycle finding is the algorithmic problem of finding a cycle in a sequence of iterated function values.

In number theory, Dixon's factorization method is a general-purpose integer factorization algorithm; it is the prototypical factor base method. Unlike for other factor base methods, its run-time bound comes with a rigorous proof that does not rely on conjectures about the smoothness properties of the values taken by a polynomial.

Pollard's rho algorithm for logarithms is an algorithm introduced by John Pollard in 1978 to solve the discrete logarithm problem, analogous to Pollard's rho algorithm to solve the integer factorization problem.

In mathematics, Hensel's lemma, also known as Hensel's lifting lemma, named after Kurt Hensel, is a result in modular arithmetic, stating that if a univariate polynomial has a simple root modulo a prime number $p$ , then this root can be lifted to a unique root modulo any higher power of $p$ . More generally, if a polynomial factors modulo $p$ into two coprime polynomials, this factorization can be lifted to a factorization modulo any higher power of $p$ .

In computational number theory, Williams's p + 1 algorithm is an integer factorization algorithm, one of the family of algebraic-group factorisation algorithms. It was invented by Hugh C. Williams in 1982.

In mathematics, the Eisenstein integers, occasionally also known as Eulerian integers, are the complex numbers of the form

In mathematics, the rational sieve is a general algorithm for factoring integers into prime factors. It is a special case of the general number field sieve. While it is less efficient than the general algorithm, it is conceptually simpler. It serves as a helpful first step in understanding how the general number field sieve works.

The Blum–Goldwasser (BG) cryptosystem is an asymmetric key encryption algorithm proposed by Manuel Blum and Shafi Goldwasser in 1984. Blum–Goldwasser is a probabilistic, semantically secure cryptosystem with a constant-size ciphertext expansion. The encryption algorithm implements an XOR-based stream cipher using the Blum-Blum-Shub (BBS) pseudo-random number generator to generate the keystream. Decryption is accomplished by manipulating the final state of the BBS generator using the private key, in order to find the initial seed and reconstruct the keystream.

In mathematics and computer algebra, factorization of polynomials or polynomial factorization expresses a polynomial with coefficients in a given field or in the integers as the product of irreducible factors with coefficients in the same domain. Polynomial factorization is one of the fundamental components of computer algebra systems.

In mathematics, particularly computational algebra, Berlekamp's algorithm is a well-known method for factoring polynomials over finite fields. The algorithm consists mainly of matrix reduction and polynomial GCD computations. It was invented by Elwyn Berlekamp in 1967. It was the dominant algorithm for solving the problem until the Cantor–Zassenhaus algorithm of 1981. It is currently implemented in many well-known computer algebra systems.

In mathematics, elliptic curve primality testing techniques, or elliptic curve primality proving (ECPP), are among the quickest and most widely used methods in primality proving. It is an idea put forward by Shafi Goldwasser and Joe Kilian in 1986 and turned into an algorithm by A. O. L. Atkin the same year. The algorithm was altered and improved by several collaborators subsequently, and notably by Atkin and François Morain, in 1993. The concept of using elliptic curves in factorization had been developed by H. W. Lenstra in 1985, and the implications for its use in primality testing followed quickly.

In mathematics and computer algebra the factorization of a polynomial consists of decomposing it into a product of irreducible factors. This decomposition is theoretically possible and is unique for polynomials with coefficients in any field, but rather strong restrictions on the field of the coefficients are needed to allow the computation of the factorization by means of an algorithm. In practice, algorithms have been designed only for polynomials with coefficients in a finite field, in the field of rationals or in a finitely generated field extension of one of them.

In number theory, Berlekamp's root finding algorithm, also called the Berlekamp–Rabin algorithm, is the probabilistic method of finding roots of polynomials over the field $with elements. The method was discovered by Elwyn Berlekamp in 1970 as an auxiliary to the algorithm for polynomial factorization over finite fields. The algorithm was later modified by Rabin for arbitrary finite fields in 1979. The method was also independently discovered before Berlekamp by other researchers.$

References

↑ Pollard, J. M. (1975). "A Monte Carlo method for factorization". BIT Numerical Mathematics. 15 (3): 331–334. doi:10.1007/bf01933667. S2CID 122775546.
↑ Cormen, Thomas H.; Leiserson, Charles E.; Rivest, Ronald L. & Stein, Clifford (2009). "Section 31.9: Integer factorization". Introduction to Algorithms (third ed.). Cambridge, MA: MIT Press. pp. 975–980. ISBN 978-0-262-03384-8. (this section discusses only Pollard's rho algorithm).
↑ Brent, Richard P. (1980). "An Improved Monte Carlo Factorization Algorithm". BIT. 20 (2): 176–184. doi:10.1007/BF01933190. S2CID 17181286.
1 2 Brent, R.P.; Pollard, J. M. (1981). "Factorization of the Eighth Fermat Number". Mathematics of Computation. 36 (154): 627–630. doi: 10.2307/2007666 . JSTOR 2007666.
↑ Galbraith, Steven D. (2012). "14.2.5 Towards a rigorous analysis of Pollard rho". Mathematics of Public Key Cryptography. Cambridge University Press. pp. 272–273. ISBN 9781107013926..

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Pollard, J. M. (1975). "A Monte Carlo method for factorization". BIT Numerical Mathematics. 15 (3): 331–334. doi:10.1007/bf01933667. S2CID 122775546.

[2] Cormen, Thomas H.; Leiserson, Charles E.; Rivest, Ronald L. & Stein, Clifford (2009). "Section 31.9: Integer factorization". Introduction to Algorithms (third ed.). Cambridge, MA: MIT Press. pp. 975–980. ISBN 978-0-262-03384-8. (this section discusses only Pollard's rho algorithm).

[3] Brent, Richard P. (1980). "An Improved Monte Carlo Factorization Algorithm". BIT. 20 (2): 176–184. doi:10.1007/BF01933190. S2CID 17181286.

[FotEFN-4] 1 2 Brent, R.P.; Pollard, J. M. (1981). "Factorization of the Eighth Fermat Number". Mathematics of Computation. 36 (154): 627–630. doi: 10.2307/2007666 . JSTOR 2007666.

[5] Galbraith, Steven D. (2012). "14.2.5 Towards a rigorous analysis of Pollard rho". Mathematics of Public Key Cryptography. Cambridge University Press. pp. 272–273. ISBN 9781107013926..

[1]

[2]

[3]

[4]

[5]

v t e Number-theoretic algorithms
Primality tests	AKS APR Baillie–PSW Elliptic curve Pocklington Fermat Lucas Lucas–Lehmer Lucas–Lehmer–Riesel Proth's theorem Pépin's Quadratic Frobenius Solovay–Strassen Miller–Rabin
Prime-generating	Sieve of Atkin Sieve of Eratosthenes Sieve of Pritchard Sieve of Sundaram Wheel factorization
Integer factorization	Continued fraction (CFRAC) Dixon's Lenstra elliptic curve (ECM) Euler's Pollard's rho p − 1 p + 1 Quadratic sieve (QS) General number field sieve (GNFS) Special number field sieve (SNFS) Rational sieve Fermat's Shanks's square forms Trial division Shor's
Multiplication	Ancient Egyptian Long Karatsuba Toom–Cook Schönhage–Strassen Fürer's
Euclidean division	Binary Chunking Fourier Goldschmidt Newton-Raphson Long Short SRT
Discrete logarithm	Baby-step giant-step Pollard rho Pollard kangaroo Pohlig–Hellman Index calculus Function field sieve
Greatest common divisor	Binary Euclidean Extended Euclidean Lehmer's
Modular square root	Cipolla Pocklington's Tonelli–Shanks Berlekamp Kunerth
Other algorithms	Chakravala Cornacchia Exponentiation by squaring Integer square root Integer relation (LLL; KZ) Modular exponentiation Montgomery reduction Schoof Trachtenberg system
Italics indicate that algorithm is for numbers of special forms

Pollard's rho algorithm

Contents

Core ideas

Algorithm

Example factorization

Variants

Application

Example: factoring $n$ = 10403 = 101 · 103

Complexity

See also

Related Research Articles

References

Further reading

External links

Pollard's rho algorithm

Contents

Core ideas

Algorithm

Example factorization

Variants

Application

Example: factoring n = 10403 = 101 · 103

Complexity

See also

Related Research Articles

References

Further reading

External links

Example: factoring $n$ = 10403 = 101 · 103