Gijswijt's sequence

Last updated

In mathematics, Gijswijt's sequence (named after Dion Gijswijt by Neil Sloane [1] ) is a self-describing sequence where each term counts the maximum number of repeated blocks of numbers in the sequence immediately preceding that term.

Contents

The sequence begins with:

1, 1, 2, 1, 1, 2, 2, 2, 3, 1, 1, 2, 1, 1, 2, 2, 2, 3, 2, 1, ... (sequence A090822 in the OEIS )

The sequence is similar in definition to the Kolakoski sequence, but instead of counting the longest run of single terms, the sequence counts the longest run of blocks of terms of any length. Gijswijt's sequence is known for its remarkably slow rate of growth. For example, the first 4 appears at the 220th term, and the first 5 appears near the rd term. [1]

Definition

The process to generate terms in the sequence can be defined by looking at the sequence as a series of letters in the alphabet of natural numbers:

  1. , and
  2. , where is the largest natural number such that the word can be written in the form for some words and , with having non-zero length.

The sequence is base-agnostic. That is, if a run of 10 repeated blocks is found, the next term in the sequence would be a single number 10, not a 1 followed by a 0.

Explanation

The sequence begins with 1 by definition. The 1 in the second term then represents the length 1 of the block of 1s that is found immediately before it in the first term. The 2 in the third term represents the length 2 of the block of 1s that are in the first and second term. At this point, the sequence decreases for the first time: The 1 in the fourth term represents the length 1 of the block of 2s in the 3rd term, as well as the length 1 of the block "1, 2" spanning the second and third term. There is no block of any repeated sequence immediately preceding the fourth term that is longer than length 1. The block of two 1s in the first and second term cannot be considered for the 4th term because they are separated by a different number in the 3rd term.

The 1 in the fifth term represents the length 1 of the "repeating" blocks "1" and "2, 1" and "1, 2, 1" and "1, 1, 2, 1" that immediately precede the fifth term. None of these blocks are repeated more than once, so the fifth term is 1. The 2 in the sixth term represents the length of the repeated block of 1s immediately leading up to the sixth term, namely the ones in the 4th and 5th terms. The 2 in the seventh term represents the 2 repetitions of the block "1, 1, 2" spanning terms 1-3 and then 4–6. This "3-number word" occurs twice immediately leading up to the seventh term - so the value of the seventh term is 2.

The 2 in the eighth term represents the length of the repeated block of 2s immediately leading up to the eighth term, namely the twos in the sixth and seventh terms. The 3 in the 9th term represents the thrice-repeated block of single 2s immediately leading up to the 9th term, namely the twos in the sixth, seventh, and eighth terms.

Properties

Only limited research has focused on Gijswijt's sequence. As such, very little has been proven about the sequence and many open questions remain unsolved.

Rate of growth

In 2006 Gijswijt proved that the sequence contains every natural number. [2] The sequence grows roughly super-logarithmically, with the first occurrence of any natural positioned at approximately . A closed-form expression for the earliest index at which a given positive integer appears is known, in terms of a constant and a sequence of constants . [3]

Average value

Though it is known that each natural number occurs at a finite position within the sequence, it has been conjectured that the sequence may have a finite mean. To define this formally on an infinite sequence, where re-ordering of the terms may matter, the conjecture is that:

Likewise, the density of any given natural number within the sequence is not known. [1]

First occurrences

As mentioned, the first 4 appears at position 220. The first instance of two consecutive 4's starts at position

255,895,648,634,818,208,370,064,452,304,769,558,261,700,170,817,472,823,
398,081,655,524,438,021,806,620,809,813,295,008,281,436,789,493,636,144.

This number has 108 digits, and was first published by Levi van de Pol. [4]

Van de Pol also gives an exact formula for the position of the first 5:

where . Expanded out, this number is approximately

Recursive structure

The sequence can be broken into discrete "block" and "glue" sequences, which can be used to recursively build up the sequence. For example, at the base level, we can define and as the first block and glue sequences, respectively. Together, we can see how they form the beginning of the sequence:

The next step is to recursively build up the sequence. Define . Noting that the sequence starts with , we can define a glue string which gives:

We assigned to a particular sequence which gives the property that also occurs at the top of the sequence.

This process can be continued indefinitely with . It turns out that we can discover a glue string by noting that will never have a 1, and that it stops once it reaches the first 1 to follow . [5] It has also been proven that Gijswijt's sequence can be built up in this fashion indefinitely ‒ that is, and are always finite in length for any . [2]

Clever manipulation of the glue sequences in this recursive structure can be used to demonstrate that Gijswijt's sequence contains all the natural numbers, among other properties of the sequence.

See also

Related Research Articles

<span class="mw-page-title-main">Cauchy sequence</span> Sequence of points that get progressively closer to each other

In mathematics, a Cauchy sequence is a sequence whose elements become arbitrarily close to each other as the sequence progresses. More precisely, given any small positive distance, all excluding a finite number of elements of the sequence are less than that given distance from each other. Cauchy sequences are named after Augustin-Louis Cauchy; they may occasionally be known as fundamental sequences.

<span class="mw-page-title-main">Law of large numbers</span> Averages of repeated trials converge to the expected value

In probability theory, the law of large numbers (LLN) is a mathematical theorem that states that the average of the results obtained from a large number of independent and identical random samples converges to the true value, if it exists. More formally, the LLN states that given a sample of independent and identically distributed values, the sample mean converges to the true mean.

In continuum mechanics, the infinitesimal strain theory is a mathematical approach to the description of the deformation of a solid body in which the displacements of the material particles are assumed to be much smaller than any relevant dimension of the body; so that its geometry and the constitutive properties of the material at each point of space can be assumed to be unchanged by the deformation.

In information theory, the typical set is a set of sequences whose probability is close to two raised to the negative power of the entropy of their source distribution. That this set has total probability close to one is a consequence of the asymptotic equipartition property (AEP) which is a kind of law of large numbers. The notion of typicality is only concerned with the probability of a sequence and not the actual sequence itself.

<span class="mw-page-title-main">Abundant number</span> Number that is less than the sum of its proper divisors

In number theory, an abundant number or excessive number is a positive integer for which the sum of its proper divisors is greater than the number. The integer 12 is the first abundant number. Its proper divisors are 1, 2, 3, 4 and 6 for a total of 16. The amount by which the sum exceeds the number is the abundance. The number 12 has an abundance of 4, for example.

In number theory, a Wall–Sun–Sun prime or Fibonacci–Wieferich prime is a certain kind of prime number which is conjectured to exist, although none are known.

In mathematics, the Lucas–Lehmer test (LLT) is a primality test for Mersenne numbers. The test was originally developed by Édouard Lucas in 1878 and subsequently proved by Derrick Henry Lehmer in 1930.

<span class="mw-page-title-main">Power of two</span> Two raised to an integer power

A power of two is a number of the form 2n where n is an integer, that is, the result of exponentiation with number two as the base and integer n as the exponent.

<span class="mw-page-title-main">Look-and-say sequence</span> Integer sequence

In mathematics, the look-and-say sequence is the sequence of integers beginning as follows:

Vector autoregression (VAR) is a statistical model used to capture the relationship between multiple quantities as they change over time. VAR is a type of stochastic process model. VAR models generalize the single-variable (univariate) autoregressive model by allowing for multivariate time series. VAR models are often used in economics and the natural sciences.

The Engel expansion of a positive real number x is the unique non-decreasing sequence of positive integers such that

In information theory, the noisy-channel coding theorem, establishes that for any given degree of noise contamination of a communication channel, it is possible to communicate discrete data nearly error-free up to a computable maximum rate through the channel. This result was presented by Claude Shannon in 1948 and was based in part on earlier work and ideas of Harry Nyquist and Ralph Hartley.

In number theory, the Dedekind psi function is the multiplicative function on the positive integers defined by

In mathematics, the Fibonacci numbers form a sequence defined recursively by:

A repeating decimal or recurring decimal is a decimal representation of a number whose digits are periodic and the infinitely repeated portion is not zero. It can be shown that a number is rational if and only if its decimal representation is repeating or terminating. For example, the decimal representation of 1/3 becomes periodic just after the decimal point, repeating the single digit "3" forever, i.e. 0.333.... A more complicated example is 3227/555, whose decimal becomes periodic at the second digit following the decimal point and then repeats the sequence "144" forever, i.e. 5.8144144144.... At present, there is no single universally accepted notation or phrasing for repeating decimals. Another example of this is 593/53, which becomes periodic after the decimal point, repeating the 13-digit pattern "1886792452830" forever, i.e. 11.18867924528301886792452830....

In mathematics the Baum–Sweet sequence is an infinite automatic sequence of 0s and 1s defined by the rule:

In mathematics, the Rudin–Shapiro sequence, also known as the Golay–Rudin–Shapiro sequence, is an infinite 2-automatic sequence named after Marcel Golay, Walter Rudin, and Harold S. Shapiro, who independently investigated its properties.

<span class="mw-page-title-main">Regular paperfolding sequence</span>

In mathematics the regular paperfolding sequence, also known as the dragon curve sequence, is an infinite sequence of 0s and 1s. It is obtained from the repeating partial sequence

Samplesort is a sorting algorithm that is a divide and conquer algorithm often used in parallel processing systems. Conventional divide and conquer sorting algorithms partitions the array into sub-intervals or buckets. The buckets are then sorted individually and then concatenated together. However, if the array is non-uniformly distributed, the performance of these sorting algorithms can be significantly throttled. Samplesort addresses this issue by selecting a sample of size s from the n-element sequence, and determining the range of the buckets by sorting the sample and choosing p−1 < s elements from the result. These elements then divide the array into p approximately equal-sized buckets. Samplesort is described in the 1970 paper, "Samplesort: A Sampling Approach to Minimal Storage Tree Sorting", by W. D. Frazer and A. C. McKellar.

In mathematics, Hooley's delta function , also called Erdős--Hooley delta-function, defines the maximum number of divisors of in for all , where is the Euler's number. The first few terms of this sequence are

References

  1. 1 2 3 Sloane, N. J. A. (ed.). "SequenceA090822(Gijswijt's sequence: a(1) = 1; for n>1, a(n) = largest integer k such that the word a(1)a(2)...a(n-1) is of the form xy^k for words x and y (where y has positive length), i.e., the maximal number of repeating blocks at the end of the sequence so far)". The On-Line Encyclopedia of Integer Sequences . OEIS Foundation.
  2. 1 2 Gijswijt, D.C. (2006). "A Slow-Growing Sequence Defined by an Unusual Recurrence". arXiv: math/0602498 .
  3. L. van de Pol, "The first occurrence of a number in Gijswijt's sequence", 2022. Accessed 4 July 2023.
  4. van de Pol, Levi. "The first occurrence of a number in Gijswijt's sequence". arXiv: 2209.04657 .
  5. Sloane, Neil. "Seven Staggering Sequences" (PDF). AT&T Shannon Lab. p. 3.
OEIS sequenceA090822(Gijswijt's sequence)