Gijswijt's sequence

In mathematics, Gijswijt's sequence (named after Dion Gijswijt by Neil Sloane ) is a self-describing sequence where each term counts the maximum number of repeated blocks of numbers in the sequence immediately preceding that term.

The sequence begins with:


 * 1, 1, 2, 1, 1, 2, 2, 2, 3, 1, 1, 2, 1, 1, 2, 2, 2, 3, 2, 1, ...

The sequence is similar in definition to the Kolakoski sequence, but instead of counting the longest run of single terms, the sequence counts the longest run of blocks of terms of any length. Gijswijt's sequence is known for its remarkably slow rate of growth. For example, the first 4 appears at the 220th term, and the first 5 appears near the $$10^{10^{23}}$$rd term.

Definition
The process to generate terms in the sequence can be defined by looking at the sequence as a series of letters in the alphabet of natural numbers:


 * 1) $$a(1) = 1$$, and
 * 2) $$a(n+1) = k$$, where $$k$$ is the largest natural number such that the word $$a(1)a(2)a(3)...a(n)$$ can be written in the form $$xy^k$$ for some words $$x$$ and $$y$$, with $$y$$ having non-zero length.

The sequence is base-agnostic. That is, if a run of 10 repeated blocks is found, the next term in the sequence would be a single number 10, not a 1 followed by a 0.

Explanation
The sequence begins with 1 by definition. The 1 in the second term then represents the length 1 of the block of 1s that is found immediately before it in the first term. The 2 in the third term represents the length 2 of the block of 1s that are in the first and second term. At this point, the sequence decreases for the first time: The 1 in the fourth term represents the length 1 of the block of 2s in the 3rd term, as well as the length 1 of the block "1, 2" spanning the second and third term. There is no block of any repeated sequence immediately preceding the fourth term that is longer than length 1. The block of two 1s in the first and second term cannot be considered for the 4th term because they are separated by a different number in the 3rd term.

The 1 in the fifth term represents the length 1 of the "repeating" blocks "1" and "2, 1" and "1, 2, 1" and "1, 1, 2, 1" that immediately precede the fifth term. None of these blocks are repeated more than once, so the fifth term is 1. The 2 in the sixth term represents the length of the repeated block of 1s immediately leading up to the sixth term, namely the ones in the 4th and 5th terms. The 2 in the seventh term represents the 2 repetitions of the block "1, 1, 2" spanning terms 1-3 and then 4–6. This "3-number word" occurs twice immediately leading up to the seventh term - so the value of the seventh term is 2.

The 2 in the eighth term represents the length of the repeated block of 2s immediately leading up to the eighth term, namely the twos in the sixth and seventh terms. The 3 in the 9th term represents the thrice-repeated block of single 2s immediately leading up to the 9th term, namely the twos in the sixth, seventh, and eighth terms.

Properties
Only limited research has focused on Gijswijt's sequence. As such, very little has been proven about the sequence and many open questions remain unsolved.

Average value
Though it is known that each natural number occurs at a finite position within the sequence, it has been shown that the sequence has a finite mean. To define this formally on an infinite sequence, where re-ordering of the terms may matter, it is known that


 * $$\lim_{n \to \infty} \frac{1}{n} \sum_{i=1}^n a(i) \approx 1.904 < \infty $$.

Likewise, the density of any given natural number within the sequence is known to be finite.

Rate of growth and first occurrences
In 2006 Gijswijt proved that the sequence contains every natural number. The sequence grows roughly super-logarithmically, with the first occurrence of any natural $$n$$ positioned at approximately $$2^{2^{3^{4^{5^{.^{.^{.{n-1}}}}}}}}$$. A closed-form expression for the earliest index at which a given positive integer $$n$$ appears was found by Levi van de Pol, in terms of a constant $$\epsilon_1$$ and a sequence of constants $$\nu_k$$.

For example, the position of the first 5 is given by
 * $$\phi^{(1)}(5)=\lfloor 1-\epsilon_1+\epsilon_1\cdot 2^{418090195952691922788353} \rfloor$$

where $$\epsilon_1\approx 3.48669886438365597023$$. Expanded out, this number is approximately
 * $$3.2719044223289929745 \cdot 10^{125857689874791897769333}$$.

The first instance of two consecutive 4's starts at position
 * 255,895,648,634,818,208,370,064,452,304,769,558,261,700,170,817,472,823,
 * 398,081,655,524,438,021,806,620,809,813,295,008,281,436,789,493,636,144.

These number both have 108 digits, and were first published by van de Pol.

Recursive structure
The sequence can be broken into discrete "block" and "glue" sequences, which can be used to recursively build up the sequence. For example, at the base level, we can define $$B_1=1$$ and $$S_1=2$$ as the first block and glue sequences, respectively. Together, we can see how they form the beginning of the sequence:


 * $$B_1B_1S_1 = 1, 1, 2$$

The next step is to recursively build up the sequence. Define $$B_2 = B_1B_1S_1$$. Noting that the sequence starts with $$B_1B_1$$, we can define a glue string $$S_2 = 2,2,3$$ which gives:


 * $$B_2B_2S_2 = 1, 1, 2, 1, 1, 2, 2, 2, 3$$

We assigned $$S_2$$ to a particular sequence which gives the property that $$B_2B_2S_2B_2B_2S_2$$ also occurs at the top of the sequence.

This process can be continued indefinitely with $$B_{n+1} = B_nB_nS_n$$. It turns out that we can discover a glue string $$S_n$$ by noting that $$S_n$$ will never have a 1, and that it stops once it reaches the first 1 to follow $$B_nB_n$$. It has also been proven that Gijswijt's sequence can be built up in this fashion indefinitely ‒ that is, $$B_n$$ and $$S_n$$ are always finite in length for any $$n$$.

Clever manipulation of the glue sequences in this recursive structure can be used to demonstrate that Gijswijt's sequence contains all the natural numbers, among other properties of the sequence.