¹¹institutetext: Laboratoire de Combinatoire et d’Informatique Mathématique,
Université du Québec à Montréal, Montréal (QC), Canada,
¹¹email: [email protected], [email protected] ²²institutetext: Institute for Advanced Study in Mathematics
Harbin Institute of Technology, Harbin, PR China
²²email: [email protected]

Block-counting sequences are not purely morphic

Antoine Abram 11 Yining Hu 22 Shuo Li 11

Abstract

Let $m$ be a positive integer larger than $1$ , let $w$ be a finite word over $\left\{0,1,...,m-1\right\}$ and let $a_{m;w}(n)$ be the number of occurrences of the word $w$ in the $m$ -expansion of $n$ mod $p$ for any non-negative integer $n$ . In this article, we first give a fast algorithm to generate all sequences of the form $(a_{m;w}(n))_{n\in\mathbf{N}}$ ; then, under the hypothesis that $m$ is a prime, we prove that all these sequences are $m$ -uniformly but not purely morphic, except for $w=1,2,...,m-1$ ; finally, under the same assumption of $m$ as before, we prove that the power series $\sum_{i=0}^{\infty}a_{m;w}(n)t^{n}$ is algebraic of degree $m$ over $\mathbb{F}_{m}(t)$ .

1 Introduction, definitions and notation

Given a positive integer $m$ larger than $1$ and a finite word $w$ over $\left\{0,1,2,...,m-1\right\}$ , the block-counting sequence $(e_{m;w}(n))_{n\in\mathbf{N}}$ counts the number of occurrences of the word $w$ in the $m$ -expansion of $n$ for each non-negative integer $n$ . Let us define $(a_{m;w}(n))_{n\in\mathbf{N}}$ to be a sequence over $\left\{0,1,2,...,m-1\right\}$ such that $a_{m;w}(n)\equiv e_{m;w}(n)\mod(m)$ for all non-negative integer $n$ . The analytical as well as the combinatorial properties of these sequences have been studied since 1900’s after Thue and some well-known sequences are strongly related to this notion. Recall that the $0,1$ -Thue-Morse sequence can be defined as $(a_{2;1}(n))_{n\in\mathbf{N}}$ (see, for example, Page 15 in [4]) and the $0,1$ -Rudin-Shapiro sequence can also be defined as $(a_{2;11}(n))_{n\in\mathbf{N}}$ (see, for example, Example 3.3.1 in [4]). In this article, we review some common properties of usual block-counting sequences and generalize them to all block-counting sequences.

To be able to announce our results, here we recall some definitions and notation. Let $A$ be a finite set. It will be called an alphabet and its elements will be called letters. Let $A^{*}$ denote the free monoid generated by $A$ under concatenations and let $A^{\mathbf{N}}$ denote the set of infinite concatenations of elements in $A$ . Let $A^{\infty}=A^{*}\cup A^{\mathbf{N}}$ . A finite word over the alphabet $A$ is an element in $A^{*}$ and an infinite word over $A$ is an element in $A^{\mathbf{N}}$ . Particularly, the empty word is an element in $A^{*}$ and it is denoted by $\epsilon$ . The length of a word $w$ , denoted by $|w|$ , is the number of letters that it contains. The length of the empty word is $0$ and the length of any infinite word is infinite. For any non-empty word $w\in A^{\infty}$ , it can be denoted by $w[0]w[1]w[2]...$ , where $w[i]$ are elements in $A$ . A word $x$ is called a factor of $w$ if there exist two integers $0\leq i\leq j\leq|w|-1$ such that $x=w[i]w[i+1]...w[j]$ , this factor can also be denoted by $w[i..j]$ . A factor $x$ is called a prefix (resp. a suffix ) of the word $w$ if there exists a positive integer $i$ such that $0\leq i\leq|w|$ and $x=w[0..i]$ (resp. $x=w[i..|w|-1]$ ). For any finite word $w$ and any positive integer $n$ , let $w^{n}$ denote the concatenation of $n$ copies of $w$ , i.e. $w^{n}=ww...w$ $n$ times. Particularly, $w^{0}=\epsilon$ . For any pair of words $w,v$ such that $v$ is a factor of $w$ , let $|w|_{v}$ denote the number of occurrences of $v$ in $w$ .

Let $A$ and $B$ be two alphabets, a morphism $\phi$ from $A$ to $B$ is a map from $A^{\infty}$ to $B^{\infty}$ satisfying $\phi(xy)=\phi(x)\phi(y)$ for any pair of elements $x,y$ in $A^{\infty}$ . The morphism $\phi$ is called $k$ -uniform if for all elements $a\in A$ , $|\phi(a)|=k$ and it is called non-uniform otherwise. A morphism $\phi$ is called a coding function if it is $1$ -uniform and it is called non-erasing if $\phi(a)\neq\epsilon$ for all $a\in A$ .

Let $A$ be a finite alphabet and let $(a_{n})_{n\in\mathbf{N}}$ be an infinite sequence over $A$ , it is called morphic if there exists an alphabet $B$ , an infinite sequence $(b_{n})_{n\in\mathbf{N}}$ over $B$ , a non-erasing morphism $\phi$ from $B^{\infty}$ to $B^{\infty}$ and a coding function $\psi$ from $B^{\infty}$ to $A^{\infty}$ , such that $(b_{n})_{n\in\mathbf{N}}$ is a fixed point of $\phi$ and $(a_{n})_{n\in\mathbf{N}}=\psi((b_{n})_{n\in\mathbf{N}})$ . Moreover, the sequence $(a_{n})_{n\in\mathbf{N}}$ is called uniformly morphic if $\phi$ is $k$ -uniform for some integer $k$ , and it is called non-uniformly morphic otherwise. The sequence $(a_{n})_{n\in\mathbf{N}}$ is called purely morphic if $A=B$ and $\psi=Id$ .

For any positive integer $m$ , let $[\![m]\!]=\left\{0,1,2,...,m-1\right\}$ . For any $t\in[\![m]\!]$ let $t^{+}\equiv t+1\mod m$ ; for any $w\in[\![m]\!]^{*}$ , let $w^{+}=w[0]^{+}w[1]^{+}...w[|w|-1]^{+}$ .

In Section2, we give a fast algorithm to generate all block-counting sequences. It is well-known that the Thue-Morse sequence can be generated by the following algorithm (see, for example, [12, A008277]):

Example 1

Let $(w_{n})_{n\in\mathbf{N}}$ be a sequence of words over the $[\![2]\!]^{*}$ such that $w_{0}=0$ and that $w_{i+1}=w_{i}w_{i}^{+}$ for all $i$ , then the Thue-Morse sequence $(a_{2;1}(n))_{n\in\mathbf{N}}$ satisfies $(a_{2;1}(n))_{n\in\mathbf{N}}=\lim_{i\to\infty}w_{i}$ .

In Section2, we prove that the Rudin-Shapiro sequence can also be generalized by a similar algorithm, see 4. More generally, we find fast algorithms to generate all block-counting sequences. These algorithms are given by 3 and 5 in Section2.

From the definitions recalled as above, any morphic word can be classified as either a uniformly morphic word or a non-uniformly morphic word. However, from a recent article [5], Allouche and Shallit proved that all uniformly morphic sequences are also non-uniformly morphic. This result implies that all sequences in the family of morphic sequences are also in its subfamily of non-uniformly morphic sequences. Indeed, many works can be found in the literature in the direction of characterizing all those non-uniformly morphic sequences which are not uniformly morphic, for example, one can find [2][13][7][1][8][9][6]. However, in [5], it is proved actually that all uniformly morphic sequences are also non-uniformly non-purely morphic. In other words, from the construction of the proof given in [5], a nontrivial coding function is required. In Section 3, we investigate all those uniformly morphic sequences which are not purely morphic. It is already known that the Rudin-Shapiro sequence is in this case (Example 26 in [3]). In section 3, we prove that all other sequences in the form of $(a_{m,w}(n))_{n\in\mathbf{N}}$ have the same property when $|w|\neq 1$ and $m$ is a prime. The result is announced as follows:

Theorem 2

Let $p$ be a prime number and $w\in[\![p]\!]^{*}$ . The sequence $(a_{p;w}(n))_{n\in\mathbf{N}}$ is a $p$ -uniformly morphic. Moreover, if $|w|=1$ and $w\neq 0$ , this sequence is purely morphic and if not is it non-purely morphic.

In Section 4, under the assumption that $p$ is a prime number, we prove that the formal power series $f_{p;w}=\sum_{i=0}^{\infty}a_{m;w}(n)t^{n}$ is algebraic and of degree $p$ over $\mathbb{F}_{p}(t)$ . Indeed, from Christol’s theorem [11], we know that the power series $f_{p;w}$ is algebraic over $\mathbb{F}_{p}(t)$ . In Section 4, we prove that $f$ is algebraic of degree $p$ .

2 Windows functions and $(a_{p;w}(n))_{n\in\mathbf{N}}$

For any positive integer $m$ and non-negative integer $n$ , let $[n]_{m}$ denote the expansion of $n$ in the base $m$ . For a given word $w\in[\![m]\!]^{*}=\{0,1,\cdots,m-1\}^{*}$ , $w=w[0]w[1]...w[|w|-1]$ , let $(w)_{m}=\sum_{i=0}^{|w|-1}w[i]m^{|w|-1-i}$ and let $w^{\prime}=w[1]w[2]\cdots w[|w|-1]$ . A word $w$ is called a $x$ -word if $w[0]=x$ . For a given string $w$ , let $\alpha_{w}=\frac{(w^{\prime})_{m}}{m^{|w|-1}}$ , $\beta_{w}=\frac{(w^{\prime})_{m}+1}{m^{|w|-1}}$ and let $\phi_{w}:[\![m]\!]^{*}\to[\![m]\!]^{*}$ be a function such that for any $v\in[\![m]\!]^{*}$ , $\phi_{w}(v)$ satisfies the following propriety:

\phi_{w}(v)[i]=\begin{cases}v[i]+1\!\mod\!m\;\;\text{if }\alpha_{w}|v|\leq i<\beta_{w}|v|\\ v[i]\;\;\;\text{otherwise}.\end{cases}

2.1 Block-counting sequences for non- $0$ -words

Proposition 3

Let $m$ a positive number, let $x\in[\![m]\!]\backslash\{0\}$ , let $w\in[\![m]\!]^{*}$ be a $x$ -word and let $t=(v)_{m}$ . If we let $(u_{i})_{i\in\mathbb{N}}$ be a sequence of words such that $|u_{0}|=m^{|w|}$ , that

u_{0}[i]=\begin{cases}1\;\;\text{if i=t}\\ 0\;\;\;\text{otherwise},\end{cases}

and that $u_{k+1}=u_{k}^{x}\phi_{w}(u_{k})u_{k}^{m-x-1}$ , then $\lim_{k\to\infty}u_{k}=(a_{m;w}(n))_{n\in\mathbb{N}}$ .

Proof

First, it is obvious that $u_{0}$ is a prefix of $(a_{m;w}(n))_{n\in\mathbf{N}}$ . Now let $y\in[\![m]\!]\backslash\{0\}$ . For any integers $r$ and $m^{k}$ such that $0\leq r<m^{k}$ , $0\leq e_{m;w}(r+ym^{k})-e_{m;w}(r)\leq 1$ . Indeed, since $y\neq 0$ , $[r+ym^{k}]_{m}=y0..0[r]_{m}$ , thus, $[r+ym^{k}]_{m}$ has exactly one more $x$ -factor of length $|v|$ than $[r]_{p}$ only if $y=x$ , and this factor can be $w$ or not. Moreover, $e_{m;w}(r+ym^{k})-e_{m;w}(r)=1$ only if $w$ is a prefix of $[r+ym^{k}]_{m}$ . Consequently, $e_{m;w}(r+ym^{k})=e_{m;w}(r)+1$ only if $\alpha_{w}m^{k}\leq r<\beta_{w}m^{k}$ and $y=x$ . Hence, for any $t\in[\![m]\!]\backslash\{x\}$ ,

	$\displaystyle(a_{m;w}(n))_{tm^{k}\leq n<(t+1)m^{k}}$	$\displaystyle=(a_{m;w}(n))_{0\leq n<m^{k}}$
	$\displaystyle(a_{m;w}(n))_{xm^{k}\leq n<(x+1)m^{k}}$	$\displaystyle=\phi_{w}((a_{m;w}(n))_{0\leq n<m^{k}}).$

This implies that

(a_{m;w}(n))_{0\leq n<m^{k+1}}=u_{k}^{x}\phi_{w}(u_{k})u_{k}^{m-x-1},

which concludes the proof.∎

Example 4

Let us compute the Rudin-Shapiro sequence using windows function. From Example 3.3.1 in [4], the Rudin-Shapiro sequence can be defined as $(a_{2;11}(n))_{n\in\mathbf{N}}$ . From Proposition 3, set $\alpha_{11}=\frac{1}{2}$ , $\beta_{11}=\frac{2}{2}$ and $s_{0}=0,0,0,1$ . For any words $w\in\left\{0,1\right\}^{*}$ such that $w=w_{1}w_{2}$ with $|w_{1}|=|w_{2}|$ , $\phi_{s}(w)=w_{1}(w_{2}^{+})$ . Thus, one can compute

s_{1}=0,0,0,1,0,0,1,0;\;\;\;\;\;\;\;\;\;\;\;s_{2}=0,0,0,1,0,0,1,0,0,0,0,1,1,1,0,1;

s_{3}=0,0,0,1,0,0,1,0,0,0,0,1,1,1,0,1,0,0,0,1,0,0,1,0,1,1,1,0,0,0,1,0;

$(e_{s}(n))_{n\in\mathbf{N}}$ is the limit of $s_{n}$ when $n$ tends to infinite.∎

2.2 Block-counting sequences for $0$ -words

Proposition 5

Let $m$ be a positive number, let $w\in[\![m]\!]^{*}$ a $0$ -word and let $t=(w)_{m}$ . Let $u_{0}$ be such that $|u_{0}|=m^{|w|}$ and

u_{0}[i]=\begin{cases}1\;\;\text{if i=t}\\ 0\;\;\;\text{otherwise},\end{cases}

and let $u_{k+1}=\phi_{w}(u_{k})u_{k}^{m-1}$ .

By letting $w_{-1}=u_{0}$ if $w=0^{|w|}$ and $w_{-1}=0^{m^{|w|}}$ if not, $w_{k}=u_{k}^{m-1}$ for $k\geq 0$ , then

(a_{m;w}(n))_{n\in\mathbb{N}}=w_{-1}w_{0}w_{1}w_{2}\cdots w_{n}\cdots.

Lemma 6

Let $m$ be a positive number, $y\in[\![m]\!]\backslash\{0\}$ , $w\in[\![m]\!]^{*}$ a $0$ -word and let $t=(w)_{m}$ , then for any integer $r$ satisfying $t<m^{k}\leq r<m^{k+1}$ :
1) $e_{m;w}(r+ym^{k+1})=e_{m;w}(r)$ ;
2) $0\leq e_{m;w}(r+m^{k})-e_{m;w}(r)\leq 1$ ;
3) $e_{m;w}(r+m^{k})-e_{m;w}(r)=1$ only if $[r]_{m}$ is a $m-1$ -word and $\alpha_{w}m^{k}\leq r<\beta_{w}m^{k}$ .

Proof

For any integer $r$ satisfying $t<m^{k}\leq r<m^{k+1}$ , we first remark that $[r+ym^{k+1}]_{m}=y[r]_{m}$ . Since $[r+ym^{k+1}]_{m}$ and $y[r]_{m}$ have the same set of $0$ -factors, $e_{m;w}(r+ym^{k+1})=e_{m;w}(r)$ . Second, if $[r]_{m}$ is not a $m-1$ -word than $[r]_{m}$ and $[r+m^{k}]_{m}$ has the same set of $0$ -factors. But if $[r]_{m}$ is a $m-1$ -word, then $[r+m^{k}]_{m}=10[r]_{m}^{\prime}$ and thus, can have at most one more $0$ factors of length $|w|$ than $[r]_{m}$ . Consequently, $0\leq e_{m;w}(r+m^{k})-e_{m;w}(r)\leq 1$ . Moreover, in the latter case, $e_{m;w}(r+m^{k})-e_{m;w}(r)=1$ only if $1w$ is a prefix of $[r+m^{k}]_{m}$ . So $e_{m;w}(r+m^{k})=e_{m;w}(r)+1$ only if $\alpha_{w}m^{k}<r\leq\beta_{w}m^{k}$ .∎

Proof (of Proposition 5)

We first remark that $w_{-1}w_{0}$ is a prefix of $(a_{m;w}(n))_{n\in\mathbf{N}}$ .

Further, for any integer $k$ satisfying $(w)_{m}<m^{k}$ and $x\in[\![m]\!]\backslash\{0\}$ , from Lemma 6,

	$\displaystyle(a_{m;w}(n))_{xm^{k}\leq n<(x+1)m^{k}}$	$\displaystyle=(a_{m;w}(n))_{m^{k}\leq n<2m^{k}}$
	$\displaystyle(a_{m;w}(n))_{m^{k+1}\leq n<m^{k+1}+m^{k}}$	$\displaystyle=(\phi(a_{m;w}(n))_{(p-1)m^{k}\leq n<m^{k+1}})).$

This implies that

(a_{m;w}(n))_{m^{k}\leq n<m^{k+1}}=\left(\phi_{w}((a_{m;w}(n))_{m^{k-1}\leq n<m^{k}})(a_{m;w}(n))_{m^{k-1}\leq n<m^{k}}^{m-1}\right)^{m-1},

which concludes the proof.∎

Example 7

Let us compute the sequence $(a_{2;01}(n))_{n\in\mathbf{N}}$ with. From the previous theorem, set $\alpha_{01}=\frac{1}{2}$ , $\beta_{01}=\frac{2}{2}$ , $s_{-1}=0,0,0,0$ and $s_{0}=0,1,0,0$ . For any words $w\in\left\{0,1\right\}^{*}$ such that $w=w_{1}w_{2}$ with $|w_{1}|=|w_{2}|=k$ for some integer $k$ , $\phi_{s}(w)=w_{1}w_{2}^{+}$ . Thus, one can compute

s_{1}=0,1,1,1,0,1,0,0;\;\;s_{2}=0,1,1,1,1,0,1,1,0,1,1,1,0,1,0,0;

s_{3}=0,1,1,1,1,0,1,1,1,0,0,0,1,0,1,1,0,1,1,1,1,0,1,1,0,1,1,1,0,1,0,0;

$(a_{2;01}(n))_{n\in\mathbf{N}}$ is the limit of $s_{-1}s_{0}s_{1}s_{2}s_{3}...s_{n}$ when $n$ tends to infinite.∎

3 $(a_{p;w}(n))_{n\in\mathbf{N}}$ are not purely morphic

From now on, we work with $p$ a prime number.

We first prove that $(a_{p;w}(n))_{n\in\mathbf{N}}$ is not purely morphic when $|w|>1$ . We will need a simple notation, for $w=w[0]\cdots w[|w|-1]$ , let $w^{\diamond}=w[0]\cdots w[|w|-2]$ .

Proposition 8

For any prime number $p$ and for any $w\in[\![p]\!]^{*}$ , the sub-sequences of the form $(a_{p;w}(pn+i))_{0\leq i\leq p-1}$ are either constant (called type 1) or of the form

a_{p;w}(pn+i)=\begin{cases}t^{+}\;\;\text{if $i=w[|w|-1]$},\\ t\;\;\;\text{otherwise};\end{cases}

for some integer $t\in[\![p]\!]$ (called type 2). Moreover, $(a_{p;w}(pn+i))_{0\leq i\leq p-1}$ is of type 2 if and only if $w^{\diamond}$ is a suffix of $[n]_{p}$ .∎

For the sequence $(a_{p;w}(n))_{n\in\mathbf{N}}$ , let us define a $p$ -block to be a sub-sequence of the form $(a_{p;w}(pn+i))_{0\leq i\leq p-1}$ for some integer $n$ . From the previous proposition, a $p$ -block is either of type 1 or type 2. For a $p$ -block $(a_{p;w}(pn+i))_{0\leq i\leq p-1}$ of type 2, let us define its index to be an integer $i\in[\![p]\!]$ such that $a_{p;w}(pn+i)\neq a_{p;w}(pn+j)$ for all $j\neq i$ .

Proposition 9

For any prime number $p$ and any $w\in[\![p]\!]^{*}$ , if there exists a word $v$ such that $v^{p+1}$ is a prefix of $(a_{p;w}(n))_{n\in\mathbf{N}}$ and that $|v|\geq 2p^{|w|}$ , then $|v|$ is a multiple of $p^{|w|-1}$ .

Proof

If $|v|\geq 2p^{|w|}$ , then, from Proposition 3 and 5, $v$ contains a $p$ -block of the form

a_{p;w}(pm+i)=\begin{cases}1\;\;\text{if $i=w[|w|-1]$},\\ 0\;\;\text{otherwise},\end{cases}

for some $m$ . Since $v^{p+1}$ is a prefix of $(a_{p;w}(n))_{n\in\mathbf{N}}$ , $(a_{p;w}(pm+p|v|+i))_{0\leq i\leq p-1}=(a_{p;w}(pm+i))_{0\leq i\leq p-1}$ , which is also a $p$ -factor of type 2. From Proposition 8, $w^{\diamond}$ is a suffix of both $[m]_{p}$ and $[m+|v|]_{p}$ . Thus, $m+|v|-m$ is a multiple of $p^{|w|-1}$ . ∎

Proposition 10

For any prime integer $p$ and any $w\in[\![p]\!]^{*}$ , the sequence $(a_{p;w}(n))_{n\in\mathbf{N}}$ cannot have a prefix $v^{p+1}$ such that $|v|=ip^{|w|-1}$ for some positive integer $i\geq p+1$ .

This proposition will be proved with the help of the following lemmas.

Lemma 11

Let $w\in[\![p]\!]^{*}$ , then for any words $a,b\in[\![p]\!]^{*}$ and for any positive integer $\ell$ , there exists a word $u$ such that $|u|=l$ , $|au|_{w}=|a|_{w}$ and $|bu|_{w}=|b|_{w}$ .

Proof

Let $x\in[\![p]\!]\backslash\{w[|w|-1]\}$ and $u=x^{\ell}$ . It is clear that $|au|_{w}=|a|_{w}$ and $|bu|_{w}=|b|_{w}$ because none of the added factor of size $|w|$ ends with $x$ .

Lemma 12

Let $w$ be a word in $[\![p]\!]^{*}$ such that $|w|>1$ . Let $a,b\in[\![p]\!]^{*}$ such that $a_{w}\neq b_{w}$ where $a_{w}$ and $b_{w}$ are the longest suffixes of respectively $a$ and $b$ that are prefixes of $w$ . Then there exists a word $u$ such that $|u|\leq|w|-1$ and that $|au|_{w}\not\equiv|bu|_{w}\mod p$ .

Proof

If $|a|_{w}\not\equiv|b|_{w}\mod p$ , then let $v=\epsilon$ .
If $|a|_{w}\equiv|b|_{w}\mod p$ , because have $a_{w}\neq b_{w}$ , then $|a|_{w}\neq|b_{w}|$ because $w$ doesn’t have multiple suffixes of the same length. Suppose that $a_{w}$ is the longest. It is clear that $|a_{w}|>0$ . We define $v$ to be a word satisfying $a_{w}v=w$ . In this case, $|v|\leq|w|-1$ , $|av|_{w}=|a|_{w}+1$ and $|bv|_{w}=|b|_{w}$ .

Now we are able to prove Proposition 10.

Proof (of Proposition 10)

We only need to prove that there exist $k,k^{\prime}\in[\![p]\!]$ such that

(a_{p;w}(n))_{kip^{|w|-1}\leq n<(k+1)ip^{|w|-1}}\neq(a_{p;w}(n))_{k^{\prime}ip^{|w|-1}\leq n<(k^{\prime}+1)ip^{|w|-1}},

i.e. there exists some $j$ such that $0\leq j<|v|$ and

a_{p;w}(kip^{|w|-1}+j)\neq a_{p;w}(k^{\prime}ip^{|w|-1}+j).

For $1\leq k\leq p$ , let $t_{k}=[kip^{|w|-1}]_{p}$ . One has $t_{k}=u_{k}x_{k}0^{j}$ for some word $u_{k}$ , some letter $x_{k}\in[\![p]\!]\backslash\{0\}$ and some non-negative integer $j\geq|w|-1$ . Note that $u_{1}\neq 0$ . Since $p$ is prime, one has $x_{k}\neq x_{k^{\prime}}$ if $k\neq k^{\prime}$ . Thus, there exists $k\in[\![p]\!]$ such that $x_{k}=w[0]$ .

Now, let $k^{\prime}\in[\![p]\!]\backslash\{k\}$ and let $v_{k}$ and $v_{k}^{\prime}$ be the longest suffixes of respectively $u_{k}x_{k}$ and $u_{k^{\prime}}x_{k^{\prime}}$ that are prefixes of $w$ . Because $x_{k}=w[0]$ , $v_{k}\neq\epsilon$ and thus $v_{k}\neq v_{k^{\prime}}$ . Therefore, by Lemma 12 and Lemma 11, there exists $u$ such that $|v_{k}u|_{w}\not\equiv|v_{k^{\prime}}u|_{w}\mod p$ and that $|u|=|w|-1$ . Let $j=[u]_{p}$ , clearly $j<ip^{|w|-1}$ and one has

a_{p;w}(kip^{|w|-1}+j)\neq a_{p;w}(k^{\prime}ip^{|w|-1}+j),

which proves the result.

Now we are able to prove the principle theorem in most cases:

Theorem 13

For any prime number $p$ and any $w\in[\![p]\!]^{*}$ , the sequence $(a_{p;w}(n))_{n\in\mathbf{N}}$ is $p$ -uniformly morphic for any $w$ and non-purely morphic when $|w|>1$ and $w\neq 10$ .

Proof

First, the fact that $(a_{p;w}(n))_{n\in\mathbf{N}}$ is $p$ -automatic for any word $w$ follows from the Proposition 3.1 in [10], Page 7 and Theorem 16.1.5 in [4].

Now, if $w\neq 10$ and $|w|>1$ and the sequence $(a_{p;w}(n))_{n\in\mathbf{N}}$ is purely morphic, then $0^{p+1}$ is a prefix of $(a_{p;w}(n))_{n\in\mathbf{N}}$ . Thus, $(a_{p;w}(n))_{n\in\mathbf{N}}$ will have infinitely many prefix of type $v^{p+1}$ . However, from Proposition 9 and 10, $(a_{p;w}(n))_{n\in\mathbf{N}}$ can only have finitely many prefix of the form $v^{p+1}$ . We conclude.∎

Here we prove the $p$ particular cases.

Proposition 14

For any prime number $p$ and for any $w\in[\![p]\!]\backslash\{0\}$ , the sequence $(a_{p,w}(n))_{n\in\mathbf{N}}$ is purely morphic.

Proof

It is easy to check that for any non-negative integer $m$ , $(a_{p;w}(pm+i))_{0\leq i\leq p-1}$ satisfies the following property:

a_{p;w}(pm+i)=\begin{cases}a_{p;w}(m)^{+}\;\;\text{if $i=w$},\\ a_{p;w}(m)\;\;\text{otherwise}.\end{cases}

Thus, it is easy to check that $(a_{p;w}(pm+i))_{0\leq i\leq p-1}$ is the fixed point of the morphism: $i\to v_{i}$ for all $i\in[\![p]\!]$ , where,

v_{i}[k]=\begin{cases}i^{+}\;\;\text{if $k=w$},\\ i\;\;\text{otherwise}.\end{cases}

∎

Proposition 15

The sequence $(a_{2,0}(n))_{n\in\mathbf{N}}$ is non-purely morphic.

Proof

The sequence $(a_{2,0}(n))_{n\in\mathbf{N}}$ begins with $1,0,1,0$ . Thus, if this sequence is purely morphic, then this sequence has infinitely many prefixes of the form $v^{2}$ . Here we prove that $(a_{2,0}(n))_{n\in\mathbf{N}}$ cannot have a prefix of the form $v^{2}$ with $|v|\geq 5$ .

If $(a_{2,0}(n))_{n\in\mathbf{N}}$ has a prefix of the form $v^{2}$ with $|v|\geq 5$ . Let us suppose that $|v|=4k+i$ for some non-negative integers $k,i$ such that $i=0,1,2,3$ and $k\geq 1$ . First note that $a_{2,0}(r)=a_{2,0}(4k+i+r)$ for any $r$ satisfying $0\leq r<4k+i$ .

Also note that, for $k\geq 0$ , $a_{2,0}(k)=a_{2,0}(4k)=a_{2,0}(4k+3)=a_{2,0}(2k+1)=x$ and $a_{2,0}(4k+1)=a_{2,0}(4k+2)=a_{2,0}(2k)=x^{+}$ for some $x\in\{0,1\}$ . This is because if $[k]_{2}=u$ , then $[2k]_{2}=u0$ , $[2k+1]_{2}=u1$ , $[4k]_{2}=u00$ , $[4k+1]_{2}=u01$ , $[4k+2]_{2}=u10$ and $[4k+3]_{2}=u11$ .

Finally, note that $a_{2,0}(2^{t}k-1)=a_{2,0}(k-1)$ , because, we know that $k\geq 1$ so if $[k-1]_{2}=u$ then $[2^{t}k-1]_{2}=u1^{t}$ . Similarly, $a_{2,0}(2^{t}k-2^{s}-1)=a_{2,0}(2^{t}k-1)^{+}$ if $1<s<t$ .

Now if $i=0$ , then $|v|=4k$ and $a_{2,0}(1)=a_{2,0}(4k+1)=0$ , $a_{2,0}(2)=a_{2,0}(4k+2)=1$ which contradicts to $a_{2,0}(4k+1)=a_{2,0}(4k+2)$ .

If $i=1$ , then $|v|=4k+1$ and $a_{2,0}(0)=a_{2,0}(4k+1)=1$ , $a_{2,0}(1)=a_{2,0}(4k+2)=0$ which contradicts to $a_{2,0}(4k+2)=a_{2,0}(4k+2)$ .

If $i=2$ , then $|v|=4k+2$ and $a_{2,0}(k-1)=a_{2,0}(4k-1)$ but $a_{2,0}(k-1)=a_{2,0}(8k-1)=a_{2,0}(4k-3)=a_{2,0}(4k-1)^{+}$ .

If $i=3$ , then $|v|=4k+3$ and $a_{2,0}(2(2k+1))=a_{2,0}(4k+2)=a_{2,0}(4k+3)^{+}=a_{2,0}(0)^{+}=0$ . Thus, we have $a_{2,0}(2k+1)=1$ . But on the other hand, $a_{2,0}(2k+1)=a_{2,0}(4(2k+1))=a_{2,0}(8k+4)=a_{2,0}(4k+1)=a_{2,0}(4k+2)=0$ , which is a contradiction.

In all cases, $(a_{2,0}(n))_{0\leq n<|v|}\neq(a_{2,0}(n))_{|v|\leq n<2|v|}$ . ∎

Proposition 16

For any prime number $p\geq 3$ , the sequence $(a_{p,0}(n))_{n\in\mathbf{N}}$ is non-purely morphic.

Proof

The sequence $(a_{p,0}(n))_{n\in\mathbf{N}}$ begins with $(10^{p-1})^{p}$ . Thus, if this sequence is purely morphic, then this sequence has infinitely many prefixes of the form $v^{2}$ . Here we prove that $(a_{p,0}(n))_{n\in\mathbf{N}}$ cannot have a prefix of the form $v^{2}$ with $|v|\geq p^{2}$ .

First, let us prove that if $v^{2}$ is a prefix of $(a_{p,0}(n))_{n\in\mathbf{N}}$ , then $|v|$ is a multiple of $p$ . It is easy to check that $v^{2}$ is not a prefix of $(a_{p,0}(n))_{n\in\mathbf{N}}$ when $|v|=1,2$ . Let us suppose that $|v|\geq 3$ . In this case, $v$ begins with $1,0,0$ . Thus, $a_{p,0}(|v|)=1,a_{p,0}(|v|+1)=a_{p,0}(|v|+2)=0$ .

Let us suppose that $|v|=kp+t$ for some nonnegative integers $k,t$ such that $0\leq t\leq k-1$ . We first prove that $t\neq k-1$ . If it is the case, then $|v|+1$ is a multiple of $p$ and $a_{p,0}(|v|+1)\neq a_{p,0}(|v|+2)=0$ since $|v|+2$ has one $0$ less than $|v|+1$ in their p-expansions. this contradicts the fact that $a_{p,0}(|v|+1)=a_{p,0}(|v|+2)=0$ .

Now let us suppose that $t\neq k-1$ . In this case, $(a_{p,0}(n))_{kp\leq n\leq(k+1)p-1}$ contains the factor $a_{p,0}(|v|)a_{p,0}(|v|+1)=1,0$ . Thus, $(a_{p,0}(n))_{kp\leq n\leq(k+1)p-1}$ is a factor of type 2 announced in proposition 8. But the word $(a_{p,0}(n))_{0\leq n\leq p-1}=10^{p-1}$ is also a word of type 2 and the word $(a_{p,0}(n))_{n\in\mathbf{N}}$ cannot have two different factors of type 2 such that the special letters are at different positions. Thus, $a_{p,0}(|v|)a_{p,0}(|v|+1)$ should be a prefix of $(a_{p,0}(n))_{kp\leq n\leq(k+1)p-1}$ and consequently $|v|$ is a multiple of $p$ .

Second, $|v|$ is not a multiple of $p^{2}$ . Because, if it is in this case, $a_{p,0}(|v|)=a_{p,0}(0)=1$ and $a_{p,0}(|v|+p)=a_{p,0}(|v|)-1=0$ . But $a_{p,0}(p)=1$ , thus, $a_{p,0}(|v|+p)\neq a_{p,0}(p)$ . Consequently, $(a_{p,0}(n))_{0\leq n<|v|-1}\neq(a_{p,0}(n))_{|v|\leq n<2|v|-1}$ .

Third, if $|v|$ is not a multiple of $p^{2}$ but larger than $p^{2}+1$ , let us suppose that $|v|=kp^{2}+tp$ for some positive integers $k,t$ such that $1\leq t\leq p-1$ . Let $x=(p-t)p$ , we then have $a_{p,0}(|v|+x)=a_{p,0}(x)=1$ . But in this case, $a_{p,0}(x+p)=1$ or $2$ , but $a_{p,0}(|v|+x+p)=0$ . Thus, $(a_{p,0}(n))_{0\leq n<|v|-1}\neq(a_{p,0}(n))_{|v|\leq n<2|v|-1}$ . ∎

Proposition 17

The sequence $(a_{2;10}(n))_{n\in\mathbf{N}}$ is non-purely morphic.

Proof

The sequence $(a_{2;10}(n))_{n\in\mathbf{N}}$ begins with $0010$ . Thus, if this sequence is purely morphic, then this sequence has infinitely many prefixes of the form $v^{2}$ . We will prove that its only prefix of square shape is $00$ .

Let $v^{2}$ be a prefix of $(a_{p;10}(n))_{n\in\mathbf{N}}$ . Because of that, one can note that $(a_{p;10}(n))_{0\leq n<|v|}=a_{p;10}(|v|+n))_{0\leq n<|v|}$ , in particular, $(a_{p;10}(|v|+n))_{0\leq n\leq 4}=0010$ . Using this, we will prove this proposition by proving all the different possibility for the word $[|v|]_{2}$ .

For now on, $u$ can be any word in $[\![p]\!]^{*}$ and $s$ and $t$ positive integer. Note that the computation is made in binary basis.

i) If $[|v|]_{2}=1^{t}$ with $t>1$ , we have $a_{p;10}(|v|+1)=1\neq 0$ because $1^{t}+1=10^{t}$ .

ii) If $[|v|]_{2}=1^{t}01$ , one can simply note that $a_{p;10}(|v|)=1\neq 0$ .

iii) If $[|v|]_{2}=u101^{t}01$ then $a_{p;10}(|v|+3)=a_{p;10}(|v|)^{+}$ because $u101^{t}01+11=u110^{s+2}$ .

iv) If $[|v|]_{2}=u101^{t}$ with $t>1$ then $a_{p;10}(|v|+2)=a_{p;10}(|v|)$ , because $u101^{t}+11=u110^{t-1}1$ .

v) If $[|v|]_{2}=u10^{s}1^{t}$ with $s>1$ we have $a_{p;10}(|v|+1)=a_{p;10}(|v|)^{+}$ , because $u10^{s}1^{t}+1=u10^{s-1}10^{t}$ .

vi) Finally, if $[|v|]_{2}=u10^{t}$ with $t>1$ , we have on one hand $a_{p;10}(|v|+(1^{t-1}0)_{2})=a_{p;10}(|v|)=0$ because $u10^{t}+1^{t-1}0=u1^{t}0$ . We also have $a_{p;10}(|v|+(1^{t-1}0)_{2})=a_{p;10}((1^{t-1}0)_{2})=1$ , which is a contradiction.

An attentive reader will remark that this cover all the number strictly bigger than 1. ∎

Proposition 18

For any prime number $p\geq 3$ the sequence $(a_{p;10}(n))_{n\in\mathbf{N}}$ is non-purely morphic.

Proof

The sequence $(a_{p;10}(n))_{n\in\mathbf{N}}$ begins with $0^{p}1$ . Thus, if this sequence is purely morphic, then this sequence has infinitely many prefixes of the form $v^{p}$ . It suffices to prove that if $|v|>p^{2}$ then $v^{p}$ is not a prefix of $(a_{p;10}(n))_{n\in\mathbf{N}}$ .

Let $v^{p}$ be a prefix of $(a_{p;10}(n))_{n\in\mathbf{N}}$ .

Suppose that $p\nmid|v|>p^{2}$ . This means that $v=uyx$ for a word $u$ and some letters $y,x$ with $x\neq 0$ . Because $v^{2}$ is a prefix of $(a_{p;10}(n))_{n\in\mathbf{N}}$ , $v$ begins with the letters $0^{p}1$ and $(a_{p;10}(n))_{0\leq n\leq p}=a_{p;10}((v)_{p}+n))_{0\leq n\leq p}$ . Thus $a_{p;10}((v)_{p})=0$ .

Let $c\in[\![p]\!]$ such that $x+c=p$ ; it exists because $x\neq 0$ and $p>2$ . Thus, $[(v)_{p}+c]_{p}=u^{\prime}y^{\prime}0$ . Because $a_{p;w}(c)=0$ , $a_{p;w}((v)_{p}+c)=0$ also and $a_{p;w}((v)_{p}+p)=0$ or $p-1$ which is not equal to $a_{p;w}(p)=1$ . Therefore, $v^{2}$ is not a prefix of $(a_{p;10}(n))_{n\in\mathbf{N}}$ .

Suppose now that $p\mid|v|\geq p$ . Let $|v|=sp^{t}$ for some positive integer $s,t$ such that $t\geq 1$ and $p\nmid s$ and let $[v]_{p}=ux0^{t}$ for some word $u$ and some letter $x\in[\![p]\!]\backslash\{0\}$ .

Since $p$ is prime, there exists $k\in[\![p]\!]$ such that $[kv]_{p}=u^{\prime}10^{t}$ for some word $u^{\prime}$ . Let $m=p^{t+1}-1$ , thus $[m]_{p}=(p-1)^{t}$ and $[kv+m]_{p}=u^{\prime}1(p-1)^{t}$ .

Since $(a_{p;10}(n))_{k_{1}|v|\leq n\leq(k_{1}+1)|v|-1}=(a_{p;10}(n))_{k_{2}|v|\leq n\leq(k_{2}+1)|v|-1}$ , for any $k_{1}$ , $k_{2}\in[\![p]\!]$ we have $a_{p;10}(0)=0=a_{p;10}(kv)$ thus $a_{p;10}(u^{\prime})=p-1$ which means that $a_{p;10}(m)=0\neq a_{p;10}(kv+m)=p-1$ .

Hence, $v^{p}$ cannot be a prefix of $(a_{p;10}(n))_{n\in\mathbf{N}}$ if $v>p^{2}$ which concludes the proof. ∎

Proof (of Theorem 2)

It is a direct result of Theorem 13, Proposition 14, Proposition 15, Proposition 16, Proposition 17 and Proposition 18.∎

4 Algebraicity

By Christol’s theorem [11], we know that the power series $f=\sum_{i=0}^{\infty}a_{p;w}(n)t^{n}$ is algebraic over $\mathbb{F}_{p}(t)$ . Now we prove that $f$ is algebraic of degree $p$ . Indeed, if we let $[w]_{p}$ denote $w_{1}p^{k-1}+\cdots+w_{k}$ , and write $a_{n}=a_{p;w}(n)$ for short, then

	$\displaystyle\quad(1+t+\cdots+t^{p-1})f^{p}-f$
	$\displaystyle=\sum_{n\geq 0}\sum_{j=0}^{p-1}(a_{n}-a_{pn+j})t^{pn+j}$
	$\displaystyle=\sum_{n\geq 0}(a_{n}-a_{pn+w_{k}})t^{pn+w_{k}}$
	$\displaystyle=\sum_{n\geq 0}\sum_{j=0}^{p-1}(a_{np+j}-a_{np^{2}+jp+w_{k}})t^{np^{2}+jp+w_{k}}$
	$\displaystyle=\sum_{n\geq 0}(a_{np+w_{k-1}}-a_{np^{2}+w_{k-1}p+w_{k}})t^{np^{2}+w_{k-1}p+w_{k}}$
	$\displaystyle\ldots$
	$\displaystyle=\sum_{n\geq 0}(a_{np^{k-1}+w_{1}p^{k-2}\cdots+w_{k-1}}-a_{np^{k}+w_{1}p^{k-1}+\cdots+w_{k}})t^{np^{k}+[w]_{p}}$
	$\displaystyle=\begin{cases}\sum_{n\geq 0}-t^{np^{k}+[w]_{p}}=t^{[w]_{p}}/(t^{p^{k}}-1),\;&\text{if }w_{1}\neq 0\\ \sum_{n\geq 1}-t^{np^{k}+[w]_{p}}=t^{p^{k}+[w]_{p}}/(t^{p^{k}}-1),\;&\text{if }w_{1}=0.\end{cases}$

The irreduciblity of the the above functional equations is straightforward from the Eisenstein’s criterion. We thus have the following propriety:

Proposition 19

For any prime number $p$ and any finite word $w$ in $[\![p]\!]^{*}$ , the power series $\sum_{i=0}^{\infty}a_{p;w}(n)t^{n}$ is algebraic of degree $p$ over $\mathbb{F}_{p}(t)$ .

5 Final remarks

The authors remark that the fast algorithms introduced in Section 2 for $0$ -words and non- $0$ -words are much different. However, the generating functions given in Section 4 for $0$ -words and non- $0$ -words are quite similar. Thus, we believe that the algorithms in Section 2 can be unified for both $0$ -words and non- $0$ -words.

References

[1] Allouche, G., Allouche, J.P., Shallit, J.: Kolam indiens, dessins sur le sable aux îles Vanuatu, courbe de Sierpinski et morphismes de monoïde. Annales de l’Institut Fourier 56(7), 2115–2130 (2006), https://aif.centre-mersenne.org/articles/10.5802/aif.2235/
[2] Allouche, J.P., Bétréma, J., Shallit, J.O.: Sur des points fixes de morphismes d’un monoïde libre. RAIRO - Theoretical Informatics and Applications - Informatique Théorique et Applications 23(3), 235–249 (1989), http://www.numdam.org/item/ITA_1989__23_3_235_0/
[3] Allouche, J.P., Cassaigne, J., Shallit, J., Zamboni, L.Q.: A taxonomy of morphic sequences (2017), https://arxiv.org/abs/1711.10807
[4] Allouche, J.P., Shallit, J.: Automatic Sequences: Theory, Applications, Generalizations. Cambridge University Press (2003)
[5] Allouche, J.P., Shallit, J.: Automatic Sequences Are Also Non-uniformly Morphic, pp. 1–6. Springer International Publishing, Cham (2020), https://doi.org/10.1007/978-3-030-55857-4_1
[6] Allouche, J.P., Shallit, J., Wen, Z.X., Wu, W., Zhang, J.M.: Sum-free sets generated by the period- $k$ -folding sequences and some sturmian sequences. Discrete Mathematics 343(9), 111958 (2020), https://www.sciencedirect.com/science/article/pii/S0012365X20301448
[7] Bartholdi, L.: Endomorphic presentations of branch groups. Journal of Algebra 268(2), 419–443 (2003), https://www.sciencedirect.com/science/article/pii/S0021869303002680
[8] Bartholdi, L., Siegenthaler, O.: The twisted twin of the Grigorchuk group. International Journal of Algebra and Computation 20(04), 465–488 (2010), https://doi.org/10.1142/S0218196710005728
[9] Benli, M.G.: Profinite completion of Grigorchuk’s group is not finitely presented. International Journal of Algebra and Computation 22(05), 1250045 (2012), https://doi.org/10.1142/S0218196712500452
[10] Cateland, E.: Suites digitales et suites k-régulières. Theses, Université Sciences et Technologies - Bordeaux I (Jun 1992), https://tel.archives-ouvertes.fr/tel-00845511
[11] Christol, G., Kamae, T., Mendès France, M., Rauzy, G.: Suites algébriques, automates et substitutions. Bull. Soc. Math. France 108(4), 401–419 (1980), http://www.numdam.org/item?id=BSMF_1980__108__401_0
[12] OEIS Foundation Inc.: The On-Line Encyclopedia of Integer Sequences (2022), published electronically at http://oeis.org
[13] Shallit, J.: Automaticity iv: sequences, sets, and diversity. Journal de Théorie des Nombres de Bordeaux 8(2), 347–367 (1996), http://www.jstor.org/stable/43974217

Block-counting sequences are not purely morphic

Abstract

1 Introduction, definitions and notation

Example 1

Theorem 2

2 Windows functions and (ap;w​(n))n∈𝐍(a_{p;w}(n))_{n\in\mathbf{N}}

2.1 Block-counting sequences for non-0-words

Proposition 3

Proof

Example 4

2.2 Block-counting sequences for 0-words

Proposition 5

Lemma 6

Proof

Proof (of Proposition 5)

Example 7

3 (ap;w​(n))n∈𝐍(a_{p;w}(n))_{n\in\mathbf{N}} are not purely morphic

Proposition 8

Proposition 9

Proof

Proposition 10

Lemma 11

Proof

Lemma 12

Proof

Proof (of Proposition 10)

Theorem 13

Proof

Proposition 14

Proof

Proposition 15

Proof

Proposition 16

Proof

Proposition 17

Proof

Proposition 18

Proof

Proof (of Theorem 2)

4 Algebraicity

Proposition 19

5 Final remarks

References

2 Windows functions and $(a_{p;w}(n))_{n\in\mathbf{N}}$

2.1 Block-counting sequences for non- $0$ -words

2.2 Block-counting sequences for $0$ -words

3 $(a_{p;w}(n))_{n\in\mathbf{N}}$ are not purely morphic