On Periodic Points in Covering Systems

Abstract

We study a system of intervals $I_{1},\ldots,I_{k}$ on the real line and a continuous map $f$ with $f(I_{1}\cup I_{2}\cup\ldots\cup I_{k})\supseteq I_{1}\cup I_{2}\cup\ldots\cup I_{k}$ . It’s conjectured that there exists a periodic point of period $\leq k$ in $I_{1}\cup\ldots\cup I_{k}$ . In this paper, we prove the conjecture by a discretization method and reduce the initial problem to an interesting combinatorial lemma concerning cyclic permutations. We also obtain a non-concentration property of periodic points of small periods in intervals.

1 Introduction

We consider a continuous mapping $f$ : $\mathbb{R}\to\mathbb{R}$ , and $k$ closed intervals $I_{1},I_{2},...,I_{k}$ . Throughout this paper, $f^{l}$ denotes the $l$ -th iterate of $f$ . And all the mappings are assumed to be continuous unless otherwise noted.

Definition 1.1.

We call $(I_{1},I_{2},\ldots,I_{k};f)$ a $covering\ system$ if $f(I_{1}\cup I_{2}\cup\ldots\cup I_{k})\supseteq I_{1}\cup I_{2}\cup\ldots\cup I_{k}$ .

This concept was first introduced by S. A. Bogatyi and E. T. Shavgulidze in their paper [1], when they tried to obtain an analogue of Sharkovskii’s theorem for an arbitrary tree. They proved the existence of a periodic point of period $\leq g(n)$ in a covering system and established an upper bound for $g(n)$ $(g(n)\leq 2(n^{2}-1)g(n-1)^{n-2})$ . Moreover, they showed $g(n)=n$ for $n\leq 4$ and conjectured in the remark that $g(n)=n$ holds for all $n\in\mathbb{N}$ , which is proved in this paper as the main theorem.

Theorem 1.2.

For any covering system $(I_{1},I_{2},\ldots,I_{k};f)$ , there exists $x_{0}\in I_{1}\cup I_{2}\cup\ldots\cup I_{k}$ and $l\leq k$ such that $f^{l}(x_{0})=x_{0}$ .

We will prove this theorem by a discretization method, which reduces the initial problem to a lemma in combinatorics. The main idea is to cut the intervals into small subintervals and perturb the mapping $f$ to get a cyclic permutation. And the proof of the theorem is completed by observing a property of cyclic elements of $S_{n}$ .

The proof of the main theorem 1.2 will be divided into several steps and discussed in the following sections. We now state a well-known proposition which will be frequently used in the proof of theorem 1.2.

Proposition 1.3 (the case $k=1$ ).

Let $f:I\to\mathbb{R}$ satisfy $f(I)\supseteq I$ . Then there exists a fixed point of $f$ in $I$ .

This fact is an easy consequence of the intermediate value theorem in calculus so we omit the proof. And we will see later that for $k$ larger, most cases can be reduced to this fundamental one.

We would like to end this introduction with the statement of the main lemma used in the proof of theorem 1.2, which studies the properties of cyclic permutations. Actually, this lemma is an extension of a lemma proved in Sharkovskii’s celebrated paper [2]. See the remark below.

Definition 1.4.

Let $f$ be a cyclic permutation in the symmetric group $S_{n}$ , $i.e.$ , $f$ can be written as $(i_{1}i_{2}\ldots i_{n})$ , meaning that $f:i_{1}\to i_{2}\to\ldots\to i_{n}\to i_{1}$ . And let $A_{i}=\{i,i+1\}$ $(i=1,2,\ldots,n-1)$ be $(n-1)$ particular sets. We define the characteristic number $m_{i}$ of each $A_{i}$ as

m_{i}=\min\{m\arrowvert\ (convf)^{m}(A_{i})\supseteq A_{i}\}.

Here, $convf(A)=$ convex hull of $f(A)=\{\min f(A),\min f(A)+1,\ldots,\max f(A)\}$ for any finite set $A\subseteq\mathbb{N}$ .

And we define the characteristic sequence of $f$ to be

m_{1}^{\prime}\leq m_{2}^{\prime}\leq\ldots\leq m_{n-1}^{\prime},

where $\{m_{i}^{\prime}\}$ is a rearrangement of $\{m_{i}\}$ .

Example 1.5.

If $f=(136245)\in S_{6}$ , then we can calculate $m_{1}=m_{3}=m_{5}=3$ , $m_{2}=2$ , $m_{4}=1$ . The characteristic sequence is $1\leq 2\leq 3\leq 3\leq 3$ .

Take $m_{2}$ as an example. Since $f(A_{2})=f(\{2,3\})=\{4,6\}$ , $convf(A_{2})=conv\{4,6\}=\{4,5,6\}$ ; and $f(\{4,5,6\})=\{1,2,5\}$ , $conv\{1,2,5\}=\{1,2,3,4,5\}$ ; therefore, $(convf)^{2}(A_{2})=\{1,2,3,4,5\}\supseteq A_{2}$ and we have $m_{2}=2$ .

Now we can state our main lemma.

Lemma 1.6.

For any cyclic permutation $f$ in the symmetric group $S_{n}$ , the characteristic sequence $\{m_{i}^{\prime}\}$ of $f$ satisfies $m_{i}^{\prime}\leq i$ , $i=1,2,\ldots,n-1$ .

Remark 1.7.

(1) For another form of this lemma more closely related to the initial problem, see lemma 2.3 in the next section.

(2) In theorem 7 of Sharkovskii’s paper [2], it is actually proved that $m_{i}^{\prime}\leq n-1$ , for $i=1,2,\ldots,n-1$ . Therefore, this lemma is a generalization of that result.

Meanwhile, this lemma leads in some sense to a non-concentration property of periodic points with lower periods. Namely, it follows that if $x_{k_{1}}<x_{k_{2}}<\ldots<x_{k_{n}}$ is an orbit of periodic points of period $n$ under $f$ , then between any two adjacent points $x_{k_{i}}$ and $x_{k_{i+1}}$ , there exists a periodic point $y_{i}$ of period $m_{i}\leq n-1$ . Futhermore, these $(n-1)$ numbers $m_{i}$ can be rearranged to have $m_{i}^{\prime}\leq i$ . In other words, all the periodic points of period $\leq i$ cannot lie only in $(i-1)$ of the intervals $[x_{k_{j}},x_{k_{j+1}}]$ , $1\leq j\leq n-1$ .

(3) One can see that the statement of lemma 1.6 doesn’t involve any notations in analysis. And hence it is of independent interest in the study of the symmetric group $S_{n}$ in combinatorics.

(4) It is necessary to require the permutation be cyclic. As a counterexmaple, consider $f=(13)(2)\in S_{3}$ , then the characteristic numbers are $m_{1}=m_{2}=2$ .

We organize the paper as follows. In section 2, we introduce the discretization method and reduce theorem 1.2 in dynamics to a lemma in combinatorics. Then in section 3, we prove the equivalence of the lemma in section 2 and the one stated above. Finally in section 4, we prove lemma 1.6 .

2 Discretization

In this section we show that Theorem 1.2 can be proved by solving another discrete problem related to it. The main idea is to divide the initial intervals into smaller subintervals, such that the image of each subinterval under $f$ contains the whole of some other subintervals, not just part of them. However, it may happen that no matter how you divide the intervals, there will be an interval whose image under $f$ contains only part of another interval. What saves us here is that, we can consider perturbation of $f$ if the intervals are cut into enough small pieces. Indeed, we will use the following lemma:

Lemma 2.1.

If there is a covering system $(I_{1},I_{2},\ldots,I_{k};f)$ such that $\forall x\in I_{1}\cup\ldots\cup I_{k}$ , and $\forall\>l\leq k,f^{l}(x)\neq x,$ then $\exists\>\delta>0$ depending on $f$ , $s.t.$ , whenever $\lVert g-f\rVert_{C(\mathbb{R})}<\delta$ , it also holds that $g^{l}(x)\neq x$ , for all $l\leq k$ , $x\in I_{1}\cup\ldots\cup I_{k}$ .

Proof.

Recall that $f$ has a periodic point of period $\leq k$ in $I_{1}\cup\ldots\cup I_{k}$ is equivalent to that $F(x)=(f(x)-x)\ldots(f^{k}(x)-x)$ has a zero in $I_{1}\cup\ldots\cup I_{k}$ . Therefore, we only need to control $\lvert f^{l}(x)-g^{l}(x)\rvert$ for $1\leq l\leq k$ , $x\in I_{1}\cup\ldots\cup I_{k}$ . And this will be done inductively. Indeed, we can write:

f^{l+1}(x)-g^{l+1}(x)=f(f^{l}(x))-f(g^{l}(x))+f(g^{l}(x))-g(g^{l}(x))

And if we assume we have proved $\lvert f^{l}(x)-g^{l}(x)\rvert$ is sufficiently small (at least less than 1) for all $x\in I_{1}\cup\ldots\cup I_{k}$ , then choose a closed interval $[p,q]$ containing $f^{l}(I_{1}\cup\ldots\cup I_{k})$ . We will have $g^{l}(I_{1}\cup\ldots\cup I_{k})\subseteq[\,p-1,\,q+1\,]$ . Thus, we will obtain a control of $\lvert f^{l+1}(x)-g^{l+1}(x)\rvert$ when $x\in I_{1}\cup\ldots\cup I_{k}$ , since $f$ is uniformly continuous on $[\,p-1,\,q+1\,]$ and $\lVert g-f\rVert_{C(\mathbb{R})}<\delta$ . Finally, we repeat the procedure $k$ times and get the conclusion. ∎

Now we can introduce the discrete problem related to Theorem 1.2. To begin with, we make a technical assumption on the initial covering system in Lemma 2.1. (See remark 2.4 for the reason). Namely, if $I_{j}=[a_{j},b_{j}]$ and $f(I_{j})=[c_{j},d_{j}]$ in the given covering system $(I_{1},I_{2},\ldots,I_{k};f)$ , then we can assume $f(\{a_{j},b_{j}\})=\{c_{j},d_{j}\}$ without loss of generality, since we can replace $I_{j}$ by a smaller subinterval if necessary.

Let $a_{1}<b_{1}<a_{2}<b_{2}<\ldots<a_{k}<b_{k}$ be all the end points of the intervals $I_{1},\ldots,I_{k}$ and denote $M_{0}=\{a_{1},b_{1},\ldots,a_{k},b_{k}\}$ . Consider their images $f(M_{0})=\{f(a_{1}),\ldots,f(b_{k})\}$ and we eliminate the points which don’t belong to $I_{1}\cup\ldots\cup I_{k}$ to obtain a set $S_{1}\subseteq f(M_{0})$ . Put $M_{1}=M_{0}\cup S_{1}$ . Inductively, in each step we consider the image of $M_{i}$ under the mapping $f$ and we eliminate those points dropping out of $I_{1}\cup...\cup I_{k}$ to get a set $S_{i+1}\subseteq f(M_{i})$ . Then let $M_{i+1}=M_{i}\cup S_{i+1}$ .

Finally we obtain a sequence of sets $M_{0}\subseteq M_{1}\subseteq\ldots\subseteq M_{i}\subseteq\ldots$ , where each $M_{i}$ contains the images of the initial end point set $M_{0}$ under { $id,f,f^{2},\ldots,f^{i}$ } except for those dropping out of $I_{1}\cup\ldots\cup I_{k}$ .

Then we can find a sufficiently large integer $N$ such that $\forall x\in M_{N}-M_{N-1}$ , $dist(x,M_{N-1})<\delta$ , where $\delta$ comes from Lemma 2.1. Such $N$ exists because if not, there will be a sequence of disjoint points in $[a_{1},b_{k}]$ with pairwise distance larger than $\delta$ , which is absurd. Therefore, we can take small perturbation of $f$ to get a map $\tilde{f}$ with $\lVert\tilde{f}-f\rVert<\delta$ , and $\widetilde{M_{N-1}}=\widetilde{M_{N}}$ for $\tilde{f}$ . Indeed, we just move each point $(x_{0},f(x_{0}))\in M_{N-1}\times M_{N}$ on the graph of $f$ to the nearest point $(x_{0},y_{0})\in\{x_{0}\}\times M_{N-1}$ preserving the continuity and the covering property of $f$ . For example, in a neighborhood of each $x_{0}\in M_{N}$ , we can define $\tilde{f}(x)=f(x)\pm\sigma\phi(x-x_{0})$ , where $0\leq\phi\leq 1$ is a cut-off function, $\phi(0)=1$ , $\sigma<\delta$ is chosen so that $\tilde{f}(x_{0})\in M_{N-1}$ . And for different $x_{0}$ we choose different signs $\pm$ so that $(I_{1},I_{2},\ldots,I_{k};\tilde{f})$ is still a covering system. (See Figure 1 and 2.)

Thus, if we divide the intervals into small pieces using the points in $\widetilde{M_{N}}$ , then $\tilde{f}$ will have the nice property that the image of each subinterval contains entirely another one or more subintervals. In other words, $\tilde{f}$ can be regarded as a discrete mapping from $\{1,2,\ldots,n\}$ to {subsets of $\{1,2,\ldots,n\}$ }. Here each number $j$ represents a small piece of interval $J_{j}$ . And $j^{\prime}\in f(j)$ if $f(J_{j})\supseteq J_{j^{\prime}}$ . Note that the image of some interval may be empty, since we ignore the part outside $I_{1}\cup\ldots\cup I_{k}$ . And we only care about the interval determined by $\{f(a),f(b)\}$ if $a,b\in\widetilde{M_{N}}$ , although the image $f([a,b])$ may be larger.

Example 2.2.

The two figures illustrate how we perturb $f$ locally to obtain $\tilde{f}$ with $\tilde{f}$ still continuous and inducing a covering system close to the initial one.

Refer to caption — Figure 1: the initial $f$

It should also be remarked here that the only useful information of $f$ on the so called “gaps” (see [1]) between two adjacent intervals $I_{i}$ and $I_{i+1}$ is the behavior of the end points of each gap. In other words, for the image $f(J_{j})$ of each small subinterval $J_{j}$ , we only focus on the part $f(J_{j})\cap(I_{1}\cup\ldots\cup I_{k})$ . This is the reason why we eliminate the points outside $I_{1}\cup\ldots\cup I_{k}$ in each step. We will see later that this information is enough for the proof.

To sum up, once we have a mapping $f$ satisfying the conditions in Lemma 2.1, we can find an $\tilde{f}$ close enough to $f$ , which also satisfies the conditions but can be viewed as a discrete mapping. Such $f$ and $\tilde{f}$ are counterexamples to Theorem 1.2. Therefore, if we argue by contradiction, we can easily see that Theorem 1.2 is a corollary of the following lemma, whose proof will be discussed later.

Lemma 2.3.

For any mapping $f$ : $\{1,2,\ldots,n\}$ $\to$ {subsets of $\{1,2,\ldots,n\}\}$ such that $\bigcup_{i=1}^{n}f(i)=\{1,2,\ldots,n\}$ , and for any partition

I_{1}=\{1,2,\ldots,i_{1}\},I_{2}=\{i_{1}+1,i_{1}+2,\ldots,i_{2}\},\ldots,I_{k}=\{i_{k-1}+1,\ldots,n\},

there exists $j\in\{1,2,\ldots,k\}$ and $r,s\in I_{j}$ ( $r=s$ is allowed) such that

(convf)^{l}(\{r,s\})\supseteq\{r,s\},\quad for\ some\quad l\leq k.

Here, $convf(A)=$ convex hull of $f(A)=\{\min f(A),\min f(A)+1,\ldots,\max f(A)\}$ for any finite set $A\subseteq\mathbb{N}$ .

Remark 2.4.

We emphasize here again the difference between $f([a,b])$ and $[f(a),f(b)]$ . In general the former is larger. And one may worry that the discrete mapping $f$ induced from a covering system does not satisfy the corresponding covering property $\bigcup_{i=1}^{n}f(i)=\{1,2,\ldots,n\}$ , since in the definition of $f$ , $j^{\prime}\in f(j)$ iff $f(J_{j})\supseteq J_{j^{\prime}}$ , where $f(J_{j})$ is understood in the sense of $[f(a),f(b)]$ rather than $f([a,b])$ . However, this problem can be easily handled by considering the initial covering system to be minimal. (See [1] for more details.) That is, in particular, the end points of each $I_{j}$ are mapped to the end points of $f(I_{j})$ . And it’s now easily checked that the perturbation and discretization preserve the covering property of $f$ , since the perturbation does not change the position of the end points of $f(I_{j})$ essentially.

Proof of Theorem 1.2 by lemma 2.3.

Suppose the covering system $(I_{1},\ldots,I_{k};f)$ is a counterexample to Theorem 1.2. Then as mentioned above, after perturbation, we can assume $f$ satisfies the conditions of lemma 2.3. Therefore the conclusion of lemma 2.3 gives us a subinterval of $I_{j}$ represented by $conv\{r,s\}$ whose image under $f^{l}$ contains itself. Thus there must exist a periodic point of period $\leq l\leq k$ in $I_{j}$ , a contradiction. ∎

3 Equivalence of the Two Lemmas

Although lemma 2.3 reduces Theorem 1.2 to a discrete problem, it is not satisfactory since it involves something inconvenient to deal with, like convex hull and partition. In the following we will simplify the conditions in lemma 2.3 and find another description of it which only depends on the own property of a permutation. And the main goal of this section is to prove the equivalence of lemma 1.6 and lemma 2.3.

Let $f$ be in lemma 2.3. We can assume further:

(1) $\quad\forall i\neq j$ , $f(i)\cap f(j)=\emptyset$ .

(2) $\quad\forall i$ , $f(i)\neq\emptyset$ .

(3) $\quad\forall i$ , $f(i)$ contains exactly one element.

(4) $\quad f$ is a cyclic permutation in the symmetric group $S_{n}$ , $i.e.$ , $f$ can be written as $(i_{1}i_{2}\ldots i_{n})$ .

Indeed, (1) can be realized since we can move out the common part of $f(i)$ and $f(j)$ from one of them without changing the property $\bigcup_{i=1}^{n}f(i)=\{1,2,\ldots,n\}$ .

(2) is also satisfied because we can eliminate the number $i$ and restrict the mapping $f$ to $\{1,2,\ldots,n\}-\{i\}$ if necessary.

(3) is a consequence of (1), (2) and the condition $\bigcup_{i=1}^{n}f(i)=\{1,2,\ldots,n\}$ . Indeed, one can apply the operations in (1) and (2) repeatedly until a minimal case. This couldn’t be an endless process because we are dealing with finite sets. And one can prove that the minimal set cannot be empty because the covering property is preserved.

For (4), note that $f\in S_{n}$ because of (3). Suppose $f$ is not cyclic. Then choose an orbit $(i_{1}\ldots i_{m})$ of $f$ ( $m<n$ ) and restrict both $f$ and the partition $I_{1},\ldots,I_{k}$ to $\{i_{1},\ldots,i_{m}\}$ .

In conclusion, we obtain:

Proposition 3.1.

It is sufficient to consider $f\in S_{n}$ and $f$ being cyclic in lemma 2.3.

Now we turn to the proof of the equivalence of lemma 2.3 and lemma 1.6. (See also Definition 1.4.)

Proposition 3.2.

Lemma 1.6 implies lemma 2.3.

Proof.

As mentioned above, we can assume $f\in S_{n}$ being cyclic, without loss of generality.

Suppose lemma 1.6 is true. Then for any partition

I_{1}=\{1,2,\ldots,i_{1}\},\quad I_{2}=\{i_{1}+1,\ldots,i_{2}\},\quad\ldots,\quad I_{k}=\{i_{k-1}+1,...,n\},

the $(k-1)$ characteristic numbers $\{m_{i_{1}},m_{i_{2}},\ldots,m_{i_{k-1}}\}$ cannot cover $\{m_{1}^{\prime},\ldots,m_{k}^{\prime}\}$ . Therefore, $\exists\,t\neq i_{1},\ldots,i_{k-1}$ , and $l\leq k$ such that $m_{t}=m_{l}^{\prime}\leq k$ , since $m_{1}^{\prime}\leq\ldots\leq m_{k}^{\prime}\leq k$ in the characteristic sequence. In other words,

(convf)^{m_{t}}(A_{t})\supseteq A_{t},\quad where\quad A_{t}=\{t,t+1\}\subseteq some\ I_{j}.

Therefore, we can just choose $r=t,\ s=t+1$ in lemma 2.3.

∎

Recall that in one-dimensional dynamics we often consider a directed graph associated to a periodic orbit (For more details, see [3] or chapter 1 of [4]). Namely, if $f$ is a cyclic permutation in Definition 1.4, we can construct a directed graph $\Gamma_{f}$ with $(n-1)$ vertices $\{A_{1},A_{2},\ldots,A_{n-1}\}$ , where $A_{i}=\{i,i+1\}$ is defined in 1.4. And there is an edge from $A_{i}$ to $A_{j}$ if and only if $convf(A_{i})\supseteq A_{j}$ .

Example 3.3.

Let $f=(136245)\in S_{6}$ as in example 1.5. Then the associated directed graph $\Gamma_{f}$ is drawn in figure 3.

Thus, we see directly from the definition that the following proposition holds:

Proposition 3.4.

For a cyclic $f\in S_{n}$ and its associated directed graph $\Gamma_{f}$ , the characteristic number $m_{i}$ is equal to the minimal length of cycles starting from $A_{i}$ in $\Gamma_{f}$ .

Using directed graphs, we can complete the proof of the equivalence of the two lemmas.

Proposition 3.5.

Lemma 1.6 is equivalent to lemma 2.3.

Proof.

Suppose lemma 2.3 is true. For a cyclic $f\in S_{n}$ , suppose on the contrary that there is a $k\leq n-1$ such that $m_{k}^{\prime}\geq k+1$ , and $k$ is the minimal number with this property. Then there are only $(k-1)$ characteristic numbers $m_{i_{1}},m_{i_{2}},\ldots,m_{i_{k-1}}\leq k$ . Without loss of generality, assume $i_{1}<\ldots<i_{k-1}$ . And consider the partition:

I_{1}=\{1,2,\ldots,i_{1}\},\quad I_{2}=\{i_{1}+1,\ldots,i_{2}\},\quad\ldots,\quad I_{k}=\{i_{k-1}+1,\ldots,n\}.

By lemma 2.3, we can find $j,l\leq k$ and $r,s\in I_{j}$ ( $r<s$ ) such that

(convf)^{l}(\{r,s\})\supseteq\{r,s\}.

Note that $r=s$ cannot happen in this case, since $m_{r}=m_{s}\geq k+1$ . Now if we extend $f$ to be a piecewise linear mapping from the interval $[1,n]$ to $[1,n]$ , then we will obtain $f^{l}([r,s])\supseteq[r,s]$ . Therefore, there exists $x_{0}\in[r,s]$ with $f^{l}(x_{0})=x_{0}$ . Note that since $f$ is cyclic as a permutation in $S_{n}$ and $l\leq k\leq n-1$ , $x_{0}$ cannot belong to $\{1,2,\ldots,n\}$ . Consequently, $x_{0}\in(t,t+1)$ for some integer $t\in[r,s-1]$ and it is similar for $f(x_{0}),f^{2}(x_{0}),\ldots,f^{l-1}(x_{0})$ . Since $f$ is piecewise linear, $x_{0}\in(t,t+1)$ , $f(x_{0})\in(t^{\prime},t^{\prime}+1)$ imply $f([t,t+1])\supseteq[t^{\prime},t^{\prime}+1]$ , and it is similar for other pairs of points. So the orbit $\{x_{0},f(x_{0}),\ldots,f^{l-1}(x_{0}),f^{l}(x_{0})\}$ gives a cycle of length $l\leq k$ starting from the vertex $A_{t}$ in the associated directed graph $\Gamma_{f}$ , which implies $m_{t}\leq k$ . But this $t$ together with $i_{1},\ldots,i_{k-1}$ gives $k$ characteristic numbers $\leq k$ , contradicting $m_{k}^{\prime}\geq k+1$ .

Combined with Proposition 3.2, the proof is completed. ∎

Therefore, we have reduced the initial problem on the existence of periodic points to a lemma in combinatorics on the properties of cyclic permutations. In the next section, we will prove the lemma and that will complete the proof of theorem 1.2.

We end this section with some examples of cyclic permutations whose characteristic sequences are known.

Example 3.6.

(1) Let $f\in S_{n}$ be $(1\to 2\to\ldots\to n\to 1)$ . And more generally, $f(i)\equiv i+m\pmod{n}$ , where $(m,n)=1$ . Then the characteristic sequence of $f$ is $1\leq 2\leq\ldots\leq n-1$ .

(2) Let $f\in S_{2n+1}$ be of Stefan type (see [2] or [5] for more details), that is,

1\to(n+1)\to(n+2)\to n\to(n+3)\to(n-1)\to\ldots\to 2n\to 2\to(2n+1)\to 1.

Then the characteristic sequence of $f$ is

1\leq 2\leq 2\leq 4\leq 4\leq\ldots\leq 2n-2\leq 2n-2\leq 2n.

(3) In [6], the author gives all the directed graphs associated to cyclic permutations in $S_{5}$ .

4 Proof of the Lemma in Combinatorics

We finally prove lemma 1.6 in this section. We first recall some definitions and notations from one-dimensional dynamics and linear algebra.

Definition 4.1.

Let $f\in S_{n}$ be a cyclic permutation and $\Gamma_{f}$ be its associated directed graph. (See the discussion before propositon 3.4.) We can define $T_{f}$ to be the $(n-1)\times(n-1)$ adjacency matrix of $f$ and $\Gamma_{f}$ . That is, $T_{ij}=1$ , if $convf(A_{j})\supseteq A_{i}$ , or equivalently, if there is an edge from vertex $A_{j}$ to vertex $A_{i}$ in $\Gamma_{f}$ . Otherwise, $T_{ij}=0$ .

Example 4.2.

Let $f=(136245)\in S_{6}$ be the permutation in example 1.5 and example 3.3. Then the adjacency matrix $T_{f}$ is

\begin{bmatrix}&0&0&0&1&1&\\ &0&0&0&1&0&\\ &1&0&0&1&0&\\ &0&1&0&1&0&\\ &0&1&1&0&0&\end{bmatrix}

We denote the field $\{0,1\}$ with exactly two elements by $\mathbb{F}_{2}$ . And we say $A\equiv B\pmod{2}$ for integer valued matrices $A$ and $B$ , when $A=B$ as matrices in $M(\mathbb{F}_{2})$ .

Proposition 4.3.

For any cyclic permutation $f\in S_{n}$ and $l\in\mathbb{N}_{+}$ , it holds that $T_{f}^{l}\equiv T_{f^{l}}\pmod{2}$ .

A proof of this proposition can be found in [4], Chapter 1, Propositon 20. And for readers’ convenience, we give a sketch of proof here.

Sketch of proof.

Given $f=(i_{1}i_{2}\ldots i_{n})\in S_{n}$ cyclic, we first extend $f$ to be a continuous piecewise linear mapping from the closed interval $[1,n]$ to itself. And recall we define $A_{i}=\{i,i+1\}$ . Later on we will also regard $A_{i}$ as the closed interval $[i,i+1]$ .

Now by the definition of $T_{f}$ and the continuity of $f$ , $(T_{f})_{ij}=1$ iff $f(A_{j})\supseteq A_{i}$ . Consider the interval $A_{i}=[i,i+1]$ and its image $f^{l}(A_{i})$ . In general, we cannot expect $f^{l}(A_{i})=[f^{l}(i),f^{l}(i+1)]$ to hold, where $[a,b]$ denotes the closed interval whose end points are $a$ and $b$ . We can only say that $f^{l}(A_{i})\supseteq[f^{l}(i),f^{l}(i+1)]$ for continuous $f$ . However, if we count the multiplicity of each point in the image $f^{l}(A_{i})$ , we will find that $f^{l}(A_{i})=[f^{l}(i),f^{l}(i+1)]$ actually holds in the sense of mod 2. Indeed, $f^{l}(A_{i})$ can be viewed as a continuous curve in $\mathbb{R}$ starting from the point $f^{l}(i)$ and ending at $f^{l}(i+1)$ . Therefore, every point $p\in f^{l}(A_{i})$ is counted an even number of times in this curve, unless $p\in[f^{l}(i),f^{l}(i+1)]$ .

Finally recall the definition of $T_{f}$ and $T_{f^{l}}$ , and we get the conclusion of the proposition in the sense of mod 2. ∎

Remark 4.4.

If the readers are familiar with representation theory of the symmetric group $S_{n}$ , then they may find that $T_{f}$ defined above is just the matrix of the tautological representation of $f$ mod 2.

Corollary 4.5.

For cyclic $f\in S_{n}$ , we have $T_{f}^{n}\equiv I\pmod{2}$ , where $I=I_{n-1}$ is the unit matrix.

Theorem 4.6.

For cyclic $f\in S_{n}$ , the characteristic polynomial of $T_{f}$ , viewed as the determinant of a matrix in $M_{n-1}(\mathbb{F}_{2})$ , is exactly:

\det(\lambda I-T_{f})=1+\lambda+\lambda^{2}+\ldots+\lambda^{n-1}\pmod{2}.

We need the following lemma:

Lemma 4.7.

There exists a vector $\alpha\in\mathbb{F}_{2}^{n-1}$ such that $\alpha,T_{f}\alpha,\ldots,T_{f}^{n-2}\alpha$ are linearly independent over the field $\mathbb{F}_{2}$ .

Proof.

We claim that if $f\in S_{n}$ is written as $(i_{1}i_{2}\ldots i_{n})$ , then the vector $\alpha=(0,\ldots,0,1,\ldots,1,0,\ldots,0)^{T}=\sum_{j=i_{1}}^{i_{2}-1}e_{j}$ (or $\sum_{j=i_{2}}^{i_{1}-1}e_{j}$ ) satisfies the desired property, where $e_{j}\in\mathbb{F}_{2}^{n-1}$ is the unit vector whose $j$ -th component is $1$ .

We prove the claim by induction on $n$ . The case $n=2$ is trivial. We assume the statement holds for all natural numbers $<n$ , and we will prove it for $n$ .

Suppose

\mu_{1}\alpha+\mu_{2}T_{f}\alpha+\ldots+\mu_{n-1}T_{f}^{n-2}\alpha=0.

By the definition of $\alpha$ , $f=(i_{1}i_{2}\ldots i_{n})\in S_{n}$ , as well as the argument in the proof of proposition 4.3, we can see that

T_{f}^{m}\alpha=\sum_{j=i_{m+1}}^{{i_{m+2}-1}}e_{j},\quad or\sum_{j=i_{m+2}}^{{i_{m+1}-1}}e_{j},\quad\forall m\leq n-2.

In other words, if we view each vector $e_{j}$ as the closed interval $[j,j+1]$ it represents, then we find that

\alpha=[i_{1},i_{2}],\quad T_{f}\alpha=[i_{2},i_{3}],\quad\ldots\quad,T_{f}^{n-2}\alpha=[i_{n-1},i_{n}].

Now if $i_{1}=1$ , take the inner product with $e_{1}$ , and we obtain:

0=\left(\mu_{1}\alpha+\mu_{2}T_{f}\alpha+\ldots+\mu_{n-1}T_{f}^{n-2}\alpha,\>e_{1}\right)=\mu_{1},

because $i_{2},\ldots,i_{n}\geq 2$ and hence $(T_{f}^{m}\alpha,e_{1})=0$ for $1\leq m\leq n-2$ . Similarly we can obtain $\mu_{1}=0$ when $i_{1}=n$ .

If $2\leq i_{1}\leq n-1$ , then since $i_{1}$ does not appear as the end points of $T_{f}^{m}\alpha$ , $[i_{1}-1,i_{1}]$ and $[i_{1},i_{1}+1]\subseteq or\not\subseteq T_{f}^{m}\alpha$ at the same time, where $m\geq 1$ . Or equivalently,

(T_{f}^{m}\alpha,e_{i_{1}-1})=(T_{f}^{m}\alpha,e_{i_{1}}),\quad 1\leq m\leq n-2.

And then if we take the inner product with $(e_{i_{1}-1}+e_{i_{1}})$ , we obtain:

0=\left(\mu_{1}\alpha+\mu_{2}T_{f}\alpha+\ldots+\mu_{n-1}T_{f}^{n-2}\alpha,\>(e_{i_{1}-1}+e_{i_{1}})\right)=\mu_{1}\pmod{2},

Anyway, we find that $\mu_{1}=0$ and consequently,

\mu_{2}T_{f}\alpha+\ldots+\mu_{n-1}T_{f}^{n-2}\alpha=0.

Now if we put $g=(i_{2}i_{3}\ldots i_{n})\in S_{n-1}$ and $\beta=T_{f}\alpha$ with the $i_{1}$ -th component dropped, we can see that $T_{f}^{m}\alpha=T_{g}^{m-1}\beta$ , regardless of the $i_{1}$ -th component of $T_{f}^{m}\alpha$ . This is easily seen to be true since one can find $T_{f}^{m}\alpha=T_{g}^{m-1}\beta=[i_{m+1},i_{m+2}]$ as intervals. And the following holds for $\beta$ and $T_{g}$ :

\mu_{2}\beta+\mu_{3}T_{g}\beta+\ldots+\mu_{n-1}T_{g}^{n-3}\beta=0.

By induction hypothesis we conclude that $\mu_{2}=\ldots=\mu_{n-1}=0$ . Therefore, we have proved that the claim also holds for $n$ . This completes the proof of the lemma. ∎

Corollary 4.8.

The minimal polynomial $g(\lambda)$ of $T_{f}$ has degree at least $n-1$ , and therefore equals the characteristic polynomial.

Proof.

Suppose $\deg g(\lambda)\leq n-2$ , then we have $g(T_{f})\alpha=0$ , which contradicts the lemma above. And the second statement follows by Cayley-Hamilton’s theorem. (Note that $T_{f}$ is an $(n-1)\times(n-1)$ matrix.) ∎

With these results, we can compute the characteristic polynomial of $T_{f}$ .

Proof of Theorem 4.6.

It follows from corollary 4.5 and 4.8 that

\det(\lambda I-T_{f})=g(\lambda)\mid(\lambda^{n}-1)=(1+\lambda)(1+\lambda+\lambda^{2}+\ldots+\lambda^{n-1})\pmod{2}.

Moreover, since the only degree-one irreducible polynomials are $\lambda$ and $\lambda+1$ in $\mathbb{F}_{2}[\lambda]$ , the right hand side can be further decomposed as

1+\lambda+\lambda^{2}+\ldots+\lambda^{n-1}=(1+\lambda)^{m}h(\lambda),

where $m\geq 0$ , and $h(\lambda)$ is a product of several irreducible polynomials of degree $\geq 2$ . Hence, by comparing the degree we conclude that

\det(\lambda I-T_{f})=1+\lambda+\lambda^{2}+\ldots+\lambda^{n-1}\pmod{2}.

∎

Proof of lemma 1.6.

We show that if the coefficient of $\lambda^{n-1-i}$ in $\det(\lambda I-T_{f})$ is nonzero, then $m_{i}^{\prime}\leq i$ holds for the characteristic sequence in lemma 1.6.

Let us consider how the coefficients in $\det(\lambda I-T_{f})$ are computed. It is then easily seen that the coefficient of $\lambda^{n-1-i}$ is nonzero implies the existence of an $i\times i$ principle matrix $P_{i}$ of $T_{f}$ whose determinant is nonzero. It then follows that there exists at least one diagonal of $P_{i}$ whose elements are all equal to 1. Now it is easily checked that this diagonal give rise to several cycles of length $\leq i$ in the associated directed graph $\Gamma_{f}$ . And the vertices of these cycles correspond to the subscripts of the columns of $P_{i}$ . Thus there exist at least $i$ vertices in $\Gamma_{f}$ , from whom the minimal length of cycles starting are all $\leq i$ . By proposition 3.4, this means that at least $i$ of the characteristic numbers $\leq i$ . So $m_{i}^{\prime}\leq i$ . ∎

Acknowledgement

I would like to thank Prof. S. A. Bogatyi for introducing the problem to me. And I thank my supervisor, Prof. Tian Gang, for his encouragement as well as helpful suggestions. I’m also grateful to Dr. Ye Yanan and Dr. Zhao Yikai for writing computer programs to verify some claims.

References

[1] S. A. Bogatyi and E. T. Shavgulidze. Periodic points of a map of a system of intervals. Mathematical Notes of the Academy of Sciences of the Ussr, 43(3):210–219, 1988.
[2] A. N. Sharkovskii. Coexistence of cycles of a continuous map of the line into itself. International Journal of Bifurcation and Chaos, 5(5):1263–1273, 1995.
[3] Uhland Burkart. Interval mapping graphs and periodic points of continuous function. Journal of Combinatorial Theory, Series B, 32(1):57–68, 1982.
[4] L. S. Block and W. A. Coppel. Dynamics in One Dimension. Springer, 1992.
[5] P. Štefan. A theorem of Šarkovskii on the existence of periodic orbits of continuous endomorphisms of the real line. Communications in Mathematical Physics, 54(3):237–248, 1977.
[6] Philip D. Straffin. Periodic points of continuous functions. Mathematics Magazine, 51(2):99–105, 1978.

On Periodic Points in Covering Systems

Abstract

1 Introduction

Definition 1.1.

Theorem 1.2.

Proposition 1.3 (the case k=1k=1).

Definition 1.4.

Example 1.5.

Lemma 1.6.

Remark 1.7.

2 Discretization

Lemma 2.1.

Proof.

Example 2.2.

Lemma 2.3.

Remark 2.4.

Proof of Theorem 1.2 by lemma 2.3.

3 Equivalence of the Two Lemmas

Proposition 3.1.

Proposition 3.2.

Proof.

Example 3.3.

Proposition 3.4.

Proposition 3.5.

Proof.

Example 3.6.

4 Proof of the Lemma in Combinatorics

Definition 4.1.

Example 4.2.

Proposition 4.3.

Sketch of proof.

Remark 4.4.

Corollary 4.5.

Theorem 4.6.

Lemma 4.7.

Proof.

Corollary 4.8.

Proof.

Proof of Theorem 4.6.

Proof of lemma 1.6.

Acknowledgement

References

Proposition 1.3 (the case $k=1$ ).