Continuously Increasing Subsequences of Random Multiset Permutations

Alexander Clifton¹¹1Dept. of Mathematics, Emory University [email protected] Bishal Deb²²2Dept. of Mathematics, University College London [email protected] Yifeng Huang³³3Dept. of Mathematics, University of Michigan [email protected] Sam Spiro⁴⁴4Dept. of Mathematics, UCSD [email protected]. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship under Grant No. DGE-1650112. Semin Yoo⁵⁵5School of Computational Sciences, KIAS [email protected]. The author is supported by the KIAS Individual Grant (CG082701) at Korea Institute for Advanced Study.

Abstract

For a word $\pi$ and integer $i$ , we define $L^{i}(\pi)$ to be the length of the longest subsequence of the form $i(i+1)\cdots j$ , and we let $L(\pi):=\max_{i}L^{i}(\pi)$ . In this paper we estimate the expected values of $L^{1}(\pi)$ and $L(\pi)$ when $\pi$ is chosen uniformly at random from all words which use each of the first $n$ integers exactly $m$ times. We show that $\mathbb{E}[L^{1}(\pi)]\sim m$ if $n$ is sufficiently larger in terms of $m$ as $m$ tends towards infinity, confirming a conjecture of Diaconis, Graham, He, and Spiro. We also show that $\mathbb{E}[L(\pi)]$ is asymptotic to the inverse gamma function $\Gamma^{-1}(n)$ if $n$ is sufficiently large in terms of $m$ as $m$ tends towards infinity.

1 Introduction

1.1 Main Results

Given integers $m$ and $n$ , let $\mathfrak{S}_{m,n}$ denote the set of words $\pi$ which use each integer in $[n]:=\{1,2,\ldots,n\}$ exactly $m$ times, and we will refer to $\pi\in\mathfrak{S}_{m,n}$ as a multiset permutation. For example, $\pi=211323\in\mathfrak{S}_{2,3}$ . For $\pi\in\mathfrak{S}_{m,n}$ and $i$ an integer, we define $L^{i}_{m,n}(\pi)$ to be the length of the longest subsequence of $\pi$ of the form $i(i+1)(i+2)\cdots j$ , which we call an $i$ -continuously increasing subsequence. We say that a subsequence is a continuously increasing subsequence if it is an $i$ -continuously increasing subsequence for some $i$ , and we define $L_{m,n}(\pi)=\max_{i}L^{i}_{m,n}(\pi)$ to be the length of a longest continuously increasing subsesquence of $\pi$ . For example, if $\pi=2341524315$ then $L_{2,5}(\pi)=L^{2}_{2,5}(\pi)=4$ due to the subsequence $2345$ , and $L^{1}_{2,5}(\pi)=3$ due to the subsequence $123$ .

The focus of this paper is to study $L^{1}_{m,n}(\pi)$ and $L_{m,n}(\pi)$ when $\pi$ is chosen uniformly at random from $\mathfrak{S}_{m,n}$ . We focus on the regime where $n$ is much larger than $m$ , as in the opposite regime $L^{i}_{m,n}(\pi)$ is very likely to be its maximum possible value $n-i+1$ for all $i$ .

We first consider $\mathbb{E}[L^{1}_{m,n}(\pi)]$ . This quantity was briefly studied by Diaconis, Graham, He, and Spiro [DGHS21] due to its relationship with a certain card game that we describe later in this paper. They proved $\mathbb{E}[L^{1}_{m,n}(\pi)]\leq m+Cm^{3/4}\log m$ for some absolute constant $C$ provided $n$ is sufficiently large in terms of $m$ . It was conjectured in [DGHS21] that this upper bound for $\mathbb{E}[L^{1}_{m,n}(\pi)]$ is asymptotically tight for $n$ sufficiently large in terms of $m$ . We verify this conjecture in a strong form, obtaining an exact formula for $\lim_{n\to\infty}\mathbb{E}[L^{1}_{m,n}(\pi)]$ for any fixed $m$ and precise estimates of this value as $m$ tends towards infinity.

Theorem 1.1.

(a)

For any integer $m\geq 1$ , let $\alpha_{1},\ldots,\alpha_{m}$ be the zeroes of $E_{m}(x):=\sum_{k=0}^{m}\frac{x^{k}}{k!}$ . If $\pi\in\mathfrak{S}_{m,n}$ is chosen uniformly at random, then

$\mathcal{L}_{m}^{1}:=\lim_{n\to\infty}\mathbb{E}[L_{m,n}^{1}(\pi)]=-1-\sum\alpha_{i}^{-1}e^{-\alpha_{i}}.$ (1)
(b)

There exists an absolute constant $\beta>0$ such that

$\left|\mathcal{L}_{m}^{1}-\left(m+1-\frac{1}{m+2}\right)\right|\leq O(e^{-\beta m}).$

For example, when $m=1$ we have $E_{1}(x)=1+x$ and $\alpha_{1}=-1$ , implying $\mathcal{L}_{1}^{1}=-1+e$ , which can also be proven by elementary means. For $m=2$ we have $E_{2}(x)=1+x+x^{2}/2$ and $\alpha_{1}=-1-i,\alpha_{2}=-1+i$ . From this Theorem 1.1(a) gives the following closed form expression for $\mathcal{L}_{2}^{1}$ .

Corollary 1.2.

\mathcal{L}_{2}^{1}=e\left(\cos(1)+\sin(1)\right)-1.

Our next result gives precise bounds for the length of a longest continuously increasing subsequence in a random permutation of $\mathfrak{S}_{m,n}$ . We recall that the gamma function $\Gamma(x)$ is a function which, in particular, gives a bijection from $x\geq 1$ to $y\geq 1$ and which satisfies $\Gamma(n)=(n-1)!$ for non-negative integers $n$ .

Theorem 1.3.

If $n$ is sufficiently large in terms of $m$ , then

\mathbb{E}[L_{m,n}(\pi)]=\Gamma^{-1}(n)+\Theta\left(1+\frac{\log m}{\log(\Gamma^{-1}(n))}\Gamma^{-1}(n)\right),

where $\Gamma^{-1}(n)$ is the inverse of the gamma function when restricted to $x\geq 1$ .

Note when $m=1$ the error term of Theorem 1.3 is $\Theta(1)$ , but for $m\geq 2$ it is $\Theta(\frac{\log m}{\log\Gamma^{-1}(n)}\Gamma^{-1}(n))$ , which is fairly close to the main term of $\Gamma^{-1}(n)$ . Thus the behavior of $\mathbb{E}[L_{m,n}(\pi)]$ changes somewhat dramatically as soon as one starts to consider multiset permutations as opposed to just permutations.

1.2 History and Related Work

Determining $L^{i}_{m,n}(\pi)$ and $L_{m,n}(\pi)$ can be viewed as variants of the well-studied problem of determining the length of the longest increasing subsequence in a random permutation of length $n$ , and we denote this quantity by $\widetilde{L}_{n}$ . It was shown by Logan and Shepp [LS77] and Vershick and Kerov [VK77] that $\mathbb{E}[\widetilde{L}_{n}]\sim 2\sqrt{n}$ , answering a famous problem of Ulam. Later Baik, Deift, and Johansson [BDJ99] showed that the limiting distribution of $\widetilde{L}_{n}$ is the Tracy-Widom distribution. Some work with the analogous problem for multiset permutations has been considered recently by Almeanazel and Johnson [AMJ20]. Much more can be said about this topic, and we refer the reader to the excellent book by Romik [Rom15] for more information.

The initial motivation for studying $L^{1}(\pi)$ was due to its relationship to a card guessing experiment introduced by Diaconis and Graham [DG81]. To start the experiment, one shuffles a deck of $mn$ cards which consists of $n$ distinct card types each appearing with multiplicity $m$ . In each round, a subject iteratively guesses what the top card of the deck is according to some strategy $G$ . After each guess, the subject is told whether their guess was correct or not, the top card is discarded, and then the experiment continues with the next card. This experiment is known as the partial feedback model. For more on the history of the partial feedback model we refer the reader to [DGS21].

If $G$ is a strategy for the subject in the partial feedback model and $\pi\in\mathfrak{S}_{m,n}$ , we let $P(G,\pi)$ denote the number of correct guesses made by the subject if they follow strategy $G$ and the deck is shuffled according to $\pi$ . We say that $G$ is an optimal strategy if $\mathbb{E}[P(G,\pi)]=\max_{G^{\prime}}\mathbb{E}[P(G^{\prime},\pi)]$ , where $G^{\prime}$ ranges over all strategies and $\pi\in\mathfrak{S}_{m,n}$ is chosen uniformly at random. Optimal strategies are unknown in general, and even if they were known they would likely be too complex for a human subject to implement in practice. As such there is interest in coming up with (simple) strategies $G$ such that $\mathbb{E}[P(G,\pi)]$ is relatively large.

One strategy is the trivial strategy which guesses card type 1 every single round, guaranteeing a score of exactly $m$ at the end of the experiment. A slightly better strategy is the safe strategy $G_{safe}$ which guesses card type 1 every round until all $m$ are guessed correctly, then 2’s until all $m$ are guessed correctly, and so on. It can be deduced from arguments given by Diaconis, Graham, and Spiro [DGS21] that $\mathbb{E}[P(G_{safe},\pi)]$ is $m+1-\frac{1}{m+1}$ plus an exponential error term, so the safe strategy does just a little better than the trivial strategy.

Another natural strategy is the shifting strategy $G_{shift}$ , defined by guessing 1 until you are correct, then 2 until you are correct, and so on; with the strategy being defined arbitrarily in the (very rare) event that one correctly guesses a copy of each card type. It is not difficult to see that $P(G_{shift},\pi)\geq L^{1}_{m,n}(\pi)$ , with equality holding provided the player does not correctly guess $n$ . Thus Theorem 1.1(b) shows that the expected number of correct guesses under the shifting strategy is close to $m+1-\frac{1}{m+2}$ , which is slightly better than the trivial strategy, and very slightly better than the safe strategy.

1.3 Preliminaries

We let $[n]:=\{1,2,\ldots,n\}$ and let $[m]^{n}$ be the set of tuples of length $n$ with entries in $[m]$ . Whenever we write, for example, $\Pr[L_{m,n}(\pi)\geq k]$ , we will assume $\pi$ is chosen uniformly at random from $\mathfrak{S}_{m,n}$ unless stated otherwise.

Throughout this paper we use several basic results from probability theory. One such result is that if $X$ is a non-negative integer-valued random variable, then

\mathbb{E}[X]=\sum_{k=1}^{\infty}\Pr[X\geq k].

A crucial observation that we use throughout the text is the following.

Observation 1.4.

For $k\leq n$ , if $\pi\in\mathfrak{S}_{m,n}$ and $\tau\in\mathfrak{S}_{m,k}$ are drawn uniformly at random, then

\Pr[L^{1}_{m,n}(\pi)\geq k]=\Pr[L^{1}_{m,k}(\tau)=k].

Proof.

For $\pi\in\mathfrak{S}_{m,n}$ , let $\phi(\pi)\in\mathfrak{S}_{m,k}$ be the word obtained by deleting every letter from $\pi$ which is larger than $k$ . Note that $L^{1}_{m,n}(\pi)\geq k$ if and only if $L^{1}_{m,k}(\phi(\pi))=k$ . Moreover, it is not difficult to see that $\phi(\pi)$ is distributed uniformly at random in $\mathfrak{S}_{m,k}$ provided $\pi$ is distributed uniformly at random in $\mathfrak{S}_{m,n}$ , proving the result. ∎

2 Proof of Theorem 1.1

2.1 Theorem 1.1(a): Generating Functions

We say that a word $\pi\in\mathfrak{S}_{m,n}$ has a complete increasing subsequence if $L^{1}_{m,n}(\pi)=n$ . Let $h_{m}(n)$ be the number of words $\pi\in\mathfrak{S}_{m,n}$ which have a complete increasing subsequence. Horton and Kurn [HK81, Corollary (c)] give the following formula for $h_{m}(n)$ .

Theorem 2.1 ([HK81]).

The number of words $\pi\in\mathfrak{S}_{m,n}$ which have a complete increasing subsequence, $h_{m}(n)$ , is given by

h_{m}(n)=\sum_{(i_{1},\ldots,i_{m})\in\mathcal{N}(n,m)}\binom{n}{i_{1},\ldots,i_{m}}\dfrac{(mn)!}{l!}\dfrac{(-1)^{l-n}}{\prod_{j=1}^{m}(m-j)!^{i_{j}}},

where

l=\sum_{j=1}^{m}ji_{j},

$\mathcal{N}(n,m)$ is the set of weak compositions of $n$ into $m$ parts, i.e.,

\mathcal{N}(n,m)\coloneqq\left\{(i_{1},\ldots,i_{m})\in\mathbb{Z}_{\geq{0}}^{m}\middle|\sum_{j=1}^{m}i_{j}=n\right\},

and

\binom{n}{i_{1},\ldots,i_{m}}=\dfrac{n!}{\prod_{j=1}^{m}i_{j}!}

is a multinomial coefficient.

Notice that $\mathcal{L}_{m}^{1}$ can be expressed in terms of $h_{m}(n)$ as follows:

\mathcal{L}_{m}^{1}=\lim_{k\to\infty}\mathbb{E}[L_{m,k}^{1}(\pi)]=\lim_{k\to\infty}\sum_{n=1}^{k}\mathrm{Pr}[L^{1}_{m,k}(\pi)\geq n]=\lim_{k\to\infty}\sum_{n=1}^{k}\dfrac{h_{m}(n)}{|\mathfrak{S}_{m,n}|}=\sum_{n=1}^{\infty}\dfrac{h_{m}(n)}{|\mathfrak{S}_{m,n}|},

(2)

where the third equality is due to Observation 1.4. Note that $|\mathfrak{S}_{m,n}|=(mn)!/(m!)^{n}$ . Thus, as a consequence of Theorem 2.1, we have

	$\dfrac{h_{m}(n)}{\|\mathfrak{S}_{m,n}\|}=(-m!)^{n}\sum_{(i_{1},\ldots,i_{m})\in\mathcal{N}(n,m)}\binom{n}{i_{1},\ldots,i_{m}}\dfrac{1}{\prod_{j=1}^{m}(m-j)!^{i_{j}}}\dfrac{(-1)^{l}}{l!},$		(3a)
	$\hskip 25.00003pt=(-m!)^{n}\sum_{(i_{1},\ldots,i_{m})\in\mathcal{N}(n,m)}\binom{n}{i_{1},\ldots,i_{m}}\prod_{j=1}^{m}\left(\dfrac{(-1)^{j}}{(m-j)!}\right)^{i_{j}}\dfrac{1}{l!}.$		(3b)

Intuitively, if the $1/l!$ were removed from the right-hand-side expression in (3b), then by using the multinomial theorem we could write this expression as an $n^{\text{th}}$ power, turning (2) into a geometric series. The next few paragraphs formalise this idea.

We begin by replacing $(-1)^{l}$ by $x^{l}$ in the right-hand-side of (3a) to obtain the polynomial

	$\displaystyle p_{m,n}(x)\coloneqq$	$\displaystyle(-m!)^{n}\sum_{(i_{1},\ldots,i_{m})\in\mathcal{N}(n,m)}\binom{n}{i_{1},\ldots,i_{m}}\dfrac{1}{\prod_{j=1}^{m}(m-j)!^{i_{j}}}\dfrac{x^{l}}{l!}$
	$\displaystyle=$	$\displaystyle(-m!)^{n}\sum_{(i_{1},\ldots,i_{m})\in\mathcal{N}(n,m)}\binom{n}{i_{1},\ldots,i_{m}}\prod_{j=1}^{m}\left(\dfrac{x^{j}}{(m-j)!}\right)^{i_{j}}\dfrac{1}{l!}.$

Thus,

p_{m,n}(-1)=\dfrac{h_{m}(n)}{|\mathfrak{S}_{m,n}|}.

Next, we define an operator in order to remove the $l!$ from the denominator. Let $R$ be a commutative ring containing $\mathbb{Q}$ and let $\Phi:R[x]\to R[x]$ be an $R$ -linear map defined on the monomials by

\Phi(x^{n})=\dfrac{x^{n}}{n!}.

We can extend $\Phi$ to an $R$ -linear map on $R[[x]]\to R[[x]]$ , which we also refer to as $\Phi$ by abuse of notation. Throughout this article, $R$ is either $\mathbb{C}$ or $\mathbb{C}[[y]]$ for an indeterminate $y$ , and we shall refer to this $R$ -linear map as $\Phi$ in both cases. Notice that $\Phi$ is invertible for any such ring $R$ . A key property that we use about $\Phi$ is

\Phi\left(\frac{1}{1-ax}\right)=\Phi\left(\sum_{i=0}^{\infty}(ax)^{i}\right)=\sum_{i=0}^{\infty}\frac{(ax)^{i}}{i!}=e^{ax}.

(4)

Consider the polynomial

	$q_{m,n}(x)\coloneqq\Phi^{-1}\left(p_{m,n}(x)\right)=(-m!)^{n}\sum_{(i_{1},\ldots,i_{m})\in\mathcal{N}(n,m)}\binom{n}{i_{1},\ldots,i_{m}}\dfrac{x^{l}}{\prod_{j=1}^{m}(m-j)!^{i_{j}}}$		(5)
	$=(-m!)^{n}\sum_{(i_{1},\ldots,i_{m})\in\mathcal{N}(n,m)}\binom{n}{i_{1},\ldots,i_{m}}\prod_{j=1}^{m}\left(\dfrac{x^{j}}{(m-j)!}\right)^{i_{j}}.$		(5)

Notice that,

q_{m,n}(x)=\left(-m!\sum_{j=1}^{m}\dfrac{x^{j}}{(m-j)!}\right)^{n}=(q_{m,1}(x))^{n}.

Let $P_{m}(x,y)$ and $Q_{m}(x,y)$ be the ordinary generating functions of $p_{m,n}(x)$ and $q_{m,n}(x)$ respectively, i.e.

	$P_{m}(x,y)\coloneqq\sum_{n=0}^{\infty}p_{m,n}(x)y^{n},$		(6)
	$Q_{m}(x,y)\coloneqq\sum_{n=0}^{\infty}q_{m,n}(x)y^{n}=\Phi^{-1}\left(P_{m}(x,y)\right).$		(6)

Putting everything together, we see that

\mathcal{L}_{m}^{1}=P_{m}(-1,1)-1.

(7)

and thus it suffices to find a nice closed form expression for $P_{m}(x,y)$ . Note that

q_{m,1}(x)=-m!x^{m}E_{m-1}(1/x),

where we recall the polynomial $E_{m-1}(x)$ is defined in Theorem 1.1 by $E_{m-1}(x)=\sum_{k=0}^{m-1}x^{k}/k!$ . As $q_{m,n}(x)=\left(q_{m,1}(x)\right)^{n}$ , we have

Q_{m}(x,y)=\dfrac{1}{1-yq_{m,1}(x)}=\dfrac{1}{1+m!x^{m}yE_{m-1}(1/x)}.

(8)

Hence,

P_{m}(x,y)=\Phi\left(Q_{m}(x,y)\right)=\Phi\left(\dfrac{1}{1+m!x^{m}yE_{m-1}(1/x)}\right),

and thus

P_{m}(x,1)=\Phi\left(\dfrac{1}{1+m!x^{m}E_{m-1}(1/x)}\right)=\Phi\left(\dfrac{1}{m!x^{m}E_{m}(1/x)}\right).

We now prove the main result of this subsection.

Proposition 2.2.

Let $\alpha_{1},\ldots,\alpha_{m}$ be the zeroes of the polynomial $E_{m}(x)$ . The formal power series $P_{m}(x,1)$ satisfies

P_{m}(x,1)=-\sum_{i=1}^{m}\alpha_{i}^{-1}e^{\alpha_{i}x}.

Proof.

Let $g(x)\coloneqq m!x^{m}E_{m}(1/x)$ . Since $\alpha_{1}^{-1},\ldots,\alpha_{m}^{-1}$ are the zeroes of $g(x)$ , we have

g(x)=m!(x-\alpha_{1}^{-1})\cdots(x-\alpha_{m}^{-1}).

Notice that $E_{m}(x)$ has no repeated zeroes. This is true because, if $\alpha$ is a repeated zero of $E_{m}(x)$ , it is also a zero of its derivative $E_{m}^{\prime}(x)=E_{m-1}(x)$ . But then $\alpha$ has to be a zero of $E_{m}(x)-E_{m-1}(x)=x^{m}/m!$ , which is only possible if $\alpha=0$ , a contradiction as $0$ is not a zero of $E_{m}(x)$ .

Thus $\alpha_{1},\ldots,\alpha_{m}$ are pairwise distinct, and hence the zeroes of $g(x)$ being $\alpha_{1}^{-1},\ldots,\alpha_{m}^{-1}$ , are also pairwise distinct. This and (8) means $Q_{m}(x,1)$ has the partial fraction decomposition

Q_{m}(x,1)=\dfrac{1}{g(x)}=\sum_{i=1}^{m}\dfrac{1}{g^{\prime}\left(\alpha_{i}^{-1}\right)}\cdot\dfrac{1}{x-\alpha_{i}^{-1}}.

The derivative of $g$ is

g^{\prime}(x)=m!\left(\dfrac{mx^{m}E_{m}(1/x)}{x}-\dfrac{x^{m}E_{m}^{\prime}(1/x)}{x^{2}}\right)=m!\left(\dfrac{mx^{m}E_{m}(1/x)}{x}-\dfrac{x^{m}(E_{m}(1/x)-x^{-m}/m!)}{x^{2}}\right)

Hence for any $i$ ,

g^{\prime}\left(\alpha_{i}^{-1}\right)=\alpha_{i}^{2}

which gives

P_{m}(x,1)=\Phi\left(Q_{m}(x,1)\right)=-\sum_{i=1}^{m}\Phi\left(\dfrac{1}{\alpha_{i}(1-\alpha_{i}x)}\right)=-\sum_{i=1}^{m}\alpha_{i}^{-1}e^{\alpha_{i}x},

where this last step used (4). ∎

This proposition together with (7) gives Theorem 1.1(a).

2.2 Theorem 1.1(b): Exponential Sums of Zeroes

We remind the reader that Theorem 1.1(b) claims

\left|\mathcal{L}_{m}^{1}-\left(m+1-\frac{1}{m+2}\right)\right|\leq O(e^{-\beta m})

for some constant $\beta>0$ . Given Theorem 1.1(a), proving this claim is equivalent to showing that $\displaystyle\sum_{i}\alpha_{i}^{-1}e^{-\alpha_{i}}=-m-2+\frac{1}{m+2}+O(e^{-\beta m})$ for some positive constant $\beta$ . We do this by following the approach used by Conrey and Ghosh [CG88] to evaluate a similar exponential sum.

For $1\leq i\leq m$ , let $s_{i}$ and $t_{i}$ be the real and imaginary parts, respectively, of $\alpha_{i}$ . Let $\gamma^{-},\gamma,\gamma^{+}$ be arbitrary positive numbers such that $0<\gamma^{-}<\gamma<\gamma^{+}<1-\log{2}$ . We partition $[m]$ into disjoint sets $S$ and $L$ where $i\in S$ when $s_{i}\leq{\gamma m}$ and $i\in L$ when $s_{i}>\gamma m$ , allowing us to rewrite our desired sum as

\displaystyle\sum_{i=1}^{m}\alpha_{i}^{-1}e^{-\alpha_{i}}=\displaystyle\sum_{i\in S}\alpha_{i}^{-1}e^{-\alpha_{i}}+\displaystyle\sum_{i\in L}\alpha_{i}^{-1}e^{-\alpha_{i}}.

Define

R_{m}(x)=e^{x}-E_{m}(x)=\displaystyle\sum_{k=m+1}^{\infty}\frac{x^{k}}{k!}.

The following results are proven by Conrey and Ghosh [CG88] and Zemyan [Zem05]:

Proposition 2.3.

(a)

[CG88, Equations (6) and (7)] For sufficiently large $m$ , we have $|\alpha_{i}|\geq{me^{\gamma^{-}-1}}$ for $i\in L$ and $|\alpha_{i}|\leq{me^{\gamma^{+}-1}}$ for $i\in S$ . Consequently, we have $|\alpha_{i}|<m/2$ for $i\in S$ .
(b)

[CG88, Lemma 1] For $|x|<\frac{1}{2}(m+2)$ , we have

$\frac{1}{R_{m}(x)}=\frac{(m+1)!}{x^{m+1}}\left(1+\displaystyle\sum_{k=1}^{\infty}c_{k}x^{k}\right)$

with $|c_{k}|\leq{\frac{1}{2}(\frac{2}{m+2})^{k}}$ .

(c)

[Zem05, Theorem 7]

\displaystyle\sum_{i=1}^{m}\alpha_{i}^{-t}=\left\{\begin{array}[]{ll}-1&t=1,\\ 0&2\leq{t}\leq{m},\\ 1/m!&t=m+1,\\ -1/m!&t=m+2.\end{array}\right.

We begin our argument by restricting our attention to the indices in $L$ :

Lemma 2.4.

\left|\displaystyle\sum_{i\in L}\alpha_{i}^{-1}e^{-\alpha_{i}}\right|\leq{\gamma^{-1}e^{-\gamma m}}.

Proof.

By the triangle inequality,

\left|\displaystyle\sum_{i\in L}\alpha_{i}^{-1}e^{-\alpha_{i}}\right|\leq{\displaystyle\sum_{i\in L}|\alpha_{i}^{-1}e^{-\alpha_{i}}|}.

Note that $|\alpha_{i}^{-1}e^{-\alpha_{i}}|=|\alpha_{i}^{-1}||e^{-s_{i}}|<|\alpha_{i}^{-1}|e^{-\gamma m}$ . Since $s_{i}>\gamma m$ , we know that $|\alpha_{i}|>\gamma m$ , so $|\alpha_{i}^{-1}|<(\gamma m)^{-1}$ . Thus for $i\in L$ , we have that

|\alpha_{i}^{-1}e^{-\alpha_{i}}|<(\gamma m)^{-1}e^{-\gamma m}.

Adding over the elements of $L$ , of which there are at most $m$ , we have $|\displaystyle\sum_{i\in L}\alpha_{i}^{-1}e^{-\alpha_{i}}|\leq{\gamma^{-1}e^{-\gamma m}}$ . ∎

To evaluate the sum for the indices in $S$ , we utilize $R_{m}(x)$ . Since the $\alpha_{i}$ ’s are the roots of $E_{m}(x)$ , we have $e^{\alpha_{i}}=R_{m}(\alpha_{i})$ for $i=1,\cdots,m$ , so $e^{-\alpha_{i}}=\frac{1}{R_{m}(\alpha_{i})}$ .

Lemma 2.5.

For $|x|<\frac{1}{2}(m+2)$ , we have

\frac{1}{R_{m}(x)}=\frac{(m+1)!}{x^{m+1}}\left(1-\frac{x}{m+2}+\displaystyle\sum_{k=2}^{\infty}c_{k}x^{k}\right)

with $|c_{k}|\leq{\frac{1}{2}(\frac{2}{m+2})^{k}}$ .

Proof.

All of this follows from Proposition 2.3(b) except for showing that $c_{1}=-1/(m+2)$ . To show this, we observe by definition of $R_{m}(x)$ that

\frac{x^{m+1}}{(m+1)!R_{m}(x)}=\left(1+(m+1)!\displaystyle\sum_{k=1}^{\infty}\frac{x^{k}}{(m+1+k)!}\right)^{-1}.

Conrey and Ghosh [CG88] note that $\left|(m+1)!\displaystyle\sum_{k=1}^{\infty}\frac{x^{k}}{(m+1+k)!}\right|<1$ when $|x|<\frac{1}{2}(m+2)$ , so this can be expanded as a convergent geometric series in such cases. Thus,

\frac{1}{R_{m}(x)}=\frac{(m+1)!}{x^{m+1}}\left(1+\displaystyle\sum_{j=1}^{\infty}\left(-(m+1)!\displaystyle\sum_{k=1}^{\infty}\frac{x^{k}}{(m+1+k)!}\right)^{j}\right).

The only time an $x$ term can appear in the double infinite sum is when $k,j=1$ , so this term has coefficient $-(m+1)!\frac{1}{(m+1+1)!}=\frac{-1}{m+2}$ as desired.

∎

Note that for $i\in S$ we have $|\alpha_{i}|<m/2$ from Proposition 2.3(a). Thus we can use Lemma 2.5 to conclude that

\displaystyle\sum_{i\in S}\alpha_{i}^{-1}e^{-\alpha_{i}}=\displaystyle\sum_{i\in S}\frac{\alpha_{i}^{-1}}{R_{m}(\alpha_{i})}=(m+1)!\displaystyle\sum_{i\in S}\displaystyle\sum_{k=0}^{\infty}c_{k}\alpha_{i}^{k-m-2},

(9)

where $c_{0}=1$ and $c_{1}=\frac{-1}{m+2}$ .

Using the values of $\displaystyle\sum_{i=1}^{m}\alpha_{i}^{-t}$ for $t=1,\cdots,m+2$ from Proposition 2.3(c) and that $c_{0}=1$ and $c_{1}=\frac{-1}{m+2}$ , we see that

\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in S}c_{k}\alpha_{i}^{k-m-2}=\displaystyle\sum_{k=0}^{m+1}\left(\displaystyle\sum_{i=1}^{m}c_{k}\alpha_{i}^{k-m-2}-\displaystyle\sum_{i\in L}c_{k}\alpha_{i}^{k-m-2}\right)=\frac{-1}{m!}-\frac{1}{m!(m+2)}-c_{m+1}-\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in L}c_{k}\alpha_{i}^{k-m-2}.

Plugging this into (9) gives

	$\displaystyle\displaystyle\sum_{i\in S}\alpha_{i}^{-1}e^{-\alpha_{i}}$	$\displaystyle=(m+1)!\left(\frac{-1}{m!}-\frac{1}{m!(m+2)}-c_{m+1}-\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in L}c_{k}\alpha_{i}^{k-m-2}+\displaystyle\sum_{i\in S}\displaystyle\sum_{k=m+2}^{\infty}c_{k}\alpha_{i}^{k-m-2}\right)$
		$\displaystyle=-(m+1)-\frac{m+1}{m+2}-(m+1)!\left(c_{m+1}+\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in L}c_{k}\alpha_{i}^{k-m-2}-\displaystyle\sum_{i\in S}\displaystyle\sum_{k=m+2}^{\infty}c_{k}\alpha_{i}^{k-m-2}\right)$
		$\displaystyle=-m-2+\frac{1}{m+2}-(m+1)!\left(c_{m+1}+\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in L}c_{k}\alpha_{i}^{k-m-2}-\displaystyle\sum_{i\in S}\displaystyle\sum_{k=m+2}^{\infty}c_{k}\alpha_{i}^{k-m-2}\right).$

By Lemma 2.4, we have $\displaystyle\sum_{i=1}^{m}\alpha_{i}^{-1}e^{-\alpha_{i}}=\displaystyle\sum_{i\in S}\alpha_{i}^{-1}e^{-\alpha_{i}}+O(e^{-\beta m})$ for some $\beta>0$ , so we are left to consider the sum over $S$ . The first three terms above match our claimed expression, so it suffices to show that the leftover terms $-(m+1)!\left(c_{m+1}+\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in L}c_{k}\alpha_{i}^{k-m-2}-\displaystyle\sum_{i\in S}\displaystyle\sum_{k=m+2}^{\infty}c_{k}\alpha_{i}^{k-m-2}\right)$ are $O(e^{-\beta m})$ .

Using the Triangle Inequality, and recalling that $|c_{k}|\leq{(\frac{2}{m})^{k}}$ for all $k$ by Proposition 2.3(b), we obtain

		$\displaystyle\left\|c_{m+1}+\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in L}c_{k}\alpha_{i}^{k-m-2}-\displaystyle\sum_{i\in S}\displaystyle\sum_{k=m+2}^{\infty}c_{k}\alpha_{i}^{k-m-2}\right\|$
	$\displaystyle\leq$	$\displaystyle{(2/m)^{m+1}+\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in L}(2/m)^{k}\|\alpha_{i}\|^{k-m-2}+\displaystyle\sum_{i\in S}\displaystyle\sum_{k=m+2}^{\infty}(2/m)^{k}\|\alpha_{i}\|^{k-m-2}}.$

Since $|L|,|S|\leq{m}$ , this is at most $(2/m)^{m+1}+m\displaystyle\sum_{k=0}^{m+1}(2/m)^{k}|\alpha_{i}|^{k-m-2}+m\displaystyle\sum_{k=m+2}^{\infty}(2/m)^{k}|\alpha_{i}|^{k-m-2}$ .

Now we make use of Proposition 2.3(a). In the first summation, the quantity $k-m-2$ is negative, so $|\alpha_{i}|^{k-m-2}\leq{(me^{\gamma^{-}-1})^{k-m-2}}$ . In the second summation, $k-m-2$ is nonnegative, so $|\alpha_{i}|^{k-m-2}\leq{(me^{\gamma^{+}-1})^{k-m-2}}$ . Putting this altogether, we have

	$\displaystyle\left\|c_{m+1}+\displaystyle\sum_{k=0}^{m+1}\displaystyle\sum_{i\in L}c_{k}\alpha_{i}^{k-m-2}-\displaystyle\sum_{i\in S}\displaystyle\sum_{k=m+2}^{\infty}c_{k}\alpha_{i}^{k-m-2}\right\|$
	$\displaystyle\leq{(2/m)^{m+1}+m\displaystyle\sum_{k=0}^{m+1}(2/m)^{k}(me^{\gamma^{-}-1})^{k-m-2}+m\displaystyle\sum_{k=m+2}^{\infty}(2/m)^{k}(me^{\gamma^{+}-1})^{k-m-2}}$
	$\displaystyle=(2/m)^{m+1}+\frac{m^{-m-1}}{(e^{\gamma^{-}-1})^{m+2}}\displaystyle\sum_{k=0}^{m+1}(2e^{\gamma^{-}-1})^{k}+\frac{m^{-m-1}}{(e^{\gamma^{+}-1})^{m+2}}\displaystyle\sum_{k=m+2}^{\infty}(2e^{\gamma^{+}-1})^{k}.$

Note that the first summation is finite and it is bounded above by the convergent infinite sum $\sum_{k=0}^{\infty}(2e^{\gamma^{-}-1})^{k}$ , which is a constant. Since $0<\gamma^{+}<1-\log{2}$ , we have that $2e^{\gamma^{+}-1}<1$ , so the second summation also converges. Let $C$ be some constant which serves as an upper bound for both of these summations. The total expression is then at most

(2/m)^{m+1}+2C\frac{m^{-m-1}}{(e^{\gamma^{-}-1})^{m+2}}.

We need to show that this expression will still be $O(e^{-\beta m})$ after we multiply it by $(m+1)!$ . By the Stirling approximation, $m!$ is asymptotically $\sqrt{2\pi m}(\frac{m}{e})^{m}$ , so for sufficiently large $m$ , we have

(m+1)!=(m+1)m!\leq{(m+1)(m-1)(m/e)^{m}}\leq{m^{2}(m/e)^{m}}.

Now we examine

\displaystyle m^{2}(m/e)^{m}\left((2/m)^{m+1}+2C\frac{m^{-m-1}}{(e^{\gamma^{-}-1})^{m+2}}\right)

\displaystyle=\frac{2^{m+1}}{e^{m}}m+2C\frac{m}{e^{m}(e^{\gamma^{-}-1})^{m+2}}.

Let $D:=(e^{\gamma^{-}-1})^{-1}$ . Since $\gamma^{-}<1-\log{2}$ , we have that $e^{\gamma^{-}-1}<\frac{1}{2}$ and hence that $D>2$ . However, $\gamma^{-}$ can be chosen arbitrarily close to $1-\log{2}$ to ensure $D<e$ . In this case

	$\displaystyle m^{2}(m/e)^{m}\left((2/m)^{m+1}+2C\frac{m^{-m-1}}{(e^{\gamma^{-}-1})^{m+2}}\right)$	$\displaystyle=me^{-m}\left(2^{m+1}+2CD^{m+2}\right)$
		$\displaystyle=O\left(me^{-m}(2^{m}+CD^{2}D^{m})\right)$
		$\displaystyle=O\left(me^{-m}D^{m}\right)$
		$\displaystyle=O\left(e^{m(\frac{\log{m}}{m}-1+\log{D})}\right).$

Note that $-1+\log{D}<0$ and that the $\frac{\log{m}}{m}$ is negligible for sufficiently large $m$ , so indeed the sum we are considering is $O(e^{-\beta m})$ for some positive $\beta$ , completing our proof of Theorem 1.1(b).

3 Proof of Theorem 1.3

At a high level, the proof of Theorem 1.3 revolves around showing that $\Pr[L_{m,n}(\pi)\geq k]$ tends to 0 or 1 depending on if $nm^{k}/k!$ tends to 0 or infinity as $n$ tends to infinity. The following lemma (which we will apply for $t!\approx n$ ) will be used to determine the threshold when $nm^{k}/k!$ shifts from being very small to very large. Here and throughout the text, $\log$ denotes the natural logarithm.

Lemma 3.1.

Given integers $m\geq 1$ and $t\geq 2$ , let $C>0$ be a real number such that $k=t+\frac{C\log m}{\log t}t$ is an integer. We have

k!\geq t!\cdot m^{Ct},

and if $t\geq m^{10C}$ , we have

k!\leq t!\cdot(1.1)^{k}m^{Ct}

Here and throughout the text we define the falling factorial $(N)_{a}:=N(N-1)\cdots(N-a+1)$ .

Proof.

Note that $k!=t!(k)_{k-t}$ , so it suffices to show

m^{Ct}\leq(k)_{k-t}\leq(1.1)^{k}m^{Ct},

with the upper bound holding when $t\geq m^{10C}$ . When $t\geq m^{10C}$ , we have $\log t\geq 10C\log m$ , so $k=t+\frac{C\log m}{\log t}t\leq 1.1t$ . This implies

(k)_{k-t}\leq k^{k-t}\leq(1.1t)^{k-t}\leq(1.1)^{k}\cdot t^{k-t}=(1.1)^{k}\cdot m^{Ct}.

Similarly $(k)_{k-t}\geq t^{k-t}=m^{Ct}$ for all $t$ , proving the result. ∎

Before delving into the details of the proof, we introduce some auxiliary definitions that will make our arguments somewhat cleaner. The main idea is that we wish to reduce multiset permutations to set permutations by labeling each of the $m$ copies of $i\in[n]$ .

To this end, let $\mathfrak{S}_{m,n}^{*}$ denote the set of permutations of the set $\{i_{h}:i\in[n],\ h\in[m]\}$ . For example, $\tau^{\prime}:=3_{1}2_{1}3_{2}1_{2}2_{2}1_{1}\in\mathfrak{S}_{2,3}^{*}$ . If $\tau\in\mathfrak{S}_{m,n}^{*}$ contains a subsequence of the form $(w_{1})_{x_{1}}\cdots(w_{k})_{x_{k}}$ , then we will say that $\tau$ has a subsequence of type $(w,x)$ where $w=w_{1}\cdots w_{k}$ and $x=x_{1}\cdots x_{k}$ . We say that $\tau\in\mathfrak{S}_{m,n}^{*}$ has a subsequence of type $w$ if it has a subsequence of type $(w,x)$ for some $x$ . For example, $\tau^{\prime}$ defined above has a subsequence of type $(12,22)$ and hence of type $12$ , but it contains no subsequence of type $123$ .

Observation 3.2.

If $\pi\in\mathfrak{S}_{m,n}$ and $\tau\in\mathfrak{S}_{m,n}^{*}$ are chosen uniformly at random, then for any word $w$ with letters in $[n]$ we have

\Pr[\pi\textrm{ contains }w\textrm{ as a subsequence}]=\Pr[\tau\textrm{ contains a subsequence of type }w].

The intuition for this observation is as follows. We can view $\{i_{h}:i\in[n],\ h\in[m]\}$ as a deck of cards with $n$ card types each having $m$ suits, and we view $\tau\in\mathfrak{S}_{m,n}^{*}$ as a way of shuffling this deck. The property that $\tau$ contains a subsequence of type $w$ is independent of the suits of the cards. Thus if we let $\pi\in\mathfrak{S}_{m,n}$ denote the shuffling $\tau$ after ignoring suits, then $\pi$ contains $w$ as a subsequence if and only if $\tau$ contains a subsequence of type $w$ . More formally, one can prove this result by considering the map $\phi:\mathfrak{S}_{m,n}^{*}\to\mathfrak{S}_{m,n}$ which deletes the subscripts in the letters of $\tau\in\mathfrak{S}_{m,n}^{*}$ . We omit the details.

3.1 The Upper Bound

To prove the upper bound of Theorem 1.3, essentially the only fact we need is that there are at most $n$ continuously increasing subsequences of a given length $k$ , and as such our proof easily generalizes to a wider set of subsequence problems.

To this end, let $\mathcal{W}$ be a set of words with letters in $[n]$ . For $\pi\in\mathfrak{S}_{m,n}$ , we define $L_{m,n}(\pi;\mathcal{W})$ to be the maximum length of a word $w\in\mathcal{W}$ which appears as a subsequence in $\pi$ . For example, if $\mathcal{W}$ consists of every word of the form $i(i+1)\cdots j$ for some $i\leq j$ , then $L_{m,n}(\pi;\mathcal{W})=L_{m,n}(\pi)$ . We will say that a set of words $\mathcal{W}$ is prefix closed if for every $w_{1}\cdots w_{k}\in\mathcal{W}$ we have $w_{1}\cdots w_{\ell}\in\mathcal{W}$ for all $\ell\leq k$ .

Lemma 3.3.

Let $\mathcal{W}$ be a prefix closed set of words with letters in $[n]$ and let $\mathcal{W}_{k}\subseteq\mathcal{W}$ be the set of words of length $k$ in $\mathcal{W}$ . If $\pi\in\mathfrak{S}_{m,n}$ is chosen uniformly at random, then

\Pr[L_{m,n}(\pi;\mathcal{W})\geq k]\leq\frac{|\mathcal{W}_{k}|m^{k}}{k!}.

Proof.

For $\tau\in\mathfrak{S}_{m,n}^{*}$ we define $L_{m,n}^{*}(\tau;\mathcal{W})$ to be the length of a longest $w\in\mathcal{W}$ such that $\tau$ contains a subsequence of type $w$ . By Observation 3.2, it suffices to bound $\Pr[L_{m,n}^{*}(\tau;\mathcal{W})\geq k]$ with $\tau$ chosen uniformly at random from $\mathfrak{S}_{m,n}^{*}$ .

Because $\mathcal{W}$ is prefix closed, we have $L_{m,n}^{*}(\tau;\mathcal{W})\geq k$ if and only if $\tau$ contains some subsequence of type $w\in\mathcal{W}_{k}$ , and by definition this happens if and only if $\tau$ contains some subsequence of type $(w,x)$ with $w\in\mathcal{W}_{k}$ and $x\in[m]^{k}$ . For $w\in\mathcal{W}_{k}$ and $x\in[m]^{k}$ , let $1_{w,x}(\tau)$ be the indicator variable which is 1 if $\tau$ contains a subsequence of type $(w,x)$ and which is 0 otherwise. Let $X(\tau)=\sum_{w\in\mathcal{W}_{k},x\in[m]^{k}}1_{w,x}(\tau)$ . By our observations above and Markov’s inequality, we find

\Pr[L_{m,n}^{*}(\tau;\mathcal{W})\geq k]=\Pr[X(\tau)\geq 1]\leq\mathbb{E}[X(\tau)]=\sum_{w\in\mathcal{W}_{k},x\in[m]^{k}}\Pr[1_{w,x}(\tau)=1]=|\mathcal{W}_{k}|m^{k}\cdot\frac{1}{k!},

where the last step used that $1_{w,x}(\tau)=1$ if and only if the distinct letters $(w_{1})_{x_{1}},\ldots,(w_{k})_{x_{k}}$ appear in the correct relative order in $\tau$ , and this happens with probability $1/k!$ . This proves the result. ∎

Proposition 3.4.

Let $\mathcal{W}$ be a prefix closed set of words with letters in $[n]$ and let $\mathcal{W}_{k}\subseteq\mathcal{W}$ be the words of length $k$ in $\mathcal{W}$ . Assume there exists an $N$ such that $|\mathcal{W}_{k}|\leq N$ for all $k$ . If $\pi\in\mathfrak{S}_{m,n}$ is chosen uniformly at random and $N$ is sufficiently large in terms of $m$ , then

\mathbb{E}[L_{m,n}(\pi;\mathcal{W})]\leq\Gamma^{-1}(N)+O\left(1+\frac{\log m}{\log(\Gamma^{-1}(N))}\Gamma^{-1}(N)\right).

In particular, for $\mathcal{W}$ the set of continuously increasing words $i(i+1)\cdots j$ , we have $|\mathcal{W}_{k}|\leq n$ for all $k$ , so taking $N=n$ gives the upper bound of Theorem 1.3. As another example, if $\mathcal{W}$ is the set of arithmetic progressions, then one can take $N=n^{2}$ to give an upper bound of roughly $\Gamma^{-1}(n^{2})$ for $\mathbb{E}[L_{m,n}(\pi;\mathcal{W})]$ . Recent work of Goh and Zhao [GZ20] shows that this bound for arithmetic progressions is tight.

Proof.

By using Lemma 3.3 and the trivial bound $\Pr[L_{m,n}(\pi;\mathcal{W})\geq K]\leq 1$ , we find for all integers $k$ that

\displaystyle\mathbb{E}[L_{m,n}(\pi;\mathcal{W})]

\displaystyle=\sum_{K\geq 1}\Pr[L_{m,n}(\pi;\mathcal{W})\geq K]\leq k+\sum_{K>k}\frac{Nm^{K}}{K!}.

(10)

Let $t$ be the integer such that $(t-1)!<N\leq t!$ and let $k=\left\lceil t+\frac{2\log m}{\log t}t\right\rceil$ . Note that this implies $k=t+\frac{C\log m}{\log t}t$ for some $C\geq 2$ . Assume $N$ is sufficiently large so that $2m\leq k\leq 2t$ . By Lemma 3.1, we have for $K>k\geq t$ that

\frac{Nm^{K}}{K!}\leq\frac{Nm^{k}}{k!}\cdot\left(\frac{m}{k}\right)^{K-k}\leq\frac{Nm^{k}}{t!m^{Ct}}\cdot 2^{k-K}\leq 2^{k-K},

with this last step using $k\leq 2t\leq Ct$ and $N\leq t!$ . Plugging this and our choice of $k$ into (10) gives, after setting $\ell=K-k$ ,

\mathbb{E}[L_{m,n}(\pi;\mathcal{W})]\leq\left\lceil t+\frac{2\log m}{\log t}t\right\rceil+\sum_{\ell>0}2^{-\ell}\leq t+\frac{2\log m}{\log t}t+2.

This gives the desired result since $t<\Gamma^{-1}(N)$ . ∎

3.2 The Lower Bound

For $x,y\in[m]^{n}$ , we define their Hamming distance $d_{H}(x,y):=|\{i\in[n]:x_{i}\neq y_{i}\}|$ . Our key lemma for proving the lower bound of Theorem 1.3 is the following:

Lemma 3.5.

Let $T\subseteq[m]^{n}$ be such that any distinct $x,y\in T$ have $d_{H}(x,y)\geq\delta$ for some integer $\delta$ . Then

\Pr[L_{m,n}(\pi)=n]\geq\frac{|T|}{n!}\left(1-\frac{|T|}{\delta!}\right).

Proof.

For $\tau\in\mathfrak{S}_{m,n}^{*}$ , let $L^{*}_{m,n}(\tau)$ denote the length of the longest subsequence of $\tau$ of type $i(i+1)\cdots j$ . By Observation 3.2, it suffices to prove this lower bound for $\Pr[L_{m,n}^{*}(\tau)=n]$ where $\tau\in\mathfrak{S}_{m,n}^{*}$ is chosen uniformly at random. For $x\in[m]^{n}$ , let $A_{x}(\tau)$ be the event that $\tau$ contains a subsequence of type $(12\cdots n,x)$ . Observe that

	$\displaystyle\Pr[L_{m,n}^{*}(\tau)=n]$	$\displaystyle=\Pr[\bigcup_{x\in[m]^{n}}A_{x}(\tau)]\geq\Pr[\bigcup_{x\in T}A_{x}(\tau)]$
		$\displaystyle\geq\sum_{x\in T}\Pr[A_{x}(\tau)]-\sum_{x,y\in T,\ x\neq y}\Pr[A_{x}(\tau)\cap A_{y}(\tau)],$		(11)

where the last inequality used the Bonferroni inequality (which is essentially a weakening of the principle of inclusion-exclusion); see e.g. [Spe14] for further details on this inequality. To bound (11), we use the following:

Claim 3.6.

If $x,y\in T$ with $x\neq y$ , then $\Pr[A_{x}(\tau)]=1/n!$ and

\Pr[A_{x}(\tau)\cap A_{y}(\tau)]\leq\frac{1}{\delta!n!}.

Proof.

Observe that $A_{x}(\tau)$ occurs if and only if $1_{x_{1}},\ldots,n_{x_{n}}$ occur in the correct relative order in $\tau$ , so $\Pr[A_{x}(\tau)]=1/n!$ . Let $S=\{i_{1}<i_{2}<\cdots<i_{\delta}\}$ be any set of $\delta$ indices $i$ such that $y_{i}\neq x_{i}$ , and note that such a set exists by assumption of $T$ . Let $A_{y}^{S}(\tau)$ be the event that $(i_{1})_{y_{i_{1}}}\cdots(i_{\delta})_{y_{i_{\delta}}}$ is a subsequence of $\tau$ . Observe that $\Pr[A_{y}^{S}(\tau)]=1/\delta!$ and that this event is independent of $A_{x}(\tau)$ since these two events concern disjoint sets of letters. Because $A_{y}(\tau)$ implies $A_{y}^{S}(\tau)$ , we have

\Pr[A_{x}(\tau)\cap A_{y}(\tau)]\leq\Pr[A_{x}(\tau)\cap A_{y}^{S}(\tau)]=\frac{1}{\delta!n!},

proving the result. ∎

Plugging the results of this claim into (11) and using that the second sum of (11) has at most $|T|^{2}$ terms gives the desired result. ∎

The problem of finding $T\subseteq[m]^{n}$ such that $d_{H}(x,y)\geq\delta$ with $|T|$ and $\delta$ both large is the central problem of coding theory. In particular, a basic greedy argument from coding theory gives the following:

Lemma 3.7.

For any $m\geq 2$ and $1\leq\delta\leq n/2$ , there exists $T\subseteq[m]^{n}$ such that any two distinct $x,y\in T$ have $d_{H}(x,y)\geq\delta$ and such that

|T|\geq\frac{m^{n}}{\delta{n\choose\delta}(m-1)^{\delta}}.

Proof.

Let $T\subseteq[m]^{n}$ be a set such that $d_{H}(x,y)\geq\delta$ for distinct $x,y\in T$ and such that $|T|$ is as large as possible. Let $B(x)=\{y\in[m]^{n}:d_{H}(x,y)<\delta\}$ , and note that for all $x$ ,

|B(x)|=\sum_{d=0}^{\delta-1}{n\choose d}(m-1)^{d}\leq\delta{n\choose\delta}(m-1)^{\delta},

with this last step using ${n\choose d}\leq{n\choose\delta}$ for $d<\delta\leq n/2$ . By the maximality of $|T|$ , we must have $[m]^{n}=\bigcup_{x\in T}B(x)$ , and thus

m^{n}=|\bigcup_{x\in T}B(x)|\leq|T|\cdot\delta{n\choose\delta}(m-1)^{\delta},

giving the desired bound on $|T|$ . ∎

Combining Lemmas 3.5 and 3.7 gives the following:

Proposition 3.8.

For $n$ sufficiently large in terms of $m\geq 2$ , we have

\Pr[L_{m,n}(\pi)=n]\geq\frac{(m/1.03)^{n}}{2n\cdot n!}.

Proof.

We start with the following fact.

Claim 3.9.

For all $\epsilon>0$ , there exists a constant $0<c_{\epsilon}\leq 1$ such that if $\delta\leq c_{\epsilon}n$ , then ${n\choose\delta}\leq(1+\epsilon)^{n}$ .

Proof.

In [Cov99] it is noted that ${n\choose\delta}\leq 2^{H(\delta/n)n}$ for all $n,\delta$ , where $H(p):=-p\log_{2}(p)-(1-p)\log_{2}(1-p)$ is the binary entropy function. Because $H(p)$ tends to 0 as $p$ tends to 0, there exists a constant $c$ such that $2^{H(c)}\leq 1+\epsilon$ , and the result follows by taking $c_{\epsilon}=c$ . ∎

Let $\delta=\frac{2\log m}{\log n}n$ , and assume $n$ is sufficiently large in terms of $m$ so that $\delta\leq c_{.01}n$ , i.e. so that ${n\choose\delta}\leq(1.01)^{n}$ . We also choose $n$ sufficiently large so that $\delta\leq\frac{\log 1.01}{\log m}n$ , or equivalently so that $m^{\delta}\leq(1.01)^{n}$ . Let $T$ be a set as in Lemma 3.7, and by our assumptions above we find

|T|\geq\frac{(m/1.03)^{n}}{n}.

Possibly by deleting elements from $T$ we can assume that $|T|$ is exactly the quantity stated above⁶⁶6Strictly speaking we should take $|T|$ to be the floor of this value to guarantee that it is an integer. This would change our ultimate bound by at most a factor of 2, and this factor of 2 can easily be recovered by sharpening our analysis in various places., so by Lemma 3.5 it suffices to show $|T|/\delta!\leq\frac{1}{2}$ . Using the inequality $\delta!\geq(\delta/e)^{\delta}$ and that $n$ is sufficiently large, we have

\delta!\geq\exp\left[\delta\cdot(\log(\delta)-1)\right]\geq\exp\left[\frac{2\log m}{\log n}n\cdot(\log(n)-\log(\log(n))-1)\right]\geq\exp\left[\log(m)n\right]=m^{n}.

Thus $|T|/\delta!\leq(1.03)^{-n}/n\leq\frac{1}{2}$ , proving the result. ∎

With this we can now prove Theorem 1.3.

Proof of Theorem 1.3.

The upper bound follows from Proposition 3.4. To prove the lower bound, fix an integer $k$ . For $0\leq j<\left\lfloor n/k\right\rfloor$ , let $A_{j}(\pi)$ be the event that $\pi$ contains the subsequence $(jk+1)(jk+2)\cdots((j+1)k)$ .

Claim 3.10.

We have the following:

(a)

If any $A_{j}(\pi)$ event occurs, then $L_{m,n}(\pi)\geq k$ ,
(b)

The $A_{j}(\pi)$ events are mutually independent, and
(c)

For all $j$ , we have $\Pr[A_{j}(\pi)]=\Pr[L_{m,k}(\sigma)=k]$ where $\sigma\in\mathfrak{S}_{m,k}$ is chosen uniformly at random.

Proof.

Part (a) is clear, and (b) follows from the fact that the $A_{j}(\pi)$ events involve the relative ordering of disjoint sets of letters. For (c), one can consider the map which sends $\pi\in\mathfrak{S}_{m,n}$ to $\sigma\in\mathfrak{S}_{m,k}$ by deleting every letter in $\pi$ except for $(jk+1),\ldots,((j+1)k)$ and then relabeling $jk+i$ to $i$ . It is not difficult to see that $A_{j}(\pi)$ occurs if and only if $L_{m,k}(\sigma)=k$ occurs, and that $\pi$ being chosen uniformly at random implies $\sigma$ is chosen uniformly at random. ∎

Let $p_{k}=\Pr[L_{m,k}(\sigma)=k]$ and let $t$ be the integer such that $t!\leq n<(t+1)!$ . The claim above implies that for all $k$ we have

\Pr[L_{m,n}(\pi)\geq k]\geq\Pr\left[\bigcup A_{j}(\pi)\right]=1-\Pr\left[\bigcap A_{j}^{c}(\pi)\right]=1-(1-p_{k})^{\left\lfloor n/k\right\rfloor}\geq 1-\exp\left(-\frac{t!p_{k}}{2k}\right),

(12)

with this last step using $\left\lfloor n/k\right\rfloor\geq n/2k$ for $k\leq n$ , that $1-p_{k}\leq e^{-p_{k}}$ , and that $n\geq t!$ .

It is easy to see by definition that $p_{k}\geq 1/k!$ for all $m,k$ ; and for $n$ sufficiently large, we have $-e^{-t/2}\geq-t^{-1}$ . For such $n$ , by (12) we have for all $k\leq t-2$ that

\Pr[L_{m,n}(\pi)\geq k]\geq 1-\exp\left(-\frac{t!}{2k\cdot k!}\right)\geq 1-\exp\left(-\frac{t}{2}\right)\geq 1-t^{-1}.

(13)

Summing this bound over all $k\leq t-2$ for $m=1$ gives

\mathbb{E}[L_{1,n}(\pi)]\geq\sum_{k\leq t-2}\Pr[L_{1,n}(\pi)\geq k]\geq t-3.

This gives the desired lower bound of $\Gamma^{-1}(n)+\Omega(1)$ for $m=1$ since $t\geq\Gamma^{-1}(n)-2$ .

We now consider $m\geq 2$ . By Proposition 3.8 we have for $k$ sufficiently large in terms of $m$ that $p_{k}\geq\frac{(m/1.03)^{k}}{2k\cdot k!}$ . Let $n$ be large enough in terms of $m$ so that this bound holds for $k\geq t$ . Also let $n$ be large enough so that $\frac{\log m}{\log t}\leq 1$ . By Lemma 3.1, if $t\leq k\leq t+\frac{\log m}{100\log t}t\leq 1.01t$ , then $k!\leq t!(1.1)^{k}m^{t/100}$ . Thus by (12) we have

	$\displaystyle\Pr[L_{m,n}(\pi)\geq k]$	$\displaystyle\geq 1-\exp\left(-\frac{(m/1.03)^{k}}{4k^{2}(1.1)^{k}m^{t/100}}\right)\geq 1-\exp\left(-\frac{m^{.99t}}{4k^{2}\cdot(1.14)^{1.01t}}\right)$
		$\displaystyle\geq 1-\exp\left(-\frac{(1.7)^{t}}{8t^{2}}\right),$

where this last step used $m\geq 2$ . This quantity is at least $1/2$ for $n$ (and hence $t$ ) sufficiently large. Using this together with (13) for $k\leq t-2$ gives, for $n$ sufficiently large in terms of $m$ ,

\mathbb{E}[L_{m,n}(\pi)]\geq t-3+\sum_{t\leq k\leq t+\frac{\log m}{100\log t}t}\Pr[L_{m,n}(\pi)\geq k]\geq t-3+\left(\frac{\log m}{100\log t}t\right)\cdot\frac{1}{2},

proving the desired result. ∎

4 Concluding Remarks

In this paper we solved a conjecture of Diaconis, Graham, He, and Spiro [DGHS21] by asymptotically determining $\mathbb{E}[L^{1}_{m,n}(\pi)]$ provided $n$ is sufficiently large in terms of $m$ . Using similar ideas, it is possible to compute the asymptotic limit of the $r^{\text{th}}$ moment $\mathbb{E}[L^{1}_{m,n}(\pi)^{r}]$ for any fixed $r$ . Based off of computational evidence for these higher moments, we conjecture the following:

Conjecture 4.1.

For all $r\geq 1$ , if $n$ is sufficiently large in terms of $m$ , then

\mathbb{E}[(L^{1}_{m,n}(\pi)-\mu)^{r}]=c_{r}m^{\left\lfloor r/2\right\rfloor}+O(m^{\left\lfloor r/2\right\rfloor-1}),

where $\mu=\mathbb{E}[L^{1}_{m,n}(\pi)]$ and

c_{r}=\begin{cases}\frac{r!}{2^{r/2}(r/2)!}&r\textrm{ even},\\ \frac{r!}{3\cdot 2^{(r-1)/2}((r-3)/2)!}&r\textrm{ odd}.\end{cases}

One can show that the standard deviation $\sigma$ of $L^{1}_{m,n}(\pi)$ is asymptotic to $m^{1/2}$ . Thus, this conjecture would imply that the standardized moments $(\frac{L^{1}_{m,n}(\pi)-\mu}{\sigma})^{r}$ converge to 0 for $r$ odd and to $\frac{r!}{2^{r/2}(r/2)!}$ for $r$ even. These are exactly the moments of a standard normal distribution, and actually this fact would imply that $(L_{m,n}^{1}(\pi)-\mu)/\sigma$ converges in distribution to a standardized normal distribution, see for example [FK16, Corollary 21.8].

Perhaps a first step towards proving Conjecture 4.1 would be to get a better understanding of the (non-centralized) moments $\mathbb{E}[L_{m,n}^{1}(\pi)^{r}]$ , and to this end we conjecture the following:

Conjecture 4.2.

For all $r\geq 1$ , if $n$ is sufficiently large in terms of $m$ , then

\mathbb{E}[L^{1}_{m,n}(\pi)^{r}]=m^{r}+{r+1\choose 2}m^{r-1}+O(m^{r-2}).

We can prove that the $r^{\text{th}}$ moment is asymptotic to $m^{r}$ , but we do not know how to determine the coefficient of $m^{r-1}$ . We were unable to observe any pattern for the coefficients of lower order terms.

In this paper, we considered continuously increasing subsequences in multiset permutations, and it is natural to consider other types of subsequences in multiset permutations. Perhaps the most natural to consider is the following:

Question 4.3.

For $\pi\in\mathfrak{S}_{m,n}$ , let $\widetilde{L}_{m,n}(\pi)$ denote the length of a longest increasing subsequence in $\pi$ . What is $\mathbb{E}[\widetilde{L}_{m,n}(\pi)]$ asymptotic to when $m$ is fixed?

When $m=1$ it is well known that $\mathbb{E}[\widetilde{L}_{1,n}(\pi)]\sim 2\sqrt{n}$ (see [Rom15]), so Question 4.3 is a natural generalization of this classical problem. See also recent work of Almeanazel and Johnson [AMJ20] for some results concerning the distribution of $\widetilde{L}_{m,n}(\pi)$ .

Acknowledgments. We thank S. Venkitesh for fruitful conversations and Persi Diaconis for comments regarding an earlier draft. We thank the Graduate Student Combinatorics Conference 2021 for hosting an open problem session, at which the fourth author presented the problem which inspired this work.

References

[AMJ20] Ayat Al-Meanazel and Brad C. Johnson. The distribution of the length of the longest increasing subsequence in random permutations of arbitrary multi-sets. Methodology and Computing in Applied Probability, 22:1009–1021, 2020.
[BDJ99] Jinho Baik, Percy Deift, and Jurt Johansson. On the distribution of the length of the longest increasing subsequence of random permutations. Journal of the American Mathematical Society, 12(4):1119–1178, 1999.
[CG88] Brian Conrey and Amit Ghosh. On the zeros of the taylor polynomials associated with the exponential function. The American mathematical monthly, 95(6):528–533, 1988.
[Cov99] Thomas M Cover. Elements of information theory. John Wiley & Sons, 1999.
[DG81] Persi Diaconis and Ronald Graham. The analysis of sequential experiments with feedback to subjects. The Annals of Statistics, 9(1):3–23, 1981.
[DGHS21] Persi Diaconis, Ron Graham, Xiaoyu He, and Sam Spiro. Card guessing with partial feedback. Combinatorics, Probability and Computing, pages 1–20, 2021.
[DGS21] Persi Diaconis, Ron Graham, and Sam Spiro. Guessing about guessing: Practical strategies for card guessing with feedback. American Mathematical Monthly (to appear), 2021.
[FK16] Alan Frieze and Michal Karoński. Introduction to random graphs. Cambridge University Press, 2016.
[GZ20] Marcel K Goh and Rosie Y Zhao. Arithmetic subsequences in a random ordering of an additive set. arXiv preprint arXiv:2012.12339, 2020.
[HK81] J.D. Horton and Andrew Kurn. Counting sequences with complete increasing subsequences. Congressus Numerantium, pages 75–80, 1981.
[LS77] B.F Logan and L.A Shepp. A variational problem for random young tableaux. Advances in Mathematics, 26(2):206–222, 1977.
[Rom15] Dan Romik. The surprising mathematics of longest increasing subsequences. Cambridge University Press, 2015.
[Spe14] Joel Spencer. Asymptopia, volume 71. American Mathematical Soc., 2014.
[VK77] A.M. Vershik and S.V. Kerov. Asymptotics of the plancheral measure of the symmetric group and a limiting form for young tableaux. Doklady Akademii Nauk SSSR, 233(6):1024–1027, 1977.
[Zem05] Stephen Zemyan. On the zeroes of the nth partial sum of the exponential series. American Mathematical Monthly, 112(10):891–909, 2005.