12
Height of walks with resets, the Moran model, and the discrete Gumbel distribution
Abstract.
In this article, we consider several models of random walks in one or several dimensions, additionally allowing, at any unit of time, a reset (or “catastrophe”) of the walk with probability . We establish the distribution of the final altitude. We prove algebraicity of the generating functions of walks of bounded height (showing in passing the equivalence between Lagrange interpolation and the kernel method). To get these generating functions, our approach offers an algorithm of cost , instead of cost if a Markov chain approach would be used. The simplest nontrivial model corresponds to famous dynamics in population genetics: the Moran model.
We prove that the height of these Moran walks asymptotically follows a discrete Gumbel distribution.
For , this generalizes a model of carry propagation over binary numbers considered e.g. by von Neumann and Knuth.
For generic , using a Mellin transform approach,
we show that the asymptotic height exhibits fluctuations for which we get an explicit description (and, in passing, new bounds for the digamma function).
We end by showing how to solve multidimensional generalizations of these walks (where any subset of particles is attributed a different probability of dying)
and we give an application to the soliton wave model.
Key words and phrases:
Random walks, renewal process, Moran model, analytic combinatorics, discrete Gumbel distribution, Mellin transform, kernel method, digamma function1. Introduction
The height of random walks is a fundamental parameter which occurs in many domains: in computer science (evolution of a stack, tree traversals, or cache algorithms [39]), in reliability or failure theory (maximal age of a component and inference statistics on the longevity before replacement [24]), in queueing theory (maximal length of the queue, with e.g. applications to traffic jam analysis [37]), in mathematical finance (e.g. in risk theory [28]), in bioinformatics (pattern matching and sequence alignment [2]), etc.
In combinatorics, random walks are studied via the corresponding notion of lattice paths, which play a central role, not only for intrinsic properties of such paths, but also as they are in bijection with many fundamental structures (trees, words, maps, …). We refer to the nice magnum opus of Flajolet and Sedgewick on analytic combinatorics [22] for many enumerative and asymptotic examples.
While the behavior of an extremal parameter such as the height is well understood for walks corresponding to Brownian motion theory, it becomes more subtle when a notion of reset/renewal/resetting/catastrophe [8, 14, 33, 9, 29, 42, 40] is introduced in the model: indeed, typical behaviors in this model are often established by conditioning on events of probability zero in the model without reset, leading to possibly counterintuitive results.
In this article, we give several enumerative and asymptotic results on different statistics (final altitude, waiting time, height) of walks with resets, focusing on the so-called Moran walks (walks related to biological/population models considered by Moran in 1958; see Section 5 for more on this).
Plan of the article.
In Section 2, we consider a generic model of walks with resets (allowing any finite set of steps and a reset step). We describe the behavior of their final altitude (at finite time, and asymptotically). We obtain an algebraic closed form for the bivariate generating function (length/final altitude) for walks of bounded height . Our approach uses a variant of the so-called kernel method, which has the advantage to avoid any case-by-case computation based on Markov chains/transfer matrices of size . In passing, we show the intimate link between Lagrange interpolation and the kernel method.
In Section 3, we consider Moran walks, a model described in Figure 1, for which we generalize an enumerative formula due to Pippenger [45]. We show that their height asymptotically follows a distribution which involves non-trivial fluctuations. We prove that this distribution is a discrete Gumbel distribution, and we clarify its links with the continuous Gumbel distribution. We give an application to the waiting time for reaching any given altitude.
In Section 4, we begin with a brief presentation of the Mellin transform method, and then use it to derive a precise analysis of the asymptotic average and variance of the height. The second asymptotic term involves some fluctuations given by a Fourier series (which we prove to be infinitely differentiable, and for which we also derive generic bounds of independent interest). This extends (and fixes some error terms) in earlier analyses by von Neumann, Knuth, Flajolet and Sedgewick [13, 38, 22].
In Section 5, we tackle some multidimensional generalizations of Moran walks, with applications to a model in population genetics and to a wave propagation model (a soliton model), as considered by Itoh, Mahmoud, and Takahashi in [35, 34].
In Section 6, we conclude with a few possible extensions for future work.

2. Walks with resets: final altitude and height
We consider walks with steps in (where is a nonempty finite subset of ), which can additionally have a reset at any altitude. That is, we have the following process on :
(So if we have with probability .)
Thus, is the altitude of the process after steps and is its height. It is convenient to encode the steps and their probabilities by the Laurent polynomial
(1) |
We assume to avoid degenerate cases. We do not require that or . Of course, if , the walk will live by design in (it is e.g. the case for Moran walks of Figure 1). In Section 2.1, we determine the distribution of the final altitude (as illustrated in Figure 2 for different families of steps) and we investigate the height in Section 2.2.




2.1. Final altitude
Let us start with a simple result which paves the way for the more subtle generating function manipulations for the height that we tackle later in Section 2.2.
We use the classical convenient notations:
-
•
stands for the coefficient of in the power series ,
-
•
is the -th derivative of with respect to , evaluated at .
Theorem 2.1 (Final altitude at finite time).
The final altitude of walks with resets follows a discrete law with probability generating function
(2) |
where is the Laurent polynomial encoding the allowed steps (a finite subset of ). Equivalently, for , we have
(3) |
Let be the drift111We recall that , so another convention could have been to call drift the quantity , i.e., we would then condition on having no reset (instead of considering walks without reset, weighted by the initial model (1)). This alternative convention does not simplify the subsequent formulas. of the walk without reset, and its second factorial moment. The mean and the variance of the final altitude of the walk with resets are given by
For Moran walks (i.e., and ), the mean and the variance simplify to
Proof 2.2.
The probability generating function can be written as
where the ’s are Laurent polynomials encoding the location of the walk at time ; thus we have , with . Multiplying both sides of this recurrence by , and summing over , one gets
As , one obtains Formula (2). Note that the generating function can also be obtained by using a regular expression encoding these walks (by factorizing the walk in factors ending by a reset): , which translates to
where the occurrences of and reflect that only the altitudes after the last reset contribute to the final altitude of the full walk. Using , we get Formula (2).
The mean of is then obtained via , while its variance is obtained via a second-order derivative: .
We can now establish the corresponding limit distribution.
Theorem 2.3 (Final altitude: asymptotics).
Consider walks with , , and (these three constraints bring no loss of generality222 There is no loss of generality. Indeed, if the walk as a periodic support (i.e., if with ) we rescale (without loss of generality) the step set by dividing each step by . Now, if , then we multiply each step by . Last, if we consider instead the equivalent model and .). Therefore the support of the walk is either (with all altitudes being reachable), or (with a finite set of altitudes impossible to reach, known as the unreachable set in the coin-exchange problem of Frobenius). The final altitude of these walks with resets behaves asymptotically according to these two cases.
-
a)
For walks with , we have for (not in the Frobenius unreachable set):
(4) In particular, for Moran walks, we have for and so .
-
b)
For walks with and , we have for :
Moreover, both in Case a) and in Case b), has a geometric decay for large .
Proof 2.4.
In Case a), we have ; the definition of in (1) then entails for large . The limit of Equation (3) thus gives
(5) |
In particular, when it is not , this quantity is lower bounded by and upper bounded by , and therefore decreases geometrically.
In Case b), the proof is more complicated and will recycle ingredients of the asymptotics of walks without reset. To this aim, first set , i.e., the step setprobabilities are renormalized to have global mass . Let be the probability generating function of walks without reset, i.e., . We then rewrite Equation (3) as
(6) |
If and , then there is a unique real such that . It is proven in [5] that is the radius of convergence of and that , where .
Note that, as we have a probability generating function, we have . The asymptotics of (6) then follows by singularity analysis, as is singular at , that is, before which is singular at :
(7) |
Note that Formulas (10) and (11) in [5, Theorem 1] give a closed form for . It implies in particular
(8) |
where and are constants independent of ; thus decays geometrically for . This concludes our analysis of Case b) and gives the theorem.
These limiting behaviors are thus in sharp contrast with the asymptotic behavior of the final altitude of walks on with no resets, which is , with fluctuations given by a continuous distribution (Rayleigh or Gaussian; see [5]).
2.2. The height
In order to study the height of these walks with resets, one considers the subset of them made of walks conditioned to have a height smaller than . We want to obtain an explicit formula for their generating function
If these walks are generated by a step set having only positive jumps, a natural but naive approach to enumerate them would be to create a deterministic finite automaton (a finite discrete Markov chain) with states encoding the possible altitudes of the process. It leads to a system of linear equations which would allow us to get the corresponding rational generating function. However, this approach to obtain the generating function (given and the transition probabilities) suffers from three drawbacks:
-
•
it would be of complexity (computing determinants of matrices),
-
•
it would be a case-by-case approach (new computations are needed for each ),
-
•
it would fail if the step set has some negative steps (then the support of the walkis , and thus one would need an automaton with an infinite number of states).
So, we prefer here to use a more efficient approach, which relies on a powerful method (namely, the kernel method [7]): the complexity to obtain a closed-form formula for then drops333The PhD thesis of Louis Dumont [17] compares the cost of different methods to compute the coefficients of such generating functions (which can be related to diagonals of rational functions); the full analysis has to take into account the space and time complexities, and some precomputation steps, of cost of course higher than , but in all cases it is more efficient than a Markov chain approach (see however Bacher [3] for a clever use of a transfer matrix point of view). from to for any finite step set ! This leads to the following theorem.
Theorem 2.5.
Let be the probability generating function of walks on of height with resets, where the length and the final altitude of the walks are respectively encoded by the exponents of and . Let encode the allowed jumps as in (1). One has
(9) |
where
(10) |
is the generating function of walks of height without reset, and where are the roots of such that .
Remark 2.6 (A rational simplification).
These generating functions are algebraic, as they rationally depends on the roots , which are themselves algebraic functions. Now, when the step set has only positive steps, is a polynomial and simplifies to a rational function (despite the fact that their closed forms (10) and (9) involve algebraic functions!). This simplification can be seen either by the automaton point of view and the Kleene theorem, or by using the Vieta formulas on Newton sums (as, when one has only positive jumps, the ’s are then all the roots of the kernel ). For example, for and , we have
(11) |
(the Vieta formulas are here: and ); then, the quotient (9) involving these algebraic functions and simplifies, leading to
Proof 2.7 (Proof of Theorem 2.5).
The probability generating function can be written as
where encodes the possible values of (constrained to be bounded by over the full process), and where
is the probability generating function of bounded walks ending at altitude .
The dynamics of the process then entails the recurrence
where extracts monomials having a degree in strictly larger than . This mimics that at time , either, with probability , we increase by the altitude of where we were at time (that is, we multiply by , and this is allowed as long as the walk stays at some altitude , thus we removed here the cases corresponding to the walks which would reach an altitude at time ); or, with probability , we have a reset to altitude 0 (i.e., all the mass of the walks at any altitude , corresponding to the coefficient of , is sent back to ; this is thus captured by the substitution ).
This directly translates to the functional equation
Setting , we get the functional equation for the generating function of walks of height without reset:
(12) |
Of course, the factorization of walks with resets into entails , which is Formula (9). So if we find a closed form for , we are happy as this also solves the initial problem for . Now, on the right-hand side of (12), the sum for from to is a polynomial in , which we conveniently rewrite as
(13) |
It is possible to solve such an equation via the kernel method: the kernel is the factor in (13), and if one considers the equation on the variety defined by , this brings additional equations which will allow us to get a closed form for . First, observe that this kernel is a (Laurent) polynomial in of “positive” degree . Then, from an analysis of its Newton polygon, one gets that it has roots such that for (the other roots being convergent at ; see [5] for more on this issue). Thus, setting (for ) in the functional equation (13) gives new equations. Some care is required in this step: we have to check that one does not create series involving an infinite number of monomials with negative exponents444Let be the ring of series . The Cauchy product of two series in is well defined only with some additional convergence conditions, and, even if we restrict ourselves to series for which the product is well defined, we have to take care to the fact that they do not form an integral ring: indeed, we have many divisors of zero (e.g. for , we have and thus ). Most algebraic manipulations in this ring, if they are temporarily handling quantities which are not in the subring of power series (or Laurent/Puiseux/Fourier series), would lead to invalid identities in ..
In fact, in our case, the substitution is legitimate as becomes a well-defined Puiseux series in : this follows from the fact that the coefficients are (Laurent) polynomials with “positive” degree bounded by (and “negative” degree lower bounded by ), so is a Puiseux series with exponents from to . Then, multiplying by and summing over , only a finite number of summands contribute to each monomial of , which is thus well defined. Via these substitutions , we obtain a linear system of equations (which only contains the ’s as unknowns). Then, by Cramer’s rule, we get , where
that is, is the matrix with its -th column entries replaced by . Thus, as is a Vandermonde matrix, its determinant is
(14) |
Now, to compute , one first proves that
(15) |
where we used the classical notation for the elementary symmetric polynomials:
(16) |
e.g., . Formula (15) follows from 2 facts:
-
•
If , then two rows of are equal and thus the determinant is 0; this explains the Vandermonde product on the right-hand side of Formula (15).
-
•
Now writing the determinant as a sum over the permutations of the entries gives a sum of monomials, each of total degree in the ’s. being of total degree , it implies that is a polynomial which is symmetric and homogeneous of total degree . Up to a constant factor (determined to be 1, by comparing any monomial), this polynomial has to be , which captures exactly the missing ’s in each of the summands.
Then, performing a Laplace expansion of on its -th column and using Formula (15), one gets (after simplification in the Cramer formula):
(17) |
Remark 2.8 (Link with Lagrange interpolation).
As we know the evaluation of the right-hand side of (13) in each of the , Formula (18) is also equivalent to the Lagrange interpolation formula (which we thus reproved en passant). Moreover, this Lagrange interpolation approach offers a nice advantage: it is circumventing the fact that the factorization argument used to get the closed forms for the generating functions in [12, 5] works only if the walks start at altitude 0.
Now, if we go back to Moran walks (i.e., for ; see Figure 1), the generating function simplifies to the following noteworthy shape.
Corollary 2.9.
The probability generating function of Moran walks of height is
(19) |
where, in the power series, the length and the final altitude of the walks are respectively encoded by the exponents of and . Accordingly,
(20) | ||||
(21) |
with the convention that if .
Proof 2.10.
The closed form (21) is obtained via the power series expansion by applying the binomial theorem to each term , with .
The binomial sum (21) generalizes a formula obtained (for ) by Pippenger in [45]. Therein, it is derived by an inclusion-exclusion principle (guided by the combinatorics of the carry propagation in binary words); for his problem, the generating function, and thus the corresponding binomial sum, are a little bit simpler than (20) and (21), and are then used to perform some real analysis for the asymptotics of the expected length.
In our case, equipped with this explicit expression for the probability generating function of Moran walks of bounded height, we can now tackle the question of the asymptotic distribution of this extremal parameter.
3. Asymptotic height of Moran walks
In this section, we establish a local limit law for the distribution of the height of Moran walks. One noteworthy consequence of the generating function explicit formula that we get in the previous section is that it allows us to have very efficient computations and simulations of the process at time , for large , as stressed by the following remark.
Remark 3.1 (Fast computation scheme for any given and ).
One does not need to run the process for steps to have the exact distribution of . Indeed, using the rational generating function from Corollary 2.9, for any , , and , it is possible to get the exact value of in time via binary exponentiation.
This allows us to plot the distribution , for quite large values of (as an example, see Figure 3). Note that for our other generating functions, which are algebraic, there exists a fast algorithm of cost to compute their -th coefficient (this algorithm works more generally for all D-finite functions). This algorithm due to the brothers Chudnovsky is e.g. implemented in the Maple computer algebra system via the package Gfun; see [49]


3.1. Localization of the dominant singularity
As (as given by Equation (19)) is a rational function, all its singularities are poles. The asymptotic behavior of the coefficients of is governed by the closest pole(s) to zero (also called “dominant singularities” of ). A natural candidate for being such a dominant singularity of would be , but it is in fact a removable singularity, as one has (e.g. via L’Hôpital’s rule) . Thus, we can focus on the other roots of the denominator of .
Lemma 3.2 (Localization of the singularities of ).
For , the roots of are such that we have for large enough:
-
(i)
is the unique root strictly between 1 and ;
-
(ii)
is the unique root of modulus ;
-
(iii)
the remaining roots are all of modulus , and arbitrarily close (in modulus) to (for );
-
(iv)
all the roots are simple.
Accordingly, is the dominant singularity of .
Proof 3.3.
Let be the unique positive zero of given by
As tends to from the left, we thus have for large enough. Moreover, is decreasing for all in the interval and increasing in the interval . As , one thus has . And since , the intermediate value theorem implies the existence of (at least) one zero of between and . Combined with the (non)decreasing properties of , this entails the unicity of this zero; let us call it . Then, Pringsheim’s theorem (see e.g. [22]) asserts that has a real positive dominant singularity which is thus , the first real positive zero of . As is a probability generating function, all its singularities are of modulus . So we have and thus proved (i).
We now prove (ii). The fact that is a root follows from . Is there any other root of the same modulus? If (with ) would be a root of , then this would imply . By the reverse triangle inequality (with equality only if or ), this would entail .
To prove (iii), we use the following version of Rouché’s theorem: if on the boundary of a disk , then and have the same number of roots inside . We can apply this theorem to with , for the disk : on its boundary, one indeed has , where the first strict inequality holds for and the next strict inequality holds for any small enough (independently of ), as we have then . As the constraint on is independent of , letting , we infer that has only one root strictly inside .
Now we can also apply this theorem to with : on the boundary of the disk , one indeed has, for large enough (depending on ),
where the last is just a crude bound of the term
which converges to for .
So , like , has roots inside this disk.
To prove (iv), note that the equation is forcing , but for , therefore all the zeros are simple for large enough.
3.2. Limit distribution of the height: the discrete Gumbel distribution
The height distribution exhibits some a priori surprising asymptotic aspects, having a flavor of number theory/Diophantine approximation. Such phenomena, however, appear for a few other probabilistic processes where some statistics could have different asymptotic behaviors depending on some resonance between and (see e.g. Janson [36] or Flajolet, Vallée, and Roux [21] for some examples related to tries or binary search trees). In our case, it appears that a resonance between and plays a role.
Theorem 3.4 (Distribution of the height of Moran walks).
We have
(22) |
where the error term is uniform for . Accordingly, is unimodal, with a peak at , the closest integer to , where , and we have
(23) |
Moreover, the mass is sharply concentrated around , as better seen by the following result, with a uniform error term in :
with (where stands for the fractional part of , and where stands for the floor function of ). [See Figure 3 on page 3 for an illustration of the distribution of and Figure 4 for the behavior of the function .]

Proof 3.5.
In the sequel, as the context is explicit, we simply denote by the zeros of . From Lemma 3.2, for large enough, all these zeros are simple; the partial fraction decomposition of is then
and as , one thus gets
It is combinatorially obvious that for all . So we now focus on , for which we have, as and :
(24) |
where (note that the summand involving cancels), and where is the contribution coming from the pole .

Set . Then , thus this implies ; therefore we have . Now, for tending to , this entails that the contribution of the pole (as given by Equation (24)) satisfies
(25) | ||||
(26) | ||||
(27) |
Observe that
(28) |
(Here and in the sequel we always consider and . In fact, is not needed right now, but this will be required for the asymptotics of the mean of in Section 4.)
For such values of , the asymptotics of the first factor in Equation (27) is
(29) |
and the asymptotics of the second factor in Equation (27) is
(30) | |||
(31) |
In this expansion, one now has to check which error term dominates. It is the big-oh term with if and the big-oh with if . Multiplying with the asymptotic expansion from Equation (29) and using the approximation (24), we get the following result (in which we simplified the part of the error term in a non-optimal way which will be enough for our purpose):
(32) |
Moreover, this approximation holds for all : first, for this follows from the fact that is increasing with respect to , and then for this follows from the bound (50) hereafter.
In conclusion, for , for any such that (with ), we have uniformly in (when ):
and we get Theorem 3.4 by setting .
If , we have (where the symbol stands for the binary logarithm, ). This subcase of particular interest corresponds to a problem initially considered in 1946 by Burks, Goldstine, and von Neumann [13]: the study of carry propagation in computer binary arithmetic; it constitutes one of the first analyses of the cost of an algorithm! They gave crude bounds which were deeply improved by Knuth in 1978 [38]. This problem can also be seen as runs in binary words, and, as such, is analyzed by Flajolet and Sedgewick [22, Theorem V.1]. Therein, the analysis unfortunately contains a few typos which affect some of the error terms. Our proofs are incidentally fixing this issue.
These extremal parameters (runs, longest carry) are archetypal examples of problems leading to a Gumbel distribution (or a discrete version of it). This distribution indeed often appears in combinatorics as the distribution of parameters encoding a maximal value: e.g., maximum of i.i.d. geometric distributions [51], longest repetition of a pattern in lattice paths [46], runs in integer compositions [23], carry propagation in signed digit representations [30], largest part in some integer compositions, longest chain of nodes with a given arity in trees, maximum degree in some families of trees [47], the maximum protection number in simply generated trees [31]. For some of these examples, it was proven only for some specific families of structures, but there is no doubt that it holds generically. A general framework leading to such double exponential laws is given by Gourdon [26, Theorem 4] for the largest component in supercritical composition schemes (see also Bender and Gao [10]). We refer to Figure 6 for an illustration of some of these parameters.
|
|
|
||||||||||
|
|
|
The Gumbel distribution is also called the “double exponential distribution”, or the “type-I generalized extreme value distribution”, and can also be expressed as a subcase of the Fisher–Tippett distribution. Let us give a formal definition.
Definition 3.6 (Gumbel distribution).
A continuous random variable with support follows a Gumbel distribution (of parameters and ), denoted by , if
Its mean satisfies (where is Euler’s constant) and its variance satisfies . It is unimodal with a peak at and its median is at .
Definition 3.7 (Discrete Gumbel distribution).
A discrete random variable follows a discrete Gumbel distribution of parameters and , which we denote 555With a slight abuse of notation, we use the same notation for both the continuous distribution and the discrete distribution, adding the right adjective if needed to remove any ambiguity., if
(33) |
In particular, one can always write , where follows a continuous ; note on the other side that follows a discrete .
To obtain a nice formula for the mean and variance of a discrete Gumbel distribution remains an open problem: for example, for , we have
(and it takes 5 seconds to get thousands of digits, as the terms decrease doubly exponentially fast), but will anybody find a closed form for this mysterious constant? Some insight on the variance of the discrete distribution can be obtained from the continuous distribution via the following trivial but useful bounds which hold more generally as soon as :
(34) |
We can now restate our previous theorem in terms of this discrete Gumbel distribution.
Corollary 3.8 (Gumbel limit law).
The sequence of random variables converges for (in distribution and in moments) to the discrete distribution with . Accordingly, it implies that
Proof 3.9.
Consider the sequence of random variables . Then, the change of variable in Equation (22), with allows us to match (where the limit is in distribution) with the discrete Gumbel defined in (33), for and . Due to the exponentially small uniform error term in (22) on the support of , we have a convergence in moments of to . Then, the asymptotics of the moments follow by applying the bounds (34) on the link between the mean/variance of the discrete and continuous Gumbel distribution.
These moment asymptotics already constitute a notable result (falling as a good ripe fruit!), but a very interesting phenomenon is hidden in these imprecise errors terms: some bodacious fluctuations, that we fully describe in Section 4.
3.3. Waiting time
Let us end this section with an application to a natural statistic: the waiting time , i.e., the number of steps spent by the random walk when it reaches a given altitude for the first time. There is an intimate relationship between height and waiting time (stated more formally in Equation (37) hereafter); it is thus natural that they have enumerative and asymptotic formulas of a similar nature, as better shown by the following corollary.
Corollary 3.10.
The waiting time for reaching height satisfies
(35) |
The distribution function of satisfies
(36) |
Proof 3.11.
Consider a walk reaching for the first time altitude at time . Cut it after each reset. It gives a sequence of factors of length , followed by a last factor with up steps. This translates into the combinatorial formula
which simplifies to Formula (35). Now, for the distribution function, instead of redoing a full analysis based on a partial fraction decomposition of this generating function, it is more convenient to use the relation
(37) |
thus this waiting time also satisfies
(38) |
Then, using Theorem 3.4, we also have
Via Formula (38) linking the waiting time and the height , this entails (36).
We now turn to a finer analysis of the mean and variance of .
4. Mean and variance of the height
4.1. Fundamental properties of the Mellin transform
In order to get a fine estimation of the average height, we use a Mellin transform, which, as we shall see, is the key tool to handle the corresponding asymptotics. We now present the needed definitions and formulas. We refer e.g. to Flajolet, Gourdon, and Dumas [19] or to the book Analytic Combinatorics [22, Appendix B.7] for more on the Mellin transform and numerous applications to asymptotics of harmonic sums, digital sums, and divide-and-conquer recurrences.
Definition 4.1 (Mellin transform).
Let be a continuous function defined on the positive real axis . The Mellin transform of is the function defined by
This integral exists only for such that the function is integrable on . Thus, if there exist two real numbers and , such that and
(39) |
then the function is well defined for any complex number with real part such that ; this domain is called the fundamental strip of . Moreover, for all in this domain, if converges uniformly to 0 for , then the function can be expressed for as the following inverse Mellin transform:
(40) |
As an example, let us consider the gamma function, which illustrates well the role of the fundamental strip (and this example will also play a role in the next pages).
Example 4.2 (The gamma function as a Mellin transform).
The gamma function satisfies
(41) | ||||
(42) |
An important consequence of Formula (40) is that, if is a meromorphic function on , and if , then one can push the integration contour of Formula (40) to the right (taking ) and one then collects in passing the contributions from the residue at each pole to the right of the fundamental strip. Now, for and , multiplying by the Laurent series of at , we see that can be expressed666The notation stands for the residue of at . as a sum of terms, and one gets
(43) | ||||
(44) |
4.2. Average height of Moran walks
We now state the main result of this section.
Theorem 4.3 (Average height).
The average height of Moran walks of length is given by
(45) |
where is Euler’s constant, and where is an oscillating function (a Fourier series of period ) given by
(46) |
Remark 4.4 (Fourier series representation).
The fact that is a Fourier series of period and is real for is better seen via the alternative equivalent expression
where and stands for the real and imaginary parts. This is illustrated in Figure 7.
Remark 4.5 (Fourier series differentiability).
Such asymptotics involving fluctuations dictated by a Fourier series are typical of results obtained via Mellin transforms. They often appear in the asymptotic cost of divide-and-conquer algorithms, or of expressions involving digital sums, harmonic sums, or finite differences (see the work of de Bruijn, Knuth, and Rice [15, 38], or Flajolet, Gourdon, and Dumas [19]). It is sometimes also possible to get them via some real analysis (like Pippenger did [45]), or like in the seminal work of Delange [16] on the sum of digits. Note that the Delange series is nowhere differentiable, while our Fourier series is infinitely differentiable, as proven in Theorem 4.15.
|
|
Proof 4.6 (Proof of Theorem 4.3).
The proof exploits the fact that the mean asymptotically behaves like ; this is proven by rewriting as follows:
(47) |
with
The key is to prove that, for some , , and adequately chosen, the sums , , and are asymptotically negligible, while the main contribution to comes from the last sum (namely, ), which we will evaluate via a Mellin transform approach.
The reader not enjoying delta-epsilon proofs could have the feeling that “cutting epsilons into 5 parts” like above is a little bit discouraging but this is the price to pay to get the error term in Formula (45). In fact, in Equation (47) for , it is possible to cut the sum into only 4 parts, but then this would lead to a final weaker error term.
So let’s be brave and begin with . Here, for the range , with ,
we get
where, for the second line we used that the sequences are increasing with respect to , and for the third line we used Formula (28) for and the approximation of Theorem 3.4. Note that this bound for also implies the uniform bound
(48) |
Now, for , in the range , with , we rewrite as . Such values of correspond to using and in the Formula (28) for .
Via the exponential bound on from Formula (32), we get
Then, for , in the range , with , we rewrite as . Such values of correspond to using and in the Formula (28) for . Via Formula (32), we get .
For the next sum, using the power series expansion of the exponential in Equation (27) (and keeping in mind that our choice of implies ), we get
(49) | ||||
(50) |
Finally, for the sum , we use the power series expansions of and of and we get:
We got that , , , , and are . It remains to evaluate . Such a sum is typical of expressions which can be evaluated by Mellin transform techniques. To this aim, let and set and , then
Let and be, respectively, the Mellin transform of the functions and . Using Identity (42) given in Example 4.2, we have on its fundamental strip and, as is a harmonic sum, its Mellin transform is
(51) |
This function extends analytically to the full complex plane, with isolated poles at the negative integers (due to poles of there), and with another set of isolated poles (the roots of ). These two sets of poles have in common. This implies that for the poles of are
(52) |
Using Formula (44) for the inverse Mellin transform, we obtain
We finally get the claim of the theorem by noting that .
4.3. Variance of the height of Moran walks
We now prove that the height of Moran walks, despite a mean of order and a second moment of order , has a variance which involves surprising cancellations at these two orders, leading to an oscillating function of order (in ), as implied by the following much more precise asymptotics.
Theorem 4.7.
Proof 4.8.
To obtain the variance of we first consider the second moment
(53) |
where we know from Theorem 3.4 that the summand can be approximated by
Then, partitioning the last sum in (53) into the same intervals as in Formula (47), we get that , where is the function defined by
From the behavior of at and , using the property given in (39), we get that the Mellin transform of is defined on the fundamental strip . Using the harmonic sum summation (51), one gets for in this strip:
Here, as we have
we finally get
(54) |
What are the poles of ? These are (a pole of order 3) and (for , which are poles of order 2). Using Formula (44) for the inverse Mellin transform, one thus obtains
(55) |
with the same as in (46), and where is another Fourier series given by
(56) |
(Similarly to , this Fourier series is always real, as can be seen by replacing by in Remark 4.4.)
4.4. Height of excursions
Excursions are walks in ending at altitude (where, as previously, time is encoded by the -axis, and altitude by the -axis). As in previous sections, let and be the final altitude and height of a walk, and let the random variable be the height of a walk of length conditioned to be an excursion, that is, . For Moran walks, we get the following behavior.
Theorem 4.9 (Distribution and moments of the height of Moran excursions).
The distribution of the height of excursions satisfies (for a uniform error term in )
(57) |
with (where stands for the fractional part of , and where stands for the floor function of ).
Proof 4.10.
As a Moran excursion necessarily ends by a reset, we have
(60) |
Thus, we have , , and , we can therefore directly recycle the results of Theorems 3.4, 4.3, and 4.7 to get the asymptotic distribution/mean/variance.
In this recycling, some care has to be brought while performing the substitution in the asymptotic formulas for the walks: indeed, this could impact intermediate asymptotic terms (smaller than the main asymptotic term, but larger than the error term); however, in our case, all is safe as we have
This result is a simple consequence of the combinatorially obvious identity (60), so this direct link between the asymptotics of walks and excursions holds in wider generality for any model of walks with resets for which the step set contains only positive steps.
4.5. Fourier series: bounds and infinite differentiability
In his seminal work [38], Knuth mentions at the end of his Section 3 that if one assumes that is equidistributed mod 1, then the sum is of “average 0”. Let us amend a little bit Knuth’s assertion. Indeed, Weyl’s criterion asserts that a sequence is equidistributed mod 1 if and only if, for any positive integer , we have
Considering this sum with and , and applying the Euler–Maclaurin formula to it, one gets that it does not converge to 0, and therefore is not equidistributed mod 1.
However, it is indeed true that the oscillating and are of mean value zero over their period (i.e., ; see Figure 7 on page 7), and that and are “almost” of mean value zero and that they possess small fluctuations. Let us give an explicit bound on their amplitude. To this aim, we first need to bound the digamma function777This is a rather misleading name: indeed, the digamma function is traditionally denoted by the letter psi (i.e., ), while it should logically be denoted by the Greek letter digamma (i.e., , a letter which looks like a big stack on a small , which later gave birth to the more familiar letter in the Latin alphabet). This paradox is due to the fact that Stirling, who introduced this function, did initially use the notation digamma , but later authors switched the notation to , while the initial name remained., defined by
(61) |
The function can be seen as an analytic continuation of harmonic numbers and satisfies . While several bounds for exist in the literature (see e.g. [52]), most of them are dedicated to (for example we have for ), so we now establish a lemma for (which we believe to be new, and which has its own interest beyond our application hereafter to bounds of Fourier series).
Lemma 4.11 (A bound for the digamma function on the imaginary axis).
For , we have
(62) |
which also implies the bound
Proof 4.12.
Using Euler’s representation of the gamma function as an infinite product, i.e.,
we get that its logarithmic derivative, , satisfies, for :
(63) |
We refer to [18, Section 1.1] for more details on these formulas. Now, setting (with ), and regrouping the imaginary and real parts gives
and thus, by the triangle inequality
(64) |
Here, note that for all , we have , and thus
Summing for from to , we obtain
So the first infinite sum in (64) is bounded by . For the second infinite sum, it is convenient to split it in the contribution from the summand for , which is bounded by
plus the remaining part (i.e., the sum of the terms for ):
Plugging these two bounds in (64) proves our lemma.
Equipped with the previous lemma, we can now give our bounds for and .
Proposition 4.13 (Uniform bounds for the oscillations).
The oscillating functions and are uniformly bounded by
(65) | ||||
(66) |
where
(67) |
For , we have more precisely
Proof 4.14.
Applying the triangle inequality on the definition of in (46), we get
(a quantity independent of , as ). Then, using the complement formula for the gamma function, we have (for ). Using this relation for (with ) together with the relation , we infer that
(68) |
Thus, for , this gives
(69) |
As, for , we have , we get
(70) | ||||
(71) |
Note that the more relaxed bound (71) is quite close to the stricter bound (69): e.g. for the bound (69) gives the upper bound (and one can numerically check that these first digits also constitute a lower bound), while the bound (71) gives the upper bound .
From this, we can establish the infinite differentiability of our fluctuations.
Theorem 4.15 (Fourier series infinite differentiability).
The Fourier series
(where ) are infinitely differentiable on .
Proof 4.16.
A Fourier series satisfies the Weierstrass -test if there exists a sequence such that (for all ) and converges. If and both satisfy the Weierstrass -test, then they converge absolutely and uniformly in , and .
Thus, by successive application of this -test, if the coefficients decay polynomially, i.e., we have , then is in (that is, times differentiable) and is in (that is, infinitely differentiable) if its coefficients decay faster than any polynomial rate. By Equation (68), the coefficients decay like , so is in . By Equation (72), the coefficients also decay like an exponential, so is in .
It is interesting to compare this smoothness result with the situation observed by Delange [16] in his seminal work on the sum of digits of in base (when is an integer). Therein, he proved an asymptotic behavior involving fluctuations dictated by a Fourier series, which can also be obtained by a Mellin transform approach, quite similarly to the road followed in our article. It appears that his Fourier series (already mentioned in Remark 4.5) has coefficients ; it is thus not surprising that the Delange series is nowhere differentiable, in sharp contrast with the smoothness of our Fourier series (see Figure 8).
This concludes our analysis of the height and the corresponding fluctuations.
|
|
|
5. Some results for the Moran model in dimension
5.1. Joint distribution of ages for the Moran model with
Moran processes are models of population evolution (or mutation transmission) where the population is of constant size (some individuals could die but are then immediately replaced by a new individual). Depending on the applications, several variants were considered in the literature starting with the seminal work of Moran himself [43, 44], up to more recent extensions (for example to spatially structured population [41].
Motivated by the model with resets of Itoh, Mahmoud, and Takahashi [35, 34], we now define the Moran model with individuals. It is a process parametrized by some probabilities and ’s such that , and which starts at time 0 with individuals of age . Then, at each new unit of time,
-
•
either, with probability , all survive (their age increases by 1),
-
•
either, with probability (for ), the -th individual dies (it is then replaced by a new -th individual of age 0), while the age of the surviving individuals increases by 1,
-
•
either, with probability , all die and are replaced by new individuals of age 0.
Now, we define the sequence of multivariate polynomials (for ) by the fact that the coefficient of in is the probability that, at time , the -th individual has age (for ). Accordingly, is the probability generating function associated to the above Moran model, where the time is encoded by the exponent of .
Theorem 5.1.
The probability generating function of the Moran model is a rational function, and it admits the closed form
(73) |
where the ’s are polynomials (given in the proof) in the ’s, , ’s, and where is the following polynomial of degree in :
(74) |
Proof 5.2.
The Moran model evolution is encoded by the following functional equation for the probability generating function :
(75) |
where means evaluated at .
To solve this single functional equation (which has unknowns888We temporarily count as unknown, even if it is obviously equal to , as is a probability generating function.), the trick is to transform it into a linear system of equations with… unknowns! Indeed, by substituting (in all the possible ways) in the functional equation (5.2), we get a system of equations.
Then, we encode this system by a matrix , where we cleverly (sic!) choose the order in which unknowns are associated to the lines/columns of . Let us define this order; to this aim consider the Cartesian product . For any pair of -tuples and from , one writes if the number of 1’s in is less than the number of 1’s in , or, when they have the same number of 1’s, if is smaller than in the lexicographical order induced by . For example, we have . Listing all the elements of in increasing order, we get a list of tuples . The matrix encoding the aforementioned system of equations is constructed such that the -th line of the matrix corresponds to the unknown and the -th column corresponds to the unknown .
With this order, the matrix is an upper triangular matrix (as each of the substitution of some ’s by some 1’s in Equation (5.2) leads from some tuple to larger tuples ), and thus the determinant of is the product of its diagonal terms:
(76) |
where we use Iverson’s bracket notation999This notation, , is 1 if the assertion is true, and 0 if not. It was introduced in the semantics of the language APL by its founder, Kenneth Iverson. It was later popularized in mathematics by Graham, Knuth, and Patashnik [27]..
As this determinant is not zero, this entails by Cramer’s rule that can be written as a rational function with denominator (note that, for some specific real values of and the ’s, it is not excluded that the numerator could have a shared factor with ). Of course, computing the determinant of each comatrix, and using the relation , we get symmetric polynomial expressions for the ’s occurring in (73), e.g.:
Note that the case , for (with ) was analyzed by Itoh and Mahmoud [34]: they proved that the age of each individual converges to a shifted geometric distribution, namely . They also show that the number of individuals of age at time converges to a Bernoulli distribution, namely .Our Theorem 5.1 constitutes a joint law version of these results, at discrete times, for generic ’s. For example, introducing , the coefficient gives the average number of individuals of age at time . (Note that the sum with the binomial coefficients has to be replaced by a sum over the subsets of if the ’s and the initial conditions for the are not symmetric.)
5.2. A multidimensional generalization of the Moran model
Interestingly, the same strategy of proof allows us to solve a wide generalization of the Moran model, where
-
•
with probability , all the individuals from the subset of die (they are then replaced by new individuals of age 0), while the age of each surviving individual increases by 1.
-
•
the process starts with individuals of any (possibly distinct) ages, encoded by a monomial .
This translates to the following single functional equation, involving unknowns:
(77) |
where .
Obviously, by taking , , , , and all other , the generalized model simplifies to the classical Moran model of Theorem 5.1. Another natural set of probabilities is , where is the number of elements in . It encodes the model where, at each unit of time, each individual dies with probability .
More generally, for any set of ’s, one gets the following result.
Theorem 5.3.
The probability generating function of the generalized Moran model is a rational function:
(78) |
where the ’s are polynomials in the ’s and ’s for , and where is the following polynomial of degree in :
(79) |
Note that, for this generalized model, the denominator is the same as in Theorem 5.1, and the ’s are a lifting of the ’s from Theorem 5.1, involving more terms and variables (namely, all the ’s). For these two models, these polynomials and are variants of symmetric functions. We comment more on this fact now.
Remark 5.4 (Links with bi-indexed families of symmetric functions).
Many problems related to lattice paths lead to generating functions expressible in terms of symmetric functions; this results from the kernel method, which involves a Vandermonde-like determinant, and thus leads to variants of Schur functions [4, 11, 6]. For the generalized Moran model we also get symmetric expressions, as the problem is by design symmetric, but in a more subtle way: one does not get formulas nicely expressible in terms of classical symmetric functions. This is due to the fact that we have to play with two distinct sets of variables (the ’s and the ’s), the occurrences of which are not fully independent. It appears that thesesubtle dependencies are well encoded by the MacMahon elementary symmetric functions, defined by . For example, we have . They allow us to provide more compact formulas for our generating functions, like . We plan to study these aspects in a forthcoming work. Note that these MacMahon symmetric functions also appear in problems a priori unrelated to our multidimensional Moran walks, see e.g. the articles of Gessel [25] and Rosas [48].
5.3. Application to the soliton wave model
The soliton wave model (as considered by Itoh, Mahmoud, and Takahashi [35]) is a stochastic system of particles encoding a unidirectional wave. The number of particles is constant during the full process: we have particles on which can only moves to the left as follows. At time , the initial configuration consists of particles, at -coordinates . Then, at each unit of time , uniformly at random, one of the particles jumps just to the left of the first particle (the wave front), thus leaving an empty space at its starting position:
![[Uncaptioned image]](https://cdn.awesomepapers.org/papers/3c6fcd07-1b7d-4659-b758-d301726306b9/x7.png)
Note that at time the location of the leftmost particle has thus -coordinate . See Figure 9 for an illustration of 6 iterations of this process, where, for drawing convenience, we shift the origin of the -axis after each step, so that the first particle is always at -coordinate 1.
Then, applying Theorem 5.3 to this model, we get the following proposition.
Proposition 5.5.
The joint distribution of the time/positions of the particles in the soliton wave model is given by Formula (78), by taking as initial condition , and, as probabilities of transition, and all other ; what is more, the denominator of thus simplifies to
where stands for the number of elements of the set .
Wave | Time | Length | |
|
0 | 4 | |
|
1 | 5 | |
|
2 | 6 | |
|
3 | 6 | |
|
4 | 4 | |
|
5 | 5 | |
|
6 | 6 |
Figure 9 also shows that this model has one degree of freedom, that is, the soliton wave model with particles can be modeled as interactive urns : the urn contains the number of white cells between the -th and -th blue particle. Accordingly, this interactive urn process starts with for all , and then, at each unit of time, we have one of the following events (with probability ):
-
•
and other urns are unchanged.
-
•
for : , (for ), , and remaining urns are unchanged.
-
•
and, for , .
The length of the soliton is then given by ; it can equivalently be viewed as the maximum of the -coordinates (at time ) of each particle.
6. Conclusion and future works
In this article, we considered several statistics (final altitude, waiting time, height) associated to walks with resets, for any given finite step set. For the case of the simplest non-trivial model (namely, for Moran walks), we prove that the asymptotic height exhibits some subtle behavior related to the discrete Gumbel distribution. In a forthcoming article, we plan to consider the asymptotic analysis of the height for more general walks.
In our formulas for walks of length , taking (and more generally ) as the probability of reset leads to models which can counterbalance the infinite negative drift of the initial model, and thus present a different type of asymptotic behavior. Studying these models and their phase transitions in more detail would be interesting.
In Section 5, we considered several multidimensional extensions of such walks, with applications to the soliton wave model, or to models in genetics. More multidimensional variants of Moran models allowing both positive and negative jumps (and with or without resets) can be handled using the approach presented in this article (see [1]). One interesting example is the one where each dimension evolves like a Motzkin path, this model was e.g. considered in the haploid Moran model [32], where the authors use a Markov chain approach, using duality/reversibility to establish links with Ornstein–Uhlenbeck processes. Note that even if one adds resets to such Motzkin-like models, one keeps nice links with continuous fractions associated to birth and death processes; see [20]. The analysis becomes much more complicated as soon as jumps of amplitude are allowed; in such cases, our approach based on the kernel method strikes again.
Another natural extension is to consider walks in the quarter plane with resets (a natural model of two queues evolving in parallel); even for walks with jumps of amplitude , the exact enumeration and the asymptotic behavior of the (maximal) height remain open. Other more ad hoc extensions consider some age-dependent probabilities ’s, then leading to partial differential equations for the corresponding generating functions. Some specific cases lead to closed-form solutions.
All these variants of Moran models are parametrized by the ’s. One can then turn to the tuning of several statistical tests: having some experimental data, it is natural to look for maximum likelihood estimators of the ’s, and to study if they are unbiased, sufficient, and consistent (for more on these notions, see e.g. [50]). In conclusion, the Moran model offers a large variety of interesting models, with many aspects to explore!
Acknowledgement: The work of the first author is supported by the Researchers Supporting Project RSPD2023R987 of King Saud University. We thank Hosam Mahmoud for introducing us to the Moran walks, and asking us about the distribution of their height. We are indebted to Rosa Orellana and Mike Zabrowski for pinpointing us that the bi-indexed symmetric functions which we introduced in Remark 5.4, and which already occurred in the literature under the name MacMahon symmetric functions. Last but not least, we also deeply thank the two referees for their kind detailed reports, which helped to improve several parts of this article.
References
- [1] Asma Althagafi. Maximum Age Attained by a Moran Population and Some Applications. PhD thesis, King Saoud University, 2023.
- [2] Stephen F. Altschul and Samuel Karlin. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc. Natl. Acad. Sci. USA, 87:2264–2268, 1990.
- [3] Axel Bacher. Generalized Dyck paths of bounded height. Sém. Lothar. Combin., 2023 (to appear).
- [4] Cyril Banderier. Combinatoire analytique des chemins et des cartes. PhD thesis, Univ. Paris VI, 2001.
- [5] Cyril Banderier and Philippe Flajolet. Basic analytic combinatorics of directed lattice paths. Theor. Comput. Sci., 281(1-2):37–80, 2002.
- [6] Cyril Banderier, Marie-Louise Lackner, and Michael Wallner. Latticepathology and symmetric functions. In Proceedings of AofA’2020, volume 159, pages 21:1–21:16. Leibniz International Proceedings in Informatics (LIPIcs), 2020.
- [7] Cyril Banderier and Pierre Nicodème. Bounded discrete walks. Discrete Math. Theor. Comput. Sci., AM:35–48, 2010.
- [8] Cyril Banderier and Michael Wallner. Lattice paths with catastrophes. Discrete Math. Theor. Comput. Sci., 19(1):32, 2017. Id/No 23.
- [9] Iddo Ben-Ari, Alexander Roitershtein, and Rinaldo B. Schinazi. A random walk with catastrophes. Electron. J. Probab., 24:21, 2019. Id/No 28.
- [10] Edward A. Bender and Zhicheng Gao. Part sizes of smooth supercritical compositional structures. Comb. Probab. Comput., 23(5):686–716, 2014.
- [11] Mireille Bousquet-Mélou. Discrete excursions. Sém. Lothar. Combin., 57:23 pp., 2008.
- [12] Mireille Bousquet-Mélou and Marko Petkovšek. Linear recurrences with constant coefficients: the multivariate case. Discrete Math., 225(1-3):51–75, 2000.
- [13] Arthur W. Burks, Herman H. Goldstine, and John von Neumann. Preliminary discussion of the logical design of an electronic computing instrument. Report to U.S. Army Ordnance Department, 1946. Reprinted in “Collected Works of John von Neumann”, vol. 5, pp. 34–79, The Macmillan Company, 1963.
- [14] Iva Chang, Alan Krinik, and Randall J. Swift. Birth-multiple catastrophe processes. Journal of Statistical Planning and Inference, 137(5):1544–1559, 2007.
- [15] Nicolaas Govert de Bruijn, Donald E. Knuth, and Stephen O. Rice. The average height of planted plane trees. In R. C. Read, editor, Graph Theory and Computing, pages 15–22. Academic Press, 1972.
- [16] Hubert Delange. Sur la fonction sommatoire de la fonction ’somme des chiffres’. Enseign. Math. (2), 21:31–47, 1975.
- [17] Louis Dumont. Algorithmes rapides pour le calcul symbolique de certaines intégrales de contour à paramètre. PhD thesis, Univ. Paris Saclay, 2016.
- [18] Arthur Erdélyi, W. Magnus, F. Oberhettinger, and Francesco G. Tricomi. Higher transcendental functions. Vol. I. McGraw-Hill Book Co., 1953. Bateman Manuscript Project.
- [19] Philippe Flajolet, Xavier Gourdon, and Philippe Dumas. Mellin transforms and asymptotics: Harmonic sums. Theor. Comput. Sci., 144(1-2):3–58, 1995.
- [20] Philippe Flajolet and Fabrice Guillemin. The formal theory of birth-and-death processes, lattice path combinatorics and continued fractions. Advances in Applied Probability, 32(3):750–778, 2000.
- [21] Philippe Flajolet, Mathieu Roux, and Brigitte Vallée. Digital trees and memoryless sources: from arithmetics to analysis. Discrete Math. Theor. Comput. Sci., AM:233–260, 2010.
- [22] Philippe Flajolet and Robert Sedgewick. Analytic Combinatorics. Cambridge University Press, 2009.
- [23] Ayla Gafni. Longest run of equal parts in a random integer composition. Discrete Math., 338(2):236–247, 2015.
- [24] Ilya B. Gertsbakh. Statistical Reliability Theory. Marcel Dekker, Inc., 1989.
- [25] Ira M. Gessel. Enumerative applications of symmetric functions. Sém. Lothar. Combin., B17a:17 pp., 1987.
- [26] Xavier Gourdon. Largest component in random combinatorial structures. Discrete Math., 180(1-3):185–209, 1998.
- [27] Ronald L. Graham, Donald E. Knuth, and Oren Patashnik. Concrete Mathematics. A Foundation for Computer Science. Addison-Wesley, 1994. Second edition (1st ed: 1988).
- [28] Jan Grandell. Aspects of Risk Theory. Springer, 1991.
- [29] Rosemary J. Harris and Hugo Touchette. Phase transitions in large deviations of reset processes. J. Phys. A, 50(10):10LT01, 13, 2017.
- [30] Clemens Heuberger and Helmut Prodinger. Carry propagation in signed digit representations. Eur. J. Comb., 24(3):293–320, 2003.
- [31] Clemens Heuberger, Sarah Selkirk, and Stephan Wagner. The distribution of the maximum protection number in random trees. arXiv, 2023.
- [32] Thierry Huillet and Martin Möhle. Duality and asymptotics for a class of nonneutral discrete Moran models. J. Appl. Probab., 46(3):866–893, 2009.
- [33] Blake Hunter, Alan Krinik, Chau Nguyen, Jennifer M. Switkes, and Hubertus F. Von Bremen. Gambler’s ruin with catastrophes and windfalls. Journal of Statistical Theory and Practice, 2008.
- [34] Yoshiaki Itoh and Hosam M. Mahmoud. Age statistics in the Moran population model. Stat. Probab. Lett., 74(1):21–30, 2005.
- [35] Yoshiaki Itoh, Hosam M. Mahmoud, and Daisuke Takahashi. A stochastic model for solitons. Random Struct. Algorithms, 24(1):51–64, 2004.
- [36] Svante Janson. Renewal theory in the analysis of tries and strings. Theor. Comput. Sci., 416:33–54, 2012.
- [37] Boris S. Kerner. Introduction to Modern Traffic Flow Theory and Control. The Long Road to Three-Phase Traffic Theory. Springer, 2009.
- [38] Donald E. Knuth. The average time for carry propagation. Nederl. Akad. Wet., Proc., Ser. A, 81:238–242, 1978.
- [39] Donald E. Knuth. The Art of Computer Programming. Vol. 3: Sorting and searching. Addison-Wesley, 1997 (2nd edition, 1st edition in 1973).
- [40] Lukasz Kusmierz, Satya N. Majumdar, Sanjib Sabhapandit, and Grégory Schehr. First order transition for the optimal search time of Lévy flights with resetting. Phys. Rev. Lett., 113:220602, 2014.
- [41] Erez Lieberman, Christoph Hauert, and Martin A. Nowak. Evolutionary dynamics on graphs. Nature, 433(7023):312–316, 2005.
- [42] Satya N. Majumdar, Sanjib Sabhapandit, and Grégory Schehr. Random walk with random resetting to the maximum position. Phys. Rev. E, 92(5):052126, 13, 2015.
- [43] Patrick A. P. Moran. Random processes in genetics. Proc. Camb. Philos. Soc., 54:60–71, 1958.
- [44] Patrick A. P. Moran. The Statistical Processes of Evolutionary Theory. Oxford University Press, 1962.
- [45] Nicholas Pippenger. Analysis of carry propagation in addition: an elementary approach. J. Algorithms, 42(2):317–333, 2002.
- [46] Helmut Prodinger and Stephan Wagner. Minimal and maximal plateau lengths in Motzkin paths. Discrete Math. Theor. Comput. Sci., AH:353–362, 2007.
- [47] Helmut Prodinger and Stephan Wagner. Bootstrapping and double-exponential limit laws. Discrete Math. Theor. Comput. Sci., 17(1):123–144, 2015.
- [48] Mercedes H. Rosas. MacMahon symmetric functions, the partition lattice, and Young subgroups. J. Comb. Theory, A, 96(2):326–340, 2001.
- [49] Bruno Salvy and Paul Zimmermann. GFUN: A Maple package for the manipulation of generating and holonomic functions in one variable. ACM Trans. Math. Softw., 20(2):163–177, 1994.
- [50] Aris Spanos. Probability Theory and Statistical Inference. Empirical Modeling with Observational Data. Cambridge University Press, 2019.
- [51] Wojciech Szpankowski and Vernon Rego. Yet another application of a binomial recurrence. Order statistics. Computing, 43(4):401–410, 1990.
- [52] Zhen-Hang Yang, Yu-Ming Chu, and Xiao-Hui Zhang. Sharp bounds for psi function. Appl. Math. Comput., 268:1055–1063, 2015.