∎

¹¹institutetext: Claude Godrèche ²²institutetext: Université Paris-Saclay, CNRS, CEA, Institut de Physique Théorique, 91191 Gif-sur-Yvette, France
²²email: [email protected]

Condensation and extremes for a fluctuating number of independent random variables

Claude Godrèche

(Received: date / Accepted: date)

Abstract

We address the question of condensation and extremes for three classes of intimately related stochastic processes: (a) random allocation models and zero-range processes, (b) tied-down renewal processes, (c) free renewal processes. While for the former class the number of components of the system is fixed, for the two other classes it is a fluctuating quantity. Studies of these topics are scattered in the literature and usually dressed up in other clothing. We give a stripped-down account of the subject in the language of sums of independent random variables in order to free ourselves of the consideration of particular models and highlight the essentials. Besides giving a unified presentation of the theory, this work investigates facets so far unexplored in previous studies. Specifically, we show how the study of the class of random allocation models and zero-range processes can serve as a backdrop for the study of the two other classes of processes central to the present work—tied-down and free renewal processes. We then present new insights on the extreme value statistics of these three classes of processes which allow a deeper understanding of the mechanism of condensation and the quantitative analysis of the fluctuations of the condensate.

Keywords:

Condensationextremesrenewal processeszero-range processessubexponentiality

^†^†journal: Journal of Statistical Physics

1 Introduction

It is well known that $n$ independent and identically distributed (iid) positive random variables conditioned by an atypical value of their sum exhibit the phenomenon of condensation, whereby one of the summands dominates upon the others, when their common distribution is subexponential (decaying more slowly than an exponential at large values of its argument).

Let $X_{1},X_{2},\dots,X_{n}$ be these $n$ random variables, henceforth taken discrete with positive integer values, whose common distribution is denoted by $f(k)=\mathop{\rm Prob}\nolimits(X=k)$ . Hereafter we consider the particular case of a subexponential distribution with asymptotic power-law decay¹¹1 In the further course of this work, the symbol $\approx$ stands for asymptotic equivalence; the symbol $\sim$ means either ‘of the order of’, or ‘with exponential accuracy’, depending on the context.

f(k)\approx\frac{c}{k^{1+{\theta}}},

(1.1)

where the index ${\theta}$ and the tail parameter $c$ are both positive. Assume—for the time being—that the first moment $\langle X\rangle$ is finite (hence ${\theta}>1$ ) and that the sum of these random variables,

S_{n}=\sum_{i=1}^{n}X_{i},

(1.2)

is conditioned to take the atypically large value $L>\langle S_{n}\rangle=n\langle X\rangle$ . In the present context the phenomenon of condensation has a simple pictorial representation. Let us consider the partial sums $S_{1},S_{2},\dots$ as the successive positions of a random walk whose steps are the summands $X_{i}$ . Such a representation is used in figure 1, which depicts six different paths of this walk, conditioned by a large, atypical value of its final position $S_{n}$ after $n$ steps. The steps have distribution (1.1) with ${\theta}=3/2$ (see the caption for details). As can be seen on this figure, for most of the paths there is a single big step bearing the excess difference $\Delta=L-n\langle X\rangle$ . In the thermodynamic limit, $L\to\infty$ , $n\to\infty$ with $\rho=L/n$ fixed, the distribution of the size of this big step becomes narrow around $\Delta$ . This picture gives the gist of the phenomenon of condensation.

The two ingredients responsible for such a phenomenon are: (i) subexponentiality of the distribution $f(k)$ , and: (ii) conditioning by an atypically large value of the sum $S_{n}$ . In contrast, keeping the same distribution (1.1), but conditioning the sum $S_{n}$ to be less than or equal to $n\langle X\rangle$ , yields a ‘democratic’ situation where all the steps $X_{i}$ are on the same footing, sharing the—now negative—excess difference $\Delta$ . Otherwise stated, in this situation, the path responds ‘elastically’ to the conditioning.

When the distribution $f(k)$ is decaying exponentially (e.g., a geometric distribution) the paths responds ‘elastically’ in all cases, i.e., regardless of whether the sum $S_{n}$ is conditioned to take an atypical value larger or smaller than $n\langle X\rangle$ . In both cases all the steps $X_{i}$ are on the same footing, sharing the excess difference $\Delta$ (which is now either positive or negative).

Refer to caption — Figure 1: Pictorial illustration of the phenomenon of condensation for the random allocation models and zrp class. This figure depicts six paths of a random walk whose positions are given by the partial sums $S_{1},S_{2},\dots$ . The distribution of the steps $X_{i}$ is given by $f(k)=3k^{-5/2}/2$ ( $k>1$ is taken continuous), for which $\langle X\rangle=3$ . The random walk is conditioned to end at position $L=6000$ at time $n=500$ . For most of the paths one can observe the occurrence of a big step whose magnitude fluctuates around $\Delta=L-\langle S_{n}\rangle=4500$ . More rarely, the excess $\Delta$ is shared by two big steps, as for the green path.

This scenario of condensation has been investigated in great detail and is basically understood. It is for example encountered in random allocation models where $n$ sites (or boxes) contain altogether $L$ particles, the $X_{i}$ representing the occupations of these sites burda ; burda2 ; janson . This situation in turn accounts for the stationary state of dynamical urn models such as zero range processes (zrp) or variants spitzer ; andjel ; camia ; evans2000 ; jeon ; gl2002 ; cg2003 ; gross ; hanney ; gl2005 ; lux ; maj2 ; ferrari ; maj3 ; armendariz2009 ; armendariz2011 ; armendariz2013 ; armendariz2017 ; cg2019 . For short we shall refer to this class of models as the class of random allocation models and zrp. Condensation means that one of the sites contains a macroscopic fraction of all the particles.

Here we shall be concerned by a different situation where the number of random variables $X_{1},X_{2},\dots,$ is itself a random variable, henceforth denoted by $N_{L}$ and defined by conditioning the sum,

S_{N_{L}}=\sum_{i=1}^{N_{L}}X_{i},

(1.3)

to satisfy either the inequality

S_{N_{L}}<L<S_{N_{L}+1},

(1.4)

or the equality

S_{N_{L}}=L,

(1.5)

where $L$ is a given positive integer number, see figure 2. These conditions are imposed irrespectively of whether the mean $\langle X\rangle$ is finite or not.

The process conditioned by the inequality (1.4) defines a free renewal process feller ; doob ; smith ; cox , the process conditioned by the equality (1.5) defines a tied-down renewal process wendel ; wendel1 ; wendel2 . In the former case the process is pinned at the origin, in the latter case it is also pinned at the end point. For both, the random variables $X_{i}$ are the sizes of the iid (spatial or temporal) intervals between two renewals. Using the temporal language, the sum $S_{N_{L}}$ is the time of occurrence of the last renewal before or at time $L$ . The last, unfinished, interval $B_{L}=L-S_{N_{L}}$ is known as the backward recurrence time in renewal theory. Tied-down renewal processes (tdrp) are special because the pinning condition (1.5) imposes $B_{L}=0$ .

A simple implementation of a tdrp is provided by the Bernoulli bridge, or tied-down random walk, made of $\pm 1$ steps, starting from the origin and ending at the origin at time $2L$ wendel ; wendel1 ; labarbe . The sizes of the intervals between the successive passages by the origin of the walk (where each tick mark on the $x-$ axis represents two units of time) represent the random variables $X_{i}$ , as depicted in figure 3. The continuum limit of the tied-down random walk is the Brownian bridge, also known as tied-down Brownian motion or else pinned Brownian motion verv .

For renewal processes (both free or tied-down) with a subexponential distribution $f(k)$ , we shall show that, by weighting the configurations according to the number $N_{L}$ of summands, a phase transition occurs, as $L\to\infty$ , when the positive weight parameter $w$ conjugate to $N_{L}$ , varies from larger values, favouring configurations with a large number of summands, to smaller ones, favouring atypical configurations with a smaller number of summands. In the former case the weight parameter $w$ is to be interpreted as a reward, in the latter case as a penalty.

Characterising this transition is the aim of the present work, with main focus on the quantitative analysis of the fluctuations of the condensate. Again, the occurrence of the phenomenon of condensation is due to: (i) subexponentiality of the distribution $f(k)$ , and: (ii) atypicality of the configurations.

Tied-down renewal processes fall into the class of linear systems considered by Fisher fisher . The latter are defined as one-dimensional chains of total length $L$ , made up, e.g., of alternating intervals of two kinds, $A$ and $B$ . This class encompasses the Poland-Scheraga model poland ; poland2 , consisting of an alternating sequence of straight paths $A$ and loops $B$ , wetting models, where $A$ and $B$ represent two phases, etc. If the direction of the chain is taken as a time axis, the loops in the perpendicular direction can be seen as random walks. The Bernoulli bridge or tied-down random walk of figure 3 is a natural implementation of this situation, where there is only one kind of intervals, say the loops $B$ , representing the intervals between two passages at the origin of the walk. In the same vein, a variant of the random allocation model defined in burda considers the case where the number of sites is varying burda3 , with an occupation variable $X$ starting at $k=1$ . This model, as well as the spin domain model considered in bar2 ; bar3 ; barma are examples of linear systems with only one kind of intervals. Both models are actually equivalent and are just particular instances of the tdrp considered in the present work. Let us finally mention the random walk (or polymer) models considered in gia1 ; gia2 which are free or tied-down renewal processes with a penalty (or reward) at each renewal events, as in the present work. In these models the condensation transition is interpreted as the transition between a localised phase and a delocalised one gia1 ; gia2 . For the example of figure 3, a localised configuration corresponds to many contacts of the walk with the origin, while a delocalised one corresponds to the presence of a macroscopic excursion.

Renewal theory is a classic in probability studies. It is also ubiquitous in statistical physics and has a wide range of applicability (see examples in gl2001 ; gms2015 ; cohen ; barkai2003 ). Yet, besides gia1 ; gia2 , which give rigorous mathematical results, studies of weighted free renewal processes are scarce. In particular, there are no available detailed characterisation of the condensation phenomenon for these processes in the existing literature, nor considerations on the statistics of extremes in the condensed phase.

We now describe the organisation of the paper, with further details on the literature on the subject.

The text is composed of three parts, which are respectively sections 2 and 3, dealing with the class of random allocation models and zrp, sections 4 to 8, dealing with tdrp, and finally, sections 9 to 11, dealing with free renewal processes. These three parts are both conceptually and analytically related.

Section 2 is a short presentation of how condensation arises for random allocation models and zrp. Subsection 2.1 gives the basic formalism. Equation (2.20) introduces a simple remark on the distribution of the maximum in the second half ( $k>L/2$ ), which will turn out to be instrumental in section 3, and which will be generalised in the later sections on renewal processes. Subsection 2.2 summarises the main features of the phenomenon of condensation in the thermodynamic limit. This classical topic has been much investigated in the past, both in statistical physics burda ; burda2 ; camia ; evans2000 ; gl2002 ; cg2003 ; gross ; hanney ; gl2005 ; lux ; maj2 ; maj3 and in mathematics jeon ; ferrari ; armendariz2009 ; armendariz2011 ; armendariz2013 ; armendariz2017 ; landim . The summary given in this section relies on the short review cg2019 to which we refer the reader for further bibliographical references. A mathematical review of a number of aspects of the subject and related matters can be found in janson .

Section 3, which will serve as a backdrop for the study contained in the two other parts, contains novel aspects of the phenomenon of condensation for the class of models at hand. Namely, instead of considering the thermodynamic limit $L\to\infty$ , $n\to\infty$ with fixed ratio $\rho=L/n$ , we investigate the situation where the number of summands $n$ is kept fixed and the value of the sum $S_{n}=L$ increases to infinity. Such a framework is precisely that considered in ferrari . In this reference it is shown that for $n$ fixed, $L\to\infty$ , assuming that $\rho_{c}=\langle X\rangle$ exists, i.e., ${\theta}>1$ , if the largest summand is removed, the measure on the remaining summands converges to the product measure with density $\rho_{c}$ , a feature which is apparent on figure 1. Therefore the largest summand is the unique condensate, with size $L-(n-1)\langle X\rangle$ . As emphasised in ferrari ‘this phenomenon is a combinatorial fact that can be observed without making the number of sites grow to infinity’. Simply stated, there is total condensation in this case, in the sense that the condensate essentially bears the totality of the $L$ particles. We shall prove this result by elementary means (see (3.8)) and extend it to the case where ${\theta}<1$ , thus proving that even though the first moment $\rho_{c}=\langle X\rangle$ does not exist, there is however (total) condensation²²2The fact that condensation also occurs when ${\theta}<1$ in the present context has been previously mentioned in landim . I am indebted to S Grosskinsky for pointing this reference to me.. If ${\theta}<1$ , the correction of the mean largest summand to $L$ —in other words the fluctuations of the condensate—scales as $L^{1-{\theta}}$ , with a known amplitude, given in (3.6).

It turns out that this scenario of (total) condensation is precisely that prevailing for the two other processes studied in the present work, namely, tied-down and free renewal processes. This scenario will be the red thread for the rest of the paper, with all the complications introduced by a now fluctuating number of summands. This red thread in particular links the three figures 5, 9 and 11, and equations (3.6), (3.8), (8.31), (8.32), (11.27) and (11.28), which are the central results of the present study.

Sections 4 to 8 are devoted to tdrp with a power-law distribution of summands (1.1). Section 4 gives a systematic presentation of the formalism, valid for any arbitrary distribution $f(k)$ , followed, in sections 6 to 8 by the analysis of the behaviour of the system in the different phases, when $f(k)$ is a subexponential distribution of the form (1.1). As said in the abstract, these topics are scattered in the literature burda3 ; bar2 ; bar3 ; barma . The analyses presented in sections 6 to 8 are comprehensive and go deeper than previous studies, especially in the description of the condensation phenomenon and in the analysis of the statistics of extremes, as detailed in section 8.

Likewise, sections 9 to 11, devoted to free renewal processes with power-law distribution of summands (1.1), give a thorough analysis both of the formalism and of the phenomenon of condensation. It turns out that this case, which is more generic than that of tdrp since the process is not pinned at the end point $L$ , is yet more complicated to analyse, the reason being that besides the intervals $X_{i}$ , the last interval $B_{L}$ , depicted in figure 2, enters the analysis.

Section 12, together with tables 1 and 2, gives a summary of the present study.

2 Condensation for random allocation models and ZRP

The key quantity for the study of the processes described above (random allocation models and zrp, free and tied-down renewal processes) is the statistical weight of a configuration, or in the language of the random walk of figure 1, the statistical weight of a path.

The choice of conventions on the initial values of $n$ , $L$ , $k$ is a matter of convenience which depends on the kind of reality that we wish to describe, as will appear shortly.

We start by giving, in §2.1, some elements of the formalism for random allocation models and zrp, where $f(k)=\mathop{\rm Prob}\nolimits(X=k)$ is any arbitrary distribution of the positive random variable $X$ . In the rest of section 2, $f(k)$ has a power-law tail (1.1).

2.1 General formalism

2.1.1 Statistical weight of a configuration

Let $X_{1},X_{2},\dots,X_{n}$ be $n$ positive iid integer random variables with sum $S_{n}$ conditioned to be equal to $L$ . The joint conditional probability associated to a configuration $\{X_{1}=k_{1},\dots,X_{n}=k_{n}\}$ , with $S_{n}=L$ given, reads³³3For the sake of simplicity we restrict the study to the case where $f(k)$ is a normalisable probability distribution.

	$\displaystyle p(k_{1},\dots,k_{n}\|L)$	$\displaystyle=$	$\displaystyle\mathop{\rm Prob}\nolimits(X_{1}=k_{1},\dots,X_{n}=k_{n}\|S_{n}=L)$		(2.1)
		$\displaystyle=$	$\displaystyle\frac{1}{Z_{n}(L)}f(k_{1})\ldots f(k_{n})\,\delta\Big{(}\sum_{i=1}^{n}k_{i},L\Big{)},$		(2.1)

where $\delta(.,.)$ is the Kronecker delta, and where the denominator, whose presence stems from the constraint $S_{n}=L$ , is the partition function

	$\displaystyle Z_{n}(L)$	$\displaystyle=$	$\displaystyle\sum_{\{k_{i}\}}f(k_{1})\dots f(k_{n})\delta\Big{(}\sum_{i=1}^{n}k_{i},L\Big{)}=(f\star)^{n}(L)$		(2.2)
		$\displaystyle=$	$\displaystyle\langle\delta(S_{n},L)\rangle=\mathop{\rm Prob}\nolimits(S_{n}=L).$		(2.2)

So $Z_{n}(L)$ is another notation for the distribution of the sum $S_{n}$ .

The physical picture associated to these definitions correspond to a system of $n$ sites (or boxes), $L$ particles in total, and where the summands $X_{i}$ are the occupation numbers of these sites, i.e., the number of particles on each of them. Since these sites can be empty, the occupation probability $f(0)$ is non zero in general. It is therefore natural to initialise $L$ to 0. In particular, the probability that the sum of occupations be zero is that all sites are empty, i.e.,

Z_{n}(0)=f(0)^{n}.

(2.3)

It also turns out to be convenient to start $n$ at 0, and set

Z_{0}(L)=\delta(L,0),

(2.4)

which serves as an initial condition for the recursion

Z_{n}(L)=\sum_{k=0}^{L}f(k)Z_{n-1}(L-k).

(2.5)

As can be seen either from (2.2) or (2.5), the generating function of $Z_{n}(L)$ with respect to $L$ yields

\tilde{Z}_{n}(z)=\sum_{L\geq 0}z^{L}Z_{n}(L)=\tilde{f}(z)^{n},

(2.6)

where the generating function of $f(k)$ with respect to $k$ is

\tilde{f}(z)=\sum_{k\geq 0}z^{k}f(k).

(2.7)

The marginal distribution of the occupation of a generic site, say site $1$ , is

\pi_{n}(k|L)=\mathop{\rm Prob}\nolimits(X_{1}=k|S_{n}=L)=\langle\delta(X_{1},k)\rangle=\frac{f(k)Z_{n-1}(L-k)}{Z_{n}(L)},

(2.8)

where $\langle\cdot\rangle$ is the average with respect to (2.1). The mean conditional occupation is

\langle X|L\rangle=\sum_{k\geq 0}k\pi_{n}(k|L)=\frac{L}{n}=\rho.

(2.9)

2.1.2 Distribution of the largest occupation

Condensation corresponds to the presence of a site with a macroscopic occupation. We are therefore led to investigate the statistics of the largest occupation. This topic has been discussed in janson ; jeon ; gross ; gl2005 ; maj3 ; armendariz2009 ; armendariz2011 ; cg2019 . We briefly revisit this topic and supplement it with equation (2.20) at the end of this subsection which sheds some new light on the subject and lay the ground for the parallel study of renewal processes.

Let $X_{\rm max}$ be the largest summand (or occupation) under the conditioning $S_{n}=L$ ,

X_{\rm max}={\rm max}(X_{1},\dots,X_{n}).

(2.10)

The distribution function of this variable is

F_{n}(k|L)=\mathop{\rm Prob}\nolimits(X_{{\rm max}}\leq k|S_{n}=L)=\frac{\mathop{\rm Prob}\nolimits(X_{{\rm max}}\leq k,S_{n}=L)}{Z_{n}(L)},

(2.11)

whose numerator is

F_{n}(k|L)_{|{\rm num}}=\mathop{\rm Prob}\nolimits(X_{{\rm max}}\leq k,S_{n}=L)=\sum_{k_{1}=0}^{k}f(k_{1})\dots\sum_{k_{n}=0}^{k}f(k_{n})\delta\Big{(}\sum_{i=1}^{n}k_{i},L\Big{)}.

(2.12)

The generating function of the latter reads

\sum_{L\geq 0}z^{L}F_{n}(k|L)_{|{\rm num}}=\prod_{i=1}^{n}\Big{(}\sum_{k_{i}=0}^{k}f(k_{i})z^{k_{i}}\Big{)}=\tilde{f}(z,k)^{n},

(2.13)

where

\tilde{f}(z,k)=\sum_{j=0}^{k}z^{j}f(j).

(2.14)

The distribution of the largest occupation is thus given by the difference

p_{n}(k|L)=\mathop{\rm Prob}\nolimits(X_{{\rm max}}=k|S_{n}=L)=F_{n}(k|L)-F_{n}(k-1|L),

(2.15)

where

F_{n}(0|L)_{|{\rm num}}=f(0)^{n}\delta(L,0).

(2.16)

Its generating function is

\sum_{L\geq 0}z^{L}p_{n}(k|L)_{|{\rm num}}=\tilde{f}(z,k)^{n}-\tilde{f}(z,k-1)^{n}.

(2.17)

The numerator (2.12) obeys the recursion

F_{n}(k|L)_{|{\rm num}}=\sum_{j=0}^{\min(k,L)}f(j)\,F_{n-1}(k|L-j)_{|{\rm num}},

(2.18)

with initial condition

F_{0}(k|L)_{|{\rm num}}=\delta(L,0).

(2.19)

Let us note that if the occupation number $X_{1}$ is larger than $L/2$ , then it is necessarily the largest one, $X_{\rm max}$ . If so, the probability distribution of the latter, $p_{n}(k|L)$ , is identical to $n\pi_{n}(k|L)$ , since there are $n$ possible choices of the generic summand $X_{1}$ . Denoting the restriction of $p_{n}(k|L)$ to the range $k>L/2$ by $q_{n}(k|L)$ , we thus have

q_{n}(k|L)=n\pi_{n}(k|L)=\frac{nf(k)Z_{n-1}(L-k)}{Z_{n}(L)}.

(2.20)

We shall see later that this relation, as simple as it may seem, is instrumental for the analysis of the fluctuations of the condensate and extends naturally to the case of tdrp or free renewal processes. When $k>L/2$ we note that

F_{n}(k|L-k)_{|{\rm num}}=Z_{n}(L-k),

(2.21)

since $X_{{\rm max}}$ is necessarily less than $k$ (with $k>L/2$ ) when $S_{n}=L-k$ .

2.2 Phenomenology of condensation in the thermodynamic limit

This subsection is a reminder of well-known facts on the phenomenon of condensation for a thermodynamic system with a large number $n$ of sites and large total occupation $L$ , at fixed density $\rho=L/n$ . More detailed accounts or complements on this topic can be found, e.g., in burda ; burda2 ; janson ; gross ; hanney ; lux ; maj2 ; maj3 ; armendariz2009 ; armendariz2011 ; cg2019 . with further bibliographical references contained in the last reference.

This reminder will help emphasising the differences between the scenario of condensation in the thermodynamic limit, described in the present subsection, with another scenario of condensation, to be described in section 3, where $L$ is still large, but $n$ (the number of summands or sites) is kept fixed. In the latter regime, condensation will turn out to be total, with a condensed fraction $X_{\rm max}/L$ asymptotically equal to unity.

For the time being, we consider the situation where $n$ and $L$ are both large, with $\rho=L/n$ kept fixed, assuming that $f(k)$ has a power-law tail (1.1) and that $\langle X\rangle=\rho_{c}$ is finite ( ${\theta}>1$ ).

2.2.1 Regimes for the single occupation distribution

Evidence for the existence of a condensate, i.e., a site with a macroscopic occupation, is demonstrated by the behaviour of the single occupation distribution (2.8). There are three regimes to consider, according to the respective values of $\rho$ and $\rho_{c}$ .

1. Subcritical regime ( $\rho<\rho_{c}$ )
The asymptotic estimate of the partition function $Z_{n}(L)$ is given by the saddle-point method

Z_{n}(L)=\oint\frac{{\rm d}z}{2\pi{\rm i}z^{L+1}}\,\tilde{f}(z)^{n}\sim\frac{\tilde{f}(z_{0})^{n}}{z_{0}^{L}},

(2.22)

where $z_{0}$ obeys the saddle-point (sp) equation

\frac{z_{0}\tilde{f}^{\prime}(z_{0})}{\tilde{f}(z_{0})}=\rho.

(2.23)

This equation has a solution $z_{0}(\rho)$ for any $\rho<\rho_{c}$ . It follows that

\pi_{n}(k|L)_{\rm sp}\approx\frac{z_{0}^{k}f(k)}{\tilde{f}(z_{0})},

(2.24)

which is no longer dependent on $L$ and $n$ separately and only depends on their ratio $\rho$ . Note that (2.23) and (2.24) entail that

\langle X|L\rangle_{\rm sp}=\sum_{k\geq 0}k\pi_{n}(k|L)_{\rm sp}\approx\frac{z_{0}\tilde{f}^{\prime}(z_{0})}{\tilde{f}(z_{0})}=\rho,

(2.25)

consistently with (2.9). In this regime, the system is made of a fluid of independent particles with common distribution (2.24).

2. Critical regime ( $\rho=\rho_{c}$ )
A phase transition occurs when the saddle-point value $z_{0}$ reaches the maximum value of $z$ , equal to one, where $\tilde{f}(z)$ is singular, with a branch cut chosen to be on the negative axis. The bulk of the partition function is given by the generalised central limit theorem and

\pi_{n}(k|L)\approx f(k),

(2.26)

up to finite-size corrections. At criticality the equality $\langle X|L\rangle=\langle X\rangle$ holds identically thanks to (2.9). In this regime, the system is made of a critical fluid of independent particles with common distribution (2.26).

3. Supercritical regime ( $\rho>\rho_{c}$ )
In this regime the saddle-point equation (2.23) can no longer be satisfied because $z_{0}$ sticks to the head of the cut of $\tilde{f}(z)$ . The excess difference,

\Delta=L-n\langle X\rangle=n(\rho-\rho_{c}),

(2.27)

instead of being equally shared by all the sites, is, with high probability, accommodated by a single site, the condensate. The partition function $Z_{n}(L)$ is asymptotically given by its right tail (see cg2019 for more details),

Z_{n}(L)\mathrel{\mathop{\approx}\limits_{n\to\infty}^{L=n\rho}}\frac{nc}{\Delta^{1+{\theta}}}.

(2.28)

In the supercritical regime, the marginal distribution $\pi_{n}(k|L)$ has different behaviours in the three regions of values of the occupation variable.

(a) The critical background corresponds to values of $k$ finite, for which (2.26) holds again. The main contribution to the total weight comes from this region.

(b) The condensate is located in the region $k\approx\Delta$ (i.e., the difference $\Delta-k$ is subextensive). The ratio of $f(k)\approx c/\Delta^{1+\theta}$ to $Z_{n}(L)$ , given by (2.28), is asymptotically equal to

\frac{f(k)}{Z_{n}(L)}\approx\frac{c/\Delta^{1/\theta}}{nc/\Delta^{1/\theta}}=\frac{1}{n}.

(2.29)

On the other hand, $Z_{n-1}(L-k)$ , is given by its bulk since $L-k\approx n\rho_{c}$ . Hence, if $1<\theta<2$ ,

\pi_{n}(k|L)_{|{\rm cond}}\approx\frac{1}{n}Z_{n-1}(L-k)\approx\frac{1}{n}\frac{1}{n^{1/\theta}}\mathcal{L}_{\theta,c}\left(\frac{\Delta-k}{n^{1/\theta}}\right),

(2.30)

where $\mathcal{L}_{\theta,c}$ is the stable Lévy distribution of index ${\theta}$ , asymmetry parameter $\beta=1$ , and tail parameter $c$ gnedenkoK , while, if $\theta>2$ ,

\pi_{n}(k|L)_{|{\rm cond}}\approx\frac{1}{n}Z_{n-1}(L-k)\approx\frac{1}{n}\frac{1}{n^{1/2}}\mathcal{G}\left(\frac{\Delta-k}{n^{1/2}}\right),

(2.31)

where $\mathcal{G}$ is the Gaussian distribution gnedenkoK . These expressions describe the bulk of the fluctuating condensate which manifests itself by a hump in the marginal distribution $\pi_{n}(k|L)$ , in the neighbourhood of $k\approx\Delta$ , visible on figure 4. The weight of this region is obtained from (2.30) or (2.31), according to the value of ${\theta}$ , as

\mathop{\rm Prob}\nolimits(X_{1}\in\mathrm{cond})=\sum_{k\in\mathrm{hump}}\pi_{n}(k|L)_{|{\rm cond}}\approx\frac{1}{n},

(2.32)

which demonstrates that the excess difference $\Delta$ is typically borne by only one summand.

This hump becomes peaked in the thermodynamic limit. For a finite system, most often there is a single condensate, i.e., a site with a macroscopic occupation, while more rarely there are two sites with macroscopic occupations, both of order $L$ . This situation corresponds to the dip region, described next.

(c) The range of values of $k$ such that $k$ and $\Delta-k$ are large and comparable, interpolates between the critical part of $\pi_{n}(k|L)$ , for $k$ or order $1$ , and the condensate, for $k$ close to $\Delta$ . It corresponds to the dip region on figure 4. In this region, $Z_{n-1}(L-k)$ is given by its right tail (2.28). So, for any $\theta>1$ ,

\pi_{n}(k|L)_{|{\rm dip}}\approx c\left[\frac{\Delta}{k(\Delta-k)}\right]^{1+\theta}\approx\frac{f(k)f(\Delta-k)}{f(\Delta)}.

(2.33)

The interpretation of this result is that in the dip region typical configurations where one summand takes the value $k$ are such that the remaining $\Delta-k$ excess difference is borne by a single other summand. The dip region is therefore dominated by rare configurations where the excess difference is shared by two summands gl2005 . An example of such a configuration is the green path in figure 1.

Setting $k={\lambda}\Delta$ in (2.33) and introducing a cutoff $\Lambda={\epsilon}\Delta$ , the weight of these configurations can be estimated as

\mathop{\rm Prob}\nolimits(X_{1}\in\mathrm{dip})=\sum_{k={\epsilon}\Delta}^{(1-{\epsilon})\Delta}\pi_{n}(k|L)_{|{\rm dip}}\sim\Delta^{-\theta}\sim n^{-\theta}.

(2.34)

The relative weights of the dip (2.34) and condensate (2.32) regions is therefore of order $n^{-(\theta-1)}$ , i.e., the weight of events where the condensate is broken into two pieces of order $n$ is subleading with respect to events with a single big summand.

2.2.2 Statistics of the largest summand in the condensed phase

In view of (2.32) and the following equations (2.33) and (2.34), we infer that, for $k$ larger than $\Delta/2$ , which is the centre of the dip, the excess difference $\Delta$ is typically borne by only one summand—namely the condensate $X_{\rm max}$ —thus

p_{n}(k|L)\mathrel{\mathop{\approx}\limits_{k>\Delta/2}}n\pi_{n}(k|L).

(2.35)

According to (2.20) we know that the two sides of this equation are actually identical for $k>L/2$ , for any finite values of $n$ and $L$ . Note however that, while $L/2$ is always larger than $\Delta/2$ , it can be smaller or larger than $\Delta$ depending on whether $\rho$ is larger or smaller than $2\rho_{c}$ . The significance of (2.35) is that the equality (2.20), valid for $k>L/2$ , extends asymptotically to the entire region $k>\Delta/2$ .

More precise statements have been given on the asymptotic distribution of the largest summand $X_{\rm max}$ in maj3 ; armendariz2009 ; armendariz2011 ; janson . The result is that, if $L=n\rho$ , $\rho>\rho_{c}$ , $n\to\infty$ , the rescaled variable $n^{-1/\alpha}(\Delta-X_{\rm max})$ converges to a stable law of index $\alpha$ , with $\alpha=\theta$ if $\theta<2$ , or $\alpha=2$ if $\theta>2$ . This means that, asymptotically, the probability distribution of $X_{\rm max}$ coincides, up to a factor $n$ , with the estimates of the marginal density in the condensate region ( $\Delta-k\sim n^{1/\alpha}$ ), that is with (2.30) or (2.31) according to the value of $\theta$ , which is precisely the content of (2.35).

On the other hand, denoting the $r-$ th largest summand by $X^{(r)}$ ( $r=1,\dots,n$ ), with $X^{(1)}\equiv X_{\rm max}$ , the distributions of these ranked summands, denoted by $p_{n}^{(r)}(k|L)$ with $p_{n}^{(1)}(k|L)\equiv p_{n}(k|L)$ , sum up exactly to

p_{n}(k|L)+\sum_{r=2}^{n}p_{n}^{(r)}(k|L)=n\pi_{n}(k|L).

(2.36)

Thus according to (2.35) the sum upon $r\geq 2$ in the left side of (2.36) is negligible for $k>\Delta/2$ .

As shown in armendariz2009 ; armendariz2011 ; janson , the distribution of the second largest summand, $X^{(2)}$ , is asymptotically Fréchet, and the subsequent ones, $X^{(r)}$ $(r\geq 2)$ , are the order statistics of $n-1$ iid random variables $X_{i}$ with distribution $f(k)$ , which amounts to saying that, in the supercritical regime, the dependency between the summands $X_{i}$ introduced by the conditioning goes asymptotically in the condensate $X_{\rm max}$ .

Since $X_{\rm max}$ typically scales as $n$ , while $X^{(2)},X^{(3)},\dots$ typically scale as $n^{1/\theta}$ , the condensate is increasingly separated from the background as $n$ increases, leaving space to the dip region ( $k$ and $\Delta-k$ large and comparable). We know from the analysis made above (see discussion following (2.33)) that this region is dominated by configurations where the excess difference is shared by two summands, namely $X_{\rm max}$ and $X^{(2)}$ , so

p_{n}(k|L)+p_{n}^{(2)}(k|L)\mathrel{\mathop{\approx}\limits_{k\in\mathrm{dip}}}n\pi_{n}(k|L)|_{\rm dip},

(2.37)

and that the contributions of these events to $n\pi_{n}(k|L)$ are of relative order $n^{-(\theta-1)}$ . To the right of $\Delta/2$ the predominant contribution to the sum on the left side of (2.37) comes from $p_{n}(k|L)$ , to the left of $\Delta/2$ it comes from $p_{n}^{(2)}(k|L)$ .

An illustration

Figure 4 depicts a comparison between $n\pi_{n}(k|L)$ , obtained from (2.8), and $p_{n}(k|L)$ obtained from (2.17), on the following example, defined by the normalised distribution

f(k)=\frac{1}{\zeta(1+{\theta})}\frac{1}{(k+1)^{1+{\theta}}},\quad(k\geq 0),

(2.38)

where $\zeta(s)=\sum_{s\geq 1}1/n^{s}$ is the Riemann zeta function. This model has been introduced in burda , then further investigated in camia ; burda2 ; glzeta .

In this figure, ${\theta}=3$ , $\rho_{c}\approx 0.1106$ , $n=600$ , $L=168$ . This choice of parameters corresponds to a density $\rho=0.28$ slightly larger than $2\rho_{c}$ , where $\Delta$ and $L/2$ coincide. The top of the hump is approximately located at $\Delta\approx 102$ and the minimum of the dip is a bit less than $\Delta/2$ . These curves are practically indiscernible as soon as $k\approx 58$ , which is less than $L/2=84$ , indicated by the vertical dotted line on the figure, from which the identity becomes exact.

Finally, let us compute the mean condensed fraction $X_{\rm max}/L$ . For ${\theta}>2$ , using (2.31), it can be estimated as

	$\displaystyle\frac{\langle X_{\rm max}\rangle}{L}$	$\displaystyle=$	$\displaystyle\frac{1}{L}\sum_{k=0}^{L}k\,p_{n}(k\|L)\approx\frac{1}{L}\sum_{k=\Delta/2}^{L}k\,n\,\pi_{n}(k\|L)$		(2.39)
		$\displaystyle\approx$	$\displaystyle\frac{1}{L}\frac{1}{n^{1/2}\sqrt{2\pi}}\int_{-\infty}^{\infty}{\rm d}k\,k\,{\rm e}^{-\frac{1}{2}\left(\frac{\Delta-k}{n^{1/2}}\right)^{2}}=1-\frac{\rho_{c}}{\rho}=\frac{\Delta}{L}.$		(2.39)

The same result holds if $1<{\theta}<2$ , using (2.30).

As $\rho$ increases, the peak of the condensate moves towards the right end $L$ , hence if $\rho\gg\rho_{c}$ the condensed fraction tends to unity, corresponding to total condensation. As detailed in section 3, this scenario still holds when $n$ is kept fixed.

3 Phenomenon of total condensation when $n$ is kept fixed and $L\to\infty$

As seen above, if $\rho\gg\rho_{c}$ , condensation becomes total, and the peak of the condensate is asymptotically located at $L$ . As we now show, this still holds true if the number of summands $n$ is kept fixed, and $L$ is large, irrespective of the existence of a first moment $\rho_{c}=\langle X\rangle$ , or in other words, irrespective of whether ${\theta}$ is smaller or larger than one. Existence of condensation in such a situation has been pointed out in ferrari ; landim . The quantitative characterisation of this phenomenon is the aim of this section.

3.1 An illustration

We start by giving an illustration of the phenomenon on the following example, corresponding to a tail index ${\theta}=1/2$ for the decay of $f(k)$ ,

\tilde{f}(z)=\frac{1-\sqrt{1-z}}{z},

(3.1)

which entails that

f(k)=\frac{1}{2^{2k+1}}\frac{(2k)!}{k!(k+1)!}\mathrel{\mathop{\approx}\limits_{k\to\infty}}\frac{c}{k^{3/2}},

(3.2)

with $c=1/(2\sqrt{\pi})$ , and

Z_{n}(L)=\frac{n}{2^{2L+n}(2L+n)}\left(2L+n\atop L\right)\mathrel{\mathop{\approx}\limits_{L\to\infty}^{n\ \mathrm{fixed}}}\frac{nc}{L^{3/2}},

(3.3)

for $n$ kept fixed, $L$ large, which is a different regime from that leading to (2.28). In the general case, whenever $f(k)$ obeys (1.1), the asymptotic estimate (3.3) becomes

Z_{n}(L)\mathrel{\mathop{\approx}\limits_{L\to\infty}^{n\ \mathrm{fixed}}}\frac{nc}{L^{1+{\theta}}},

(3.4)

as can be deduced from (2.6).

Figure 5 depicts a comparison of $n\pi_{n}(k|L)$ with $p_{n}(k|L)$ for this example with $L=60$ , $n=4$ . These two quantities are identical for $k>L/2$ , The inset highlights the existence of a cusp at $L/2$ . The distribution of the maximum $p_{n}(k|L)$ is significantly depressed for $k<L/2$ . It vanishes identically for $k<L/4$ (more generally $L/n$ ). These features are analysed in the two subsections below.

3.2 Fluctuations of the condensate

Let us turn to the general case. A measure of the fluctuations of the condensate is provided by the width of the peak of the maximum, i.e., the mass outside the condensate. It can be estimated by the sum (see §3.3 below for details)

L-\langle X_{\rm max}\rangle=\sum_{\ell=0}^{L-1}\ell\,p_{n}(L-\ell|L)\approx\sum_{\ell=0}^{L/2-1}\ell\,q_{n}(L-\ell|L)\approx\sum_{\ell=0}^{L/2-1}\ell\,n\pi_{n}(L-\ell|L),

(3.5)

where $\ell=L-k$ . The dominant contribution to this sum depends on whether ${\theta}$ is smaller or larger than one.

$\bullet$ If ${\theta}<1$ , the dominant contribution comes from values of $\ell=L-k$ comparable to $L$ . Setting $\ell=\lambda L$ in (3.5), we have (see §3.3 below for details)

L-\langle X_{\rm max}\rangle\approx(n-1)cL^{1-{\theta}}\int_{0}^{1/2}{\rm d}{\lambda}\frac{{\lambda}}{[({\lambda}(1-{\lambda})]^{1+{\theta}}}\approx(n-1)c\,\mathrm{B}_{\frac{1}{2}}\Big{(}1-{\theta},-{\theta}\Big{)}L^{1-{\theta}},

(3.6)

where the incomplete beta function is defined as

\mathrm{B}_{x}(a,b)=\int_{0}^{x}{\rm d}t\,t^{a-1}(1-t)^{b-1}.

(3.7)

For example, if ${\theta}=1/2$ , we have $\mathrm{B}_{\frac{1}{2}}\Big{(}1-{\theta},-{\theta}\Big{)}=2$ .

$\bullet$ If ${\theta}>1$ , the main contribution comes from finite values of $\ell$ ,

	$\displaystyle L-\langle X_{\rm max}\rangle$	$\displaystyle\approx$	$\displaystyle n\sum_{\ell=0}^{L/2}\ell\,\frac{f(L-\ell)Z_{n-1}(\ell)}{Z_{n}(L)}\approx\sum_{\ell=0}^{L/2}\ell\,Z_{n-1}(\ell)$		(3.8)
		$\displaystyle\approx$	$\displaystyle(n-1)\sum_{\ell=0}^{L/2}\ell\,f(\ell)\to(n-1)\sum_{\ell=0}^{\infty}\ell f(\ell)=(n-1)\langle X\rangle.$		(3.8)

This last result (3.8) has a simple interpretation. It says that the correction $L-\langle X_{\rm max}\rangle$ comes from the $n-1$ sites of the fluid, each with mean occupation $\langle X\rangle=\rho_{c}$ , in accordance with the prediction made in ferrari and recalled in the introduction.

3.3 A finer analysis

Let us now add some more details on the derivations made above. The aim of this subsection is to give a detailed analysis of the distributions in the various regimes, in order to eventually compute the corrections to the scaling expressions predicted in (3.6) and (3.8) above.

We start with the discussion of the regimes for the single occupation distribution $\mathop{\rm Prob}\nolimits(X_{1}=k|S_{n}=L)=\pi_{n}(k|L)$ . There are such three regimes to consider (see figure 5):

Downhill region. For $X_{1}=k$ finite, using (3.4), we have

\pi_{n}(k|L)\mathrel{\mathop{\approx}\limits_{L\to\infty}}\frac{n-1}{n}f(k),

(3.9)

reflecting the fact that, with probability $(n-1)/n$ , a randomly chosen site belongs to the fluid.

Introducing a cutoff $\Lambda$ , such that $1\ll\Lambda\ll L$ , the weight of this region can be estimated by the sum

\sum_{k=0}^{\Lambda}\pi_{n}(k|L)\approx\sum_{k=0}^{\Lambda}\frac{n-1}{n}f(k)\mathrel{\mathop{\to}\limits_{L\to\infty}}\sum_{k=0}^{\infty}\frac{n-1}{n}f(k)=1-\frac{1}{n}.

(3.10)

Dip region. In the dip region, where $k$ and $L-k$ are simultaneously large and comparable, setting $k={\lambda}L$ in (2.8) where $0<{\lambda}<1$ , and using (3.4), yields the estimate

\pi_{n}(k|L)\mathrel{\mathop{\approx}\limits_{L\to\infty}}\frac{n-1}{n}\frac{f(k)f(L-k)}{f(L)}\approx\frac{n-1}{n}\frac{c}{L^{1+{\theta}}}\frac{1}{[{\lambda}(1-{\lambda})]^{1+{\theta}}},

(3.11)

In this region the distribution is therefore U-shaped: the most probable configurations are those where almost all the particles are located on one of two sites. The dip centred around $k=L/2$ becomes deeper and deeper with $L$ .

The weight of this region reads, choosing $\Lambda={\epsilon}L$ ,

\sum_{k=\Lambda}^{L-\Lambda}\pi_{n}(k|L)\approx\frac{n-1}{n}cL^{-{\theta}}\int_{\epsilon}^{1-{\epsilon}}\frac{{\rm d}{\lambda}}{[{\lambda}(1-{\lambda})]^{1+{\theta}}}.

(3.12)

Uphill region. The condensate region corresponds to $\ell=L-k$ finite, where (2.8) simplifies into

\pi_{n}(L-\ell|L)\mathrel{\mathop{\approx}\limits_{L\to\infty}}\frac{1}{n}Z_{n-1}(\ell),

(3.13)

as in (2.30) or (2.31). The weight of this uphill region can be estimated as

\sum_{\ell=0}^{\Lambda}\pi_{n}(L-\ell|L)\approx\frac{1}{n}\sum_{\ell=0}^{\Lambda}Z_{n-1}(\ell)\to\frac{1}{n}\sum_{\ell=0}^{\infty}Z_{n-1}(\ell)=\frac{1}{n},

(3.14)

where $\Lambda$ is yet another cutoff, and where the last step is obtained by setting $z=1$ in the expression of the generating function (2.6). Thus, as seen in (3.10) and (3.14) the weights of the downhill and uphill regions add up to one, in line with the fact that the contribution of the dip region is subdominant, as shown in (3.12) above.

We now proceed to the discussion of the regimes for the distribution of the maximum, $p_{n}(k|L)$ . There are again three regimes to consider, that we describe in turn, from right to left in figure 5. In the uphill and dip regions, such that $X_{\rm max}=k>L/2$ , $p_{n}(k|L)$ is denoted by $q_{n}(k|L)=n\pi_{n}(k|L)$ (see (2.20)), whose estimates follow from those of $\pi_{n}(k|L)$ seen above.

1.

Uphill region. For $\ell$ finite, using (3.13), we have

$q_{n}(L-\ell|L)\approx Z_{n-1}(\ell),$ (3.15)

with weight equal to 1 up to the subleading corrections detailed below. The interpretation of (3.15) is that, asymptotically, the difference between $L$ and $X_{\rm max}$ has the same distribution as the sum of $n-1$ iid random variables, the latter composing the fluid,

$L-X_{\rm max}\mathrel{\mathop{\approx}\limits_{L\to\infty}}\ \mathrel{\mathop{\underbrace{\sum_{i=1}^{n-1}X_{i}}}\limits_{\rm fluid}}.$ (3.16)

Dip region. For $L/2<k\sim L-k$ , we have, according to (3.11),

q_{n}(k|L)\approx\frac{(n-1)f(k)f(L-k)}{f(L)}\approx\frac{(n-1)\,c}{L^{1+{\theta}}}\frac{1}{[{\lambda}(1-{\lambda})]^{1+{\theta}}}.

(3.17)

The weight of this region therefore scales as $L^{-{\theta}}$ , as seen in (3.12).

Left region. For $k\leq L/2$ , the weight of this region is subdominant with respect to that of the two previous ones. A simple argument shows that

\mathop{\rm Prob}\nolimits(X_{\rm max}\leq L/2|S_{n}=L)=F_{n}(L/2|L)\sim L^{-\beta},\qquad\beta=\left\{\begin{array}[]{ll}2{\theta}&\textrm{if }{\theta}\leq 1\vspace{4pt}\\ 1+{\theta}&\textrm{if }{\theta}>1,\end{array}\right.

(3.18)

where $F_{n}$ is defined in (2.11).

The argument leading to (3.18) and the prediction of the amplitude $\lim_{L\to\infty}F_{n}(L/2|L)L^{\beta}$ are given in appendix A. In appendix B, we give an exact calculation of the weight of the left region when $f(k)$ is the continuous Lévy $\frac{1}{2}$ stable density.

Equation (3.18) eventually justifies the approximation made in (3.5), where the contribution of the left region to the sum $\sum_{\ell=0}^{L-1}\ell\,p_{n}(L-\ell|L)$ was neglected. In view of (3.18), this contribution is $O(L^{1-\beta})$ , thus subdominant by a factor $L^{-{\theta}}$ with respect to the first correction—respectively $O(L^{1-{\theta}})$ if ${\theta}<1$ (see (3.6)), or $O(1)$ for ${\theta}>1$ (see (3.8))—whether ${\theta}$ is smaller of larger than unity.

There is actually a hierarchy of weights for the distribution of the maximum in the successive regions $(L/3,L/2)$ , $(L/4,L/3)$ and so on. This can be intuitively grasped as follows.

1.

If $L/3<X_{\rm max}\leq L/2$ , the total ‘mass’ $L$ is dominantly shared by two summands. The weight of this rare event scales as in (3.18).
2.

Then if $L/4<X_{\rm max}\leq L/3$ , the total ‘mass’ $L$ is dominantly shared by three summands, which is a still rarer event, and so on.
3.

Finally, if $X_{\rm max}<L/n$ , the probability $p_{n}(k|L)$ vanishes since it is no longer possible to divide the ‘mass’ $L$ into $n$ pieces, all less than $L/n$ .

This hierarchy is reflected by the presence of cusps in the distribution of the maximum at $L/2,L/3$ , $L/4\dots$ (see appendix A).

All the discussion given in the present section is a preparation for subsections 8.3 and 11.4, where figures 9 and 11 are to be compared to figure 5, and equations (8.31), (8.32), (11.27) and (11.28) are to be compared to equations (3.6) and (3.8).

4 General statements on tied-down renewal processes

The random variables $X_{i}$ now represent the sizes of (spatial or temporal) intervals, that we take strictly positive, hence

f(0)=0.

(4.1)

In the temporal language $L$ is the total duration of the process, in the spatial language it is the length of the system. To each interval (equivalently, to each renewal event) is associated a positive weight $w$ , to be interpreted as a reward if $w>1$ or a penalty if $w<1$ . In the models considered in burda3 ; bar2 ; bar3 ; barma , $w$ has the interpretation of the ratio $y/y_{c}$ , where $y$ is a fugacity, and $y_{c}$ its value at criticality.

In the present section, $f(k)=\mathop{\rm Prob}\nolimits(X=k)$ is any arbitrary distribution of the positive random variable $X$ . Later on, in sections 5, 6, 7 and 8, $f(k)$ will obey the form (1.1).

4.1 Joint distribution

The probability of the configuration $\{X_{1}=k_{1},\dots,X_{N_{L}}=k_{n},N_{L}=n\}$ , given that $S_{N_{L}}=L$ , reads

	$\displaystyle p(k_{1},\dots,k_{n},n\|L)$	$\displaystyle=$	$\displaystyle\mathop{\rm Prob}\nolimits(X_{1}=k_{1},\dots,X_{N_{L}}=k_{n},N_{L}=n\|S_{N_{L}}=L)$		(4.2)
		$\displaystyle=$	$\displaystyle\frac{1}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}w^{n}f(k_{1})\dots f(k_{n})\delta\Big{(}\sum_{i=1}^{n}k_{i},L\Big{)},$		(4.2)

where the denominator is the tied-down partition function

	$\displaystyle Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$	$\displaystyle=$	$\displaystyle\sum_{n\geq 0}w^{n}\sum_{\{k_{i}\}}f(k_{1})\dots f(k_{n})\delta\Big{(}\sum_{i=1}^{n}k_{i},L\Big{)}$		(4.3)
		$\displaystyle=$	$\displaystyle\sum_{n\geq 0}w^{n}Z_{n}(L)=\delta(L,0)+\sum_{n\geq 1}w^{n}(f\star)^{n}(L).$		(4.3)

The probability $Z_{n}(L)$ is still defined as in (2.2), except for the change of the initial value (4.1) of $f(k)$ , which entails that $Z_{n}(L)$ is only defined for $n\leq L$ . The first term $\delta(L,0)$ follows from (2.4). The first values of $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ are

	$\displaystyle Z^{\mathop{\scriptstyle\mathrm{td}}}(w,0)=1,\quad Z^{\mathop{\scriptstyle\mathrm{td}}}(w,1)=wf(1),\quad Z^{\mathop{\scriptstyle\mathrm{td}}}(w,2)=wf(2)+w^{2}f(1)^{2},$
	$\displaystyle Z^{\mathop{\scriptstyle\mathrm{td}}}(w,3)=wf(3)+2w^{2}f(1)f(2)+w^{3}f(1)^{3},$		(4.4)

and so on. The generating function of $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ with respect to $L$ is

\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)=\sum_{L\geq 0}z^{L}Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)=\sum_{n\geq 0}w^{n}\tilde{f}(z)^{n}=\frac{1}{1-w\tilde{f}(z)}.

(4.5)

Note that $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ can be seen as the grand canonical partition function of the system with respect to $N_{L}$ .

For $w=1$ , the tied-down partition function,

Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)=\sum_{n\geq 0}\mathop{\rm Prob}\nolimits(S_{n}=L)=\mathop{\rm Prob}\nolimits(S_{N_{L}}=L)=\langle\delta(S_{N_{L}},L)\rangle,

(4.6)

is the probability that a renewal occurs at $L$ .

Finally, we note that

p(k_{1},\dots,k_{n},n|L)_{|{\rm num}}=w^{n}p_{n}(k_{1},\dots,k_{n}|L)_{|{\rm num}},

(4.7)

hence

p(k_{1},\dots,k_{n},n|L)=p_{n}(k_{1},\dots,k_{n}|L)\mathrm{p}_{n}(L),

(4.8)

where $p_{n}(\{k_{i}\}|L)$ and $\mathrm{p}_{n}(L)$ are respectively defined in (2.1) and (4.9).

4.2 Distribution of the number of intervals

The distribution of the number of intervals is obtained by summing the distribution (4.2) upon all variables except $n$ ,

\mathrm{p}_{n}(L)=\mathop{\rm Prob}\nolimits(N_{L}=n)=\frac{w^{n}Z_{n}(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}.

(4.9)

For instance, taking the successive terms of (4.3) divided by $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ yields

\mathrm{p}_{0}(L)=\frac{\delta(L,0)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)},\quad\mathrm{p}_{1}(L)=\frac{wf(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)},\quad\mathrm{p}_{2}(L)=\frac{w^{2}\sum_{k_{1}}f(k_{1})f(L-k_{1})}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)},\dots

(4.10)

and more generally, for $n\geq 1$ ,

\mathrm{p}_{n}(L)=\frac{w^{n}(f\star)^{n}(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}=\frac{w^{n}\left[\tilde{f}(z)^{n}\right]_{L}}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)},

(4.11)

where the notation $[\cdot]_{L}$ stands for the $L-$ th coefficient of the series inside the brackets. Hence the generating function with respect to $L$ of the numerator of (4.9) reads

\sum_{L\geq 0}z^{L}\mathrm{p}_{n}(L)_{|{\rm num}}=w^{n}\tilde{f}(z)^{n}.

(4.12)

Taking the sum of the right side upon $n\geq 0$ yields back $\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)$ given in (4.5).

The first moment of this distribution is by definition

\langle N_{L}\rangle=\sum_{n\geq 1}n\,\mathrm{p}_{n}(L).

(4.13)

The generating function of its numerator reads, using (4.12)

\sum_{L\geq 0}z^{L}\langle N_{L}\rangle_{|{\rm num}}=\sum_{n\geq 1}n(w\tilde{f}(z))^{n}=\frac{w\tilde{f}(z)}{\big{(}1-w\tilde{f}(z)\big{)}^{2}}=w\frac{{\rm d}\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)}{{\rm d}w},

(4.14)

hence

\langle N_{L}\rangle=w\,\frac{{\rm d}\ln Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}{{\rm d}w},

(4.15)

as expected in the grand canonical ensemble with respect to $N_{L}$ . Alternatively, since

\sum_{L\geq 0}z^{L}\langle N_{L}\rangle_{|{\rm num}}=\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)^{2}-\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z),

(4.16)

we have

\langle N_{L}\rangle_{|{\rm num}}=(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L)-Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L).

(4.17)

More generally, the generating function of the moments of $N_{L}$ is given by

\langle v^{N_{L}}\rangle=\sum_{n\geq 0}v^{n}\mathrm{p}_{n}(L).

(4.18)

Taking the generating function with respect to $L$ of the numerator of this expression, using (4.12),

\sum_{L\geq 0}z^{L}\langle v^{N_{L}}\rangle_{|{\rm num}}=\sum_{n\geq 0}v^{n}(w\tilde{f}(z))^{n}=\frac{1}{1-vw\tilde{f}(z)}=\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(vw,z),

(4.19)

yields

\langle v^{N_{L}}\rangle=\frac{Z^{\mathop{\scriptstyle\mathrm{td}}}(vw,L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}.

(4.20)

Likewise, the inverse moment $\langle 1/N_{L}\rangle$ is

\Big{\langle}\frac{1}{N_{L}}\Big{\rangle}=\sum_{n\geq 1}\frac{\mathrm{p}_{n}(L)}{n}=\frac{1}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}\sum_{n\geq 1}\frac{\left[[w\tilde{f}(z)]^{n}\right]_{L}}{n}=\frac{1}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}\left[-\ln(1-w\tilde{f}(z))\right]_{L}.

(4.21)

4.3 Single interval distribution

The marginal distribution of one of the summands, say $X_{1}$ , is by definition

\pi(k|L)=\mathop{\rm Prob}\nolimits(X_{1}=k|S_{N_{L}}=L)=\langle\delta(X_{1},k)\rangle,

(4.22)

where $\langle\cdot\rangle$ is the average with respect to (4.2), with a summation upon the variables $k_{1},\dots,k_{n}$ (with $1\leq k\leq L$ ) and $n\geq 1$ , resulting in

$\displaystyle\pi(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\sum_{n\geq 1}\sum_{k_{1}}\delta(k_{1},k)wf(k_{1})\sum_{{k_{2},\dots}}w^{n-1}f(k_{2})\dots f(k_{n})\delta\Big{(}k_{1}+\sum_{i=2}^{n}k_{i},L\Big{)}$	(4.23)
	$\displaystyle=$	$\displaystyle\sum_{k_{1}}\delta(k_{1},k)wf(k_{1})\delta(k_{1},L)+\sum_{k_{1},k_{2}}\delta(k_{1},k)w^{2}f(k_{1})f(k_{2})\delta(k_{1}+k_{2},L)+\cdots$
	$\displaystyle=$	$\displaystyle wf(k)\delta(k,L)+wf(k)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k)\big{(}1-\delta(k,L)\big{)}.$

Finally

\pi(k|L)=\mathrel{\mathop{\underbrace{\frac{wf(k)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}\delta(k,L)}}\limits_{p(k,1|L)}}+wf(k)\frac{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}\big{(}1-\delta(k,L)\big{)},

(4.24)

where the first term corresponds to $n=1$ , i.e.,

\pi(L|L)=\frac{wf(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}=\mathop{\rm Prob}\nolimits(N_{L}=1).

(4.25)

Also, since $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,0)=1$ , (4.24) can be more compactly written as

\pi(k|L)=\frac{wf(k)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}.

(4.26)

The generating function of the numerator of (4.26) with respect to $L$ yields

\sum_{L\geq k}z^{L}\pi(k|L)_{|{\rm num}}=wz^{k}f(k)\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)=\frac{wz^{k}f(k)}{1-w\tilde{f}(z)}.

(4.27)

Summing (4.26) upon $k$ we obtain

Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)=\sum_{k=1}^{L}wf(k)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k),\quad L\geq 1,

(4.28)

which can also be obtained by multiplying the recursion (2.5) for $Z_{n}(L)$ by $w^{n}$ and summing on $n$ .

Remarks

1. An alternative route to (4.26) is as follows. We have

$\displaystyle\pi(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\sum_{n\geq 1}w^{n}\sum_{\{k_{i}\}}\delta(k_{1},k)p_{n}(\{k_{i}\}\|L)_{\|{\rm num}}=\sum_{n\geq 1}w^{n}\pi_{n}(k\|L)_{\|{\rm num}}$	(4.29)
	$\displaystyle=$	$\displaystyle\sum_{n\geq 1}w^{n}f(k)Z_{n-1}(L-k)=wf(k)\sum_{n\geq 0}w^{n}Z_{n}(L-k)$
	$\displaystyle=$	$\displaystyle wf(k)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k)_{\|{\rm num}}.$

2. We also note that

\pi(k|L)=\sum_{n\geq 1}\pi_{n}(k|L)\mathrm{p}_{n}(L),

(4.30)

which is a simple consequence of (4.8).

4.4 Mean interval $\langle X|L\rangle$

This is, by definition,

\langle X|L\rangle=\sum_{k\geq 1}k\pi(k|L).

(4.31)

Multiplying (4.27) by $k$ and summing upon $k$ yields

\sum_{L\geq 1}z^{L}\langle X|L\rangle_{|{\rm num}}=\frac{wz\tilde{f}^{\prime}(z)}{1-w\tilde{f}(z)},

(4.32)

which can also be obtained by taking the derivative with respect to $z$ of the expression for the inverse moment $\langle 1/N_{L}\rangle$ . Indeed,

\langle X|L\rangle_{|{\rm num}}=\Big{\langle}\frac{L}{N_{L}}\Big{\rangle}_{|{\rm num}}=L\left[-\ln(1-w\tilde{f}(z))\right]_{L},

(4.33)

then, taking the generating function of the right side gives

	$\displaystyle\sum_{L\geq 0}z^{L}\langle X\|L\rangle_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\sum_{L\geq 1}z^{L}L\left[-\ln(1-w\tilde{f}(z))\right]_{L}$		(4.34)
		$\displaystyle=$	$\displaystyle z\frac{{\rm d}}{{\rm d}z}\left(-\ln(1-w\tilde{f}(z))\right)=\frac{wz\tilde{f}^{\prime}(z)}{1-w\tilde{f}(z)}.$		(4.34)

4.5 The longest interval

By definition, the longest interval is

X_{{\rm max}}={\rm max}(X_{1},\dots,X_{N_{L}}).

(4.35)

Its distribution function is defined as

F(k|L)=\mathop{\rm Prob}\nolimits(X_{{\rm max}}\leq k|L)=\sum_{n\geq 0}\sum_{k_{1}=1}^{k}\dots\sum_{k_{n}=1}^{k}p(\{k_{i}\},n|L)=\frac{F(k|L)_{|{\rm num}}}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)},

(4.36)

with initial value

F(k|0)_{|{\rm num}}=1.

(4.37)

The numerator in (4.36) reads

	$\displaystyle F(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\sum_{n\geq 0}w^{n}\sum_{k_{1}=1}^{k}\dots\sum_{k_{n}=1}^{k}p_{n}(\{k_{i}\}\|L)_{\|{\rm num}}$		(4.38)
		$\displaystyle=$	$\displaystyle\sum_{n\geq 0}w^{n}F_{n}(k\|L)_{\|{\rm num}},$		(4.38)

where $F_{n}(k|L)_{|{\rm num}}$ is defined in (2.12). Note that $F(L|L)_{|{\rm num}}=Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ , hence $F(L|L)=1$ . The generating function of the numerator is

	$\displaystyle\sum_{L\geq 0}z^{L}F(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle 1+\sum_{n\geq 1}\prod_{i=1}^{n}\Big{(}\sum_{k_{i}=1}^{k}wf(k_{i})z^{k_{i}}\Big{)}$		(4.39)
		$\displaystyle=$	$\displaystyle 1+\sum_{n\geq 1}\Big{(}w\tilde{f}(z,k)\Big{)}^{n}=\frac{1}{1-w\tilde{f}(z,k)},$		(4.39)

where

\tilde{f}(z,k)=\sum_{j=1}^{k}z^{j}f(j).

(4.40)

The numerator obeys the recursion (renewal) equation, which generalises the Rosén-Wendel’s result (2.4) of wendel1 ,

F(k|L)_{|{\rm num}}=\sum_{j=1}^{\min(k,L)}wf(j)\,F(k|L-j)_{|{\rm num}},

(4.41)

with initial condition (4.37).

The distribution of $X_{\rm max}$ is given by the difference

p(k|L)=\mathop{\rm Prob}\nolimits(X_{{\rm max}}=k)=F(k|L)-F(k-1|L),

(4.42)

where $F(0|L)=\delta(L,0)$ , with generating function

	$\displaystyle\sum_{L\geq 0}z^{L}p(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\frac{1}{1-w\tilde{f}(z,k)}-\frac{1}{1-w\tilde{f}(z,k-1)}$		(4.43)
		$\displaystyle=$	$\displaystyle\frac{wz^{k}f(k)}{[1-w\tilde{f}(z,k)][1-w\tilde{f}(z,k-1)]}.$		(4.43)

Its end point value is the same as $\pi(L|L)$ (4.25), i.e.,

p(L|L)=\mathop{\rm Prob}\nolimits(N_{L}=1)=\frac{wf(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}.

(4.44)

Note that (4.43) can be obtained by multiplying (2.17) by $w^{n}$ and summing on $n$ . In other words,

p(k|L)_{|{\rm num}}=\sum_{n\geq 1}w^{n}p_{n}(k|L)_{|{\rm num}},

(4.45)

as can also be inferred from (4.38). And therefore (see (4.8))

p(k|L)=\sum_{n\geq 1}p_{n}(k|L)\mathrm{p}_{n}(L).

(4.46)

When $X_{\rm max}=k>L/2$ , the longest interval is unique. Denoting the restriction of $p(k|L)$ to the range $k>L/2$ by $q(k|L)$ , and generalising the reasoning made in wendel ; wendel1 we can decompose a configuration into three contributions to obtain

	$\displaystyle q(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\sum_{i=0}^{L-k}Z^{\mathop{\scriptstyle\mathrm{td}}}(w,i)wf(k)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k-i)$		(4.47)
		$\displaystyle=$	$\displaystyle wf(k)(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L-k),$		(4.47)

which, using (4.17), can be alternatively written as

q(k|L)=\frac{wf(k)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}(1+\langle N_{L-k}\rangle),

(4.48)

that is (see (4.26)),

q(k|L)=\pi(k|L)(1+\langle N_{L-k}\rangle).

(4.49)

For $k=L$ , (4.44) with (4.25) is recovered.

Remarks

1. An alternative route to (4.49) can be inferred from (4.45) as follows. We have

q(k|L)_{|{\rm num}}=\sum_{n\geq 1}w^{n}q_{n}(k|L)_{|{\rm num}}=\sum_{n\geq 1}w^{n}n\pi_{n}(k|L)_{|{\rm num}},

(4.50)

where $\pi_{n}(k|L)$ is given in (2.8). So

$\displaystyle q(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle wf(k)\sum_{n\geq 1}nw^{n-1}Z_{n-1}(L-k)$	(4.51)
	$\displaystyle=$	$\displaystyle wf(k)\sum_{n\geq 0}(w^{n}Z_{n}(L-k)+nw^{n}Z_{n}(L-k))$
	$\displaystyle=$	$\displaystyle wf(k)(Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k)+\langle N_{L-k}\rangle_{\|{\rm num}}),$

which, after division by $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ , yields (4.49).

2. Comparison between (4.51) and (4.47) entails the equality

(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L)=\sum_{n\geq 1}nw^{n-1}Z_{n-1}(L),

(4.52)

which can also be checked directly by taking the generating functions of both sides,

\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)^{2}=\sum_{n\geq 1}nw^{n-1}\tilde{f}(z)^{n-1}=\frac{1}{(1-\tilde{w}f(z))^{2}}.

(4.53)

3. From (2.21) we infer that, if $k>L/2$ ,

F^{\mathop{\scriptstyle\mathrm{td}}}(k|L-k)_{|{\rm num}}=Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k).

(4.54)

5 Phase transition for tied-down renewal processes

In this section and in the following sections 6, 7 and 8 the distribution $f(k)=\mathop{\rm Prob}\nolimits(X=k)$ is taken subexponential with asymptotic power-law decay (1.1). Before discussing the phase diagram of the process, we give some illustrative examples of such distributions.

5.1 Illustrative examples

In the sequel, we shall illustrate the general results derived for tdrp in the current section, and for free renewal processes in section 9, on the following examples.

Example 1. This first example corresponds to the tied-down random walk of figure 3 on which we come back in more detail. The distribution of the size of intervals, $f(k)$ , representing the probability of first return at the origin of the walk after $2k$ steps, or equivalently after $k$ tick marks on figure 3, reads

f(k)=\frac{1}{2^{2k-1}}\frac{(2k-2)!}{(k-1)!k!}=\frac{\Gamma(k-1/2)}{2\sqrt{\pi}\Gamma(k+1)}\approx\frac{1}{2\sqrt{\pi}k^{3/2}},

(5.1)

since the number of such walks is equal to $(2k-2)!/[(k-1)!k!]$ . Its generating function reads

\tilde{f}(z)=1-\sqrt{1-z}.

(5.2)

The partition function (4.6) for $w=1$ represents the probability that the walk returns at the origin after $2L$ steps, or equivalently after $L$ tick marks,

Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)=\frac{1}{2^{2L}}\left(2L\atop L\right)\approx\frac{1}{\sqrt{\pi L}},

(5.3)

since the number of such walks is equal to $(2L)!/(L!)^{2}$ . Its generating function reads

\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(1,z)=\frac{1}{\sqrt{1-z}}.

(5.4)

Note that

f(L)=Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L-1)-Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L).

(5.5)

The partition function $Z_{n}(L)$ is explicit for this case,

Z_{n}(L)=\frac{n}{2^{2L-n}}\frac{(2L-n-1)!}{L!(L-n)!}\approx\frac{n}{2\sqrt{\pi}L^{3/2}},

(5.6)

with $n\leq L$ .

Example 2. This second example is defined for any ${\theta}>0$ by

f(k)=\frac{1}{\zeta(1+{\theta})}\frac{1}{k^{1+{\theta}}},\quad k>0.

(5.7)

\tilde{f}(z)=\frac{\mathrm{Li}_{1+{\theta}}(z)}{\zeta(1+{\theta})},

(5.8)

where $\mathrm{Li}_{s}(z)=\sum_{k\geq 1}z^{k}/k^{s}$ is the polylogarithm function. If ${\theta}>1$ the mean $\langle X\rangle=\zeta({\theta})/\zeta(1+{\theta})$ . This is the distribution used, e.g., in burda ; burda3 ; bar2 ; bar3 ; barma ; glzeta .

5.2 Phase diagram

Demonstrating the existence of a phase transition in the model defined by (4.2), with distribution $f(k)$ given by (1.1), when $w$ crosses the value one, is a classical subject. This model is a particular instance of a linear system, as described in fisher , where the mechanism of the transition is explained in simple terms. This transition is also studied in burda3 ; bar2 ; bar3 ; barma for Example 2 (see (5.7)). Let us first analyse the large $L$ behaviour of $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ . Recalling (4.5) we have, for a contour encircling the origin,

Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)=\oint\frac{{\rm d}z}{2\pi{\rm i}}\frac{\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)}{z^{L+1}}=\oint\frac{{\rm d}z}{2\pi{\rm i}\,z^{L+1}}\frac{1}{1-w\tilde{f}(z)}.

(5.9)

Since $\tilde{f}(z)$ is monotonically increasing for $z\in(0,1)$ the denominator of $\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)$ , $1-w\tilde{f}(z)$ , is monotonically decreasing between 1 and $1-w$ .

Disordered phase. If $w>1$ , the denominator vanishes for $z=z_{0}<1$ such that $w\tilde{f}(z_{0})=1$ , hence $\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)$ has a pole at $z_{0}$ , and therefore $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ is exponentially increasing,

Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)\approx\frac{z_{0}^{-L}}{wz_{0}\tilde{f}^{\prime}(z_{0})}.

(5.10)

Critical regime. If $w=1$ , then $z_{0}=1$ . The asymptotic estimates of $Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)$ are given in (7.5) and (7.23).

Condensed phase. If $w<1$ , the denominator $1-w\tilde{f}(z)$ has no zero, but it is singular for $z=z_{0}=1$ (which is the singularity of $\tilde{f}(z)$ ). Hence $z_{0}$ sticks to 1. The asymptotic estimate of $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ is given in (8.1).

This is the switch mechanism of Fisher fisher : the condition determining $z_{0}$ switches from $z_{0}$ being the smallest root of the equation $1-w\tilde{f}(z)=0$ to being the closest real singularity of $\tilde{f}(z)$ , which is a cut at $z=z_{0}=1$ . This non analytical switch signals the phase transition. The free energy density fisher

\mathbf{f}=\lim_{L\to\infty}-\frac{1}{L}\ln Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)=\ln z_{0},

(5.11)

therefore vanishes when $w\leq 1$ . The three cases above are successively reviewed in the next sections.

6 Disordered phase ( $w>1$ ) for tied-down renewal processes

The asymptotic expression at large $L$ of the distribution of the size of a generic interval is obtained by carrying (5.10) in (4.26), which leads to

\pi(k|L)\approx wf(k)z_{0}^{k}=wf(k){\rm e}^{-k/\xi},\qquad\xi=\frac{1}{|\ln z_{0}|},

(6.1)

where $\xi$ is the correlation length, divergent at the transition. This expression is independent of $L$ and normalised, since summing on $k$ restores $w\tilde{f}(z_{0})=1$ . This exponentially decaying distribution has a finite mean,

\langle X|L\rangle\approx wz_{0}\tilde{f}^{\prime}(z_{0}),

(6.2)

an expression which can also be inferred from (4.32). Thus (5.10) can be recast as

Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)\approx\frac{z_{0}^{-L}}{\langle X|L\rangle}.

(6.3)

The distribution of $N_{L}$ is given by (4.9)

\mathrm{p}_{n}(L)=\frac{w^{n}Z_{n}(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}\approx w^{n}Z_{n}(L)\,w\tilde{f}^{\prime}(z_{0})z_{0}^{L+1}.

(6.4)

This distribution obeys the central limit theorem, as illustrated on the example below. Using (4.15), we obtain the asymptotic expression of $\langle N_{L}\rangle$ ,

\langle N_{L}\rangle\approx-L\frac{w}{z_{0}}\frac{{\rm d}z_{0}}{{\rm d}w}\approx\frac{L}{\langle X|L\rangle},

(6.5)

which means that

\frac{1}{\langle N_{L}\rangle}\approx\left\langle\frac{1}{N_{L}}\right\rangle.

(6.6)

Let us denote the density of points (or intervals) for a finite system as

\nu_{L}=\frac{\langle N_{L}\rangle}{L},

(6.7)

then, asymptotically, we have

\nu=\lim_{L\to\infty}\nu_{L}=\lim_{L\to\infty}\frac{\langle N_{L}\rangle}{L}=\lim_{L\to\infty}\frac{1}{\langle X|L\rangle}.

(6.8)

We illustrate these general statements on Example 1 (see (5.1)), for which $z_{0}$ is explicit,

z_{0}=\frac{2w-1}{w^{2}},

(6.9)

hence

\xi\approx(w-1)^{-2},

(6.10)

and the following asymptotic expressions hold,

$\displaystyle Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$	$\displaystyle\approx$	$\displaystyle\frac{2(w-1)w^{2L}}{(2w-1)^{L+1}},$
$\displaystyle\langle X\|L\rangle$	$\displaystyle\mathrel{\mathop{\to}\limits_{L\to\infty}}$	$\displaystyle\frac{2w-1}{2(w-1)},$
$\displaystyle\langle N_{L}\rangle$	$\displaystyle\approx$	$\displaystyle\frac{L}{\langle X\|L\rangle}+\frac{w}{(w-1)(2w-1)},$
$\displaystyle\mathop{\rm Var}\nolimits N_{L}$	$\displaystyle\approx$	$\displaystyle L\frac{2w}{(2w-1)^{2}}-\frac{w(2w^{2}-1)}{(2w-1)^{2}(w-1)^{2}},$
$\displaystyle\mathrm{p}_{n}(L)$	$\displaystyle\approx$	$\displaystyle\frac{1}{\sqrt{2\pi\mathop{\rm Var}\nolimits{N_{L}}}}\exp\Big{(}-\frac{(n-\langle N_{L}\rangle)^{2}}{2\mathop{\rm Var}\nolimits{N_{L}}}\Big{)},$
$\displaystyle\nu$	$\displaystyle=$	$\displaystyle\frac{2(w-1)}{2w-1}.$	(6.11)

Figure 6 depicts a comparison between the exact finite-size expression of the density $\nu_{L}$ obtained by means of (4.14) for $L=1000$ as a function of $w$ , and the asymptotic expression (6.11). It vanishes at the transition $w=1$ , where the system becomes critical.

More generally, if ${\theta}<1$ , close to the transition, we get

\nu\sim(w-1)^{1/{\theta}-1},

(6.12)

as can be easily inferred by means of the expansion (7.1), a result already present in burda3 , later recovered in bar2 .

If ${\theta}>1$ , the density $\nu$ tends to $1/\langle X\rangle$ when $w\to 1$ , as can be seen on (6.2) and (6.8), using the expansion (7.2) (see also (7.24)). The density vanishes in the condensed phase since $\langle N_{L}\rangle$ is finite (see (8.5)), it is therefore discontinuous at the transition, as noted in burda3 ; bar2 . Likewise, it is easy to see that

\xi^{-1}\sim\left\{\begin{array}[]{ll}(w-1)^{1/{\theta}}&{\theta}<1\vspace{12pt}\\ w-1&{\theta}>1.\end{array}\right.

(6.13)

The correlation length diverges at the transition, while the order parameter $\nu$ is either continuous ( ${\theta}<1$ ) or discontinuous ( ${\theta}>1$ ), as seen above. The transition is therefore of mixed order burda3 ; bar2 .

Finally we note that the intervals $X_{i}$ behave essentially as iid random variables, with distribution (6.1), hence the statistics of the longest interval belongs to the Gumbel class gumbel ; gnedenko . This is detailed on Example 2 in bar3 .

7 Critical regime ( $w=1$ ) for tied-down renewal processes

In this regime, the behaviour of the quantities of interest strongly depends on whether the index ${\theta}$ is smaller or larger than unity. The discussion below is organised accordingly. Part of the material of this section can be found in more details in wendel ; wendel1 and is also addressed in bar3 ; barma . Here we summarise these former studies and complement them by a detailed analysis of the distribution of the number of intervals $N_{L}$ and of the distribution $\pi(k|L)$ of the size of a generic interval. We also come back on the distribution of the longest interval.

If $w=1$ the singularity is at $z=1$ , or, setting $z={\rm e}^{-s}$ , at $s=0$ . The generating function $\tilde{f}(z)$ becomes the Laplace transform $\hat{f}(s)$ which has the expansion

	$\displaystyle\hat{f}(s)\mathrel{\mathop{\approx}\limits_{s\to 0}}1-\|a\|s^{\theta},\qquad$	$\displaystyle{\theta}<1$			(7.1)
	$\displaystyle\hat{f}(s)\mathrel{\mathop{\approx}\limits_{s\to 0}}1-s\langle X\rangle+\cdots+as^{\theta},\qquad$	$\displaystyle{\theta}>1$			(7.2)

with

a=c\,\Gamma(-{\theta})={\theta}\Gamma(-{\theta})k_{0}^{\theta},

(7.3)

i.e., $c={\theta}k_{0}^{\theta}$ , where $k_{0}$ is a microscopic scale, defined as

g(k)=\sum_{j>k}f(j)\mathrel{\mathop{\approx}\limits_{k\to\infty}}\Big{(}\frac{k_{0}}{k}\Big{)}^{\theta}.

(7.4)

The parameter $a$ is negative if $0<\theta<1$ , positive if $1<\theta<2$ , and so on. For instance, $\Gamma(-1/2)=-2\sqrt{\pi}$ , $\Gamma(-3/2)=4\sqrt{\pi}/3$ , $\Gamma(-5/2)=-8\sqrt{\pi}/15$ , and so on.

7.1 Distribution $f(k)$ with index ${\theta}<1$

Since $\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(1,z)=1/(1-\tilde{f}(z))$ , in Laplace space we have $\hat{Z}^{\mathop{\scriptstyle\mathrm{td}}}(1,s)\approx 1/as^{{\theta}}$ which yields the expression of the partition function (see (4.6) in wendel1 )

Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)\mathrel{\mathop{\approx}\limits_{L\to\infty}}\frac{{\theta}\sin\pi{\theta}}{\pi c}L^{{\theta}-1}.

(7.5)

For instance, setting ${\theta}=1/2$ and $c=1/(2\sqrt{\pi})$ restores (5.3).

7.1.1 The number of intervals

We have (see (4.10) in wendel1 ),

\langle N_{L}\rangle\approx\frac{A({\theta})}{c}L^{\theta},\qquad A({\theta})=\frac{\Gamma(1+{\theta})}{\Gamma(1-{\theta})\Gamma(2{\theta})},

(7.6)

which can be easily deduced from (4.14). For the specific case of Example 1 (see (5.1)), we have the exact result (see (2.47) in wendel1 )

\langle N_{L}\rangle=\frac{1}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}-1=\frac{2^{2L}}{\left(2L\atop L\right)}-1\approx\sqrt{\pi L},

(7.7)

which is in agreement with (7.6), with ${\theta}=1/2,c=1/(2\sqrt{\pi})$ . We know from (4.9) that the distribution of $N_{L}$ is given by the ratio

\mathrm{p}_{n}(L)=\frac{Z_{n}(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}.

(7.8)

For $n$ and $L$ large $\mathrm{p}_{n}(L)$ has a scaling form. On the one hand, according to the generalised central limit theorem, the scaling form of the numerator is given by

Z_{n}(L)\approx\frac{1}{n^{1/{\theta}}}\mathcal{L}_{{\theta},c}\left(\frac{L}{n^{1/{\theta}}}\right),

(7.9)

where $\mathcal{L}_{{\theta},c}$ is the density of the stable law of index ${\theta}$ , tail parameter $c$ and asymmetry parameter $\beta=1$ (see e.g., cg2019 ). Then using (7.5), we get, with $u=L/n^{1/{\theta}}$ ,

\mathrm{p}_{n}(L)\approx\frac{\pi c}{{\theta}\sin\pi{\theta}}\frac{1}{L^{\theta}}\frac{L}{n^{1/{\theta}}}\mathcal{L}_{{\theta},c}\left(\frac{L}{n^{1/{\theta}}}\right)\approx\frac{\pi c}{{\theta}\sin\pi{\theta}}\frac{1}{L^{\theta}}u\mathcal{L}_{{\theta},c}(u).

(7.10)

The Lévy distribution of index ${\theta}=1/2$ has the explicit expression

\mathcal{L}_{1/2,c}(u)=\frac{c\,{\rm e}^{-\pi c^{2}/u}}{u^{3/2}},

(7.11)

hence (see (2.49) in wendel1 ), for Example 1 (see (5.1)),

\mathrm{p}_{n}(L)\approx\frac{v}{2\sqrt{L}}{\rm e}^{-v^{2}/4},\quad v=\frac{1}{\sqrt{u}}=\frac{n}{\sqrt{L}}.

(7.12)

Moreover, for this example, for $n$ and $L$ finite, $\mathrm{p}_{n}(L)$ is explicit since both $Z_{n}(L)$ given by (5.6) and $Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)$ , given by (5.3), are known explicitly.

7.1.2 Single interval distribution

Two regimes are to be considered.

(i) In all regimes where $\ell=L-k$ is large, using (7.5) we have

\pi(k|L)=f(k)\frac{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L-k)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}\approx f(k)\left(1-\frac{k}{L}\right)^{{\theta}-1}.

(7.13)

For instance, if $1\ll k\ll L$ ,

\pi(k|L)\approx f(k)\approx\frac{c}{k^{1+{\theta}}},

(7.14)

while if $k={\lambda}L$ , with ${\lambda}\in(0,1)$ ,

\pi(k|L)\approx\frac{c}{{\lambda}^{1+{\theta}}(1-{\lambda})^{1-{\theta}}}\frac{1}{L^{1+{\theta}}},

(7.15)

which is minimum at ${\lambda}=(1+{\theta})/2$ .

(ii) On the other hand, if $\ell=L-k=O(1)$ ,

\pi(L-\ell|L)=f(L-\ell)\frac{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,\ell)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}\approx\frac{\pi c^{2}Z^{\mathop{\scriptstyle\mathrm{td}}}(1,\ell)}{{\theta}\sin\pi{\theta}}\frac{1}{L^{2{\theta}}}.

(7.16)

In particular, for $k=L$ ,

\pi(L|L)=\frac{f(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}\approx\frac{\pi c^{2}}{{\theta}\sin\pi{\theta}}\frac{1}{L^{2{\theta}}}.

(7.17)

In this regime the ratio of $\pi(k|L)$ to the estimate (7.13) reads

\frac{\pi(L-\ell|L)}{f(L-\ell)(1-k/L)^{{\theta}-1}}=\frac{\pi c\,\ell^{1-{\theta}}Z^{\mathop{\scriptstyle\mathrm{td}}}(1,\ell)}{{\theta}\sin\pi{\theta}},

(7.18)

which tends to one when $\ell$ becomes large, if one refers to (7.5).

All these results can be illustrated on Example 1 (see (5.1)). For instance figure 7 gives a comparison between the exact expression of $\pi(k|L)$ computed for $L=100$ by means of the middle expression in (7.13) and its asymptotic form given by the rightmost expression in (7.13).

The generating function of the mean interval $\langle X|L\rangle$ given in (4.32) yields the estimate, in Laplace space,

\sum_{L\geq 1}z^{L}\langle X|L\rangle_{|{\rm num}}\approx-\frac{\hat{f}^{\prime}(s)}{1-\hat{f}(s)}\approx\frac{{\theta}}{s},

(7.19)

hence (see (4.18) in wendel1 )

\langle X|L\rangle\approx\frac{{\theta}}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}\approx\frac{\pi c}{\sin\pi{\theta}}L^{1-{\theta}}.

(7.20)

This result can be recovered by taking the average of the estimate (7.13). It predicts correctly that the product $\langle X|L\rangle\langle N_{L}\rangle\sim L$ . For Example 1, (see (5.1)), the computation leads to the exact result (see (2.58) in wendel1 )

\langle X|L\rangle=\frac{1}{2Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}\approx\frac{\sqrt{\pi L}}{2}.

(7.21)

7.1.3 The longest interval

The study of the statistics of the longest interval for the critical case, including the scaling analysis of $p(k|L)$ for ${\theta}<1$ , is done in wendel ; wendel1 (see also bar3 for a study of Example 2).

For $k>L/2$ , in contrast with the case of random allocation models and zrp where $p(k|L)=q(k|L)=n\pi(k|L)$ (see §2.1.2), the ‘enhancement factor’ $n$ is now replaced by the factor $1+\langle N_{L-k}\rangle$ (see (4.49)), which is equal to one for $k=L$ , where $p(L|L)=\pi(L|L)$ are equal, see (4.44). Using (7.6) for $\langle N_{L}\rangle$ , and (7.13) for $\pi(k|L)$ in (4.49) allows to recover the universal scaling expression valid for $k>L/2$ (see equation (4.48) in wendel1 ), in the limit where $1\ll k\sim L$ , with $r=k/L$ kept fixed,

q(k|L)\approx\frac{1}{L}\frac{A({\theta})}{r^{1+{\theta}}(1-r)^{1-2{\theta}}},

(7.22)

where $A({\theta})$ is given in (7.6).

Though there is no condensation at criticality, some features are precursors of this phenomenon. For instance, the mean longest interval $\langle X_{\rm max}\rangle$ scales as $L$ while the typical interval $\langle X|L\rangle$ scales as $L^{1-{\theta}}$ . However, not only $X_{\rm max}\equiv X^{(1)}$ scales as $L$ but also all the following maxima $X^{(r)}$ ( $k=2,3,\dots$ ) do so wendel ; wendel1 . Moreover $X_{\rm max}$ continues to fluctuate when $L\to\infty$ while for genuine condensation as in section 2 above or in section 8 below, its distribution is peaked. Finally, the dominant contribution to the weight of $\pi(k|L)$ comes from values of $k$ less than a small cutoff.

7.2 Distribution $f(k)$ with index ${\theta}>1$

For ${\theta}>1$ , we have, using (4.5) (see (4.73) in wendel1 ),

Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)\approx\frac{1}{\langle X\rangle}+\frac{c}{{\theta}({\theta}-1)\langle X\rangle^{2}}L^{1-{\theta}}.

(7.23)

The average value of $N_{L}$ is obtained by means of (4.14)⁴⁴4Equation (7.24) corrects the inaccurate expression (4.74) given in wendel1 for this quantity.

\langle N_{L}\rangle\approx\left\{\begin{array}[]{ll}\frac{L}{\langle X\rangle}+\frac{c}{({\theta}-1)(2-{\theta})\langle X\rangle^{2}}L^{2-{\theta}}&1<{\theta}<2\vspace{12pt}\\ \frac{L}{\langle X\rangle}+\frac{\mathop{\rm Var}\nolimits{X}}{\langle X\rangle^{2}}&{\theta}>2\end{array}\right.

(7.24)

The subleading correction in the second line (i.e., for ${\theta}>2$ ) is given by the correction term of the first line which is now negative and decreasing. The distribution of $N_{L}$ reads

\mathrm{p}_{n}(L)=\frac{Z_{n}(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}\approx\langle X\rangle Z_{n}(L).

(7.25)

The asymptotic estimate for $\langle X|L\rangle$ is obtained by analysing (4.32), yielding for ${\theta}>1$ ,

\langle X|L\rangle\approx\langle X\rangle-\frac{c}{{\theta}-1}L^{1-{\theta}},

(7.26)

which is the same expression as for a free renewal process gl2001 . The single interval distribution has the form

\pi(k|L)=f(k)\frac{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L-k)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}\mathrel{\mathop{\approx}\limits_{L\to\infty}}f(k),

(7.27)

except for $L-k$ finite, where in particular,

\pi(L|L)=\frac{f(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)}\approx f(L)\langle X\rangle.

(7.28)

The distribution of the longest interval is analysed in wendel1 (see also bar3 for the case of Example 2 defined in (5.7)). The result is

F(k|L)\approx{\rm e}^{-[L/\langle X\rangle](k_{0}/k)^{\theta}},

(7.29)

where $k_{0}$ is related to the tail coefficient by $c={\theta}k_{0}^{\theta}$ . Setting

X_{\rm max}=k_{0}\left(\frac{L}{\langle X\rangle}\right)^{1/\theta}Y_{L},

(7.30)

we have, as $L\to\infty$ , $Y_{L}\to Y^{F}$ , with limiting distribution

\mathop{\rm Prob}\nolimits(Y^{F}<x)={\rm e}^{-1/x^{\theta}},

(7.31)

which is the Fréchet law frechet ; gnedenko . Therefore

\langle X_{{\rm max}}\rangle\approx k_{0}\left(\frac{L}{\langle X\rangle}\right)^{1/\theta}\underbrace{\langle Y^{F}\rangle}_{\Gamma(1-1/\theta)},

(7.32)

as was already the case for free renewal processes gms2015 .

8 Condensed phase ( $w<1$ ) for tied-down renewal processes

The aim of this section—central in the present work—is to investigate the statistics of the number of intervals and characterise the fluctuations of the condensate. We start by analysing the large $L$ behaviour of the quantities of interest which are functions of $L$ only (partition function, moments and distribution of $N_{L}$ ). We then investigate the regimes for the distributions of the size of a generic interval, $\pi(k|L)$ , and of the longest one, $p(k|L)$ . Related material can be found in gia1 ; bar3 ; barma .

8.1 Asymptotic estimates at large $L$

Starting from (4.5) and linearising with respect to the singular part, we obtain, when $L\to\infty$ , for any value of ${\theta}$ ,

Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)\approx\frac{w}{(1-w)^{2}}\frac{c}{L^{1+{\theta}}}\approx\frac{w}{(1-w)^{2}}f(L).

(8.1)

Alternatively, it suffices to notice that, for $n$ fixed and $L$ large, for any subexponential distribution chistyakov ,

Z_{n}(L)\approx nf(L),

(8.2)

as for example in (5.6), which entails

Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)=\sum_{n\geq 0}w^{n}Z_{n}(L)\approx\sum_{n\geq 0}w^{n}nf(L),

(8.3)

hence restores (8.1). Likewise, we find

(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L)\approx\frac{2w}{(1-w)^{3}}f(L).

(8.4)

If one substitutes (8.1) in the expression of the mean $\langle N_{L}\rangle$ (4.15), we obtain, the superuniversal result, independent of ${\theta}$ ,

\langle N_{L}\rangle\mathrel{\mathop{\to}\limits_{L\to\infty}}\frac{1+w}{1-w}.

(8.5)

This result can be found alternatively using (4.17) together with (8.1) and (8.4), or else using (8.7) below.

More generally, this superuniversality also holds for the asymptotic distribution of $N_{L}$ . Using (4.20), the latter reads

\langle v^{N_{L}}\rangle=\sum_{n\geq 0}v^{n}\mathrm{p}_{n}(L)=\frac{Z^{\mathop{\scriptstyle\mathrm{td}}}(vw,L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}\approx\frac{y(1-w)^{2}}{(1-vw)^{2}},

(8.6)

hence extracting the coefficient of order $n$ in $y$ of this expression leads to the asymptotic distribution, independent of ${\theta}$ ,

\mathrm{p}_{n}(L)\mathrel{\mathop{\to}\limits_{L\to\infty}}\mathrm{p}_{n}=n(1-w)^{2}w^{n-1}.

(8.7)

This distribution is depicted in figure 8. The same result can be found by noting that

\mathrm{p}_{n}(L)=\frac{w^{n}Z_{n}(L)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)}\approx\frac{Z_{n}(L)}{f(L)}(1-w)^{2}w^{n-1}\mathrel{\mathop{\to}\limits_{L\to\infty}}n(1-w)^{2}w^{n-1},

(8.8)

using (8.2) again. The interpretation of (8.7) is simple: $N_{L}-1$ is the sum of two independent geometric random variables (see (11.4)), which represent the fluid on either side of the remaining interval, which is the condensate.

The inverse moment $\langle 1/N_{L}\rangle$ can be obtained by using (8.7) above,

\Big{\langle}\frac{1}{N_{L}}\Big{\rangle}\to 1-w.

(8.9)

As a consequence of (8.9) we have, for any value of ${\theta}$ ,

\langle X|L\rangle\approx(1-w)L,

(8.10)

which can also be deduced from the asymptotic analysis of (4.32).

8.2 Regimes for the single interval distribution

For $L$ large, the asymptotic expression of the single interval distribution is obtained by substituting (8.1) in (4.24), leading to

\pi(k|L)\mathrel{\mathop{\approx}\limits_{L\to\infty}}(1-w)^{2}f(k)\frac{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-k)}{f(L)}.

(8.11)

Figure 9 depicts the distribution $\pi(k|L)$ (together with the distribution of the longest interval $p(k|L)$ , see §8.3 below), for $L=60$ and $w=0.6$ computed from (4.26), with Example 1 (see (5.1)). As can be seen on this figure, there are three distinct regions for $\pi(k|L)$ , namely, from left to right, a downhill region, followed by a long dip region, then an uphill region which accounts for the fluctuations of the condensate.

Let us discuss the behaviour of $\pi(k|L)$ given by (8.11) in each of these regions successively.

1.

Downhill region. For $k$ finite we have, using again (8.1),

$\pi(k|L)\approx wf(k).$ (8.12)

Introducing a cutoff $\Lambda$ , such that $1\ll\Lambda\ll L$ , the weight of this downhill region can be estimated as

$\sum_{k=1}^{\Lambda}\pi(k|L)\approx\sum_{k=1}^{\Lambda}wf(k)\to\sum_{k=1}^{\infty}wf(k)=w.$ (8.13)

Dip region. In the dip region, where $k$ and $L-k$ are simultaneously large, setting $k={\lambda}L$ in (8.12) ( $0<{\lambda}<1$ ) and using (8.1) yields the estimate

\pi(k|L)\approx wf(k)\frac{f(L-k)}{f(L)}\approx\frac{w}{[{\lambda}(1-{\lambda})]^{1+{\theta}}}\frac{c}{L^{1+{\theta}}}.

(8.14)

The weight of the dip region can be estimated using (8.14), as

\sum_{k=\Lambda}^{L-\Lambda}\pi(k|L)\approx L^{-{\theta}}w\,c\int_{{\epsilon}}^{1-{\epsilon}}\frac{{\rm d}{\lambda}}{[{\lambda}(1-{\lambda})]^{1+{\theta}}},

(8.15)

where, for the sake of simplicity, we chose $\Lambda={\epsilon}L$ . The two downhill and uphill regions are therefore well separated by the dip region, as is conspicuous on figure 9. Note the similarity between (8.15) and (3.12), with the correspondence

1-\frac{1}{n}\hookrightarrow 1-\left\langle\frac{1}{N_{L}}\right\rangle=w.

(8.16)

Uphill region. The uphill region corresponds to $\ell=L-k$ finite, where (8.11) simplifies into

\pi(L-\ell|L)\approx(1-w)^{2}Z^{\mathop{\scriptstyle\mathrm{td}}}(w,\ell).

(8.17)

The weight of this region can be estimated as

	$\displaystyle\sum_{\ell=0}^{\Lambda}\pi(L-\ell\|L)$	$\displaystyle\approx$	$\displaystyle(1-w)^{2}\sum_{\ell=0}^{\Lambda}Z^{\mathop{\scriptstyle\mathrm{td}}}(w,\ell)$		(8.18)
		$\displaystyle\to$	$\displaystyle(1-w)^{2}\sum_{\ell=0}^{\infty}Z^{\mathop{\scriptstyle\mathrm{td}}}(w,\ell)=1-w$		(8.18)

where the last step is obtained by setting $z=1$ in the expression of the generating function (4.5). The right side of (8.18), $1-w$ , is precisely the limiting value of $\langle 1/N_{L}\rangle$ , see (8.9). This result is therefore the analogue of (3.14) with the correspondence given in (8.16).

In view of (8.13) and (8.18) we conclude that the weights of the downhill and uphill regions add to one, in line with the fact that the contribution of the dip region is subdominant, as shown above.

8.3 Regimes for the distribution of the longest interval

As can be seen on figure 9, there are two main regions for the distribution of the maximum, $p(k|L)$ . For $k\leq L/2$ , the contribution of $p(k|L)$ to the total weight is vanishingly small. The argument is the same as in §3.3. Hence we restrict the rest of the discussion to the region $(L/2<k\leq L)$ , where $p(k|L)$ has the simpler expression $q(k|L)$ given by (4.47). Using (8.1), its asymptotic estimate is

q(k|L)\mathrel{\mathop{\approx}\limits_{L\to\infty}}(1-w)^{2}f(k)\frac{(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L-k)}{f(L)},

(8.19)

or, equivalently, for $\ell=L-k\in(0,L/2-1)$ ,

\displaystyle q(L-\ell|L)\mathrel{\mathop{\approx}\limits_{L\to\infty}}(1-w)^{2}f(L-\ell)\frac{(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,\ell)}{f(L)}.

(8.20)

Let us note that the ratio

r(\ell)=\frac{q(L-\ell|L)}{\pi(L-\ell|L)}=\frac{(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,\ell)}{Z^{\mathop{\scriptstyle\mathrm{td}}}(w,\ell)}=1+\langle N_{\ell}\rangle

(8.21)

is an increasing function of $\ell$ , with first values

r(0)=1,\quad r(1)=2,\quad r(2)=\frac{3wf(1)^{2}+2f(2)}{wf(1)^{2}+f(2)},

(8.22)

and so on, reaching the limit, for large $\ell$ ,

r(\ell)\to\frac{2}{1-w}.

(8.23)

The inset in figure 9 depicts the prediction $q(k|L)=\pi(k|L)(1+\langle N_{L-k}\rangle)$ (see (4.49)) which coincides perfectly with $p(k|L)$ in the second half $k>L/2$ . This graph is very similar to the inset in figure 5.

We now discuss the behaviour of $q(L-\ell|L)$ in the two regions of interest.

1.

For $\ell$ finite, we have, see (8.20),

$q(L-\ell|L)\approx(1-w)^{2}(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,\ell).$ (8.24)

In particular for $\ell=0$ ,

$p(L|L)=q(L|L)=\pi(L|L)=\mathop{\rm Prob}\nolimits(N_{L}=1)\approx(1-w)^{2}.$ (8.25)

If $1\ll\ell\ll L$ , (8.24) simplifies to

$q(L-\ell|L)\approx\frac{2w}{1-w}f(\ell).$ (8.26)

If $\ell$ et $L-\ell$ are simultaneously large, with $\ell={\lambda}L$ , we have

q(L-\ell|L)\approx\frac{2w}{1-w}f(\ell)\frac{f(L-\ell)}{f(L)}\approx\frac{2w}{1-w}\frac{1}{[{\lambda}(1-{\lambda})]^{1+{\theta}}}\frac{c}{L^{1+{\theta}}},

(8.27)

which is proportional to (8.14), with ratio $2/(1-w)$ . Note the similarity of (8.27) with (3.17) with the correspondence

n-1\hookrightarrow\langle N_{L}\rangle-1=\frac{2w}{1-w}.

(8.28)

The weight under the peak of the condensate tends to unity,

	$\displaystyle\sum_{\ell=0}^{\Lambda}q(L-\ell\|L)$	$\displaystyle\approx$	$\displaystyle(1-w)^{2}\sum_{\ell=0}^{\Lambda}(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,\ell)$		(8.29)
		$\displaystyle\to$	$\displaystyle(1-w)^{2}\sum_{\ell=0}^{\infty}(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,\ell)=1,$		(8.29)

by a computation similar to (8.18), using now the fact that the last sum is equal to $1/(1-w)^{2}$ , as can be seen by setting $z=1$ in the expression of the generating function $\tilde{Z}^{\mathop{\scriptstyle\mathrm{td}}}(w,z)^{2}$ .

We can further analyse the fluctuations of the condensate by considering the width of the peak,

L-\langle X_{\rm max}\rangle=\sum_{\ell=0}^{L-1}\ell\,q(L-\ell|L)\approx\sum_{\ell=0}^{L/2}\ell\,q(L-\ell|L).

(8.30)

The dominant contribution to this sum depends on whether ${\theta}$ is smaller or larger than one.

$\bullet$ If ${\theta}<1$ , the dominant contribution comes from (8.27), i.e., for $\ell$ comparable to $L$ ,

	$\displaystyle L-\langle X_{\rm max}\rangle$	$\displaystyle\approx$	$\displaystyle\frac{2wc}{1-w}L^{1-{\theta}}\int_{0}^{1/2}{\rm d}{\lambda}\frac{{\lambda}}{[({\lambda}(1-{\lambda})]^{1+{\theta}}}$		(8.31)
		$\displaystyle\approx$	$\displaystyle\frac{2wc\,}{1-w}\mathrm{B}_{\frac{1}{2}}\Big{(}1-{\theta},-{\theta}\Big{)}L^{1-{\theta}},$		(8.31)

where $\mathrm{B}(\cdot)$ is the incomplete beta function, which, for example, is equal to $2$ for ${\theta}=1/2$ .

$\bullet$ If ${\theta}>1$ , the main contribution comes from (8.24),

	$\displaystyle L-\langle X_{\rm max}\rangle$	$\displaystyle\approx$	$\displaystyle(1-w)^{2}\sum_{\ell=0}^{\Lambda}\ell(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,\ell)$		(8.32)
		$\displaystyle\to$	$\displaystyle(1-w)^{2}\sum_{\ell=0}^{\infty}\ell(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,\ell)\to\frac{2w}{1-w}\langle X\rangle,$		(8.32)

using (8.4). This last result (8.32) has a simple interpretation. It says that the correction $L-\langle X_{\rm max}\rangle$ is made of $\langle N_{L}\rangle-1=2w/(1-w)$ intervals, of mean length $\langle X\rangle$ . It is therefore the perfect parallel of the result (3.8). Likewise, (8.31) is the perfect parallel of (3.6). In the present case the expression in the right side of this equation is proportional to $\langle N_{L}\rangle-1$ times the critical mean interval $\langle X|L\rangle$ , given in (7.20).

8.4 Discussion

In view of the above analysis the following picture emerges. The ‘contrast’ between the dip and the condensate region increases with $L$ , i.e., the dip centred around $L/2$ becomes deeper and deeper as $L^{-1-{\theta}}$ (see (8.14)) relatively to the height of the peak, which is of order one. An estimate of the contribution of the condensate to the total weight can thus be operationally obtained by summing $\pi(L-\ell|L)$ for $\ell$ in the layer $(0,\Lambda)$ . This sum is asymptotically equal to $1-w$ , according to (8.18), which turns out to be also the asymptotic estimate of $\langle 1/N_{L}\rangle$ . The interpretation of this result is clear. When $X_{1}$ is larger than $L/2$ , this interval is necessarily the longest one, i.e., $X_{1}=X_{\rm max}$ . Furthermore, since this interval is chosen amongst $N_{L}$ intervals, we expect that, in average,

\sum_{\ell=0}^{\Lambda}\pi(L-\ell|L)\approx\mathrel{\mathop{\underbrace{\left\langle\frac{1}{N_{L}}\right\rangle}}\limits_{1-w}}\ \mathrel{\mathop{\underbrace{\sum_{\ell=0}^{\Lambda}p(L-\ell|L)}}\limits_{\approx 1}}.

(8.33)

But the sum on the right side, namely the weight of $X_{\rm max}$ in the same layer is asymptotically equal to one. This simple heuristic reasoning therefore recovers (8.18).

In the condensed phase ( $w<1,L\to\infty$ ) the number of intervals is finite and fluctuates around its mean, which is a superuniversal constant independent of ${\theta}$ . This situation is akin to the case of random allocation models and zrp, when $L\to\infty$ and $n$ is kept fixed. Note that, for the latter, results were independent of the value of ${\theta}$ , too. In both situations condensation is total, the condensed fraction is asymptotically equal to unity.

Table 1 summarises the results found in sections 6, 7 and 8, which demonstrate a large degree of universality.

Table 1: Dominant asymptotic behaviours at large

L

for tied-down renewal processes with power-law distribution (1.1) for

f(k)

, in the different phases.

	disordered	critical ${\theta}<1$	critical ${\theta}>1$	condensed
$\langle N_{L}\rangle$	$\frac{L}{\langle X\|L\rangle}$	$L^{\theta}$	$\frac{L}{\langle X\rangle}$	$\frac{1+w}{1-w}$
$\langle X\|L\rangle$	$\mathrm{constant}$	$L^{1-{\theta}}$	$\langle X\rangle$	$(1-w)L$
$\langle X_{\rm max}\rangle$	$\ln L$	$L$	$L^{1/{\theta}}$	$L$
$Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$	${\rm e}^{L/\xi}$	$L^{{\theta}-1}$	$\frac{1}{\langle X\rangle}$	$L^{-1-{\theta}}$

9 General statements on free renewal processes

We now turn to the case of free renewal processes. The random number $N_{L}$ of intervals up to $L$ is defined through the condition (1.4), $S_{N_{L}}<L<S_{N_{L}+1}$ . The size of the current (unfinished) interval—named the backward recurrence time in renewal theory—is denoted by $B_{L}=L-S_{N_{L}}$ , see figure 2. As will appear shortly, free renewal processes are more complicated to analyse than tdrp, essentially because now there are two kinds of intervals to consider, the intervals $X_{i}$ on one hand, and the last unfinished interval $B_{L}$ , on the other hand.

In the present section, $f(k)=\mathop{\rm Prob}\nolimits(X=k)$ is any arbitrary distribution of the positive random variable $X$ . Later on, in sections 10 and 11, $f(k)$ will obey the form (1.1).

9.1 Joint distribution

As for tdrp a weight $w$ is attached to each interval. The joint probability of the configuration $\{X_{1}=k_{1},\dots,X_{N_{L}}=k_{n},B_{L}=b,N_{L}=n\}$ , reads

	$\displaystyle p(k_{1},\ldots,k_{n},b,n\|L)$	$\displaystyle=$	$\displaystyle\mathop{\rm Prob}\nolimits(\{X_{i}=k_{i}\},B_{L}=b,N_{L}=n)$		(9.1)
		$\displaystyle=$	$\displaystyle\frac{1}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}w^{n}f(k_{1})\ldots f(k_{n})\,g(b)\,\delta\big{(}\sum_{i=1}^{n}k_{i}+b,L\big{)},$		(9.1)

where $g(b)$ is the tail probability (or complementary distribution function) defined in (7.4),

g(b)=\mathop{\rm Prob}\nolimits(X>b)=\sum_{k>b}f(k)=f(b+1)+f(b+2)+\cdots

(9.2)

As mentioned in section 4, since the summands $X_{i}$ have the interpretation of the sizes of the intervals, we take $f(0)=0$ . For $n=0$ ,

p(\{\},b,0|L)_{|{\rm num}}=g(b)\delta(b,L),

(9.3)

corresponding to the event of no renewal occurring between $0$ and $L$ , i.e., $B_{L}=L$ , and where $\{\}$ means empty. The generating function of $g(b)$ is

\tilde{g}(z)=\sum_{b\geq 0}z^{b}g(b)=\frac{1}{1-z}-\sum_{b\geq 1}z^{b}\sum_{k=1}^{b}f(k)=\frac{1-\tilde{f}(z)}{1-z},

(9.4)

with $\tilde{g}(0)=g(0)=1$ .

The denominator of (9.1) is the free partition function, obtained by summing on $n$ , on the $k_{i}$ and on $b$ ,

$\displaystyle Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)$	$\displaystyle=$	$\displaystyle\sum_{n\geq 0}\sum_{b\geq 0}\sum_{\{k_{i}\}}\,p(k_{1},\ldots,k_{n},b,n\|L)$	(9.5)
	$\displaystyle=$	$\displaystyle\sum_{n\geq 0}w^{n}\sum_{b\geq 0}\sum_{\{k_{i}\}}f(k_{1})\cdots f(k_{n})g(b)\delta\big{(}\sum_{i=1}^{n}k_{i}+b,L\big{)}$
	$\displaystyle=$	$\displaystyle\sum_{b\geq 0}g(b)\Big{[}\delta(b,L)+\sum_{k_{1}}wf(k_{1})\delta(k_{1}+b,L)$
	$\displaystyle+$	$\displaystyle\sum_{k_{1},k_{2}}w^{2}f(k_{1})f(k_{2})\delta(k_{1}+k_{2}+b,L)+\dots\Big{]}$
	$\displaystyle=$	$\displaystyle\,\mathrel{\mathop{\underbrace{g(L)}}\limits_{n=0}}+\mathrel{\mathop{\underbrace{w\,g\star f}}\limits_{n=1}}+\mathrel{\mathop{\underbrace{w^{2}\,g\star f\star f}}\limits_{n=2}}+\dots=\sum_{n\geq 0}w^{n}\big{(}g\star(f\star)^{n}\big{)}(L).$

For instance,

	$\displaystyle Z^{\mathop{\scriptstyle\mathrm{f}}}(w,0)=1,\qquad Z^{\mathop{\scriptstyle\mathrm{f}}}(w,1)=g(1)+wf(1),\qquad$		(9.6)
	$\displaystyle Z^{\mathop{\scriptstyle\mathrm{f}}}(w,2)=g(2)+wf(2)+wf(1)g(1)+w^{2}f(1)^{2},$		(9.7)

and so on. From (9.5) we have

\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z)=\sum_{L\geq 0}z^{L}Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)=\sum_{n\geq 0}\big{(}w\tilde{f}(z)\big{)}^{n}\tilde{g}(z)=\frac{\tilde{g}(z)}{1-w\tilde{f}(z)}.

(9.8)

Let us note that

Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)=\sum_{n\geq 0}w^{n}\sum_{b\geq 0}\mathop{\rm Prob}\nolimits(S_{n}=L-b)g(b)=(g\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L),

(9.9)

where $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L)$ is the partition function (4.3) for tdrp.

The case of usual (unweighted) free renewal processes is recovered by setting $w=1$ in these expressions. This yields $Z^{\mathop{\scriptstyle\mathrm{f}}}(1,L)=1$ , as can be seen from (9.8), and the joint probability distribution (9.1) simplifies accordingly.

9.2 Distribution of the number of intervals

As for tdrp we denote this distribution as⁵⁵5Whenever no ambiguity arises, we use the same notations for the observables of the tied-down and free renewal processes. Otherwise, when necessary, we add a superscript, as e.g., for $Z^{\mathop{\scriptstyle\mathrm{td}}}$ , $Z^{\mathop{\scriptstyle\mathrm{f}}}$ or in (9.18).

\mathrm{p}_{n}(L)=\mathop{\rm Prob}\nolimits(N_{L}=n).

(9.10)

We read on the successive terms of $Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)$ that

\mathrm{p}_{0}(L)=\frac{g(L)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}=\mathop{\rm Prob}\nolimits(B_{L}=L),\quad\mathrm{p}_{1}(L)=\frac{w(g\star f)(L)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)},

(9.11)

and so on. More generally,

\mathrm{p}_{n}(L)=\frac{w^{n}\big{(}g\star(f\star)^{n}\big{)}(L)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}=\frac{w^{n}(g\star Z_{n})(L)}{(g\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L)}.

(9.12)

Summing (9.1) on $b$ and on the $k_{i}$ , and taking the generating function with respect to $L$ yields

\sum_{L\geq 0}z^{L}\mathrm{p}_{n}(L)_{|{\rm num}}=\Big{(}w\tilde{f}(z)\Big{)}^{n}\tilde{g}(z),

(9.13)

to be compared to (9.8). Therefore

\sum_{L\geq 0}z^{L}\langle N_{L}\rangle_{|{\rm num}}=\frac{w\tilde{f}(z)\tilde{g}(z)}{(1-w\tilde{f}(z))^{2}}=w\frac{{\rm d}}{{\rm d}w}\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z),

(9.14)

and

\langle N_{L}\rangle=w\frac{{\rm d}\ln Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}{{\rm d}w},

(9.15)

as for tdrp, see (4.15). More generally,

\sum_{L\geq 0}z^{L}\left\langle v^{N_{L}}\right\rangle_{|{\rm num}}=\sum_{n\geq 0}v^{n}\Big{(}w\tilde{f}(z)\Big{)}^{n}\tilde{g}(z)=\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(vw,z),

(9.16)

so we obtain, as for tdrp, see (4.20),

\left\langle v^{N_{L}}\right\rangle=\sum_{n\geq 0}v^{n}\mathrm{p}_{n}(L)=\frac{Z^{\mathop{\scriptstyle\mathrm{f}}}(vw,L)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}.

(9.17)

Finally, comparing (9.14) to (4.14), we note the relationship between the free and tied-down cases,

\langle N^{\mathop{\scriptstyle\mathrm{f}}}_{L}\rangle_{|{\rm num}}=\sum_{k=0}^{L}g(k)\langle N^{\mathop{\scriptstyle\mathrm{td}}}_{L-k}\rangle_{|{\rm num}},

(9.18)

hence using (4.17), we have

\langle N^{\mathop{\scriptstyle\mathrm{f}}}_{L}\rangle_{|{\rm num}}=(g\star Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L)-Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L),

(9.19)

or, equivalently,

(g\star Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L)=Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)(1+\langle N_{L}\rangle).

(9.20)

9.3 Distribution of $S_{N_{L}}$

We recall that this quantity is the sum of the $N_{L}$ intervals before $L$ , see (1.3) and figure 2. By definition,

\mathop{\rm Prob}\nolimits(S_{N_{L}}=j)=\langle\delta(S_{N_{L}},j)\rangle=\sum_{n\geq 0}\sum_{b\geq 0}\sum_{\{k_{i}\}}\delta(S_{N_{L}},j)p(k_{1},\ldots,k_{n},b,n|L).

(9.21)

Thus, using (9.1), we have

\sum_{L\geq 0}z^{L}\sum_{j=0}^{L}x^{j}\mathop{\rm Prob}\nolimits(S_{N_{L}}=j)_{|{\rm num}}=\frac{\tilde{g}(z)}{1-w\tilde{f}(xz)},

(9.22)

which generalises the expression for this quantity when $w=1$ gl2001 . By derivation with respect to $x$ then setting $x=1$ , leads to

\sum_{L\geq 0}z^{L}\langle S_{N_{L}}\rangle_{|{\rm num}}=\frac{wz\tilde{f}^{\prime}(z)\tilde{g}(z)}{(1-w\tilde{f}(z))^{2}},

(9.23)

whose summation with (9.31) below leads to the equality

\sum_{L\geq 0}z^{L}\big{(}\langle S_{N_{L}}\rangle_{|{\rm num}}+\langle B_{L}\rangle_{|{\rm num}}\big{)}=z\frac{{\rm d}\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z)}{{\rm d}z},

(9.24)

which expresses the sum rule

\langle S_{N_{L}}\rangle+\langle B_{L}\rangle=L.

(9.25)

The asymptotic behaviours of these quantities for $w=1$ are simple gl2001 . If ${\theta}<1$ , then $\langle S_{N_{L}}\rangle\approx{\theta}L$ , $\langle B_{L}\rangle\approx(1-{\theta})L$ . If $1<{\theta}<2$ , then $\langle S_{N_{L}}\rangle\approx L-cL^{2-{\theta}}/[{\theta}(2-{\theta})\langle X\rangle]$ , and $\langle B_{L}\rangle$ follows by difference.

9.4 Distribution of $B_{L}$

The distribution of $B_{L}$ is obtained by summing (9.1) on the $k_{i}$ and on $n$

\mathop{\rm Prob}\nolimits(B_{L}=b)=\frac{1}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}g(b)\sum_{n\geq 0}w^{n}\sum_{\{k_{i}\geq 1\}}f(k_{1})\cdots f(k_{n})\delta\Big{(}\sum_{i=1}^{n}k_{i}+b,L\Big{)}.

(9.26)

This entails that

\mathop{\rm Prob}\nolimits(B_{L}=b)_{|{\rm num}}=g(b)\sum_{n\geq 0}w^{n}\mathop{\rm Prob}\nolimits(S_{n}=L-b)=g(b)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-b),

(9.27)

hence, using (9.9)

\mathop{\rm Prob}\nolimits(B_{L}=b)=\frac{g(b)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-b)}{(g\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L)}.

(9.28)

The generating function with respect to $L$ of (9.27) reads

\sum_{L\geq 0}z^{L}\mathop{\rm Prob}\nolimits(B_{L}=b)_{|{\rm num}}=\frac{z^{b}g(b)}{1-w\tilde{f}(z)},

(9.29)

which summed upon $b$ gives $\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z)$ back. Taking now the generating function of (9.29) with respect to $b$ yields

\sum_{L\geq 0}z^{L}\sum_{b\geq 0}y^{b}\mathop{\rm Prob}\nolimits(B_{L}=b)_{|{\rm num}}=\frac{\tilde{g}(yz)}{1-w\tilde{f}(z)}.

(9.30)

The mean $\langle B_{L}\rangle$ ensues by taking the derivative of the right side of this equation with respect to $y$ and setting $y$ to one,

\sum_{L\geq 0}z^{L}\langle B_{L}\rangle_{|{\rm num}}=\frac{z\tilde{g}^{\prime}(z)}{1-w\tilde{f}(z)}.

(9.31)

9.5 Single interval distribution

As for tdrp, the single interval distribution,

\pi(k|L)=\langle\delta(X_{1},k)\rangle,

(9.32)

is obtained by summing $p(k_{1},\dots,k_{n},b,n|L)$ on $k_{1},\dots,k_{n}$ , $b$ and $n\geq 1$

$\displaystyle\pi(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\sum_{b\geq 0}g(b)\Big{[}\sum_{k_{1}}\delta(k_{1},k)wf(k_{1})\delta(k_{1}+b,L)$	(9.33)
	$\displaystyle+$	$\displaystyle\sum_{k_{1},k_{2}}\delta(k_{1},k)w^{2}f(k_{1})f(k_{2})\delta(k_{1}+k_{2}+b,L)+\cdots\Big{]}$
	$\displaystyle=$	$\displaystyle\sum_{b\geq 0}g(b)\Big{[}wf(k)\delta(k+b,L)+\sum_{k_{2}}w^{2}f(k)f(k_{2})\delta(k+k_{2}+b,L)+\dots\Big{]}$
	$\displaystyle=$	$\displaystyle wf(k)Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L-k).$

\pi(k|L)=\frac{wf(k)Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L-k)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}.

(9.34)

The generating function of the numerator is therefore

\displaystyle\sum_{L\geq 0}z^{L}\pi(k|L)_{|{\rm num}}=wz^{k}f(k)\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z)=\frac{wz^{k}f(k)\tilde{g}(z)}{1-w\tilde{f}(z)},

(9.35)

to be compared to (4.32). Though (9.34) is formally identical to (4.26) the normalisations of these two distributions are different. Indeed,

\displaystyle\sum_{L\geq 0}z^{L}\sum_{k=1}^{L}\pi(k|L)_{|{\rm num}}=w\tilde{f}(z)\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z)=\frac{w\tilde{f}(z)\tilde{g}(z)}{1-w\tilde{f}(z)}=\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z)-\tilde{g}(z),

(9.36)

which means that

1-\sum_{k=1}^{L}\pi(k|L)=\frac{g(L)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}=\mathrm{p}_{0}(L)=\mathop{\rm Prob}\nolimits(B_{L}=L).

(9.37)

In other words

\sum_{k=1}^{L}\mathop{\rm Prob}\nolimits(X=k|L)+\mathop{\rm Prob}\nolimits(B_{L}=L)=1.

(9.38)

The distribution $\pi(k|L)$ is thus defective. The recursion relation for $Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)$ follows from (9.33) and (9.37)

Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)=g(L)+\sum_{k=1}^{L}wf(k)Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L-k).

(9.39)

At the end point, $k=L$ , we have

\pi(L|L)=\frac{wf(L)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)},

(9.40)

which corresponds to the event $\{X_{1}=L,B_{L}=0,N_{L}=1\}$ , and which is formally the same as (4.25).

Both sides of (9.39) are equal to unity if $w=1$ . Note that the computation of $\pi(k|L)$ for $w=1$ is given in gl2001 , yielding $\pi(k|L)=f(k)$ with $k\leq L$ , which shows that this distribution is already defective for $w=1$ , with $\sum_{k=1}^{L}\pi(k|L)=1-g(L)$ .

9.6 Mean interval $\langle X|L\rangle$

We proceed as in §4.4. The mean interval is, by definition,

\langle X|L\rangle=\sum_{k\geq 1}k\pi(k|L).

(9.41)

Multiplying (9.35) by $k$ and summing upon $k$ yields

\sum_{L\geq 1}z^{L}\langle X|L\rangle_{|{\rm num}}=\frac{wz\tilde{f}^{\prime}(z)\tilde{g}(z)}{1-w\tilde{f}(z)},

(9.42)

to be compared to (4.32) for tdrp.

9.7 The longest interval

In the present case the longest interval is defined as

X_{\rm max}={\rm max}(X_{1},X_{2},\dots,B_{L}).

(9.43)

Its distribution function is

F(k|L)=\mathop{\rm Prob}\nolimits(X_{{\rm max}}\leq k|L)=\sum_{n\geq 0}\sum_{b=0}^{k}\sum_{k_{1}=1}^{k}\dots\sum_{k_{n}=1}^{k}p(\{k_{i}\},b,n|L)=\frac{F(k|L)_{|{\rm num}}}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)},

(9.44)

with initial value

F(k|0)_{|{\rm num}}=1.

(9.45)

As for tdrp, $F(L|L)_{|{\rm num}}=Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)$ , hence $F(L|L)=1$ . The generating function of the numerator is

	$\displaystyle\sum_{L\geq 0}z^{L}F(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\tilde{g}(z,k)\Bigg{(}1+\sum_{n\geq 1}\prod_{i=1}^{n}\Big{(}\sum_{k_{i}=1}^{k}wf(k_{i})z^{k_{i}}\Big{)}\Bigg{)}$		(9.46)
		$\displaystyle=$	$\displaystyle\tilde{g}(z,k)\Bigg{(}1+\sum_{n\geq 1}\Big{(}w\tilde{f}(z,k)\Big{)}^{n}\Bigg{)}=\frac{\tilde{g}(z,k)}{1-w\tilde{f}(z,k)},$		(9.46)

where

\tilde{f}(z,k)=\sum_{j=1}^{k}z^{j}f(j),\qquad\tilde{g}(z,k)=\sum_{b=0}^{k}z^{b}g(b),

(9.47)

are related by

1-\tilde{f}(z,k)=z^{k+1}g(k)+(1-z)\tilde{g}(z,k).

(9.48)

Note that

	$\displaystyle F^{\mathop{\scriptstyle\mathrm{f}}}(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\sum_{b=0}^{k}g(b)F^{\mathop{\scriptstyle\mathrm{td}}}(k\|L-b)_{\|{\rm num}}$		(9.49)
		$\displaystyle=$	$\displaystyle\sum_{b=0}^{k}g(b)\sum_{n=0}^{L}w^{n}F_{n}(k\|L-b)_{\|{\rm num}},$		(9.49)

where we used (4.38) in the last step.

The distribution of $X_{\rm max}$ is given by the difference

p(k|L)=\mathop{\rm Prob}\nolimits(X_{{\rm max}}=k)=F(k|L)-F(k-1|L),

(9.50)

where $F(0|L)=\delta(L,0)$ , with generating function

	$\displaystyle\sum_{L\geq 0}z^{L}p(k\|L)_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\tilde{g}(z,k)\Big{(}\frac{1}{1-w\tilde{f}(z,k)}-\frac{1}{1-w\tilde{f}(z,k-1)}\Big{)}$		(9.51)
		$\displaystyle=$	$\displaystyle\frac{wz^{k}f(k)\tilde{g}(z,k)}{[1-w\tilde{f}(z,k)][1-w\tilde{f}(z,k-1)]}.$		(9.51)

At the end point, $k=L$ , we have

p(L|L)=\frac{wf(L)+g(L)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}=\pi(L|L)+\mathop{\rm Prob}\nolimits(B_{L}=L),

(9.52)

where the last two terms correspond respectively to the events $\{X_{1}=L,B_{L}=0,N_{L}=1\}$ , cf (9.40) and $\{B_{L}=L,N_{L}=0\}$ , cf (9.11). The mean is given by the sum

\langle X_{\rm max}\rangle=\sum_{k=0}^{L}\Big{(}1-F(k|L)\Big{)}=L-\sum_{k=1}^{L-1}F(k|L),

(9.53)

which implies a relation between the generating functions

	$\displaystyle\sum_{L\geq 0}z^{L}{\langle X_{\rm max}\rangle}_{\|{\rm num}}=\sum_{k\geq 0}\Big{(}\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z)-\tilde{F}(k,z)_{\|{\rm num}}\Big{)}$		(9.54)
	$\displaystyle=\sum_{k\geq 0}\Big{(}\frac{\tilde{g}(z)}{1-w\tilde{f}(z)}-\frac{\tilde{g}(z,k)}{1-w\tilde{f}(z,k)}\Big{)},$		(9.55)

where $\tilde{F}(k,z)_{|{\rm num}}$ is given by (9.46).

Denoting again the restriction of $p(k|L)$ to $k>L/2$ by $q(k|L)$ , we can obtain an expression of this quantity by a similar reasoning as done for (4.47) in §4.5. We thus obtain

	$\displaystyle q(k\|L)$	$\displaystyle=$	$\displaystyle\mathop{\rm Prob}\nolimits(B_{L}=k)+\frac{wf(k)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}\sum_{b=0}^{L-k}g(b)(Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}})(w,L-k-b)$		(9.56)
		$\displaystyle=$	$\displaystyle\mathop{\rm Prob}\nolimits(B_{L}=k)+\frac{wf(k)\big{(}g\star Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}}\big{)}(w,L-k)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)},$		(9.56)

expressing the fact that the longest interval can be either $B_{L}$ or a generic interval $X_{i}$ . Equivalently, using (9.20), this reads

\displaystyle q(k|L)=\mathop{\rm Prob}\nolimits(B_{L}=k)+\frac{wf(k)Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L-k)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}(1+\langle N_{L-k}\rangle),

(9.57)

or else

\displaystyle q(k|L)=\mathop{\rm Prob}\nolimits(B_{L}=k)+\pi(k|L)(1+\langle N_{L-k}\rangle),

(9.58)

to be compared to (4.48) and (4.49). For $k=L$ , (9.52) is recovered.

Remarks

1. We can prove (9.56) otherwise. We start from the first line of (9.49) and take the discrete derivative with respect to $k$ of both sides, which yields

\Delta_{k}F^{\mathop{\scriptstyle\mathrm{f}}}(k|L)_{|{\rm num}}=g(k)F^{\mathop{\scriptstyle\mathrm{td}}}(k|L-k)_{|{\rm num}}+\sum_{b=0}^{k}g(b)\Delta_{k}F^{\mathop{\scriptstyle\mathrm{td}}}(k|L-b)_{|{\rm num}}.

(9.59)

If $k>L/2$ , using (4.54), we recognise the numerator of the first term of (9.56) in the first term of this equation. Likewise, if $k>L/2$ , it can be shown that the second term of the above equation is equal to the numerator of the second term of (9.56).

2. From (4.54) we infer that, if $k>L/2$ ,

F^{\mathop{\scriptstyle\mathrm{f}}}(k|L-k)_{|{\rm num}}=Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L-k).

(9.60)

Lastly, consider the probability ${Q_{L}}$ that the last unfinished interval is the longest one, that is

{Q_{L}}=\mathop{\rm Prob}\nolimits(B_{L}\geq{\rm max}(X_{1},\dots,X_{N_{L}}))=\sum_{n\geq 0}\sum_{b\geq 0}\sum_{k_{1}=1}^{b}\dots\sum_{k_{n}=1}^{b}p(\{k_{i}\},b,n|L).

(9.61)

The generating function with respect to $L$ of its numerator reads

\sum_{L\geq 0}z^{L}{Q_{L}}_{|{\rm num}}=\sum_{b\geq 0}\frac{z^{b}g(b)}{1-w\tilde{f}(z,b)},

(9.62)

hence

	$\displaystyle\tilde{Z}^{\mathop{\scriptstyle\mathrm{f}}}(w,z)-\sum_{L\geq 0}z^{L}{Q_{L}}_{\|{\rm num}}$	$\displaystyle=$	$\displaystyle\frac{\tilde{g}(z)}{1-w\tilde{f}}-\sum_{b\geq 0}\frac{z^{b}g(b)}{1-w\tilde{f}(z,b)}$		(9.63)
		$\displaystyle=$	$\displaystyle\sum_{b\geq 0}z^{b}g(b)\Big{(}\frac{1}{1-w\tilde{f}}-\frac{1}{1-w\tilde{f}(z,b)}\Big{)}.$		(9.64)

10 Critical regime ( $w=1$ ) for free renewal processes

In this section and in the following one (section 11) we specialise the discussion to the case of a subexponential distribution $f(k)=\mathop{\rm Prob}\nolimits(X=k)$ with asymptotic power-law decay (1.1).

The critical regime is thoroughly described in gl2001 ; gms2015 and builds upon previous studies feller ; dynkin ; lamperti58 ; lamperti61 . The results are summarised in table 2, which also presents the main outcomes for the disordered regime ( $w>1$ ).

The initial analysis of the distribution of the longest interval is due to Lamperti lamperti61 . Let us just recover the universal asymptotic expression of $q(k|L)$ , for $1\ll k\sim L$ , with $r=k/L$ fixed, when ${\theta}<1$ , given in lamperti61 ,

q(k|L)\approx\frac{1}{L}\frac{\sin\pi{\theta}}{\pi}\frac{1}{r^{1+{\theta}}(1-r)^{1-{\theta}}}.

(10.1)

This result can be simply inferred from (9.58). For the first term of (9.58), we obtain for $L$ large, using (7.5),

\mathop{\rm Prob}\nolimits(B_{L}=k)=g(k)Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L-k)\mathrel{\mathop{\approx}\limits_{L\to\infty}}\frac{\sin\pi{\theta}}{\pi}\frac{1}{k^{\theta}(L-k)^{1-{\theta}}},

(10.2)

i.e., an arcsine law in the variable $r=k/L$ , which is a well-known result dynkin ; feller ; gl2001 . For the second term of (9.58), we need

\langle N_{L-k}\rangle\mathrel{\mathop{\approx}\limits_{L\to\infty}}\frac{\sin\pi{\theta}}{\pi c}(L-k)^{\theta},

(10.3)

which is obtained using (9.14). Adding the two terms of (9.58) yields (10.1).

11 Condensed phase ( $w<1$ ) for free renewal processes

We now focus on the case of most interest, namely the condensed phase, $w<1$ , for subexponential distributions (1.1). As in section 8, we investigate the statistics of the number of intervals, the single-interval distribution and we characterise the fluctuations of the condensate. We also address the statistics of the last interval $B_{L}$ .

11.1 Asymptotic estimates at large $L$

The asymptotic analysis of (9.8) yields, for large $L$ ,

	$\displaystyle Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)$	$\displaystyle\approx$	$\displaystyle\frac{g(L)}{1-w}\Big{(}1-\frac{wc\Gamma(1-{\theta})^{2}}{(1-w)\Gamma(1-2{\theta}){\theta}L^{\theta}}\Big{)},\quad{\theta}<1$		(11.1)
	$\displaystyle Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)$	$\displaystyle\approx$	$\displaystyle\frac{g(L)}{1-w}\Big{(}1+\frac{2w{\theta}\langle X\rangle}{(1-w)L}\Big{)},\quad{\theta}>1.$		(11.2)

where $g(\cdot)$ is defined in (7.4). As a consequence, (9.17) yields

\left\langle v^{N_{L}}\right\rangle\mathrel{\mathop{\to}\limits_{L\to\infty}}\frac{1-w}{1-vw},

(11.3)

leading asymptotically to a geometric distribution for $N_{L}$ ,

\mathop{\rm Prob}\nolimits(N_{L}=n)\to\mathrm{p}_{n}=(1-w)w^{n},

(11.4)

independent of ${\theta}$ , from which entails, in the same limit,

\langle N_{L}\rangle\to\frac{w}{1-w}.

(11.5)

For $L$ large but finite, we find, using (9.15) and (11.1) or (11.2),

	$\displaystyle\langle N_{L}\rangle\approx\frac{w}{1-w}\Big{(}1-\frac{c}{1-w}\frac{{\theta}\,\Gamma(1-{\theta})^{2}}{\Gamma(1-2{\theta})L^{\theta}}\Big{)},\quad{\theta}<1.$
	$\displaystyle\langle N_{L}\rangle\approx\frac{w}{1-w}\Big{(}1+\frac{2{\theta}\langle X\rangle}{(1-w)L}\Big{)},\quad{\theta}>1.$		(11.6)

The asymptotic estimate of the mean interval can be obtained from the analysis of (9.42),

	$\displaystyle\langle X\|L\rangle$	$\displaystyle\approx$	$\displaystyle\frac{wc\Gamma(1-{\theta})^{2}}{\Gamma(2-2{\theta})}L^{1-{\theta}},\quad{\theta}<1,$		(11.7)
	$\displaystyle\langle X\|L\rangle$	$\displaystyle\approx$	$\displaystyle w(1+{\theta})\langle X\rangle,\quad{\theta}>1.$		(11.8)

These expressions can also be obtained by means of the marginal distribution $\pi(k|L)$ , see below. Likewise, the asymptotic estimate of the mean sum can be obtained from the analysis of (9.23),

	$\displaystyle\langle S_{N_{L}}\rangle$	$\displaystyle\approx$	$\displaystyle\frac{wc}{1-w}\frac{\Gamma(1-{\theta})^{2}}{\Gamma(2-2{\theta})}L^{1-{\theta}},\qquad{\theta}<1,$		(11.9)
	$\displaystyle\langle S_{N_{L}}\rangle$	$\displaystyle\approx$	$\displaystyle\frac{w}{1-w}(1+{\theta})\langle X\rangle,\qquad{\theta}>1,$		(11.10)

hence

	$\displaystyle\langle B_{L}\rangle$	$\displaystyle\approx$	$\displaystyle L-\frac{wc}{1-w}\frac{\Gamma(1-{\theta})^{2}}{\Gamma(2-2{\theta})}L^{1-{\theta}},\qquad{\theta}<1,$		(11.11)
	$\displaystyle\langle B_{L}\rangle$	$\displaystyle\approx$	$\displaystyle L-\frac{w}{1-w}(1+{\theta})\langle X\rangle,\qquad{\theta}>1.$		(11.12)

11.2 Regimes for the single interval distribution

We proceed as was done for tdrp (see section 8.2). Using (11.1) we have the estimate, at large $L$ ,

\pi(k|L)\mathrel{\mathop{\approx}\limits_{L\to\infty}}w(1-w)\frac{f(k)Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L-k)}{g(L)}.

(11.13)

Figure 10 depicts the distribution $\pi(k|L)$ (together with the distribution $\mathop{\rm Prob}\nolimits(B_{L}=k)$ , see §11.3 below), for $L=60$ and $w=0.8$ computed with Example 1 (see (5.1)). As can be seen on this figure, there are three distinct regions for $\pi(k|L)$ , that we consider in turn.

1.

Downhill region. For $k$ finite, using (11.1) again, we have

$\pi(k|L)\approx\frac{wf(k)g(L-k)}{g(L)}\approx wf(k).$ (11.14)

Dip region. When $k$ and $L-k$ are simultaneously large, setting $k={\lambda}L$ in (11.13) ( $0<{\lambda}<1$ ) yields the estimate, at large $L$ ,

\pi(k|L)\approx\frac{wf(k)g(L-k)}{g(L)}\approx\frac{1}{{\lambda}^{1+{\theta}}(1-{\lambda})^{\theta}}\frac{wc}{L^{1+{\theta}}}\approx\frac{1}{{\lambda}^{1+{\theta}}(1-{\lambda})^{\theta}}wf(L),

(11.15)

with a dip centred around $k_{\min}=L(1+{\theta})/(1+2{\theta})$ .

3.

In the region corresponding to $L-k$ finite, (11.13) simplifies into

$\pi(k|L)\approx\frac{w(1-w){\theta}}{L}Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L-k).$ (11.16)

In particular, for $k=L$ , $\pi(L|L)\approx w(1-w){\theta}/L$ , cf (9.40).

The weight of the downhill region is found to be asymptotically equal to $w$ , using the same reasoning as in §8.2. The complement is borne by $g(L)/Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)\approx 1-w$ , see (9.37) and (11.21). The two other regions therefore do not contribute to the total weight, asymptotically.

In order to complete the picture we now investigate the distribution of $B_{L}$ , the last unfinished interval.

11.3 Regimes for the distribution of $B_{L}$

According to (9.27), and in view of (11.1), for $L$ large we have

\mathop{\rm Prob}\nolimits(B_{L}=b)\approx(1-w)\frac{g(b)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-b)}{g(L)}.

(11.17)

Let us discuss the different regimes of this expression according to the magnitude of $b$ .

1.

If $b$ is finite, the asymptotic estimate of $Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-b)$ is given by (8.1), hence (11.17) becomes

$\mathop{\rm Prob}\nolimits(B_{L}=b)\approx\frac{w}{1-w}\frac{g(b)f(L-b)}{g(L)}.$ (11.18)

If $b\sim L$ , the same estimate (11.18) still holds, then setting $L-b={\lambda}L$ , we get

\mathop{\rm Prob}\nolimits(B_{L}=b)\approx\frac{w}{1-w}\frac{g(b)f(L-b)}{g(L)}\approx\frac{wc}{1-w}\frac{L^{-1-{\theta}}}{{\lambda}^{1+{\theta}}(1-{\lambda})^{{\theta}}},

(11.19)

which has its minimum at $k_{\min}=L\,{\theta}/(1+2{\theta})$ .

3.

If $L-b$ is finite, (11.17) becomes

$\mathop{\rm Prob}\nolimits(B_{L}=b)\approx(1-w)Z^{\mathop{\scriptstyle\mathrm{td}}}(w,L-b),$ (11.20)

in particular, see (9.37),

$\mathop{\rm Prob}\nolimits(B_{L}=L)=\mathrm{p}_{0}(L)=\frac{g(L)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}\to 1-w.$ (11.21)

Let us estimate, for later use, the probability that $B_{L}$ is less than $L/2$ . The result depends on the value of ${\theta}$ .

$\bullet$ If ${\theta}<1$ , using (11.19), we have

\mathop{\rm Prob}\nolimits(B_{L}\leq L/2)\approx\frac{wc}{1-w}L^{-{\theta}}\int_{1/2}^{1}\frac{{\rm d}{\lambda}}{{\lambda}^{1+{\theta}}(1-{\lambda})^{{\theta}}}\approx\frac{wc}{1-w}\mathrm{B}_{\frac{1}{2}}\Big{(}1-{\theta},-{\theta}\Big{)}L^{-{\theta}}.

(11.22)

$\bullet$ If ${\theta}>1$ , using (11.18), we have

\mathop{\rm Prob}\nolimits(B_{L}\leq L/2)\approx\frac{w\,{\theta}}{(1-w)L}\sum_{b=0}^{L/2}g(b)\approx\frac{w\,{\theta}}{(1-w)L}\sum_{b=0}^{L}g(b)\approx\frac{w\,{\theta}\langle X\rangle}{(1-w)L}.

(11.23)

11.4 Regimes for the distribution of the longest interval

The bulk of the distribution of $X_{\rm max}$ lies in the region $k>L/2$ , and is therefore given by $q(k|L)$ , see (9.56) or (9.58).

We start by giving an illustration. The distribution $p(k|L)$ of $X_{\rm max}$ on the whole interval $(0,L)$ is depicted in figure 11 for Example 1 (see (5.1)). The restriction of this distribution to the second half $k>L/2$ , that is $q(k|L)$ , computed from (9.58) is also depicted. The corresponding figure for ${\theta}>1$ is qualitatively alike.

Let us now compare the respective contributions of each of the two terms in (9.56) to the total weight in the region $k>L/2$ . The first term is investigated in §11.3 above. The asymptotic estimate of the second term is as follows. We start with the case ${\theta}<1$ .

$\bullet$ If ${\theta}<1$ , we consider two regimes.
(i) The main contribution of the second term to the total weight comes from the regime $L-k\sim L$ . Using the asymptotic estimate for large $L$ ,

\big{(}g\star Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}}\big{)}(w,L)\approx\frac{g(L)}{(1-w)^{2}}\Big{(}1-\frac{2wc}{1-w}\frac{{\theta}\Gamma(-{\theta})^{2}}{\Gamma(1-2{\theta})L^{\theta}}\Big{)},

(11.24)

and setting $L-k={\lambda}L$ , the second term reads

\frac{wf(k)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}\frac{c}{(1-w)^{2}{\theta}(L-k)^{\theta}}\approx\frac{wc}{1-w}\frac{L^{-1-{\theta}}}{{\lambda}^{{\theta}}(1-{\lambda})^{1+{\theta}}},

(11.25)

which is similar to (11.19). Summing this expression upon ${\lambda}$ from $0$ to $1/2$ yields (11.22), that is $\mathop{\rm Prob}\nolimits(B_{L}\leq L/2)$ . Therefore adding this contribution to the first one, namely $\mathop{\rm Prob}\nolimits(B_{L}>1/2)$ , gives unity, up to small corrections, in agreement with the fact that the weight of $p(k|L)$ in the left domain $k<L/2$ is negligible.

(ii) If $L-k$ is finite, then the second term reads

\frac{wf(k)\big{(}g\star Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}}\big{)}(w,L-k)}{Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)}\approx\frac{(1-w)w{\theta}}{L}\big{(}g\star Z^{\mathop{\scriptstyle\mathrm{td}}}\star Z^{\mathop{\scriptstyle\mathrm{td}}}\big{)}(w,L-k),

(11.26)

which is subdominant compared to (11.20).

In order to get the correction of $\langle X_{\rm max}\rangle$ to $L$ , we take the average of (9.56), by integrating each of the terms from $0$ to $1/2$ upon ${\lambda}$ , using (11.19) and (11.25). Adding the contributions coming from the two terms, we finally obtain for the dominant correction,

	$\displaystyle L-\langle X_{\rm max}\rangle$	$\displaystyle\approx$	$\displaystyle\frac{wc\,}{1-w}\left(\mathrm{B}_{\frac{1}{2}}(1-{\theta},1-{\theta})+\mathrm{B}_{\frac{1}{2}}(2-{\theta},-{\theta})\right)L^{1-{\theta}}$		(11.27)
		$\displaystyle=$	$\displaystyle\frac{wc\,}{1-w}\mathrm{B}_{\frac{1}{2}}\Big{(}1-{\theta},-{\theta}\Big{)}L^{1-{\theta}},$		(11.27)

which has the same structure as (8.31) or (3.6). The comment made below (8.32) also holds here. In the present case the expression in the right side of (11.27) is proportional to $\langle N_{L}\rangle=w/(1-w)$ times the critical mean interval $\langle X|L\rangle\sim L^{1-{\theta}}$ , as given in gl2001 .

$\bullet$ Likewise, for ${\theta}>1$ , the weight of the first term in (9.56) dominates upon the second one, and we find for the correction of the mean to $L$ ,

L-\langle X_{\rm max}\rangle\approx\frac{w}{1-w}\langle X\rangle,

(11.28)

showing that this correction is made of $\langle N_{L}\rangle=w/(1-w)$ intervals of size $\langle X\rangle$ . This expression is therefore the perfect parallel of (8.32) or (3.8).

11.5 Probability for the last interval to be the longest one

Lastly, we investigate the behaviour of ${Q_{L}}$ defined in (9.61) as the probability that $B_{L}$ is the longest interval. An estimate of ${Q_{L}}$ for $w<1$ can be obtained by means of the inequality

1-{Q_{L}}\lesssim\mathop{\rm Prob}\nolimits(B_{L}\leq L/2).

(11.29)

In view of (11.22) and (11.27), we infer that asymptotically for $L$ large, if ${\theta}<1$ ,

{Q_{L}}\approx\frac{\langle X_{\rm max}\rangle}{L},

(11.30)

while, if ${\theta}>1$ , in view of (11.23) and (11.28),

1-Q_{L}\approx\frac{{\theta}}{L}(L-\langle X_{\rm max}\rangle).

(11.31)

In other words, for $w<1$ , $Q_{L}\to 1$ . At criticality, $w=1$ , $Q_{L}\to Q_{\infty}=0.626\dots$ , if ${\theta}<1$ , while if ${\theta}>1$ , $Q_{L}\sim L^{-(1-1/{\theta})}$ gms2015 . For $w>1$ , $Q_{L}\to 0$ .

Figure 12 depicts $Q_{L}$ as a function of $w$ for Example 1 (see (5.1)) and for three different sizes, crossing at the universal critical value $Q_{\infty}=0.626\dots$ for $w=1$ gms2015 and the data collapse obtained by using the scaling variable $x=(w-1)L^{1/2}$ .

Table 2 summarises the results found in section 11 and recapitulates the results for the two other phases (disordered and critical). This table demonstrates a large degree of universality of the results, as was the case of table 1, with which it should be put in perspective.

Table 2: Dominant asymptotic behaviours at large

L

for free renewal processes with power-law distribution (1.1) for

f(k)

, in the different phases. The results in columns 2 and 3 (critical phase) are taken from gl2001 ; gms2015 . In the last column,

\langle X|L\rangle\sim L^{1-{\theta}}

{\theta}<1

, or

\langle X|L\rangle\approx\mathrm{constant}

{\theta}>1

	disordered	critical ${\theta}<1$	critical ${\theta}>1$	condensed
$\langle N_{L}\rangle$	$\frac{L}{\langle X\rangle}$	$L^{\theta}$	$\frac{L}{\langle X\rangle}$	$\frac{w}{1-w}$
$\langle X\|L\rangle$	$\mathrm{constant}$	$L^{1-{\theta}}$	$\langle X\rangle$	$L^{1-{\theta}}$ or constant
$\langle B_{L}\rangle$	constant	$L$	$L^{2-{\theta}}$	$L$
$\langle X_{\rm max}\rangle$	$\ln L$	$L$	$L^{1/{\theta}}$	$L$
$Z^{\mathop{\scriptstyle\mathrm{f}}}(w,L)$	${\rm e}^{L/\xi}$	$1$	$1$	$L^{-{\theta}}$

12 Conclusion

Let us summarise the salient aspects of this study.

We first recalled the main features of the condensation transition taking place for random allocation models and zrp in the thermodynamic limit ( $L,n\to\infty$ with fixed ratio $\rho=L/n$ ), when the distribution of occupations is subexponential. These occupations are independent and identically distributed random variables conditioned by the value of their sum. The phase diagram is made of three phases: disordered, critical, and condensed. The critical line $\rho=\rho_{c}({\theta})$ , where ${\theta}>1$ , separates the disordered phase at low density from the condensed phase at high density. Condensation manifests itself by the occurrence, in the thermodynamic limit, of a unique site with macroscopic occupation. In the language of particles and boxes (or sites), the condensate is by definition the site with the largest occupation. In the language of sums of random variables used all throughout the present work, the condensate $X_{{\rm max}}$ is the unique summand with extensive value. In the thermodynamic limit, the fraction $X_{\rm max}/L$ no longer fluctuates and takes the asymptotic value $1-\rho_{c}/\rho$ .

A second scenario for the same class of models consists in taking the $L\to\infty$ limit keeping the number of sites (or summands) fixed. In this limit there is again a single extensive summand $X_{\rm max}$ , but now the fraction $X_{\rm max}/L$ tends to unity, which means that condensation is total. The novelty is that this occurs irrespective of the existence of a first moment $\langle X\rangle$ , or in other words, irrespective of whether ${\theta}$ is smaller or larger than one. If $L$ is large but finite, the distribution of $X_{\rm max}$ is peaked, with a width $L-\langle X_{\rm max}\rangle$ scaling as $L^{1-{\theta}}$ if ${\theta}<1$ , with a known amplitude, or asymptotically equal to $(n-1)\langle X\rangle$ , if ${\theta}>1$ . Note that, in contrast to the previous case, one can no longer speak of a phase transition, nor even of a phase, since the system is made of a finite number $n$ of summands (or sites).

This scenario is a good preparation for the study of condensation in free and tied-down renewal processes, with power-law distribution of intervals (1.1), which is the main motivation of the present work. Instead of particle occupations and sites one speaks in terms of renewal events and intervals, whose sizes sum up to a fixed value $L$ . The novelty—and complication—is that the number of these renewal events, or equivalently of intervals, $N_{L}$ , fluctuates. For instance these renewal points are the passages by the origin of a random walk, as depicted in figure 3. A weight $w$ is attached to each renewal event. In the language of random walks (or of polymer chains) $w$ represents the reward or penalty when the walk touches the origin fisher ; gia1 ; gia2 . A high value of $w$ favours configurations with a large number of intervals $N_{L}$ , i.e., a disordered phase—or localised phase in the language of random walks. A low value of $w$ favours configurations with a small number of intervals $N_{L}$ , i.e., a condensed phase—or delocalised phase in the language of random walks. It is therefore intuitively clear that the same scenario of total condensation as seen above should prevail, where now the driving force is no longer a change in the density, $\rho$ , but a change in the value of the weight $w$ attached to each interval (or summand). In this respect it is worth noting the similarity between equations (3.6), (8.31) and (11.27) on one hand, and the similarity between equations (3.8), (8.32) and (11.28) on the other hand.

It turns out that, in the condensed phase, when $L\to\infty$ , the distribution of the number of intervals, $N_{L}$ , is superuniversal, i.e., model independent, since it only depends on $w$ and not even on the index ${\theta}$ of the power-law decay (1.1). This distribution is geometric for free renewal processes, while it is the convolution of two such distributions for tdrp. More generally, an important distinction is to be made according to whether ${\theta}$ is less or larger than unity. In the first case the distribution $f(k)$ has no first moment, atypical events play a major role and the system becomes self-similar at criticality. In the second case the observables of interest depend on the first moment $\langle X\rangle$ , which is finite.

In closing, let us broaden the perspective. The phase transition occurring when $w$ passes through unity is second order for the density of intervals $\nu$ (defined in (6.8)) if ${\theta}<1$ , and first order if ${\theta}>1$ . On the other hand the correlation length diverges at the transition, see (6.13). The transition is therefore mixed order as was pointed out for the particular case of tdrp with Example 2 (see (5.7)) in burda3 ; bar2 . Furthermore, the magnetisation, defined as the alternating sum $m=(X_{1}-X_{2}+X_{3}-\cdots)/L$ , changes, when $L\to\infty$ , from the value $0$ in the disordered phase to $\pm 1$ in the condensed phase since condensation is total. More on this can be found in bar2 . If ${\theta}<1$ the distribution of the magnetisation at criticality is broad and self-similar, both for free lamperti58 ; gl2001 and tied-down renewal processes wendel2 . At criticality, for ${\theta}<1$ , the non stationary two-time (or two-space) correlation function is also self-similar, again for both processes gl2001 ; wendel2 .⁶⁶6After submission of the present work, a study devoted to the statistics of $X_{\rm max}$ in the range $(L/2,L)$ for tied-down or free renewal processes at criticality ( $w=1$ ) was presented in barkai . For the tdrp case the result (4.48) with $w=1$ is obtained. For the free renewal case, barkai predicts, if $w=1$ , $q(k|L)=g(k)(\langle N^{\mathop{\scriptstyle\mathrm{f}}}_{L-k}\rangle-\langle N^{\mathop{\scriptstyle\mathrm{f}}}_{L-k-1}\rangle)+f(k)(1+\langle N^{\mathop{\scriptstyle\mathrm{f}}}_{L-k}\rangle),$ (12.1) which is (9.57), with $w=1$ , noting that $Z^{\mathop{\scriptstyle\mathrm{td}}}(1,L)=\langle N^{\mathop{\scriptstyle\mathrm{f}}}_{L}\rangle-\langle N^{\mathop{\scriptstyle\mathrm{f}}}_{L-1}\rangle$ , as is clear by taking the generating functions of both sides.

Acknowledgements.

It is a pleasure to thank G Giacomin, M Loulakis and J-M Luck for enlightening discussions. I am also indebted to S Grosskinsky and S Janson for useful correspondence.

Appendix

Appendix A On equation (3.18)

Let us explain the argument leading to (3.18) and the origin of the hierarchical structure mentioned in §3.3⁷⁷7I am indebted to M Loulakis for sharing his comments on this part with me..

1.

In the uphill region where $L-X_{\rm max}$ is finite, $X_{\rm max}$ is the unique big summand and (3.16) holds. This property stems from the fact that when the sum of $n$ subexponential random variables is conditioned to a large value $L$ , all the dependence is absorbed by the maximum and the ensemble of $n-1$ smaller variables becomes asymptotically independent. This property, initially put forward by early workers, has been progressively refined in subsequent studies ferrari ; armendariz2011 ; janson .
2.

In the dip region, where $X_{\rm max}>L/2$ , since $L-X_{\rm max}$ gets large, the sum $\sum_{i=1}^{n-1}X_{i}$ becomes subjected to a large deviation event. This event will be realised by $X^{(2)}$ , the second largest summand, typically equal to $L-X_{\rm max}$ . We thus obtain (3.17).

One can now iterate the reasoning. If $X_{\rm max}=k\leq L/2$ , the difference $L-k\geq k$ cannot accommodate a single big summand $X^{(2)}=j$ since the latter should be less than $X_{\rm max}$ . Now

L-X_{\rm max}-X^{(2)}\mathrel{\mathop{\approx}\limits_{L\to\infty}}\sum_{i=1}^{n-2}X_{i}.

(A.1)

where the sum in the right side is subjected to a large deviation, which will be realised by a third large summand $X^{(3)}$ . Since $L-k-j$ should be less than $j$ , the constraint $j\geq(L-k)/2$ holds. Moreover $X^{(2)}\leq X_{\rm max}$ imposes the condition $(L-k)/2\leq k$ , i.e. $k\geq L/3$ . We are thus lead to the asymptotic estimate

\mathop{\rm Prob}\nolimits(X_{\rm max}=k|S_{n}=L)\approx\sum_{j=\frac{L-k}{2}}^{k}n(n-1)\frac{f(k)f(j)Z_{n-2}(L-k-j)}{Z_{n}(L)},

(A.2)

and therefore

\mathop{\rm Prob}\nolimits(L/3\leq X_{\rm max}\leq L/2|S_{n}=L)\mathrel{\mathop{\approx}\limits}\sum_{k=L/3}^{L/2}\ \sum_{j=\frac{L-k}{2}}^{k}n(n-1)\frac{f(k)f(j)Z_{n-2}(L-k-j)}{Z_{n}(L)}.

(A.3)

For ${\theta}<1$ , the analysis of this expression in the continuum limit leads to

\mathop{\rm Prob}\nolimits(L/3\leq X_{\rm max}\leq L/2|S_{n}=L)\approx(n-1)(n-2)c^{2}\frac{A({\theta})}{L^{2{\theta}}},

(A.4)

where the amplitude $A({\theta})$ is given by

A({\theta})=\int_{1/3}^{1/2}{\rm d}x\,\int_{\frac{1-x}{2}}^{x}{\rm d}y\,\frac{1}{[xy(1-x-y)]^{1+{\theta}}}.

(A.5)

For instance, $A(1/2)=2\pi$ , $A(1/3)=9\sqrt{3}\,\Gamma(2/3)^{3}/(4\pi)$ .

For ${\theta}>1$ , the analysis of (A.3) yields

\mathop{\rm Prob}\nolimits(L/3<X_{\rm max}<L/2|S_{n}=L)\approx\frac{(n-1)c\,2^{2+2{\theta}}}{L^{1+{\theta}}}\frac{(n-2)\langle X\rangle}{2},

(A.6)

for continuous random variables, and

\mathop{\rm Prob}\nolimits(L/3\leq X_{\rm max}\leq L/2|S_{n}=L)\approx\frac{(n-1)c\,2^{2+2{\theta}}}{L^{1+{\theta}}}\frac{(n-2)\langle X\rangle+1}{2},

(A.7)

for discrete ones.

One can iterate the reasoning leading to (A.3) and derive the weights of the successive sectors $(L/4,L/3)$ , $(L/5,L/4$ ), etc. For instance, for $X_{\rm max}$ in the interval $(L/4,L/3)$ , one finds

\mathop{\rm Prob}\nolimits(X_{\rm max}=k|S_{n}=L)\approx\sum_{j=\frac{L-k}{3}}^{k}\sum_{i=\frac{L-k-j}{2}}^{j}n(n-1)(n-2)\frac{f(k)f(j)f(i)Z_{n-3}(L-k-j-i)}{Z_{n}(L)},

(A.8)

which yields

\mathop{\rm Prob}\nolimits(L/4\leq X_{\rm max}\leq L/3|S_{n}=L)\sim L^{-\gamma},\qquad\gamma=\left\{\begin{array}[]{ll}3{\theta}&\textrm{if }{\theta}\leq 2\vspace{4pt}\\ 2(1+{\theta})&\textrm{if }{\theta}>2,\end{array}\right.

(A.9)

For example, if ${\theta}=1/2$ , one finds

\mathop{\rm Prob}\nolimits(L/4\leq X_{\rm max}\leq L/3|S_{n}=L)\approx(n-1)(n-2)(n-3)c^{3}\frac{\sqrt{2}\,\pi/3}{L^{3/2}}.

(A.10)

Appendix B Weight of the maximum in the left region for a Lévy $\frac{1}{2}$ stable law

We want to determine the weight of the maximum in the left region considered in §3.3,

P_{n}=\mathop{\rm Prob}\nolimits(X_{\rm max}\leq L/2|S_{n}=L)=\sum_{k=0}^{L/2}p_{n}(k|L)=1-\sum_{k=L/2+1}^{L}p_{n}(k|L)=1-\sum_{k=L/2+1}^{L}n\pi_{n}(k|L),

(B.1)

on the particular example of a a Lévy $\frac{1}{2}$ stable law. We use a continuum formalism where distributions are densities and variables are real numbers for the particular case where $f(k)$ is the Lévy $\frac{1}{2}$ stable density (7.11),

f(k)=\mathcal{L}_{\frac{1}{2},c}(k)=\frac{c}{k^{3/2}}{\rm e}^{-\pi c^{2}/k}.

(B.2)

Likewise, considering $L$ as a real number and $Z_{n}(L)$ as a density,

Z_{n}(L)=\mathcal{L}_{\frac{1}{2},nc}(k)=\frac{nc}{L^{3/2}}{\rm e}^{-n^{2}\pi c^{2}/L}.

(B.3)

Thus

\pi_{n}(k|L)=\frac{f(k)Z_{n-1}(L-k)}{Z_{n}(L)}

(B.4)

is explicit. Setting $k=L/t$ , we obtain

P_{n}=1-\frac{(n-1)c}{\sqrt{L}}\int_{1}^{2}{\rm d}t\,\frac{t\,{\rm e}^{-\frac{\pi c^{2}(n-t)^{2}}{L(t-1)}}}{(t-1)^{3/2}}.

(B.5)

Setting $c/\sqrt{L}=b/(2\sqrt{\pi})$ , we finally get

P_{n}=\frac{1}{2}\Bigg{(}2-n\mathop{\rm erfc}\Big{(}\frac{(n-2)b}{2}\Big{)}+(n-2){\rm e}^{(n-1)b^{2}}\mathop{\rm erfc}\Big{(}\frac{nb}{2}\Big{)}\Bigg{)}.

(B.6)

For $L$ large, expanding in powers of $b$ , we obtain

	$\displaystyle P_{n}$	$\displaystyle=$	$\displaystyle(n-1)(n-2)\Big{(}\frac{b^{2}}{2}-n\frac{b^{3}}{3\sqrt{\pi}}+(n-1)\frac{b^{4}}{4}+\cdots\Big{)}$		(B.7)
		$\displaystyle=$	$\displaystyle(n-1)(n-2)\Big{(}2\pi\frac{c^{2}}{L}-\frac{8\pi n}{3}\frac{c^{3}}{L^{3/2}}+(n-1)4\pi^{2}\frac{c^{4}}{L^{2}}+\cdots\Big{)}.$		(B.7)

For Example 1, $c=1/(2\sqrt{\pi})$ (see (5.1)), the first term of the expansion,

P_{n}\approx\frac{(n-1)(n-2)}{2L},

(B.8)

matches the predictions made in (A.4) and (A.5).

References

(1) Bialas P, Burda Z and Johnston D 1997 Nucl. Phys. B 493 505 Condensation in the Backgammon model
(2) Bialas P, Bogacz L, Burda Z and Johnston D 2000 Nucl. Phys. B 575 599 Finite size scaling of the balls in boxes model
(3) Janson S 2012 Prob. Surveys 9 103 Simply generated trees, conditioned Galton-Watson trees, random allocations and condensation
(4) Spitzer F 1970 Advances in Math. 5 246 Interaction of Markov Processes
(5) Andjel ED 1982 Ann. Prob. 10 525 Invariant measures for the zero-range process
(6) Drouffe JM, Godrèche C and Camia F 1998 J. Phys. A 31 L19 A simple stochastic model for the dynamics of condensation
(7) Evans MR 2000 Braz. J. Phys. 30 42 Phase transitions in one-dimensional nonequilibrium systems
(8) Jeon I, March P and Pittel B 2000 Ann. Probab. 28 1162 Size of the largest cluster under zero-range invariant measures
(9) Godrèche C and Luck JM 2002 J. Phys.: Condens. Matter 14 1601 Nonequilibrium dynamics of urn models
(10) Godrèche C 2003 J. Phys. A 36 6313 Dynamics of condensation in zero-range processes
(11) Grosskinsky S, Schütz GM and Spohn H 2003 J. Stat. Phys. 113 389 Condensation in the zero range process: stationary and dynamical properties
(12) Evans MR and Hanney T 2005 J. Phys. A 38 R195 Nonequilibrium Statistical Mechanics of the Zero-Range Process and Related Models
(13) Godrèche C and Luck J M 2005 J. Phys. A 38 7215 Dynamics of the condensate in zero-range processes
(14) Godrèche C 2007 Lect. Notes Phys. Springer 716 261 From urn models to zero-range processes: statics and dynamics
(15) Evans MR, Majumdar SN and Zia RKP 2006 J. Stat. Phys. 123 357 Canonical Analysis of Condensation in Factorised Steady States
(16) Ferrari PA, Landim C and Sisko VV 2007 J. Stat. Phys. 128 1153 Condensation for a Fixed Number of Independent Random Variables
(17) Evans MR and Majumdar SN 2008 J. Stat. Mech. P05004 Condensation and Extreme Value Statistics
(18) Armendariz I and Loulakis M 2009 Probab. Theory Relat. Fields 145 175 Thermodynamic limit for the invariant measures in supercritical zero range processes
(19) Armendariz I and Loulakis M 2011 Stochastic Process. Appl. 121 1138 Conditional distribution of heavy tailed random variables on large deviations of their sum
(20) Armendariz I, Grosskinsky S and Loulakis M 2013 Stochastic Process. Appl. 123 3466 Zero-range condensation at criticality
(21) Armendariz I, Grosskinsky S and Loulakis M 2017 Probab. Theory Relat. Fields 169 105 Metastability in a condensing zero-range process in the thermodynamic limit
(22) Godrèche C 2019 J. Stat. Mech. 063207 Condensation for random variables conditioned by the value of their sum
(23) Feller W 1968 1971 An Introduction to Probability Theory and its Applications volumes 1&2 (New York: Wiley)
(24) Doob JL 1948 Trans. Amer. Math. Soc. 63 422 Renewal theory from the point of view of the theory of probability
(25) Smith WL 1958 J. Roy. Statist. Soc. Ser. B 20 243 Renewal Theory and its Ramifications
(26) Cox DR 1962 Renewal theory (London: Methuen)
(27) Wendel JG 1964 Math. Scand. 14 21 Zero-Free Intervals of Semi-Stable Markov Processes
(28) Godrèche C 2017 J. Phys. A 50 195003 Longest interval between zeros of the tied-down random walk, the Brownian bridge and related renewal processes
(29) Godrèche C 2017 J. Stat. Mech. P073205 Two-time correlation and occupation time for the Brownian bridge and tied-down renewal processes
(30) Labarbe JM and Marckert JF 2007 Elect. Journ. Prob. 12 229 Asymptotics of Bernoulli random walks, bridges, excursions and meanders with a given number of peaks
(31) Vervaat W 1979 Ann. Probab. 7 143 A relation between Brownian bridge and Brownian excursion
(32) Fisher M 1984 J. Stat. Phys. 34 667 Walks, Walls, Wetting and Melting
(33) Poland D and Scheraga HA 1966 J. Chem. Phys. 45 1456 Phase Transitions in One Dimension and the Helix-Coil Transition in Polyamino Acids
(34) Poland D and Scheraga HA 1966 J. Chem. Phys. 45 1464 Occurrence of a Phase Transition in Nucleic Acid Models
(35) Bialas P, Burda Z and Johnston D 1999 Nucl. Phys. B 542 413 Phase diagram of the mean field model of simplicial gravity
(36) Bar A and Mukamel D 2014 J. Stat. Mech. P11001 Mixed order transition and condensation in an exactly soluble one dimensional spin model
(37) Bar A, Majumdar SN, Schehr G and Mukamel D 2016 Phys. Rev. 93 052130 Exact extreme-value statistics at mixed-order transitions
(38) Barma M, Majumdar SN and Mukamel D 2019 J. Phys. A 52 254001 Fluctuation-dominated phase ordering at a mixed order transition
(39) Giacomin G 2007 Random polymer models (Imperial College Press: World Scientific)
(40) Giacomin G 2011 Disorder and critical phenomena through basic probability models École d’été de probabilités de Saint-Flour Lect. Notes Math. Springer
(41) Godrèche C and Luck JM 2001 J. Stat. Phys. 104 489 Statistics of the occupation time of renewal processes
(42) Godrèche C Majumdar SN and Schehr G 2015 J. Stat. Mech. P03014 Statistics of the longest interval in renewal processes
(43) Bardou F, Bouchaud J P, Aspect A and Cohen-Tannoudji C 2002 Lévy Statistics and Laser Cooling (Cambridge: Cambridge University Press)
(44) Barkai E 2003 Phys. Rev. Lett. 90 104101 Aging in Subdiffusion Generated by a Deterministic Dynamical System
(45) J. Beltrán J and C. Landim C 2012 Probab. Theory Relat. Fields 152 781 Metastability of reversible condensed zero range processes on a finite set
(46) Gnedenko B V and Kolmogorov A 1954 Limit Distributions for Sums of Independent Random Variables (Cambridge, Mass: Addison-Wesley)
(47) Godrèche C and Luck JM 2001 Eur. Phys. J. B 23, 473 Nonequilibrium dynamics of the zeta urn model
(48) Gumbel E J 1958 Statistics of Extremes (New York: Columbia University Press)
(49) Gnedenko B V 1943 Ann. Math. 44 423 Sur la distribution limite du terme maximum d’une série aléatoire
(50) Fréchet M 1927 Ann. Soc. Polon. Math. 6 93 Sur la loi de probabilité de l’écart maximum
(51) Chistyakov VP 1964 Theor. Probab. Appl. 9 640 A theorem on sums of independent positive random variables and its applications to branching random processes
(52) Dynkin EB 1955 Izv. Akad. Nauk. SSSR Ser. Math. 19 247; 1961 Selected Translations Math. Stat. Prob. 1 171 Limit theorems for sums of independent random quantities
(53) Lamperti J 1958 Trans. Amer. Math. Soc. 88 380 An occupation time theorem for a class of stochastic processes
(54) Lamperti J 1961 Am. Math. Soc. 12 724 A contribution to renewal theory
(55) Höll M, Wang W and Barkai E arXiv:2006.06253 Extreme value theory for constrained physical systems

Condensation and extremes for a fluctuating number of independent random variables

Abstract

Keywords:

1 Introduction

2 Condensation for random allocation models and ZRP

2.1 General formalism

2.1.1 Statistical weight of a configuration

2.1.2 Distribution of the largest occupation

2.2 Phenomenology of condensation in the thermodynamic limit

2.2.1 Regimes for the single occupation distribution

2.2.2 Statistics of the largest summand in the condensed phase

An illustration

3 Phenomenon of total condensation when nn is kept fixed and L→∞L\to\infty

3.1 An illustration

3.2 Fluctuations of the condensate

3.3 A finer analysis

4 General statements on tied-down renewal processes

4.1 Joint distribution

4.2 Distribution of the number of intervals

4.3 Single interval distribution

Remarks

4.4 Mean interval ⟨X|L⟩\langle X|L\rangle

4.5 The longest interval

Remarks

5 Phase transition for tied-down renewal processes

5.1 Illustrative examples

5.2 Phase diagram

6 Disordered phase (w>1w>1) for tied-down renewal processes

7 Critical regime (w=1w=1) for tied-down renewal processes

7.1 Distribution f​(k)f(k) with index θ<1{\theta}<1

7.1.1 The number of intervals

7.1.2 Single interval distribution

7.1.3 The longest interval

7.2 Distribution f​(k)f(k) with index θ>1{\theta}>1

8 Condensed phase (w<1w<1) for tied-down renewal processes

8.1 Asymptotic estimates at large LL

8.2 Regimes for the single interval distribution

8.3 Regimes for the distribution of the longest interval

8.4 Discussion

9 General statements on free renewal processes

9.1 Joint distribution

9.2 Distribution of the number of intervals

9.3 Distribution of SNLS_{N_{L}}

9.4 Distribution of BLB_{L}

9.5 Single interval distribution

9.6 Mean interval ⟨X|L⟩\langle X|L\rangle

9.7 The longest interval

Remarks

10 Critical regime (w=1w=1) for free renewal processes

11 Condensed phase (w<1w<1) for free renewal processes

11.1 Asymptotic estimates at large LL

11.2 Regimes for the single interval distribution

11.3 Regimes for the distribution of BLB_{L}

11.4 Regimes for the distribution of the longest interval

11.5 Probability for the last interval to be the longest one

12 Conclusion

Acknowledgements.

Appendix

Appendix A On equation (3.18)

Appendix B Weight of the maximum in the left region for a Lévy 12\frac{1}{2} stable law

References

3 Phenomenon of total condensation when $n$ is kept fixed and $L\to\infty$

4.4 Mean interval $\langle X|L\rangle$

6 Disordered phase ( $w>1$ ) for tied-down renewal processes

7 Critical regime ( $w=1$ ) for tied-down renewal processes

7.1 Distribution $f(k)$ with index ${\theta}<1$

7.2 Distribution $f(k)$ with index ${\theta}>1$

8 Condensed phase ( $w<1$ ) for tied-down renewal processes

8.1 Asymptotic estimates at large $L$

9.3 Distribution of $S_{N_{L}}$

9.4 Distribution of $B_{L}$

9.6 Mean interval $\langle X|L\rangle$

10 Critical regime ( $w=1$ ) for free renewal processes

11 Condensed phase ( $w<1$ ) for free renewal processes

11.1 Asymptotic estimates at large $L$

11.3 Regimes for the distribution of $B_{L}$

Appendix B Weight of the maximum in the left region for a Lévy $\frac{1}{2}$ stable law