This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Large Deviations Theory of Increasing Returns

Simone Franchini and Riccardo Balzan Sapienza Università di Roma, Piazza A. Moro 1, 00185 Roma, Italy
Abstract

An influential theory of increasing returns has been proposed by the economist W. B. Arthur in the ’80s to explain the lock-in phenomenon between two competing commercial products. In the most simplified situation there are two competing products that gain customers according to a majority mechanism: each new customer arrives and asks which product they bought to a certain odd number of previous customers, and then buy the most shared product within this sample. It is known that one of these two companies reaches monopoly almost surely in the limit of infinite customers. Here we consider a generalization [G. Dosi, Y. Ermoliev, Y. Kaniovsky, J. Math. Econom. 23, 1–19 (1994)] where the new customer follows the indication of the sample with some probability, and buy the other product otherwise. Other than economy, this model can be reduced to the urn of Hill, Lane and Sudderth, and includes several models of physical interest as special cases, like the Elephant Random Walk, the Friedman’s urn and other generalized urn models. We provide a large deviation analysis of this model at the sample-path level, and give a formula that allows to find the most likely trajectories followed by the market share variable. Interestingly, in the parameter range where the lock-in phase is expected, we observe a whole region of convergence where the entropy cost is sub-linear. We also find a non-linear differential equation for the cumulant generating function of the market share variable, that can be studied with a suitable perturbations theory.

Part I Main results

I Introduction

It is known that certain economic markets - especially the technological ones - show increasing returns Arthur nature ; Arthur book ; Arthur , a positive feedback phenomenon where if a company gains some initial advantage (even small) is more likely to get even more in the future, eventually dominating the market share in the long run - this phase is also called lock-in into a monopolistic state. To understand the origin of this effect, a simplified market model has been introduced in the ’80s by the economist W. B. Arthur UM Arthur in the framework of its Increasing Returns theory (IRT) Arthur book ; Arthur . Let consider two competing companies that launch a new kind of product roughly at the same time (as practical example we could think about two smart phones in the early 2000s). Suppose that these products are roughly equivalent, such that there is no practical reason for choosing one over the other, we can imagine that a buyer will base his decision in part on personal opinions (personal tastes, ideologies, advertising, etc.) and in part on those of other people that already purchased one of the products. Then, let us consider a simplified situation in which the new customers are imperfectly informed about the products, so that they will make their choices by looking at the number of adopters who are already using it Arthur ; UM Arthur ; Dosi Ermoliev ; Dosi last . An alternative hypothesis that gives the same effect is to consider positive (or negative) externalities in adoption Dosi Ermoliev ; Dosi last . In both cases, we consider the additional rule that any new adopter will choose the technology used by the majority of the sample only with a certain probability, and the other technology otherwise Dosi Ermoliev ; Dosi last .

This scenario has been considered by G. Dosi, Y. Ermoliev and Y. Kaniovsky (DEK, 1994) Dosi Ermoliev , the proposed model is as follows: consider a binary vector that represents the individual choices of the customers,

XN:={XN,1,XN,2,,XN,N},X_{N}:=\{X_{N,1},\,X_{N,2},\,...\,,X_{N,N}\}, (1)

with XN,n{0,1}X_{N,n}\in\{0,1\} and NN potential size of the market. This vector represents the full history of the market evolution, from the first sell to full saturation, when the maximal number of customers is reached. The variable XN,nX_{N,n} represents the choice of the nn-th customer, we arbitrarily associate the value one to the first product and zero to the second. The total number of customers of the first product will therefore be

ΓN,n:=mnXN,m\Gamma_{N,n}:=\sum_{m\leq n}X_{N,m} (2)

the market share of the first product up to the nn-th customer is represented by the variable

xN,n:=ΓN,nn=1nmnXN,m.x_{N,n}:=\frac{\Gamma_{N,n}}{n}=\frac{1}{n}\sum_{m\leq n}X_{N,m}. (3)

Then, the choice of the next customer XN,n+1X_{N,n+1} is determined by the following rule: first, sample kk previous customers, where kk is an odd integer (this to avoid inconclusive outputs from the poll). Then, if the sample is found to have more customers that bought the first product, the variable XN,n+1X_{N,n+1} will be equal to one with a probability pp, and will be zero otherwise. On the other hand, if more customers owning the second product are found in the sample, XN,n+1X_{N,n+1} will be zero with probability pp, and one otherwise. Notice that the new customers follow the majority of the polled sample with probability pp, that in some sense quantifies the trust of the newcomers in the behavior of their predecessors: hereafter we will call pp trust parameter, although it may also reflect more practical constraints, such as a requirement for compatibility with the technology adopted by the polled customers. For p=1p=1 the DEK model describes a market where the customers always buy the product owned by the majority of the sample, i.e., the original version introduced by Arthur et al. (1983) and Arthur (1989) UM Arthur ; Arthur : some sample trajectories of this process for p=1p=1 and k=3k=3 are in Figure 2a of G. Dosi et al. (2017) Dosi last . Concerning the initial conditions, we will distinguish of two kinds: we introduce τ[0,1]\tau\in\left[0,1\right] the fraction of customers that made their choices already (market saturation parameter): in this paper we will consider an early start in the market at some fixed number of customers M<M<\infty, also called virgin market condition, that in the limit of infinite customers is equivalent to a debut in the market approximately at τ=0\tau=0 (and does not affect the LDT theory for NN\rightarrow\infty) and a late start M=τNM=\tau N (a product that enters in the market when the saturation is already macroscopic), that strongly influences the distribution of the final share also at the LDT level.

II Relation with HLS urns

In this paper we develop a Large Deviations theory (LDT) for the DEK model for any pp and kk by adapting results from the Hill Lane and Sudderth (HLS) urn model HLS1 ; HLS2 , a very general model for which a mathematically rigorous LDT has been recently developed Franchini , and that includes the DEK model as special case. An HLS urn process Pemantle review ; Mahmoud ; HLS1 ; HLS2 ; Pemantle Touch ; Franchini is a two color urn process controlled by a functional parameter π(x)\pi\left(x\right) that we call urn function (actually adoption function in Ref. Arthur ), where the new step XN,n+1X_{N,n+1} is one with probability π(xN,n)\pi\left(x_{N,n}\right) and zero otherwise. The relation between IRT and HLS urns is well known since the very beginning, in fact, this model has been introduced independently by HLS (1980) and then also by Arthur et al. (1983) within just three years. The urn function that describes the DEK model can be determined as follows: start with k=1k=1, the probability of extracting an owner of the first product is xx, then their total number will increase with probability

π1(x):=px+(1p)(1x)=(1p)+(2p1)x,\pi_{1}\left(x\right):=p\,x+\left(1-p\right)\left(1-x\right)=\left(1-p\right)+\left(2p-1\right)x, (4)

that is a linear urn function. In case k=3k=3: the probability of increasing the owners of the first product is that of extracting two positive and one negative, plus that of extracting three positive, that is Dosi Ermoliev

P3(x):=x3+3x2(1x)=3x22x3,P_{3}\left(x\right):=x^{3}+3x^{2}\left(1-x\right)=3x^{2}-2x^{3}, (5)

then, the corresponding urn function is Dosi Ermoliev

π3(x):=p(3x22x3)+(1p)(1(3x22x3))=(1p)+(2p1)(3x22x3)\pi_{3}\left(x\right):=p\,\left(3x^{2}-2x^{3}\right)+\left(1-p\right)\left(1-\left(3x^{2}-2x^{3}\right)\right)=\left(1-p\right)+\left(2p-1\right)\left(3x^{2}-2x^{3}\right) (6)

and cannot be reduced to the linear case k=1k=1. In general, the probability of finding a positive majority when extracting an odd number kk of steps is Dosi Ermoliev

Pk(x):=h>k/2k!h!(kh)!xh(1x)khP_{k}\left(x\right):=\sum_{h>k/2}\frac{k!}{h!\left(k-h\right)!}\,x^{h}\left(1-x\right)^{k-h} (7)

where the hh sum runs from (k+1)/2\left(k+1\right)/2 to kk. Follows that the urn function that describes a DEK model with k>2k>2 extractions per step is Dosi Ermoliev

πk(x):=pPk(x)+(1p)(1Pk(x))=(1p)+(2p1)Pk(x),\pi_{k}\left(x\right):=p\,P_{k}\left(x\right)+\left(1-p\right)\left(1-P_{k}\left(x\right)\right)=\left(1-p\right)+\left(2p-1\right)P_{k}\left(x\right), (8)

this is a kk-th degree polynomial, and is therefore non-linear for all non-trivial values of the trust parameter pp.

In case of a virgin market start, the convergence properties of the HLS urns with any continuous urn functions have been studied in HLS1 ; HLS2 ; UM Arthur ; Arthur ; Dosi Ermoliev ; Pemantle Touch ; Franchini , finding that the points of convergence of xN,Nx_{N,N} always belong to the set of solutions of

π(x)=x,\pi\left(x\right)=x, (9)

and that these solutions are stable only if the derivative of the urn function in those points is smaller than one, i.e., if the π(x)\pi\left(x\right) crosses xx from top to bottom (down-crossing). For the DEK model with k=1k=1, the urn function π1(x)\pi_{1}\left(x\right) crosses xx at 1/21/2 for any value of p<1p<1, and therefore 1/21/2 is the only possible point of convergence for the associated share xN,Nx_{N,N}, see Figure 1. This imply that xN,Nx_{N,N} converges to 1/21/2 almost surely

limNxN,N=1/2,a.s.\lim_{N\rightarrow\infty}x_{N,N}=1/2,\ a.s. (10)

for all values of p<1p<1 and of the initial condition xN,Mx_{N,M} a phase diagram for the DEK k=1k=1 is shown in Figure 3. This model does not show the lock-in phenomenon, although there is still a value of pp where the dynamics is expected to slow down (see Section VIII).

In the DEK with k>2k>2 one can see the appearance of the lock-in phase above some critical pcp_{c}. For k=3k=3 the Eq. (9) is a third degree equation, and can be solved with the well known formula. In general, we find tree solutions Dosi Ermoliev : x0=1/2x_{0}=1/2 and

2x±=1±6p52p1,2x_{\pm}=1\pm\sqrt{\frac{6p-5}{2p-1}}, (11)

the quantity inside the square root is positive for p1/2p\leq 1/2 and p5/6p\geq 5/6, but notice that x±[0,1]x_{\pm}\in\left[0,1\right] only if p[1/2,1]p\in\left[1/2,1\right], then, for pp below the critical value pc=5/6p_{c}=5/6 there is again a unique stable solution at 1/21/2 that crosses xx from top to bottom (down crossing), see Figure 2. Above pcp_{c} the function π\pi still crosses xx at the point 1/21/2, but it now does from bottom to top, i.e. it is an up-crossing and is therefore not stable. Notice that for p>pcp>p_{c} two new solutions x+x_{+} and xx_{-} appear, those are both down-crossings, and can be stable attractors for xN,Nx_{N,N}. Therefore, for pp above pcp_{c} there are two attractors separated by an unstable equilibrium point at x0=1/2x_{0}=1/2 Dosi Ermoliev . Notice that in the limit of infinite kk the probability of finding a majority of first product owners within the sample converges to

P(x):=θ(12x)P_{\infty}\left(x\right):=\theta\left(1-2x\right) (12)

ie, the urn function πk\pi_{k} converges to a step function

π(x):=(1p)+(2p1)θ(12x)\pi_{\infty}\left(x\right):=\left(1-p\right)+\left(2p-1\right)\theta\left(1-2x\right) (13)

that still crosses the diagonal at the point x0=1/2x_{0}=1/2 (from top to bottom) for p<pc=1/2p<p_{c}=1/2, and at x=px_{-}=p, x+=1px_{+}=1-p if the trust parameter is above pcp_{c}. Then also in the infinite kk limit there is a pcp_{c} above which we find the same region of the phase diagram that is observed for k=3k=3. In fact, the phase diagram shows the same structure for all k>2k>2, apart from different pcp_{c} and x±x_{\pm}. For this reason, we will concentrate our analysis to the cases k=1k=1 and k=3k=3.

Refer to caption
Figure 1: Example of linear urn function π1(x)\pi_{1}\left(x\right) for the DEK with k=1k=1, the memory parameter is p=5/8p=5/8. The urn function always down-crosses the diagonal at x0=1/2x_{0}=1/2, that is the only convergence point.
Refer to caption
Figure 2: Three examples of the urn function π3(x)\pi_{3}\left(x\right) for a generalized DEK with k=3k=3. The figure shows the urn functions for three non-trivial memory parameters, p2=19/24p_{2}=19/24, pc=5/6p_{c}=5/6 and p1=21/24p_{1}=21/24. Below pcp_{c} the urn function down-crosses the line xx at x0=1/2x_{0}=1/2, that is the only convergence point. For p>pcp>p_{c} the point x0x_{0} becomes an up-crossing (unstable equilibrium), and the urn function crosses the diagonal xx also in xx_{-} and x+x_{+}, that are both down-crossings and are the new stable attractors for the process xNx_{N}.

Summarizing, under virgin market condition the limit value of xN,Nx_{N,N} for p>pcp>p_{c} converges to the points xx_{-} and x+x_{+} almost surely for any initial condition xN,Mx_{N,M} with M<M<\infty (the phases for k=3k=3 are shown in the Figure 4) but since the urn functions that we are considering never touches zero or one at any x(0,1)x\in\left(0,1\right), for any initial condition xN,Mx_{N,M} that is fixed at MNM\ll N there is a strictly positive probability to reach the nearby of any other xx by gaining a finite number of customers at the beginning of the process, then in the limit NN\rightarrow\infty both points x±x_{\pm} carry some non-zero probability mass for any early start. Anyway, it can be shown that the probability mass of that point farther from xN,Mx_{N,M} will be exponentially suppressed as MM grows. Fixing the initial condition at some M=o(N)M=o\left(N\right) but still divergent in NN will suppress one of the two possibilities, and concentrate the probability mass in the attraction point x±x_{\pm} that is closest to the initial share xN,Mx_{N,M}. Concerning the case of late market entry at some M=τ0NM=\tau_{0}N, we discuss it in Section V, after introducing the optimal trajectories.

III Relation with other models

We remark that, apart from economic models, the theory of the HLS urns allows to put IRT in relation with many others interesting situations that can be embedded (or approximated) by this very general urn model: there is a number of computer science problems on preferential attachment, network growth etc. Pemantle review ; Mahmoud that can be studied in this framework. Here we list three that, in our opinion, are of special physical interest. Transfer of knowledge between these field would be certainly fruitful, and should be encouraged.

For example, the case k=1k=1 of the DEK is fully equivalent to another well known stochastic model, the Elephant Random Walk (ERW), a simple random walk where each new step is determined by selecting one of the previous, then going in the same direction with probability pp. This model appears to have been re-descovered independently by G. Schütz and S. Trimper in 2004 ERW shcutz trimper , ten years after the introduction of the DEK model, and has received much attention since then as a paradigmatic example of processes with long range memory. An important advancement in the understanding of this model was made in 2016, when E. Baur and J. Bertoin observed ERW UM Baur Berton that the ERW could be mapped exactly into a two color urn of the Friedman’s type (that is in fact equivalent to a linear HLS urn Pemantle review ; Mahmoud ; Franchini ) where at each time one ball is drawn from the urn, and then replaced together with a fixed numbers of new balls whose color depend on which was drawn. This finding allowed many quantities of interest to be studied from known results on these types of models, ERW UM Baur Berton ; Jack Harris however, this analogy cannot be extended to the k>2k>2 case.

Also, Jack Jack LD has identified the urn function describing an interesting irreversible growth model introduced by Klymko, Garrahan and Whitelam KGW ; KGGW , and also this model exhibits a lock-in phase with a sub-linear entropy region, that is similar to the k>2k>2 case of the DEK model Jack LD . In this perspective, it would be quite interesting to investigate also the universal HLS scaling for symmetric urn functions recently proposed by Nakayama et al. (2021) Kazuaki . Notice that in Jack (2019) Jack LD a non-rigorous but powerful LDT is presented for a large class of models, whose predictive power should be comparable to the rigorous LD techniques used in Franchini (2017), see also the interesting review Jack 2020 Jack LD-1 .

Finally, the HLS framework allows to relate the DEK model with the very classic Random Walk Range problem Huges ; Franchini Range ; Franchini Range Urns ; Franchini Range Line , that studies the number of different sites visited by a random walk on the lattice d\mathbb{Z}^{d}. This problem is important to polymer physics as it exhibits a coil-globule transition at some critical range density, and is in the same universality class of the Self-Avoiding Walk above that value Franchini Range . In Ref. Franchini Range Urns is shown that the Range problem can be exactly embedded in the HLS model for some non-linear urn function at any dd. For d=2,3d=2,3 a strongly non-linear urn function is observed (by numerical analysis), but for d4d\geq 4 the urn function gets surprisingly close to some linear function in the self-avoiding walk-like region of large range values, that would then be related to the DEK model with k=1k=1. This model also shows a sub-linear entropy region below some critical range, as can be deduced also from a very detailed analysis of the “moderate deviations” of the Wiener Sausage in the collapsed phase by M. van den Berg, E. Bolthausen, F. Den Hollander (2001) van den Berg . Interestingly, they find cases of non-homogeneous optimal trajectories with sub-linear entropy cost: we conjecture that in this collapsed region the range undergoes a mechanism similar to that observed in the lock-in phase (actually, the non-homogeneous zero-cost trajectories that are described in Corollary 7 of Ref. Franchini ).

Refer to caption
Figure 3: Phase diagram xx vs pp for the entropy density ϕ(x)\phi\left(x\right) of the DEK k=1k=1, the diagram is shown for p>1/2p>1/2. For all p<1p<1 the point x0=1/2x_{0}=1/2 is the only point of convergence for the density of black balls, although there still is a critical pp^{*} where the derivative of π(x)\pi\left(x\right) in x0x_{0} crosses the value 1/21/2, and the convergence of xNx_{N} is slowed according to the Pemantle mechanism Jack Harris ; Pemantle Touch ; Franchini , see Section X. The line x0=0x_{0}=0 is always a stable attractor for xNx_{N}, and the entropy is convex and strictly negative in the whole region, except at the critical line x=0x=0, where is zero. According to Eq. (9), there is a critical value at p=3/4p^{*}=3/4 at which the derivative of the urn function gets above 1/21/2: for p>pp>p^{*} there is a shape change in ϕ(x)\phi\left(x\right) in the neighborhood of x=1/2x=1/2.
Refer to caption
Figure 4: Phase diagram xx vs pp for the entropy density ϕ(x)\phi\left(x\right) of the generalized DEK k=3k=3. Above the critical value pc=5/6p_{c}=5/6 the point x0=0x_{0}=0 becomes an unstable equilibrium, and two new symmetric attractors arise according to Eq. (11). In the white colored region we still find a convex and negative ϕ(x)\phi\left(x\right), except on the critical line, but notice that a new region appeared above pc=5/6p_{c}=5/6, highlighted in darker shade, where ϕ(x)=0\phi\left(x\right)=0, i.e. the entropy is sub linear in NN. The shape of ϕ(x)\phi\left(x\right) near the critical line is similar to the case k=1k=1 for p<pcp<p_{c}, except that here the point p=2/3p^{*}=2/3 at which the derivative of the urn function rise above 1/21/2. Below pcp_{c} the derivative of π\pi in x0x_{0} is increasing in pp, and pp^{*}is the value at which crosses the value 1/21/2 (from below). On the contrary, above pcp_{c} the derivative of π\pi in x±x_{\pm} decreases in pp, and crosses the value 1/21/2 (from above) at p=11/12p^{**}=11/12: when the derivative of the associated urn function in y±y_{\pm} goes back below 1/21/2 the shape of ϕ(x)\phi\left(x\right) changes in the right (left) neighborhood of xx_{-}(x+)x_{+}). See also Figure 2.

IV Zero-cost trajectories

We perform a Large Deviations analysis for the HLS model at the sample-path level, and adapt it to find the most probable trajectories taken by the DEK model. Let τ[0,1]\tau\in\left[0,1\right] be the level of market saturation (or the fraction of customers that already made their choice): the optimal trajectories, that we indicate with the symbol uu, are the scaling limit for n/Nτn/N\rightarrow\tau of the most likely trajectories followed by the share variable xN,nx_{N,n} of the first product to reach some given final share xx. These can be obtained by solving the variational problem that is presentend in Section VI (see also Theorem 4 of Ref. Franchini for a full mathematical derivation). Most interesting, we will show that for any initial condition with positive saturation τ0>0\tau_{0}>0 the scaling limit of the trajectory taken by the share variable

limNxN,τN=:u(τ)\lim_{N\rightarrow\infty}x_{N,\left\lfloor\tau N\right\rfloor}=:u\left(\tau\right) (14)

is non-degenerate for any starting share u0[0,1]u_{0}\in\left[0,1\right], and can be found by inverting the following integral:

τ(u)=τ0expu0udαπ(α)α.\tau\left(u\right)=\tau_{0}\,\exp\int_{u_{0}}^{u}\frac{d\alpha}{\pi\left(\alpha\right)-\alpha}. (15)

A crucial quantity of our analysis will be the scaling limit of the entropy (logarithm of the probability) of xN,Nx_{N,N} converging to some given xx. Define the asymptotic limit of the entropy per customer (hereafter we will call it entropy density)

ϕ(x):=limN1NlogP(xN,N=xN/N),\phi\left(x\right):=-\lim_{N\rightarrow\infty}\frac{1}{N}\log P\left(x_{N,N}=\left\lfloor xN\right\rfloor/N\right), (16)

informally, this is the scaling limit of the entropy respect to the total number of customers, i.e. for a large number of customers the probability of reaching a share xx is proportional to

P(xN,N=xN/N)exp(Nϕ(x)).P\left(x_{N,N}=\left\lfloor xN\right\rfloor/N\right)\sim\exp\left(-N\phi\left(x\right)\right). (17)

In the Section VI we will show that the shape of the limit entropy density ϕ\phi can be linked to the trajectories taken by the number of customers of the first product ΓN,n\Gamma_{N,n} to reach its final value ΓN,N=xN\Gamma_{N,N}=\left\lfloor xN\right\rfloor. These trajectories, that we indicate with the symbol φ\varphi, are the scaling limit n/Nτn/N\rightarrow\tau for the number of customers of the first product, rescaled with the total number of customers NN,

limNΓN,τN/N=:φ(τ)\lim_{N\rightarrow\infty}\Gamma_{N,\left\lfloor\tau N\right\rfloor}/N=:\varphi\left(\tau\right) (18)

this is related to the scaling limit of the share by the formula

u(τ)=φ(τ)/τ.u\left(\tau\right)=\varphi\left(\tau\right)/\tau. (19)

In Theorem 4 of Ref. Franchini (see Section VI of the present paper for an informal derivation) it is shown that the limit entropy density of any HLS urn model with α\alpha-Hölder urn function π\pi is obtained trough the following LD principle: let C1([0,1])C_{1}\left(\left[0,1\right]\right) be the set of absolutely continuous function on [0,1]\left[0,1\right] (essentially, such that the derivative exists almost everywhere) and let QC1([0,1])Q\subset C_{1}\left(\left[0,1\right]\right) the subset of those functions with initial value zero, and such that their derivative is positive but smaller than one (11-Lipschitz function),

Q:={φC1([0,1]):τφ(τ)[0,1],φ(0)=0},Q:=\{\varphi\in C_{1}\left(\left[0,1\right]\right):\,\partial_{\tau}\varphi\left(\tau\right)\in\left[0,1\right],\,\varphi\left(0\right)=0\}, (20)

also, let Q(x)Q\left(x\right) be the subset with final value xx,

Q(x):={φQ:φ(1)=x}.Q\left(x\right):=\{\varphi\in Q:\,\varphi\left(1\right)=x\}. (21)

Also, define the auxiliary function

L(α,β):=αlog(β/α)+(1α)log((1β)/(1α)),L\left(\alpha,\beta\right):=\alpha\log\left(\beta/\alpha\right)+\left(1-\alpha\right)\log\left(\left(1-\beta\right)/\left(1-\alpha\right)\right), (22)

then, the entropy density ϕ(x)\phi\left(x\right) can be computed by solving the following variational problem:

ϕ(x)=infφQ(x)I(φ),\phi\left(x\right)=\inf_{\varphi\in Q\left(x\right)}I\left(\varphi\right), (23)

with rate function defined as follows:

I(φ):=01𝑑τL(τφ(τ),π(φ(τ)/τ)).I\left(\varphi\right):=-\int_{0}^{1}d\tau\,L\left(\partial_{\tau}\varphi\left(\tau\right),\pi\left(\varphi\left(\tau\right)/\tau\right)\right). (24)

From this general result we where able to deduce a method to identify those trajectories followed by the process xN,nx_{N,n} that have a sub-linear entropy cost, i.e., such that the entropy cost of following the trajectory u(τ)u\left(\tau\right) is of the order o(N)o\left(N\right): it implies that the probability of following such a trajectory decays sub-exponentially in the number of customers (actually as a power law, see in Section VII), and not exponentially fast as for those with a cost linear in NN. Hereafter we will improperly call these the zero-cost trajectories, although their absolute entropy cost is not exactly zero. It is shown that these zero-cost trajectories can be deduced from the variational problem in Eq. (23), with the additional constraint that the Lagrangian of the Eq. (24) is exactly zero. In Corollary 6 of the Ref. Franchini explicit formulas are derived for those optimal trajectories ending in the region where ϕ(x)=0\phi\left(x\right)=0. Since for the HLS model the function LL is a negative concave function, the condition I(φ)=0I\left(\varphi\right)=0 implies that the trajectory φ\varphi satisfies the equation

L(τφ(τ),π(φ(τ)/τ))=0,L(\partial_{\tau}\varphi\left(\tau\right),\pi\left(\varphi\left(\tau\right)/\tau\right))=0, (25)

if this condition can be explicited in the variable τφ(τ)\partial_{\tau}\varphi\left(\tau\right), then it provides the differential equation for the zero-cost trajectories. Remarkably, since L(α,β)=0L(\alpha,\beta)=0 if and only if α=β\alpha=\beta, then the condition before reduces to the autonomous equation

τφ(τ)=π(φ(τ)/τ),\partial_{\tau}\varphi\left(\tau\right)=\pi\left(\varphi\left(\tau\right)/\tau\right), (26)

with final condition φ(1)=x\varphi\left(1\right)=x. Applying the substitution in Eq. (19) we obtain the equation for the scaling of the share,

τu(τ)π(u(τ))u(τ)=1τ\frac{\partial_{\tau}u\left(\tau\right)}{\pi\left(u\left(\tau\right)\right)-u\left(\tau\right)}=\frac{1}{\tau} (27)

with final condition u(1)=xu\left(1\right)=x. This equation can be integrated exactly, in the end one finds that the trajectories u(τ)u\left(\tau\right) can be computed in implicit form: the result is a simple formula,

Π(u)Π(x)=logτ(u),\Pi\left(u\right)-\Pi\left(x\right)=\log\tau\left(u\right), (28)

where Π\Pi is the primitive function (indefinite integral) of the reciprocal of π(α)α\pi\left(\alpha\right)-\alpha, that is

Π(α):=dαπ(α)α.\Pi\left(\alpha\right):=\int\frac{d\alpha}{\pi\left(\alpha\right)-\alpha}. (29)

We can formally invert the formula of τ(u)\tau\left(u\right) before, and write the equation for the zero-cost trajectories as follows: let Π1\Pi^{-1} the inverse function of Π\Pi, then

u(τ)=Π1(Π(x)+logτ).u\left(\tau\right)=\Pi^{-1}\left(\Pi\left(x\right)+\log\tau\right). (30)

The first important remark about this formula is that it allows to extend the convergence theory of HLS urns also in case of a late start in the market: let τ0\tau_{0} be the level of market saturation, and let u0u_{0} be the initial share (for a firm entering in the market at τ0\tau_{0} we would have u0=0u_{0}=0), then, it can be shown by inverting Eq. (30) that for any positive τ0\tau_{0} there is a unique point

limNxN,N=x(u0,τ0)a.s.\lim_{N\rightarrow\infty}x_{N,N}=x\left(u_{0},\tau_{0}\right)\ \ a.s. (31)

where the final share xN,Nx_{N,N} converges almost surely,

x(u0,τ0):=Π1(Π(u0)log(τ0)),x\left(u_{0},\tau_{0}\right):=\Pi^{-1}\left(\Pi\left(u_{0}\right)-\log\left(\tau_{0}\right)\right), (32)

it can be shown that the convergence points found before for the virgin market case are recovered in the limit τ00\tau_{0}\rightarrow 0.

In general, these results about the optimal trajectories could be useful to confront with (and then eventually fit) trajectories followed by real datasets in those cases where both the time series of the share and the saturation are known for the considered market Dosi last . In this respect, it is important to realize that the market saturation is not a time variable: the process describes the competition between firms, but does not need to specify the underlying market grow. Let n(t)n\left(t\right) be the total number of customers up to time tt, growing according to some law in such way that the limit market size is finite and equal to NN. Then, the share of the first product up to time tt would be x(t)=xN,n(t)x\left(t\right)=x_{N,n\left(t\right)}, that can be confronted with the predicted scaling limit

limNx(t(τ))=u(τ).\lim_{N\rightarrow\infty}x\left(t\left(\tau\right)\right)=u\left(\tau\right). (33)

by plotting x(t)x\left(t\right) in function of the saturation τ(t)=n(t)/N\tau\left(t\right)=n\left(t\right)/N. We also remark that the Eq. (28) holds for any α\alpha-Hölder urn function at least, and can be applied out of the box to more advanced IR models that can still be embedded in the HLS urn model, like those considered in Dosi Ermoliev ; Dosi Kaniovsky . For example, we could have considered a market model where multiple products are present, as far as we follow the market share of only one of them (say the first one). If the customers follow the majority of the polled sample with a probability pp, and buy at random one of the r>0r>0 available products with probability 1p1-p, the probability that the product is purchased would have been

πk(x):=pPk(x)+(1p)/r,\pi_{k}^{*}\left(x\right):=p\,P_{k}\left(x\right)+\left(1-p\right)/r, (34)

although there are differences in the convergence properties (for example this new function is not symmetric around 1/21/2 and the autocorrelation scaling of Ref. Kazuaki may not hold) it is still possible to repeat the same LD analysis, and find a similar phase structure - this model will be discussed in detail elsewhere. Moreover, the LDT techniques shown in Sections VI and IV goes beyond the HLS model, and could be adapted to find trajectories of processes that are not directly embedded in the HLS model, such as the one presented in the Ref. Dosi last , where also the possibility of losing customers is considered. It should be also possible to extend the LDT to time-dependent urn functions that varies on a time scale O(N)O\left(N\right), maybe by considering a partition of the range of τ\tau into small subintervals where the urn function can be approximated as a constant and then apply the equations given before. We expect that even some quenched disordered versions of these models may be studied, either by combining with the Replica Symmetry Breaking theory PMV RSB or the kernel methods of Ref. KERNEL THEO .

V Trajectories of the DEK model

We explicitly write trajectories in closed form for the cases k=1k=1 and k=3k=3, distinguishing between those trajectories for which u(0)[0,1]u\left(0\right)\in\left[0,1\right], the only possible for a virgin market start, from those crossing the boundary values at some positive saturation (ie, u(τ){0,1}u\left(\tau\right)\in\left\{0,1\right\} for some τ>0\tau>0), that can have zero cost only in the τ0>0\tau_{0}>0 case. For the DEK model with k=1k=1 we can evaluate the integral that defines Π1\Pi_{1}:

uydαπ1(α)α=11puxdα12α=12(1p)log(ux0xx0),\int_{u}^{y}\frac{d\alpha}{\pi_{1}\left(\alpha\right)-\alpha}=\frac{1}{1-p}\int_{u}^{x}\frac{d\alpha}{1-2\alpha}=\frac{1}{2\left(1-p\right)}\log\left(\frac{u-x_{0}}{x-x_{0}}\right), (35)

this formula can be easily inverted, then from Eq. (30) we find the equation for the trajectories

u(τ)=x0+(xx0)/τ 2(1p),u\left(\tau\right)=x_{0}+\left(x-x_{0}\right)/\tau^{\,2\left(1-p\right)}, (36)

for xx0x\neq x_{0} these trajectories diverge for any p>0p>0 when τ0\tau\rightarrow 0, while for x=x0x=x_{0} a unique non-divergent trajectory exists for all pp, and is u(τ)=x0u\left(\tau\right)=x_{0}. Notice that in the limit of perfect trust p=1p=1, that is equivalent to the classic Polya Urn Model, each u(τ)=xu\left(\tau\right)=x becomes a non divergent zero-cost trajectory for any share value xx. For both late and early starts, we find that the share xN,nx_{N,n} always follows a single zero-cost trajectory, that is therefore optimal. For a virgin market, this trajectory is u(τ)=x0u\left(\tau\right)=x_{0} and is independent from the initial share, for late start we find the general convergence point

x(u0,τ0)=x0+(u0x0)τ0 2(1p).x\left(u_{0},\tau_{0}\right)=x_{0}+\left(u_{0}-x_{0}\right)\tau_{0}^{\,2\left(1-p\right)}. (37)

This implies that for any initial condition, either early or late, the entropy density of the DEK model with k=1k=1 is zero only at the critical value x0=1/2x_{0}=1/2 (and strictly negative otherwise) for p<1p<1, while for p=1p=1 the entropy density is ϕ(x)=0\phi\left(x\right)=0 at any point x[0,1]x\in\left[0,1\right], as is expected for the Polya Model Mahmoud .

Most interesting to the IRT is the case k=3k=3 with p>pcp>p_{c}, where the lock-in phenomenon is possible: the general picture below pc=5/6p_{c}=5/6 is qualitatively the same that is found in the k=1k=1 case, but above pcp_{c} and for virgin market start we observe a whole region [x,x+][x_{-},x_{+}] where the trajectories have sub linear entropy cost, although only u(τ)=x±u\left(\tau\right)=x_{\pm} are really optimal. Let compute the trajectories for k=3k=3: also in this case the integral can be evaluated exactly, define the parameters

Δ:=6p5,Λ2:=Δ4(2p1),\Delta:=6p-5,\ \ \Lambda^{2}:=\frac{\Delta}{4\left(2p-1\right)}, (38)

the integral for Π3\Pi_{3} can be found via computer algebra,

uxdαπ3(α)α=1Δlog(1Λ2/(ux0)21Λ2/(xx0)2),\int_{u}^{x}\frac{d\alpha}{\pi_{3}\left(\alpha\right)-\alpha}=\frac{1}{\Delta}\log\left(\frac{1-\Lambda^{2}/\left(u-x_{0}\right)^{2}}{1-\Lambda^{2}/\left(x-x_{0}\right)^{2}}\right), (39)

let introduce the xx-dependent coefficient

ρ(x):=Λ2/(xx0)21,\rho\left(x\right):=\Lambda^{2}/\left(x-x_{0}\right)^{2}-1, (40)

we can invert the Eq. (39), and compute the equation for the zero-cost trajectories also in the case k=3k=3

u(τ)=x0±Λ21+ρ(x)/τΔ,u\left(\tau\right)=x_{0}\pm\sqrt{\frac{\Lambda^{2}}{1+\rho\left(x\right)/\tau^{\,\Delta}}}, (41)

where the plus and minus depends on weather the parameter xx lies above or below x0=1/2x_{0}=1/2. Also in this case, for x=x0x=x_{0} there is a zero-cost trajectory for any pp, in fact, for this value ρ\rho diverges, and the trajectory is u(τ)=1/2u\left(\tau\right)=1/2. For xx0x\neq x_{0} we have to look weather the sign of Λ2\Lambda^{2} is positive or not, and we can see from Eq. (38) that when p>pc=5/6p>p_{c}=5/6 both Δ\Delta and Λ2\Lambda^{2} are indeed positive quantities. This implies that τΔ\tau^{\Delta} converges to zero when also τ\tau does, then the u(τ)x0u\left(\tau\right)-x_{0} converges to zero at the admissible point u(0)=1/2u\left(0\right)=1/2. Finally, notice that ρ(x)\rho\left(x\right) also must be positive, otherwise the formula inside the radical would become negative for some τ>0\tau>0: then, any admissible trajectory requires the further condition xx0[Λ,Λ],x-x_{0}\in\left[-\Lambda,\,\Lambda\right], by confronting Eq. (38) with Eq. (11) we can readily see that, as expected, Λ\Lambda is equal to half distance between the convergence points |x±x0|\left|x_{\pm}-x_{0}\right|. Then, the condition reduces to x[x,x+]x\in\left[x_{-},x_{+}\right], implying that a non divergent zero-cost trajectory exists for any xx lying between the convergence points. On the other hand, in the case of a late start with an initial share u(τ0)=u0u\left(\tau_{0}\right)=u_{0} at some initial saturation τ0>0\tau_{0}>0 there is always a unique zero-cost trajectory emanating from u0u_{0} and ending in

x(u0,τ0)=x0±Λ21+ρ(u0)τ0Δ,x\left(u_{0},\tau_{0}\right)=x_{0}\pm\sqrt{\frac{\Lambda^{2}}{1+\rho\left(u_{0}\right)\tau_{0}^{\,\Delta}}}, (42)

that is also optimal. In Figures (5) and (6) a simulation of the DEK model in the lock-in phase is shown for different initial conditions, and confronted with its predicted trajectory. See also Figure 1 and 2a of G. Dosi et al. (2019) Dosi last with the Figure 2.1 of Franchini (2017) Franchini for the zero-cost trajectories of the seminal model with p=1p=1 by Arthur et al. with virgin market initial conditions.

Refer to caption
Figure 5: Zero-cost trajectories (the background lines in light gray) of the DEK model for parameters k=3k=3, p=21/24>pcp=21/24>p_{c}, size N=214N=2^{14}, early start at τ0=2/N\tau_{0}=2/N, and initial condition u0{0, 1/2}u_{0}\in\left\{0,\,1/2\right\}. For an early start in the market at some negligible saturation τ0=o(1)\tau_{0}=o\left(1\right) the trajectories of the simulated process are scattered around the equilibrium points x+x_{+} and xx_{-} at the very beginning of the process and then progressively stabilize on some zero-cost trajectory. The figure shows 2525 realizations with initial conditions XN,2=0X_{N,2}=0 red (gray) lines, XN,2=1X_{N,2}=1 black lines: notice that some processes starting from zero where able to reach the nearby of x+x_{+} anyway.
Refer to caption
Figure 6: Zero-cost trajectories (background lines in light gray, same of Figure 5) of the DEK model for k=3k=3, p=21/24>pcp=21/24>p_{c}, size N=214N=2^{14}, late start at τ0=1/13\tau_{0}=1/13, and initial condition u0{0, 1/2}u_{0}\in\left\{0,\,1/2\right\}. As one can see, in case of a late start at τ0=O(1)\tau_{0}=O\left(1\right) the initial conditions become relevant, and the process follows a single trajectory u(τ)u\left(\tau\right) emanating from the initial condition at (u0,τ0)(u_{0},\tau_{0}). The figure shows 2525 realizations with initial conditions XN,2=0X_{N,2}=0 red (gray) lines, XN,2=1X_{N,2}=1 black lines: the late start removes the degeneracy of the trajectories.

Part II Methods

VI Large deviations

The variational problem shown in Eq. (23) is deduced from two central results of LDT, the Varadhan Integral Lemma and the Mogulskii theorem (see the recent paper by Touchette Touchette for an introductory presentation, Pham for some applications to economy, or the very detailed book by A. Dembo and O. Zeitouni Dembo Zeitouni for a mathematical review). Now, instead of considering the event xN,N=xN/Nx_{N,N}=\left\lfloor xN\right\rfloor/N, let first study the simpler situation

Ω:={XN{0,1}N:xN,N[α,β]},\Omega:=\{X_{N}\in\left\{0,1\right\}^{N}:\,x_{N,N}\in\left[\alpha,\beta\right]\}, (43)

where the sample paths end in the interval [α,β]\left[\alpha,\beta\right] that contains xx. The limit entropy density of such event is

ϕ(α,β):=limN1NlogP(XΩ).\phi\left(\alpha,\beta\right):=-\lim_{N\rightarrow\infty}\frac{1}{N}\log P\left(X\in\Omega\right). (44)

The starting point is the formula for the probability mass of a sample trajectory. Let

YN={YN,1,YN,2,,YN,N}Y_{N}=\{Y_{N,1},\,Y_{N,2},\,...\,,\,Y_{N,N}\} (45)

with YN,n{0,1}Y_{N,n}\in\{0,1\} be a possible path, hereafter sample path, of the process XNX_{N}, then, its probability mass P(XN=YN)P\left(X_{N}=Y_{N}\right) according to the measure PP is given by the formula

P(XN=YN)=nNπ(yN,n)YN,n(1π(yN,n))1YN,n.P\left(X_{N}=Y_{N}\right)=\prod_{n\leq N}\pi\left({\textstyle y_{N,n}}\right)^{Y_{N,n}}\left(1-\pi\left(y_{N,n}\right)\right)^{1-Y_{N,n}}. (46)

From here we define the entropy density of the path:

S(YN):=1NlogP(XN=YN),S^{*}\left(Y_{N}\right):=-\frac{1}{N}\log P\left(X_{N}=Y_{N}\right), (47)

introducing the auxiliary function

H(α,β):=αlogβ+(1α)log(1β)H\left(\alpha,\beta\right):=\alpha\log\beta+\left(1-\alpha\right)\log\left(1-\beta\right) (48)

the entropy density before can be rewritten as

S(YN)=1NnNH(YN,n,π(yN,n)).S^{*}\left(Y_{N}\right)=-\frac{1}{N}\sum_{n\leq N}H\left(Y_{N,n},\pi\left({\textstyle y_{N,n}}\right)\right). (49)

It will be useful to introduce a notation for the average respect to the measure PP

Ef(XN):=YN{0,1}NP(XN=YN)f(YN),Ef\left(X_{N}\right):=\sum_{Y_{N}\in\left\{0,1\right\}^{N}}P\left(X_{N}=Y_{N}\right)f\left(Y_{N}\right), (50)

in this notation the probability mass of the event Ω\Omega is

P(XNΩ)=EI(XNΩ)P\left(X_{N}\in\Omega\right)=EI\left(X_{N}\in\Omega\right) (51)

We perform a change of measure

P(XNΩ)=YNΩP(XN=YN)=YN{0,1}NP(XN=YN)I(YNΩ)=YN{0,1}NeNS(YN)I(YNΩ).P\left(X_{N}\in\Omega\right)=\sum_{Y_{N}\in\Omega}P\left(X_{N}=Y_{N}\right)=\sum_{Y_{N}\in\left\{0,1\right\}^{N}}P\left(X_{N}=Y_{N}\right)I\left(Y_{N}\in\Omega\right)=\sum_{Y_{N}\in\left\{0,1\right\}^{N}}e^{-NS^{*}\left(Y_{N}\right)}I\left(Y_{N}\in\Omega\right). (52)

such that the probability of Ω\Omega can be represented as follows:

P(XNΩ)=2NE0eNS(YN)I(YNΩ),P\left(X_{N}\in\Omega\right)=2^{N}E_{0}\,e^{-NS^{*}\left(Y_{N}\right)}I\left(Y_{N}\in\Omega\right), (53)

where E0E_{0} is the average according to the uniform measure

E0f(XN):=12NXN{0,1}Nf(XN)E_{0}f\left(X_{N}\right):=\frac{1}{2^{N}}\sum_{X_{N}\in\left\{0,1\right\}^{N}}f\left(X_{N}\right) (54)

ie, the measure of a binary random walk.

The next step is to construct a continuous interpolation for the path YNY_{N}, we introduce the function

φ:={(τN/N)yN,τN+(ττN/N)YN,τN:τ[0,1]},\varphi:=\{\,\left(\left\lfloor\tau N\right\rfloor/N\right)\,y_{N,\left\lfloor\tau N\right\rfloor}+(\tau-\left\lfloor\tau N\right\rfloor/N)\,Y_{N,\left\lfloor\tau N\right\rfloor}:\,\tau\in\left[0,1\right]\}, (55)

so that the probability of the sample path can be represented in terms of φ\varphi. The interpolated trajectories are supported by

Q(Ω):={φQ:YNΩ}.Q\left(\Omega\right):=\{\,\varphi\in Q:\,Y_{N}\in\Omega\}. (56)

It can be shown that SS^{*} admits a continuous representation. This representation can be informally derived by changing the sum in Eq. (49) into an integral

1NnN01𝑑τ\frac{1}{N}\sum_{n\leq N}\rightarrow\int_{0}^{1}d\tau (57)

and apply the proper scaling to the arguments of HH, i.e.

YN,nτφ(τ),π(yN,n)π(φ(τ)/τ).Y_{N,n}\rightarrow\partial_{\tau}\varphi\left(\tau\right),\ \ \pi\left({\textstyle y_{N,n}}\right)\rightarrow\pi\left(\varphi\left(\tau\right)/\tau\right). (58)

Applying these substitutions we obtain the following entropy functional that approximate SS^{*}:

S(φ):=01𝑑τH(τφ(τ),π(φ(τ)/τ)).S\left(\varphi\right):=-\int_{0}^{1}d\tau\,H\left(\partial_{\tau}\varphi\left(\tau\right),\pi\left(\varphi\left(\tau\right)/\tau\right)\right). (59)

It can be shown that if π(0,1)\pi\in\left(0,1\right) this functional is continuous respect to the sup norm

φη:=supτ[0,1]|φ(τ)η(τ)|\left\|\varphi-\eta\right\|:=\sup_{\tau\in\left[0,1\right]}\left|\varphi\left(\tau\right)-\eta\left(\tau\right)\right| (60)

ie, is such that if φ\varphi converges to η\eta in sup norm then also

|S(φ)S(η)|0.\left|S\left(\varphi\right)-S\left(\eta\right)\right|\rightarrow 0. (61)

In Ref. Franchini it is actually shown that

limN|S(YN)S(φN)|=0,\lim_{N\rightarrow\infty}\left|S^{*}\left(Y_{N}\right)-S\left(\varphi_{N}\right)\right|=0, (62)

then, if SS is continuous in the large NN limit holds

log2ϕ(α,β)=limN1NlogE0eNS(YN)I(YNΩ)=limN1NlogE0eNS(φ)I(φQ(Ω)).\log 2-\phi\left(\alpha,\beta\right)=\lim_{N\rightarrow\infty}\frac{1}{N}\log E_{0}\,e^{-NS^{*}\left(Y_{N}\right)}I\left(Y_{N}\in\Omega\right)=\lim_{N\rightarrow\infty}\frac{1}{N}\log E_{0}\,e^{-NS\left(\varphi\right)}I\left(\varphi\in Q\left(\Omega\right)\right). (63)

This is enough to compute the rate function from Varadhan Integral Lemma Dembo Zeitouni . Informally, this theorem can be seen as a rigorous functional version of the well known saddle-point method. From Lemmas 4.3.2 and 4.3.4 of the book by Dembo and Zeitouni Dembo Zeitouni we obtain

limN1NlogE0eNS(φ)I(φQ(Ω))=infφQ(α,β){S(φ)S0(φ)}\lim_{N\rightarrow\infty}\frac{1}{N}\log E_{0}e^{-NS\left(\varphi\right)}I\left(\varphi\in Q\left(\Omega\right)\right)=-\inf_{\varphi\in Q\left(\alpha,\beta\right)}\left\{S\left(\varphi\right)-S_{0}\left(\varphi\right)\right\} (64)

where Q(α,β)Q\left(\alpha,\beta\right) is the limit of the set Q(Ω)Q\left(\Omega\right), i.e.

limNQ(Ω)=Q(α,β)=γ[α,β]Q(γ),\lim_{N\rightarrow\infty}Q\left(\Omega\right)=Q\left(\alpha,\beta\right)=\bigcup_{\gamma\in\left[\alpha,\beta\right]}Q\left(\gamma\right), (65)

and S0S_{0} the rate function of a simple random walk with binary steps, in our context would be the case p=1/2p=1/2.

The rate function S0S_{0} is provided by the Mogulskii Theorem Dembo Zeitouni : it states that the rate function of any process where the increments form an i.i.d. sequence is given by

S0(φ)=01𝑑τM(τφ(τ)).S_{0}\left(\varphi\right)=-\int_{0}^{1}d\tau\,M\left(\partial_{\tau}\varphi\left(\tau\right)\right). (66)

where MM is the Legendre transform

M(α):=infβ[0,){αβζ(β)},M\left(\alpha\right):=\inf_{\beta\in\left[0,\infty\right)}\left\{\alpha\beta-\zeta\left(\beta\right)\right\}, (67)

of the moment generating function of the increments

ζ(β):=E0exp(βYN,1),\zeta\left(\beta\right):=E_{0}\exp\left(\beta Y_{N,1}\right), (68)

in case of coin-flip distributed binary variables:

E0exp(βYN,1)=12YN,1{0,1}exp(βYN,1)=1+eβ2.E_{0}\exp\left(\beta Y_{N,1}\right)=\frac{1}{2}\sum_{Y_{N,1}\in\left\{0,1\right\}}\exp\left(\beta Y_{N,1}\right)=\frac{1+e^{\,\beta}}{2}. (69)

Applying the Legendre transform, and following the Mogulskii Theorem Franchini ; Dembo Zeitouni , we find:

S0(φ)=log2+J(φ),S_{0}\left(\varphi\right)=-\log 2+J\left(\varphi\right), (70)

where the functional JJ is defined

J(φ):=01𝑑τH(τφ(τ),τφ(τ)),J\left(\varphi\right):=-\int_{0}^{1}d\tau H\left(\partial_{\tau}\varphi\left(\tau\right),\partial_{\tau}\varphi\left(\tau\right)\right), (71)

for any absolutely continuous φQ\varphi\in Q, and is -\infty otherwise, i.e. those trajectories that are not absolutely continuous have zero probability mass (and can be ignored). In the end it is found Franchini that the rate function is equal to

I(φ)=J(φ)S(φ),I\left(\varphi\right)=J\left(\varphi\right)-S\left(\varphi\right), (72)

Noticing that

L(α,β)=H(α,β)H(α,α)L\left(\alpha,\beta\right)=H\left(\alpha,\beta\right)-H\left(\alpha,\alpha\right) (73)

we arrive to the rate function as presented in Eq. (24).

We remark that Eq. (23) cannot be deduced by contraction principle, because the internal part of Q(x)Q\left(x\right) (the set minus its boundary) is void, and then cannot be a continuity set for the rate function I(φ)I\left(\varphi\right). Some additional arguments would then be necessary to rigorously prove this result, where we apply the contraction principle to the mass of Q(α,β)Q\left(\alpha,\beta\right), and then show that is possible to take α,βx\alpha,\beta\rightarrow x.

This proof is rather technical and we do not need to discuss it here, the interested readers can find it in the proof section of the Ref. Franchini . Also, notice that the requirement that π(0,1)\pi\in\left(0,1\right) is not fulfilled if p=1p=1, in Ref. Franchini a special surgery on the set QQ is performed to a priori exclude the problematic trajectories and extend the result to the general case π[0,1]\pi\in\left[0,1\right].

VII Scaling of the entropy inside the sub-linear region

For a late market start at saturation τ0>0\tau_{0}>0 and initial share u0u_{0} we have shown that the process follows a well defined trajectory if NN is large enough, with a single convergence point x(u0,τ0)x\left(u_{0},\tau_{0}\right) where the ϕ\phi is zero. The scaling of the entropy can be deduced by noticing that this is the unique concentration point of the process, then the probability mass of its nearby should be O(1)O\left(1\right): since for finite NN and any finite nearby of x(u0,τ0)x\left(u_{0},\tau_{0}\right) the mass must be distributed between a number of possible share values that is of order O(N)O\left(N\right), we expect that at the concentration points the mass decays with some power of NN. This reasoning suggests that in the sub-linear region the entropy of the trajectory is logarithmic in the number of potential customers, let define the sub-linear scaling

ϕ(x):=limN1logNlogP(xN,N=xN/N)\phi^{*}\left(x\right):=-\lim_{N\rightarrow\infty}\,\frac{1}{\log N}\log P\left(x_{N,N}=\left\lfloor xN\right\rfloor/N\right) (74)

since there is a is unique concentration point, for any τ0>0\tau_{0}>0 we can expect a monovariate probability mass function, then the predicted sub-linear scaling would be divergent for any xx different from x(u0,τ0)x\left(u_{0},\tau_{0}\right) and equal to some positive constant otherwise. On the contrary, in the case of a virgin market start at τ0=0\tau_{0}=0 and for p>pcp>p_{c} the limit of the entropy density is found to be zero for any x[x,x+]x\in[x_{-},x_{+}], and therefore in the lock-in phase the entropy of any trajectory that ends between the points x±x_{\pm} has a cost that is sub-linear in the potential number of customers. In fact, is also possible to show Franchini that, for any continuous and invertible urn function, the limit ϕ(x)\phi\left(x\right) exists, it is strictly convex and negative from x=0x=0 up to the first point where the urn function crosses the diagonal, is zero from that point to the last crossing, and then is convex negative again. A numeric example of the scaling of ϕ\phi and ϕ\phi^{*} is in Figures (7) and (8).

Although the analysis of the zero-cost trajectories allows to establish that in the early entry case the entropy in the region between the convergence points is sub-linear, the exact scaling of the entropy is not captured by this analysis. By the way, we remark that any deviation from these trajectories on time scale O(N)O(N) implies exponential cost. Moreover, from the Corollary 6 of Ref. Franchini follows also the uniqueness of the solution for each x(x1,x2)x\in\left(x_{1},x_{2}\right). The probability mass current can flow along these trajectories only, therefore, the current flowing through (φ1,φ2)(\varphi_{1},\varphi_{2}) is a constant in τ\tau,

P(φ(1)(x1,x2))=P(φ(τ)(φ1(τ),φ2(τ))),P\left(\varphi\left(1\right)\in\left(x_{1},x_{2}\right)\right)=P\left(\varphi\left(\tau\right)\in\left(\varphi_{1}\left(\tau\right),\varphi_{2}\left(\tau\right)\right)\right), (75)

since can be also shown Franchini that zero-cost trajectories always emanate from the closest unstable equilibrium point, follows that the entropy of the event xN,N(x1,x2)x_{N,N}\in\left(x_{1},x_{2}\right) should scale like the entropy near that point, that in this case is x0=1/2x_{0}=1/2. It would be very interesting to have a general mathematical theory that allows to find the exact rate at which the point x0x_{0} expels its probability mass. An informal but general argument can be found in Section III.B.2 of Jack (2019) Jack LD .

Interestingly, also Nakayama and Mori (2021) find that for urn functions that are symmetric around x0x_{0} the autocorrelation function satisfy a universal logarithmic scaling for a suitable definition of the correlation length. See Ref. Kazuaki for further details.

Refer to caption
Figure 7: Sub-linear entropy scaling ϕ\phi^{*} of the DEK model for k=3k=3, p=21/24>pcp=21/24>p_{c}, size N=2kN=2^{k} with 10k1710\leq k\leq 17, early start τ0=2/N\tau_{0}=2/N, and initial condition u0=1/2u_{0}=1/2 (XN,2=1X_{N,2}=1). The scaling limit is reached very slowly in the sub-linear region, as the finite size corrections vanish only logarithmically in the number of customers.
Refer to caption
Figure 8: Linear entropy scaling ϕ\phi (entropy density) of the DEK model for k=3k=3, p=21/24>pcp=21/24>p_{c}, size N=2kN=2^{k} with 12k1712\leq k\leq 17, early start at τ0=2/N\tau_{0}=2/N, and initial condition u0=1/2u_{0}=1/2 (XN,2=1X_{N,2}=1). The scaling limit is reached much faster in the linear regions. The gray dot line is the prediction ϕ=0\phi=0 in the region between [x,x+]\left[x_{-},\,x_{+}\right].

VIII Cumulant generating function

We still didn’t found much about the region ϕ(x)<0\phi\left(x\right)<0: it would be very interesting to have a method to compute the optimal trajectories also in this region, perhaps this could be achieved by properly deforming the zero-cost trajectories, or applying techniques from Lagrange mechanics, or other optimal control methods. Although this has not yet been achieved, we can still compute the shape of ϕ\phi outside the sub linear-region by analyzing the cumulant generating function (CGF)

ξ(λ):=limN1NlogΓNeλΓP(xN,N=Γ/N),\xi\left(\lambda\right):=\lim_{N\rightarrow\infty}\frac{1}{N}\log\sum_{\Gamma\leq N}e^{-\lambda\Gamma}P\left(x_{N,N}=\Gamma/N\right), (76)

the right (left) behavior of ϕ(x)\phi\left(x\right) near the convergence points can be deduced from the left (right) limit λ0±\lambda\rightarrow 0^{\pm} of the CGF before. Since the convergence points are always symmetric around x0=1/2x_{0}=1/2 we only compute the limit from right. In Ref. Franchini is shown that, in general, the CGF satisfies the following nonlinear differential equation at any pp and kk (eventually any invertible π\pi)

λξ(λ)=π1(eξ(λ)1eλ1)\partial_{\lambda}\xi\left(\lambda\right)=\pi^{-1}\left(\frac{e^{\,\xi\left(\lambda\right)}-1}{e^{\,\lambda}-1}\right) (77)

with π1\pi^{-1} inverse urn function, and we can study the behavior at small lambda with a suitable perturbations theory (see next section). The shape of ϕ\phi near the convergence points is then computed via the Legendre transform

ϕ(x)=infλ[0,){λxξ(λ)}.\phi\left(x\right)=\inf_{\lambda\in\left[0,\infty\right)}\left\{\lambda x-\xi\left(\lambda\right)\right\}. (78)

A possible informal derivation is as follows: let consider the difference between the partition functions of the system at N+1N+1 and NN customers

Eexp(λNxN,N+λXN+1,N+1)Eexp(λNxN,N)=(eλ1)E(π(xN,N)exp(λNxN,N)),E\exp\,(\lambda Nx_{N,N}+\lambda X_{N+1,N+1})-E\exp\,(\lambda Nx_{N,N})={\textstyle\left(e^{\lambda}-1\right)}\,E(\pi\left(x_{N,N}\right)\exp\,(\lambda Nx_{N,N})), (79)

consider the following equivalent expression for the CGF

ξN(λ)=1NlogEexp(λNxN,N),\xi_{N}\left(\lambda\right)=\frac{1}{N}\log\,E\exp\,(\lambda Nx_{N,N}), (80)

define the auxiliary function

δN(λ):=(N+1)(ξN+1(λ)ξN(λ)),\delta_{N}\left(\lambda\right):=\left(N+1\right)\left(\xi_{N+1}\left(\lambda\right)-\xi_{N}\left(\lambda\right)\right), (81)

and the notation EλE_{\lambda} for the tilted average,

Eλf(XN):=Ef(XN)exp(λNxN,N)Eexp(λNxN,N).E_{\lambda}f\left(X_{N}\right):=\frac{Ef\left(X_{N}\right)\exp\left(\lambda Nx_{N,N}\right)}{E\exp\left(\lambda Nx_{N,N}\right)}. (82)

Using this notation and after some manipulations we arrive to the identity

δN(λ)+ξN(λ)=log(1+(eλ1)Eλπ(xN,N)),\delta_{N}\left(\lambda\right)+\xi_{N}\left(\lambda\right)=\log\left(1+{\textstyle\left(e^{\lambda}-1\right)}\,E_{\lambda}\pi\left(x_{N,N}\right)\right), (83)

now we take the limit NN\rightarrow\infty: from the existence of ϕ\phi follows that of ξ\xi, then the limit of ξN\xi_{N} exists and is

limNξN(λ)=ξ(λ),\lim_{N\rightarrow\infty}\xi_{N}\left(\lambda\right)=\xi\left(\lambda\right), (84)

and can be shown Franchini that, if the urn function π\pi is invertible, which is our case, then also its derivative exists, and converges to λξ\partial_{\lambda}\xi in the limit

limNEλxN,N=limNλξN(λ)=λξ(λ).\lim_{N\rightarrow\infty}E_{\lambda}x_{N,N}=\lim_{N\rightarrow\infty}\partial_{\lambda}\xi_{N}\left(\lambda\right)=\partial_{\lambda}\xi\left(\lambda\right). (85)

It is also possible to prove Franchini that xN,Nx_{N,N} weakly concentrates on its convergence point under the tilted average EλE_{\lambda}, notice that the tilted average of xN,Nx_{N,N} is

EλxN,N=λξN,N(λ),E_{\lambda}x_{N,N}=\partial_{\lambda}\xi_{N,N}\left(\lambda\right), (86)

therefore by weak convergence

limNEλπ(xN,N)=limNπ(EλxN,N)=π(λξ(λ)).\lim_{N\rightarrow\infty}E_{\lambda}\pi\left(x_{N,N}\right)=\lim_{N\rightarrow\infty}\pi\left(E_{\lambda}x_{N,N}\right)=\pi\left(\partial_{\lambda}\xi\left(\lambda\right)\right). (87)

Finally, with a slightly more technical argument (see the proof section of Ref. Franchini ) one can show that δN\delta_{N} converges to zero

limNδN(λ)=0,\lim_{N\rightarrow\infty}\delta_{N}\left(\lambda\right)=0, (88)

putting together we find

ξ(λ)=log(1+(eλ1)π(λξ(λ))),\xi\left(\lambda\right)=\log\left(1+{\textstyle\left(e^{\lambda}-1\right)}\,\pi\left(\partial_{\lambda}\xi\left(\lambda\right)\right)\right), (89)

that is equivalent to Eq. (77).

The DEK with k=1k=1 is fully equivalent to the ERW, whose LDT properties have been studied by Jack and Harris in both pp regimes Jack Harris : the urn function is

π1(x)=a+bx,\pi_{1}\left(x\right)=a+bx, (90)

with coefficients equal to

a=1p,b=2p1.a=1-p,\ \ b=2p-1. (91)

From Eq. (77), the CGF satisfies the differential equation

a+bλξ(λ)=eξ(λ)1eλ1,a+b\,\partial_{\lambda}\,\xi\left(\lambda\right)=\frac{e^{\,\xi\left(\lambda\right)}-1}{e^{\,\lambda}-1}, (92)

this equation can be integrated exactly by applying a proper substitution, and then the Laplace method (see Section 3.3.2 Ref. Franchini ): adapting the results from Corollary 10 Ref. Franchini (see also Jack and Harris Jack Harris ) we find that the CGF is

1eξ(λ)=abe(a/b)λ(1eλ)1/b1eλ1𝑑t(1t)(a/b)1t1/b1-e^{-\xi\left(\lambda\right)}=\frac{a}{b}\,e^{-\left(a/b\right)\lambda}\left({\textstyle 1-e^{-\lambda}}\right)^{1/b}\int_{1-e^{-\lambda}}^{1}dt\,\left(1-t\right)^{\left(a/b\right)-1}t^{-1/b} (93)

for p>1/2p>1/2 and λ>0\lambda>0. Interestingly for b>0b>0 (p>1/2p>1/2) the function is never analytic at λ=0\lambda=0, expanding for small λ\lambda we find a non vanishing term, of order λ1/blogλ\lambda^{1/b}\log\lambda when 1/b1/b is an integer number and λ1/b\lambda^{1/b} when is a real number: derivatives of order higher than 1/b1/b are singular at λ=0\lambda=0.

IX Scaling of the master equation

Numerically, we can study the shape of ϕ\phi by computing the master equation,

P(xN+1,N+1=Γ/N)=π(Γ/N1/N)P(xN,N=Γ/N1/N)+(1π(Γ/N))P(xN,N=Γ/N),P\left(x_{N+1,N+1}=\Gamma/N\right)=\pi\left(\Gamma/N-1/N\right)P\left(x_{N,N}=\Gamma/N-1/N\right)+\left(1-\pi\left(\Gamma/N\right)\right)P\left(x_{N,N}=\Gamma/N\right), (94)

that can be integrated iteratively starting from the distribution of the initial condition xN,Mx_{N,M}. Notice that in practical numerical tasks is not convenient to consider exponential quantities, and then in our numerical tests we will consider the entropy

Φ(Γ,N):=logP(xN,N=Γ/N),\Phi\left(\Gamma,N\right):=-\log P\left(x_{N,N}=\Gamma/N\right), (95)

in this form the master equation can be rewritten as follows

exp(Φ(Γ,N)Φ(Γ,N+1))=π(Γ/N1/N)exp(Φ(Γ,N)Φ(Γ1,N))+(1π(Γ/N)).\exp\left(\Phi\left(\Gamma,N\right)-\Phi\left(\Gamma,N+1\right)\right)=\pi\left(\Gamma/N-1/N\right)\exp\left(\Phi\left(\Gamma,N\right)-\Phi\left(\Gamma-1,N\right)\right)+\left(1-\pi\left(\Gamma/N\right)\right). (96)

The Eq.s (77) and (78) can be (informally) deduced also from the master equation: in fact, the existence of ϕ\phi suggests to try the following scaling

Φ(Γ,N)Nϕ(Γ/N),\Phi\left(\Gamma,N\right)\rightarrow N\phi\left(\Gamma/N\right), (97)

that holds for large NN. The left term of the master equation is

Φ(Γ,N)Φ(Γ,N+1)(N+1)ϕ(Γ/(N+1))+Nϕ(Γ/N)\Phi\left(\Gamma,N\right)-\Phi\left(\Gamma,N+1\right)\rightarrow-\left(N+1\right)\phi\left(\Gamma/\left(N+1\right)\right)+N\phi\left(\Gamma/N\right) (98)

while the right term is

Φ(Γ,N)Φ(Γ1,N)Nϕ(Γ/N)+Nϕ((Γ+1)/N),\Phi\left(\Gamma,N\right)-\Phi\left(\Gamma-1,N\right)\rightarrow-N\phi\left(\Gamma/N\right)+N\phi\left(\left(\Gamma+1\right)/N\right), (99)

putting back into the master equation we find

exp((N+1)ϕ(Γ/(N+1))Nϕ(Γ/N))==π(Γ/N1/N)exp(Nϕ(Γ/N1/N)Nϕ(Γ/N))+(1π(Γ/N)).\exp\left(\left(N+1\right)\phi\left(\Gamma/\left(N+1\right)\right)-N\phi\left(\Gamma/N\right)\right)=\\ =\pi\left(\Gamma/N-1/N\right)\exp\left(N\phi\left(\Gamma/N-1/N\right)-N\phi\left(\Gamma/N\right)\right)+\left(1-\pi\left(\Gamma/N\right)\right). (100)

Now, let apply the scaling Γ/Nx\Gamma/N\rightarrow x and 1/Ndx1/N\rightarrow dx, from this conditions we deduce that

(Γ+1)/Nx+dx,\left(\Gamma+1\right)/N\rightarrow x+dx, (101)
Γ/(N+1)=Γ/NΓ/(N(N1))xxdx,\Gamma/\left(N+1\right)=\Gamma/N-\Gamma/\left(N\left(N-1\right)\right)\rightarrow x-xdx, (102)

the scaling of the entropy density is

ϕ(Γ/(N+1))ϕ(x)\phi\left(\Gamma/\left(N+1\right)\right)\rightarrow\phi\left(x\right) (103)
ϕ(Γ/(N+1))ϕ(x)xxϕ(x)dx\phi\left(\Gamma/\left(N+1\right)\right)\rightarrow\phi\left(x\right)-x\,\partial_{x}\phi\left(x\right)dx (104)
ϕ((Γ1)/N)ϕ(x)xϕ(x)dx.\phi\left(\left(\Gamma-1\right)/N\right)\rightarrow\phi\left(x\right)-\partial_{x}\phi\left(x\right)dx. (105)

In the end one obtains a non-linear differential equation

π(x)=exp(xxϕ(x)ϕ(x))1exp(xϕ(x))1,\pi\left(x\right)=\frac{\exp\,(x\,\partial_{x}\phi\left(x\right)-\phi\left(x\right))-1}{\exp\,(\partial_{x}\phi\left(x\right))-1}, (106)

that reduces to Eq. (77) if one substitutes xϕ(x)λ\partial_{x}\phi\left(x\right)\rightarrow-\lambda and

xxϕ(x)ϕ(x)ξ(λ),xλξ(λ).x\,\partial_{x}\phi\left(x\right)-\phi\left(x\right)\rightarrow\xi\left(\lambda\right),\ \ \ x\rightarrow\partial_{\lambda}\xi\left(\lambda\right). (107)

It would be very interesting to have a general theory to solve these differential equations for any π\pi: at present, this can be done only for linear urn functions.

X Perturbations theory for k=1k=1

In these final sections we elaborate a first order perturbations theory for the shape of ϕ\phi outside the sublinear region. We find some more critical values of the trust parameter pp, that exist in both the k=1k=1 and k=3k=3 cases. For k=1k=1 only one pp^{*} exists, beyond which the peak of the share distribution is not Gaussian anymore (that is well known). Interestingly, in the case k=3k=3 there are two critical values: pp^{*}, that is analogue to the case k=1k=1, and a pp^{**}, beyond which the Gaussianity near the convergence point seems restored, see Figures 3 and 4.

To systematically understand the shape of ϕ\phi it will be more instructive to perform an approximate analysis. We put emphasis on perturbation theory because is a simple method and does not require special mathematical knowledge on ODE to be applied. We consider the following general scaling at small λ\lambda

ξ(λ)Aλ+Bλ2+Cλθ\xi\left(\lambda\right)\approx A\lambda+B\lambda^{2}+C\lambda^{\theta} (108)

where the approximate equality symbol \approx is intended in the sense that we are ignoring all terms of the kind λθ\lambda^{\theta} with θ>2\theta>2. This is because λ\lambda is assumed to be small, then the term λθ\lambda^{\theta} can rival with the regular terms only if θ2\theta\leq 2, i.e., for θ>2\theta>2 the regular terms dominate the first two moments of the distribution and λθ\lambda^{\theta} can be ignored. The derivative respect to λ\lambda is

λξ(λ)A+2Bλ+θCλθ1.\partial_{\lambda}\xi\left(\lambda\right)\approx A+2B\lambda+\theta C\lambda^{\theta-1}. (109)

Then, we approximate the right side of Eq. (92),

eξ(λ)1eλ1A+(A(1A)/2+B)λ+Cλθ1,\frac{e^{\,\xi\left(\lambda\right)}-1}{e^{\,\lambda}-1}\approx A+\left(A\left(1-A\right)/2+B\right)\lambda+C\lambda^{\theta-1}, (110)

equating the coefficients of the terms with equal power

((1b)Aa)+(A(1A)/2+(12b)B)λ+(1bθ)Cλθ10,\left(\left(1-b\right)A-a\right)+\left(A\left(1-A\right)/2+\left(1-2b\right)B\right)\lambda+\left(1-b\theta\right)C\lambda^{\theta-1}\approx 0, (111)

we find the following values for AA, BB and θ\theta:

A=a1b=12,A=\frac{a}{1-b}=\frac{1}{2}, (112)
2B=A(1A)12b=14(34p),2B=-\frac{A\left(1-A\right)}{1-2b}=-\frac{1}{4\left(3-4p\right)}, (113)
θ=1b=12p1,\theta=\frac{1}{b}=\frac{1}{2p-1}, (114)

the amplitude CC is not captured by this expansion, and must be determined in a different way, for example it could be obtained from the exact expression of the CGF that is given before, but we don’t need it.

We remark that, when p>p=3/4p>p^{*}=3/4, i.e., when the derivative of this urn function at the point of convergence x0x_{0} goes above 1/21/2, then even the second order cumulant is super-linear, and the shape ϕ(x)\phi\left(x\right) in the nearby of x0=1/2x_{0}=1/2 is not even Gaussian anymore for p(p,1)p\in\left(p^{*},1\right). This suggests some phase change in the convergence mechanism of xN,Nx_{N,N}: below pp^{*}, when the urn function derivative at the point x0x_{0} is less than 1/21/2, we expect that xN,Nx_{N,N} will cross the critical value infinitely many times in its evolution. But above the value pcp_{c} the convergence of xN,Nx_{N,N} has a slow down, according to an interesting mechanism first described by Pemantle Pemantle Touch , where xN,Nx_{N,N} approaches x0x_{0} so slowly that it will never cross this point (almost surely), and will accumulate in its neighborhood.

The effects of this transition can be observed in the shape of ϕ\phi. Let apply the Legendre transform to the expression of the CGF for small λ\lambda, first we have to solve the equation

xλξ(λ)=0,x-\partial_{\lambda}\xi\left(\lambda\right)=0, (115)

inserting the approximation before we have

xA2BλθCλθ10.x-A-2B\lambda-\theta C\lambda^{\theta-1}\approx 0. (116)

For θ2\theta\geq 2 the quadratic term is dominant at small λ\lambda, and the previous condition reduces to

xA2Bλ0x-A-2B\lambda\approx 0 (117)

solving the equation we find the λ\lambda that minimizes the Legendre functional of Eq. (78)

λxA2B,\lambda\approx\frac{x-A}{2B}, (118)

putting back in the expression for ϕ\phi we find

ϕ(x)B(xA2B)2.\phi\left(x\right)\approx B\left(\frac{x-A}{2B}\right)^{2}. (119)

If instead 1θ21\leq\theta\leq 2 the quadratic term can be ignored in favor of the non-linear term, the condition is

xAθCλθ10x-A-\theta C\lambda^{\theta-1}\approx 0 (120)

The new condition bring to a different minimizer

λ(xAθC)1θ1,\lambda\approx\left(\frac{x-A}{\theta C}\right)^{\frac{1}{\theta-1}}, (121)

then we can compute the approximate shape,

ϕ(x)(θC)b1b(θ11)(xA)11b.\phi\left(x\right)\approx\left(\theta C\right)^{-\frac{b}{1-b}}\left(\theta^{-1}-1\right)\left(x-A\right)^{\frac{1}{1-b}}. (122)

Summarizing, the shape of ϕ\phi nearby the convergence point y0y_{0} for the DEK k=1k=1 is approximately

ϕ(x){K0|xx0|2K1|xx0|1/(22p)0<p<pp<p<1,\phi\left(x\right)\approx\begin{cases}\begin{array}[]{l}K_{0}\left|x-x_{0}\right|^{2}\\ K_{1}\left|x-x_{0}\right|^{1/\left(2-2p\right)}\end{array}&\begin{array}[]{l}0<p<p^{*}\\ p^{*}<p<1\end{array}\end{cases}, (123)

where the first constant is

K0=1/2B=2(34p)K_{0}=1/2B=2\left(3-4p\right) (124)

and K1K_{1} must be determined from the exact form of ξ\xi. To keep the analysis simple we do not discuss the critical case p=pcp=p_{c}, altough this also can be inferred from the exact form of ξ\xi.

Refer to caption
Figure 9: Test of perturbation theory near x+x_{+} for pc<p<pp_{c}<p<p^{**}. The figure shows in double-logarithmic scale the density ϕ\phi of the DEK model near the convergence point (in log-log scale) and the predicted behavior (gray dot line) |xx+|4|x-x_{+}|^{4} from Eq. (136) obtained via perturbations analysis. The parameters are k=3k=3, p=21/24p=21/24, N=2kN=2^{k} with 10k1410\leq k\leq 14, τ0=2/N\tau_{0}=2/N and u0=1/2u_{0}=1/2 (XN,2=1X_{N,2}=1). We remark that in this range of pp the exact coefficient K2K^{\prime}_{2} cannot be determined from perturbations, the gray dot line in the figure has been settled to highlight the agreement of the predicted exponent with the simulation.

XI Perturbations theory for k=3k=3

Concerning the generalized DEK k=3k=3, its urn function is a third degree polynomial of the kind

π3(x)=a+cx2dx3\pi_{3}\left(x\right)=a+cx^{2}-dx^{3} (125)

with null linear term, the other coefficients are

a=1p,c=3(2p1),d=2(2p1).a=1-p,\ \ c=3\left(2p-1\right),\ \ d=2\left(2p-1\right). (126)

The implicit differential equation for the CGF is

a+c(λξ(λ))2d(λξ(λ))3=eξ(λ)1eλ1,a+c\,\left(\partial_{\lambda}\xi\left(\lambda\right)\right)^{2}-d\,\left(\partial_{\lambda}\xi\left(\lambda\right)\right)^{3}=\frac{e^{\,\xi\left(\lambda\right)}-1}{e^{\,\lambda}-1}, (127)

this equation cannot be solved (at best of our knowledge), but, by looking at the behavior for small λ\lambda, we expect that below pcp_{c} the same picture of the linear case (with k=1k=1) will arise, although at different critical value p=2/3p^{*}=2/3. This is because the urn function has a flex at the convergence point x0x_{0}, i.e. the urn function is locally linear.

Let expand the urn function near the convergence point, for example x0x_{0}, we can linearize it

π3(x)π3(x0)+xπ3(x0)(xx0),\pi_{3}\left(x\right)\approx\pi_{3}\left(x_{0}\right)+\partial_{x}\pi_{3}\left(x_{0}\right)\left(x-x_{0}\right), (128)

where the derivative of π3\pi_{3} is

xπ3(x)=x(2c3dx)=6(2p1)x(1x).\partial_{x}\pi_{3}\left(x\right)=x\left(2c-3dx\right)=6\left(2p-1\right)x\left(1-x\right). (129)

We can use the results obtained for the linear case before, in the region below pcp_{c} we can take

b0=xπ3(x0)=6(2p1)x0(1x0)=32(2p1)b_{0}=\partial_{x}\pi_{3}\left(x_{0}\right)=6\left(2p-1\right)x_{0}\left(1-x_{0}\right)=\frac{3}{2}\left(2p-1\right) (130)

while above pcp_{c}we have

b±=xπ3(x±)=6(2p1)x±(1x±)=6(2p1)(x0+Λ)(x0Λ)=b0(14Λ2).b_{\pm}=\partial_{x}\pi_{3}\left(x_{\pm}\right)=6\left(2p-1\right)x_{\pm}\left(1-x_{\pm}\right)=6\left(2p-1\right)\left(x_{0}+\Lambda\right)\left(x_{0}-\Lambda\right)=b_{0}\left(1-4\Lambda^{2}\right). (131)

Then, in the case p<pcp<p_{c} we have Λ=0\Lambda=0, recalling that θ=1/b\theta=1/b the sub-critical exponent is

θ0=23(2p1),\theta_{0}=\frac{2}{3\left(2p-1\right)}, (132)

solving the equation θ=2\theta=2 we find p=2/3p^{*}=2/3. For p>pcp>p_{c} one has a positive Λ\Lambda, substituting

4Λ2=6p52p14\Lambda^{2}=\frac{6p-5}{2p-1} (133)

into Eq. (131) we find that

θ±=16(1p)\theta_{\pm}=\frac{1}{6\left(1-p\right)} (134)

therefore, there is another critical point where the shape of ϕ\phi changes again, solving the equation we find it at

p=11/12.p^{**}=11/12. (135)

Then, above pcp_{c}, another special trust parameter pp^{**} can be identified, that corresponds to the value at which the derivative of the urn function near x±x_{\pm} (that above pcp_{c} is decreasing in pp) goes once again below 1/21/2. We predict that in this last region the convergence mechanism below pp^{*} is restored, although with a different convergence point.

Putting these considerations together, we find the following approximate shape of ϕ\phi for k=3k=3,

ϕ(x){K0|xx0|2K1|xx0|256pK2|xx±|16p5K3|xx±|20<p<pp<p<pcpc<p<pp<p<1\phi\left(x\right)\approx\begin{cases}\begin{array}[]{l}K^{\prime}_{0}\left|x-x_{0}\right|^{2}\\ K^{\prime}_{1}\left|x-x_{0}\right|^{\frac{2}{5-6p}}\\ K^{\prime}_{2}\left|x-x_{\pm}\right|^{\frac{1}{6p-5}}\\ K^{\prime}_{3}\left|x-x_{\pm}\right|^{2}\end{array}&\begin{array}[]{l}0<p<p^{*}\\ p^{*}<p<p_{c}\\ p_{c}<p<p^{**}\\ p^{**}<p<1\end{array}\end{cases} (136)

where we implicitly assumed that |x|>1/2+Λ\left|x\right|>1/2+\Lambda, since in the region between the convergence points we already know that ϕ=0\phi=0. The constants K0K^{\prime}_{0} and K3K^{\prime}_{3} are computed like in the k=1k=1 case, one finds

K0=2(12b0)=4(32p),K^{\prime}_{0}=2\left(1-2b_{0}\right)=4\left(3-2p\right), (137)

in the region below pp^{*} and

K3=2(12b±)14Λ2=1+12(12p)(1p)2(1p)K^{\prime}_{3}=\frac{2\left(1-2b_{\pm}\right)}{1-4\Lambda^{2}}=\frac{1+12\left(1-2p\right)\left(1-p\right)}{2\left(1-p\right)} (138)

in the region above p.p^{**}. On the contrary, the constants K1K^{\prime}_{1} and K2K^{\prime}_{2} cannot be determined using perturbations, and should be found by other methods. A numerical check of the exponent in the region pc<p<pp_{c}<p<p^{**} is in Figure (9).

Acknowledgments

We thank Giovanni Dosi (Scuola Superiore Sant’Anna) and two anonymous referees of Physical Review E for their useful comments. This research has received funding from European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant Agreement No [694925]).

References

  • (1) W. B. Arthur, Nat. Rev. Phys. 3, 136–145 (2021).
  • (2) W. B. Arthur, Increasing Returns and Path Dependence in the Economy, Univ. Michigan Press, Ann Arbor (1994).
  • (3) W. B. Arthur, Econ. J. 99, 116–131 (1989).
  • (4) W. B. Arthur, Y. Ermoliev, M. Kaniovski, Kibernetika 1, 49–56 (1983).
  • (5) G. Dosi, Y. Ermoliev, Y. Kaniovsky, J. Math. Econom. 23, 1–19 (1994).
  • (6) G. Dosi, Y. Ermoliev, Y. Kaniovsky, J. Evol. Econ. 4, 93–123 (1994).
  • (7) G. Dosi, A. Moneta, E. Stepanova, Ind. Innov. 26, 461–478 (2019).
  • (8) G. Schütz, S. Trimper, Phys. Rev. E 70, 045101(R) (2004).
  • (9) E. Baur, J. Bertoin, Phys. Rev. E 94, 052134 (2016).
  • (10) R. Pemantle, Probab. Surveys 4, 1-79 (2007).
  • (11) H. Mahmoud, Polya Urn Models, Taylor & Francis Ltd, New York (2008).
  • (12) R. Jack, Phys. Rev. E 100, 012140 (2019).
  • (13) R. Jack, Eur. Phys. J. B 93, 74 (2020).
  • (14) R. Jack, R. Harris, Phys. Rev. E 102, 012154 (2020).
  • (15) B.M. Hill, D. Lane, W. Sudderth, Ann. Probab. 8, 214–226 (1980).
  • (16) B.M. Hill, D. Lane, W. Sudderth, Ann. Probab. 15, 1586–1592 (1987).
  • (17) R. Pemantle, Proc. Amer. Math. Soc. 113, 235–243 (1991).
  • (18) S. Franchini, Stoc. Proc. Appl. 127, 3372 (2017).
  • (19) K. Nakayama, S. Mori, Phys. Rev. E 104, 014109 (2021).
  • (20) H. Touchette, Physica A 504, 5–19 (2018).
  • (21) H. Pham, Some Applications and Methods of Large Deviations in Finance and Insurance, Lect. Notes Math. 1919, Springer, Berlin, Heidelberg (2007).
  • (22) A. Dembo, O. Zeitouni, Large Deviation Techniques and Applications, Springer, New York (1998).
  • (23) G. Parisi, M. Mezard, M. A. Virasoro, Spin Glass theory and Beyond, World Scientific, Singapore (1987).
  • (24) S. Franchini, Ann. Phys. 450, 169220 (2023).
  • (25) K. Klymko, J. P. Garrahan, S. Whitelam, Phys. Rev. E 96, 042126 (2017).
  • (26) K. Klymko, P. L. Geissler, J. P. Garrahan, S. Whitelam, Phys. Rev. E 97, 032123 (2018).
  • (27) B. D. Hughes, Random Walks and Random Environments Vol. 1, Clarendon Press, Oxford (1995).
  • (28) S. Franchini, Phys. Rev. E 84, 051104 (2011).
  • (29) S. Franchini, R. Balzan, Phys. Rev. E 98, 042502 (2018).
  • (30) S. Franchini, R. Balzan, Phys. Rev. E 102, 032143 (2020).
  • (31) M. van den Berg, E. Bolthausen, F. Den Hollander, Ann. Math. 153, 355–406 (2001)