This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

An explicit example for the high temperature convolution: crossover between the binomial law B(2,1/2)B(2,1/2) and the arcsine law

Pierre Mergny [email protected] LPTMS, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91405 Orsay, France Chair of Econophysics &\& Complex Systems, Ecole Polytechnique, 91128 Palaiseau Cedex, France
Abstract

In this note, we study the high-temperature convolution introduced in Ref.  [1], between two symmetric Bernoulli distributions. We give an analytical expression for both the Stieltjes transform and the density. This result provides the first non-trivial expression for the high-temperature convolution of two distributions and gives a new family of densities, interpolating between the centered binomial distribution with number of trials n=2n{=}2 and probability of success p=1/2p{=}1/2, and the centered and re-scaled arcsine law.

I Introduction

Understanding the spectrum of a large random matrix is of special interest in fields as diverse as high energy physics [2, 3], statistics [4], disordered systems [5, 6], finance [7], economy [8], ecology [9] and genetics [10, 11] to cite a few. A particular - yet fundamental - case of interest concerns the situation where one is dealing with the sum of random matrices.

Classical and free convolution - Important progress under this setting has been made in the 90s with the introduction of free probability [12, 13, 14]. To fix things, we consider 𝐚,𝐛N\mathbf{a},\mathbf{b}\in\mathbb{R}^{N} such that as NN goes to infinity, μ𝐚:=(1/N)i=1NδaiμA\mu_{\mathbf{a}}:=(1/N)\sum_{i=1}^{N}\delta_{a_{i}}\to\mu_{A} and similarly μ𝐛μB\mu_{\mathbf{b}}\to\mu_{B}, and a random rotation matrix 𝐎Haar(𝖮(N))\mathbf{O}\sim\mathrm{Haar}(\mathsf{O}(N)), where 𝖮(N)\mathsf{O}(N) is the group of (N×N)(N\times N) orthogonal matrices. As NN goes to infinity, the spectrum of Diag(𝐚)+𝐎Diag(𝐛)𝐎𝖳\mathrm{Diag}(\mathbf{a})+\mathbf{O}\mathrm{Diag}(\mathbf{b})\mathbf{O}^{\mathsf{T}} only depends on the limiting distributions μA\mu_{A} and μB\mu_{B}, and is known as the free convolution of the former distributions, denoted by μAμB\mu_{A}\boxplus\mu_{B}. In practice, one can compute the distribution μAμB\mu_{A}\boxplus\mu_{B} thanks to the so-called R-transform, and we refer to the textbook [15] for further details. The two symmetric matrices 𝐀:=Diag(𝐚)\mathbf{A}:=\mathrm{Diag}(\mathbf{a}) and 𝐁:=𝐎Diag(𝐛)𝐎𝖳\mathbf{B}:=\mathbf{O}\mathrm{Diag}(\mathbf{b})\mathbf{O}^{\mathsf{T}} are said to be (asymptotically) free or equivalently their eigenbasis are in “generic positions”. Another way to view free probability is to notice that 𝐀\mathbf{A} and 𝐁\mathbf{B} are “maximally non-commutative” self-adjoint objects, in the sense that if both μA\mu_{A} and μB\mu_{B} have zero mean and if we denote by τ(.):=Tr(.)/N\tau(.):=\mathrm{Tr}(.)/N, we have at large NN, τ(𝐀2𝐁2)=τ(𝐀2)τ(𝐁2)=m2[μA]m2[μB]\tau(\mathbf{A}^{2}\mathbf{B}^{2})=\tau(\mathbf{A}^{2})\tau(\mathbf{B}^{2})=m_{2}[\mu_{A}]m_{2}[\mu_{B}] with m2[μ]:=x2dμm_{2}[\mu]:=\int x^{2}\mathrm{d}\mu but τ(𝐀𝐁𝐀𝐁)=0τ(𝐀2)τ(𝐁2)\tau(\mathbf{A}\mathbf{B}\mathbf{A}\mathbf{B})=0\neq\tau(\mathbf{A}^{2})\tau(\mathbf{B}^{2}). From a purely combinatorial point of view [16], this has a simple interpretation: the former corresponds to a non-crossing partition, while the latter corresponds to a crossing one. As NN\to\infty, only non-crossing terms contribute to the computation of the moments of 𝐀+𝐁\mathbf{A}+\mathbf{B}, in the free probability setting.

On the contrary, if now one is looking at the spectrum of Diag(𝐚)+𝐏Diag(𝐛)𝐏𝖳\mathrm{Diag}(\mathbf{a})+\mathbf{P}\mathrm{Diag}(\mathbf{b})\mathbf{P}^{\mathsf{T}} where 𝐏Unif(𝖲(N))\mathbf{P}\sim\mathrm{Unif}(\mathsf{S}(N)) is a (uniform) random permutation matrix, one is summing diagonal matrices and the limiting spectral distribution for the sum is now simply given by the classical convolution μAμB\mu_{A}\ast\mu_{B}. Thus, classical convolution naturally appears in Random Matrix Theory (RMT) when the eigenbasis of both symmetric matrices are perfectly aligned, or said differently when one is looking at the spectrum of large commutative self-adjoint objects. In particular, if now 𝐁:=𝐏Diag(𝐛)𝐏𝖳\mathbf{B}:=\mathbf{P}\mathrm{Diag}(\mathbf{b})\mathbf{P}^{\mathsf{T}} and μA\mu_{A} and μB\mu_{B} have zero mean, we have τ(𝐀𝐁𝐀𝐁)=τ(𝐀2)τ(𝐁2)\tau(\mathbf{A}\mathbf{B}\mathbf{A}\mathbf{B})=\tau(\mathbf{A}^{2})\tau(\mathbf{B}^{2}) and more generally crossing and non-crossing partitions contribute equally in this setting.

The high-temperature convolution - A natural problem is to give a meaning for “intermediate cases”, that is to describe the spectrum of the sum of two large self-adjoint objects for cases where those objects are not commutative nor maximally non-commutative. Recently [1], promising progress has been made in this direction with the introduction of the so-called high-temperature convolution (or “cc-convolution”), which we briefly describe in this paragraph, see also [17]. The high-temperature convolution has been constructed by looking at spherical integrals [18, 19, 20, 21, 22, 23] in a double scaling regime where NN\to\infty but now the usual inverse temperature parameter β\beta of RMT scales as βN=2c/N\beta_{N}=2c/N, hence the name for the convolution.

The precise description of this convolution will be given later on, but for now one can think of it as an operation taking a parameter c(0,)c\in(0,\infty) and two distributions μA\mu_{A} and μB\mu_{B} as inputs and giving a distribution denoted by μAcμB\mu_{A}\oplus_{c}\mu_{B} as output. This high temperature convolution admits the usual and free convolution as limiting cases, since μAc0μBμAμB\mu_{A}\oplus_{c\to 0}\mu_{B}\equiv\mu_{A}\ast\mu_{B} and μAcμBμAμB\mu_{A}\oplus_{c\to\infty}\mu_{B}\equiv\mu_{A}\boxplus\mu_{B}. As a consequence, it forms a continuous family of convolutions interpolating between the two aforementioned ones as one varies the parameter cc.

Let us mention that this convolution is done directly at the level of the limiting distributions and finding the corresponding ’linear algebra operation’ - such that the eigenvalues of the sum of two symmetric matrices 𝐀\mathbf{A} and 𝐁\mathbf{B} with proper conditions between the two - is an open problem. Nevertheless, one can still develop a combinatorial formula where now the weight of a crossing partition depend on the parameter cc, see [17]. If one thinks of two abstract self-adjoint objects 𝐀,𝐁\mathbf{A},\mathbf{B} which are ’cc-free’, by which we mean that we think of μAcμB\mu_{A}\oplus_{c}\mu_{B} as the limiting spectral distribution of their sum, we have in particular (again for μA\mu_{A} and μB\mu_{B} with zero mean):

τ(𝐀𝐁𝐀𝐁)=τ(𝐀2𝐁2)c+1=m2[μA]m2[μB]c+1.\tau(\mathbf{A}\mathbf{B}\mathbf{A}\mathbf{B})=\frac{\tau(\mathbf{A}^{2}\mathbf{B}^{2})}{c+1}=\frac{m_{2}[\mu_{A}]m_{2}[\mu_{B}]}{c+1}\,. (1)

Thus, the high-temperature convolution corresponds in spirit to the spectrum of the sum of two large self-adjoint objects with degree of non-commutativity indexed by the parameter cc.

This high-temperature convolution admits an intriguing duality with the so-called finite free convolution [24, 25]. The latter can be understood [26] as the spectrum of the sum of two β\beta-ensembles in the low-temperature limit β\beta\to\infty, where the number of particles NN is fixed. This duality translates into a correspondence cNc\leftrightarrow N between the respective parameter of the two convolutions and can be seen as an extension for the sum of the high-low temperature duality developed in Refs. [27, 28, 29, 30].

In practice, the combinatorial formula developed in Ref. [17] is too cumbersome to compute the distribution μAcμB\mu_{A}\oplus_{c}\mu_{B}, and a simpler way is to follow the road developed in Ref. [1] which rely on the Markov-Krein relation [31, 32]:

Suppμdμ(x)(zx)c=exp[cSuppνdν(y)log(zy)],\int_{\mathrm{Supp}\,\mu}\frac{\mathrm{d}\mu(x)}{(z-x)^{c}}=\exp{\left[-c\int_{\mathrm{Supp}\,\nu}\mathrm{d}\nu(y)\log\left(z-y\right)\right]}\,, (2)

valid for any zz in the complex plane outside the supports of the two distributions. The distribution ν\nu is known as the Markov-Krein transform with index cc (MKTc) of the distribution μ\mu and conversely μ\mu is the inverse Markov-Krein transform (IMKT) of ν\nu. The main reason to introduce the Markov-Krein relation is that the high-temperature convolution μAcμB\mu_{A}\oplus_{c}\mu_{B} corresponds to the classical convolution of the MKTc of μA\mu_{A} and μB\mu_{B}. After a few simplifications developed in Ref. [1], this means that one can decompose the computation of the high-temperature convolutions into the following steps:

  1. 1.

    Compute the moment generating functions (MGF) MA,B(s):=𝔼YνA,νB[esY]M_{A,B}(s):=\mathbb{E}_{Y\sim\nu_{A},\nu_{B}}\left[\mathrm{e}^{sY}\right] of the MKTc the two distributions μA\mu_{A} and μB\mu_{B}. The MKTc νA\nu_{A} and νB\nu_{B} can be computed thanks to sophisticated integral representation developed in Ref. [1].

  2. 2.

    Compute the function

    U(c)(z):=1Γ(c)0dsezssc1MA(s)MB(s),U^{(c)}(z):=\frac{1}{\Gamma(c)}\int_{0}^{\infty}\mathrm{d}s\,\mathrm{e}^{-zs}s^{c-1}M_{A}(s)M_{B}(s)\,, (3)

    for zz high enough, that is higher than the K=max(SuppμA)+max(SuppμB)K=\mathrm{max}(\mathrm{Supp}\,\mu_{A})+\mathrm{max}(\mathrm{Supp}\,\mu_{B}) and then extend analytically this function to all z(,K)z\in\mathbb{C}\setminus(-\infty,K).

  3. 3.

    Compute the Stieltjes transform G(c)(z):=d(μAcμB)(x)(zx)1G^{(c)}(z):=\int\mathrm{d}(\mu_{A}\oplus_{c}\mu_{B})(x)\,(z-x)^{-1}, thanks to the formula:

    G(c)(z):=1cddzlogU(c)(z)=1c(U(c))(z)U(c)(z).G^{(c)}(z):=-\frac{1}{c}\frac{\mathrm{d}}{\mathrm{d}z}\log U^{(c)}(z)=-\frac{1}{c}\frac{(U^{(c)})^{\prime}(z)}{U^{(c)}(z)}\,. (4)
  4. 4.

    Compute the distribution μAcμB\mu_{A}\oplus_{c}\mu_{B} thanks to the Sokochi-Plemelj formula:

    (μAcμB)(x)=1π𝔪G(c)(xi0+).(\mu_{A}\oplus_{c}\mu_{B})(x)=\frac{1}{\pi}\mathfrak{Im}\,G^{(c)}(x-\mathrm{i}0^{+})\,. (5)

Each step can be easily approximated numerically such that one can really think of the entire process as an algorithm for computing the high temperature convolution of two distributions.

However, for a given choice of μA\mu_{A} and μB\mu_{B} and the parameter cc, finding an explicit expression for the density of their high-temperature convolution is a daunting task. In fact, the only known cases where one has an explicit expression for the high temperature convolution correspond to trivial fixed points (or infinitely divisible distributions) which, up to rescaling, are left unchanged by the high-temperature convolution. Note that even for the free convolution, one has an analytical expression for the density only for specific choices of the distribution μA\mu_{A} and μB\mu_{B} such that one should not expect to have a simple expression for the high-temperature convolution.

II Main result

The present note aims to answer this issue by providing a complete description of μAcμB\mu_{A}\oplus_{c}\mu_{B} for a specific choice of μA\mu_{A} and μB\mu_{B} and any value of the parameter cc. We consider the case where μA=μB=μ\mu_{A}=\mu_{B}=\mu, with,

μ:=12δ1/2+12δ1/2,\mu:=\frac{1}{2}\delta_{-1/2}+\frac{1}{2}\delta_{1/2}\,, (6)

since this is a famous case where the density of its free convolution with itself is known analytically and given by the (shifted and re-scaled) arscine law: for x[1,1]x\in[-1,1], (μμ)(x)=1π1x2(\mu\boxplus\mu)(x)=\frac{1}{\pi\sqrt{1-x^{2}}} . For the classical convolution, we have μμ=14δ1+12δ0+14δ1\mu\ast\mu=\frac{1}{4}\delta_{-1}+\frac{1}{2}\delta_{0}+\frac{1}{4}\delta_{1}, which is the re-centered binomial distribution with number of trials n=2n=2 and probability of success p=1/2p=1/2.

Expression for the Stieltjes transform - Our first result writes as follows: for any c>0c>0, the Stieltjes transform G(c)G^{(c)} of μcμ\mu\oplus_{c}\mu is given by:

G(c)(z)=1zF12(c2,1+c2;c;1z2)F12(c2,c2;c;1z2),G^{(c)}(z)=\frac{1}{z}\frac{{}_{2}F_{1}\left(\frac{c}{2},1+\frac{c}{2};c;\frac{1}{z^{2}}\right)}{{}_{2}F_{1}\left(\frac{c}{2},\frac{c}{2};c;\frac{1}{z^{2}}\right)}\,, (7)

where F12(a,b;c;u):=k=0(a)k(b)k(c)kk!uk{}_{2}F_{1}(a,b;c;u):=\sum_{k=0}^{\infty}\frac{(a)_{k}(b)_{k}}{(c)_{k}\,k!}u^{k} is the Gauss hypergeometric function and (a)k:=Γ(a+k)/Γ(a)(a)_{k}:=\Gamma(a+k)/\Gamma(a). The derivation of Eq. (7) is postponed to the next section, where we also discuss how one can recover the Stietljes transform of the classical convolution and the free convolution, corresponding respectively to c0+c\to 0^{+} and cc\to\infty. Using the power series of the hypergeometric functions, one gets the large zz behavior of the Stieltjes transform:

G(c)(z)=1z+12z3+4+3c8(c+1)z5+8+5c16(c+1)z7+o(z8),G^{(c)}(z)=\frac{1}{z}+\frac{1}{2z^{3}}+\frac{4+3c}{8(c+1)z^{5}}+\frac{8+5c}{16(c+1)z^{7}}+o(z^{-8})\,, (8)

from which we deduce that the first even moments of the symmetric distribution μcμ\mu\oplus_{c}\mu are given by m2=1/2m_{2}=1/2, m4=(4+3c)/(8c+8)m_{4}=(4+3c)/(8c+8) and m6=(8+5c)/(16c+16)m_{6}=(8+5c)/(16c+16), in accordance with the first moments one can obtain with the combinatorial formula developed in Ref. [17].

If one introduces a new distribution ρ~(x):=(μcμ)(x)x1/2\tilde{\rho}(x):=(\mu\oplus_{c}\mu)(\sqrt{x})\,x^{-1/2} defined for any x[0,1]x\in[0,1], then its Stieltjes transform G~(c)(z):=01dρ~(x)(zx)1\tilde{G}^{(c)}(z):=\int_{0}^{1}\mathrm{d}\tilde{\rho}(x)\,(z-x)^{-1} is related to the one of μcμ\mu\oplus_{c}\mu by G(c)(z)=zG~(c)(z2)G^{(c)}(z)=z\tilde{G}^{(c)}(z^{2}), that is:

G~(c)(z)=1zF12(c2,1+c2;c;1z)F12(c2,c2;c;1z).\tilde{G}^{(c)}(z)=\frac{1}{z}\frac{{}_{2}F_{1}\left(\frac{c}{2},1+\frac{c}{2};c;\frac{1}{z}\right)}{{}_{2}F_{1}\left(\frac{c}{2},\frac{c}{2};c;\frac{1}{z}\right)}\,. (9)

Note that the change of variable going from ρ~\tilde{\rho} to μcμ\mu\oplus_{c}\mu admits a natural interpretation in RMT: if one think of ρ~\tilde{\rho} as the limiting spectrum of some matrix 𝐌𝐌𝖳\mathbf{M}\mathbf{M}^{\mathsf{T}} then μcμ\mu\oplus_{c}\mu is the symmetrized singular value distribution of the square matrix 𝐌\mathbf{M}, see Ref. [33]. Eq. (9) expressed G~(c)(z)\tilde{G}^{(c)}(z) as a product of the inverse function by the ratio of two different hypergeometric functions, both evaluated at 1/z1/z. Such general form has already appeared before in RMT in the study of the high-temperature Jacobi ensemble, see Ref. [29]. Yet, the parameters of the hypergeometric functions here are different such that - to the best knowledge of the author - the family of distributions μcμ\mu\oplus_{c}\mu (and ρ~\tilde{\rho}) is a new one in RMT.

Expression for the density - Our second result is written as follows: if we define V1(x):=F12(1c2,c2,1;x)V_{1}(x):={}_{2}F_{1}\left(1-\frac{c}{2},\frac{c}{2},1;x\right) and V2(x):=F12(2c2,1+c2,2;x)V_{2}(x):={}_{2}F_{1}\left(2-\frac{c}{2},1+\frac{c}{2},2;x\right), then for any cc such that 2c2c\notin\mathbb{N}, the density μcμ\mu\oplus_{c}\mu is given for any x[1,1]{1,0,1}x\in[-1,1]\setminus\{-1,0,1\} by:

(μcμ)(x)=(2c)sin(cπ/2)2π×\displaystyle(\mu\oplus_{c}\mu)(x)=\frac{(2-c)\sin\left(c\pi/2\right)}{2\pi}\times (10)
|x|(V1(1x2)V2(x2)+V1(x2)V2(1x2))V1(x2)2+2cos(cπ2)V1(x2)V1(1x2)+V1(1x2)2.\displaystyle\frac{|x|(V_{1}(1{-}x^{2})V_{2}(x^{2})+V_{1}(x^{2})V_{2}(1{-}x^{2}))}{V_{1}(x^{2})^{2}+2\cos\left(\frac{c\pi}{2}\right)V_{1}(x^{2})V_{1}(1{-}x^{2})+V_{1}(1{-}x^{2})^{2}}\,.

Furthermore, one can obtain the cases where cc is a positive even integer by carefully taking the limit, such that one can understand Eq. (10) as being valid for any c>0c>0, after proper regularization. The distribution μcμ\mu\oplus_{c}\mu diverges at the points {1,0,1}\{-1,0,1\} and is otherwise absolutely continuous with no singular parts in [1,1][-1,1]. The derivation of Eq. (10) is given in the next section. A plot of the density of μcμ\mu\oplus_{c}\mu is given in FIG. 1.

Refer to caption
Figure 1: Plot of the density μcμ\mu\oplus_{c}\mu for x[1,1]x\in[-1,1] and different values of the parameter cc, in logarithmic scale.

Now, for special values of the parameter cc, this expression greatly simplifies. For example for c=1c{=}1, corresponding in a sense to the mid-point between the classical and the free convolution (see Eq. (1)), we have:

(μc=1μ)(x)=12|x|(1x2)1K(x2)2+K(1x2)2,(\mu\oplus_{c=1}\mu)(x)=\frac{1}{2|x|(1{-}x^{2})}\frac{1}{K(x^{2})^{2}+K(1{-}x^{2})^{2}}\,, (11)

where K(.)K(.) is the complete elliptic integral of the first kind, K(u):=0π/2dθ(1u2sin2θ)1/2K(u):=\int_{0}^{\pi/2}\mathrm{d}\theta\,(1-u^{2}\sin^{2}\theta)^{-1/2}. The expression for c=2c=2 is even simpler since we have:

(μc=2μ)(x)=1|x|(1x2)1(log(1x2x2))2+π2.\displaystyle(\mu\oplus_{c=2}\mu)(x)=\frac{1}{|x|(1-x^{2})}\frac{1}{\left(\log\left(\frac{1-x^{2}}{x^{2}}\right)\right)^{2}+\pi^{2}}\,. (12)

In practice when cc is an even positive integer it is easier to evaluate the Stieltjes transform thanks to Eq. (7) and then use the Sokochi-Plemelj formula of Eq. (5) rather than taking the limit in the generic expression of Eq. (10).

III Derivation of the result

In this section we follow the steps enumerated in the previous section in order to prove Eq. (7) and Eq. (10) giving the expression for the Stieltjes transform and the density, respectively.

Computing the MGF of the MKTc - For the symmetric Bernoulli distribution, it has been previously shown, see Refs.  [1, 34, 35], that the corresponding MKTc is the density of the random variable YBeta(c/2,c/2)Y^{\prime}\sim\mathrm{Beta}(c/2,c/2). The distribution μ\mu is the symmetric Bernoulli distribution, shifted by 1/21/2. From the Markov-Krein relation of Eq.  (2), one sees immediately that a shift in the distribution μ\mu introduces the same shift in the MKTc. Thus, the MKTc of μ\mu is simply the law of Y=Y1/2Y{=}Y^{\prime}{-}1/2. From well-known properties of the Beta distribution, this means that the MGF of the MKTc of μ\mu, M(.)MA(.)=MB(.)M(.)\equiv M_{A}(.)=M_{B}(.), is given by:

M(s)=es/2F11(c2;c;s),M(s)=\mathrm{e}^{s/2}\,{}_{1}F_{1}\left(\frac{c}{2};c;s\right)\,, (13)

where F11(a;b;u):=k=0(a)k(b)kuk{}_{1}F_{1}(a;b;u):=\sum_{k=0}^{\infty}\frac{(a)_{k}}{(b)_{k}}u^{k} is the confluent hypergeometric function. Using identities [36] for the confluent hypergeometric, this can also be expressed in terms of the modified Bessel function of the kind Iα(.)I_{\alpha}(.):

M(s)=C1s(1c)/2Ic12(s2),M(s)=C_{1}\,s^{(1-c)/2}I_{\frac{c-1}{2}}\left(\frac{s}{2}\right)\,, (14)

where C1:=2c1Γ(c+12)C_{1}:=2^{c-1}\Gamma\left(\frac{c+1}{2}\right).

Computing the function U(c)U^{(c)} - Injecting Eq. (14) into the definition of U(c)(z)U^{(c)}(z) given by Eq. (3), one obtains the integral representation:

U(c)(z)=C20dsezs(Ic12(s/2))2,U^{(c)}(z)=C_{2}\,\int_{0}^{\infty}\mathrm{d}s\,\mathrm{e}^{-zs}\left(I_{\frac{c-1}{2}}\left(s/2\right)\right)^{2}\,, (15)

where C2:=C12/Γ(c)C_{2}:=C_{1}^{2}/\Gamma(c) is a constant that will not contribute to the expression of the Stietljes transform (and hence the density). The square of the Bessel function can be expressed as an integrated Bessel function thanks to the formula [36]:

(Ic12(s/2))2=2π0π2dθIc1(scosθ).\left(I_{\frac{c-1}{2}}\left(s/2\right)\right)^{2}=\frac{2}{\pi}\int_{0}^{\frac{\pi}{2}}\mathrm{d}\theta\,I_{c-1}\left(s\cos\theta\right)\,. (16)

If we do the change of variable sscosθs\to s\cos\theta in Eq. (15) and then θarcos(coshθ)\theta\to\mathrm{arcos}\left(\mathrm{cosh}\,\theta\right) we have:

U(c)(z)=C30dsIc1(s)(0dθe(zs)coshθ),U^{(c)}(z)=C_{3}\int_{0}^{\infty}\mathrm{d}s\,I_{c-1}\left(s\right)\left(\int_{0}^{\infty}\mathrm{d}\theta\,\mathrm{e}^{-(zs)\,\mathrm{cosh}\,\theta}\right)\,, (17)

with C3=2C2/πC_{3}=2\,C_{2}/\pi. The integral with respect to the variable θ\theta is the integral representation [36] of the Bessel function of the second kind K0(.)K_{0}(.), such that we have:

U(c)(z)=C30dsIc1(s)K0(zs).U^{(c)}(z)=C_{3}\,\int_{0}^{\infty}\mathrm{d}s\,I_{c-1}\left(s\right)K_{0}(zs)\,. (18)

By identities for the integral of the product of two Bessel functions of different kinds, see [36], one can finally express U(c)U^{(c)} in terms of a hypergeometric function:

U(c)(z)=Γ(c)zcF12(c2,c2;c;1z2).U^{(c)}(z)=\Gamma(c)z^{-c}\,{}_{2}F_{1}\left(\frac{c}{2},\frac{c}{2};c;\frac{1}{z^{2}}\right)\,. (19)

Computing the Stieltjes transform - In order to compute G(c)G^{(c)} given by Eq. (4), we first need to compute the derivative of U(c)(z)U^{(c)}(z). Using the differentiation formula [36] for the hypergeometric function, (d/du)F12(a,b;c;u)=(ab/c)F12(a+1,b+1;c+1;u)(\mathrm{d}/\mathrm{d}u)\,\,{}_{2}F_{1}\left(a,b;c;u\right)=(ab/c)\,\,{}_{2}F_{1}\left(a+1,b+1;c+1;u\right), we get:

(U(c))(z)=Γ(c)(c)zc1[c2z2×\displaystyle(U^{(c)})^{\prime}(z)=\Gamma(c)(-c)z^{-c-1}\biggl{[}\frac{c}{2z^{2}}\times (20)
F12(1+c2,1+c2;1+c;1z2)+F12(c2,c2;c;1z2)].\displaystyle\,{}_{2}F_{1}\left(1+\frac{c}{2},1+\frac{c}{2};1+c;\frac{1}{z^{2}}\right)+{}_{2}F_{1}\left(\frac{c}{2},\frac{c}{2};c;\frac{1}{z^{2}}\right)\biggr{]}\,.

Next, using identities [36] between contiguous hypergeometric functions, the sum inside the brackets simplifies such that the derivative of U(c)(z)U^{(c)}(z) writes:

(U(c))(z)=Γ(c)(c)zcF12(c2,1+c2;c;1z2)z.(U^{(c)})^{\prime}(z)=\Gamma(c)(-c)z^{-c}\,\frac{{}_{2}F_{1}\left(\frac{c}{2},1+\frac{c}{2};c;\frac{1}{z^{2}}\right)}{z}\,. (21)

Injecting the expression of U(c)(z)U^{(c)}(z) and its derivative, given respectively by Eq. (19) and Eq. (21), in Eq. (4) gives the desired expression for the Stieltjes transform, see Eq. (7).

The limiting case c0+c\to 0^{+} given by the classical convolution can be recovered by using an expansion for small cc in the hypergeometric functions entering the expression of G(c)G^{(c)}. For the numerator, we get:

F12(c2,1+c2;c;1z2)=1+k=1(12+oc(1))1z2k,{}_{2}F_{1}\left(\frac{c}{2},1+\frac{c}{2};c;\frac{1}{z^{2}}\right)=1+\sum_{k=1}^{\infty}(\frac{1}{2}+o_{c}(1))\frac{1}{z^{2k}}\,, (22)

that is:

F12(c2,1+c2;c;1z2)=1+1(z21)+oc(1).{}_{2}F_{1}\left(\frac{c}{2},1+\frac{c}{2};c;\frac{1}{z^{2}}\right)=1+\frac{1}{(z^{2}-1)}+o_{c}(1)\,. (23)

Similarly, the hypergeometric function in the denominator is equal to 1+oc(1)1+o_{c}(1). Combining these two asymptotic behaviors, we get for the Stietljes:

G(c0+)=1z+12z(z21),G^{(c\to 0^{+})}=\frac{1}{z}+\frac{1}{2z(z^{2}-1)}\,, (24)

which decomposes into simple elements as:

G(c0+)=14(z+1)+12z+14(z1),G^{(c\to 0^{+})}=\frac{1}{4(z+1)}+\frac{1}{2z}+\frac{1}{4(z-1)}\,, (25)

as expected for the Stieltjes transform of the centered binomial distribution, μμ=14δ1+12δ0+14δ1\mu\ast\mu=\frac{1}{4}\delta_{-1}+\frac{1}{2}\delta_{0}+\frac{1}{4}\delta_{1}.

The limiting case cc\to\infty corresponding to the free convolution requires more work, and we only sketch the main ingredients to recover the Stieltjes transform of the arcsine law. The idea is to use the integral representation [36] of the hypergeometric function:

F12(a,b;c;u)=C3\displaystyle{}_{2}F_{1}\left(a,b;c;u\right)=C_{3} 01dttb1\displaystyle\int_{0}^{1}\mathrm{d}t\,t^{b-1} (26)
×(1t)cb1(1tu)a,\displaystyle\times(1-t)^{c-b-1}(1-tu)^{-a}\,,

with C4=Γ(c)Γ(cb)Γ(b)C_{4}=\frac{\Gamma(c)}{\Gamma(c-b)\Gamma(b)} and c>b>0c>b>0. If we denote by Fη(c,u):=F12(c/2,η+c/2;c;u)F_{\eta}(c,u):={}_{2}F_{1}\left(c/2,\eta+c/2;c;u\right) with η=1\eta=1 for the hypergeometric function in the numerator of Eq. (7) and η=0\eta=0 for the denominator, this means that we can write Fη(c,u)F_{\eta}(c,u) as:

Fη(c,u)01dtec2g(t,u)h(t),F_{\eta}(c,u)\propto\int_{0}^{1}\mathrm{d}t\,\mathrm{e}^{\frac{c}{2}g(t,u)}h(t)\,, (27)

with g(t,z):=log(t(1t))log(1tu)g(t,z):=\log(t(1-t))-\log(1-tu) and h(t):=tη1(1t)1ηh(t):=t^{\eta-1}(1-t)^{1-\eta}. As cc\to\infty, Eq. (27) can be approximated by Laplace method and the results writes:

Fη(c,u)cKc(u)(1u)η/2,F_{\eta}(c,u)\underset{c\to\infty}{\sim}K_{c}(u)(1-u)^{-\eta/2}\,, (28)

where Kc(u)K_{c}(u) is a function independent of the parameter η{\eta}. Thus, if we inject this asymptotic behavior in Eq. (7) we get:

G(c)=1z111z2,G^{(c\to\infty)}=\frac{1}{z}\frac{1}{\sqrt{1-\frac{1}{z^{2}}}}\,, (29)

which is indeed the Stietljes transform of μμ\mu\boxplus\mu, see [15].

Computing the density - The explicit expression for the density is obtained thanks to the Sokochi-Plemelj formula of Eq. (5) and the expression of Eq. (7) for the Stietljes transform, by looking carefully at the behavior of the hypergeometric functions near their branch cuts. As the situation is very similar for both the numerator and denominator, we only detail the complete computation for the latter case. The idea is to use both the integral representation of Eq. (26) for a=c/2a=c/2 and b=η+c/2b=\eta+c/2 and the behavior of the power function near its branch cut. For u>0u>0, we have (u+i0+)α=uα(u+\mathrm{i}0^{+})^{\alpha}=u^{\alpha} but otherwise:

(u+i0+)α=cos(πα)|u|α+isin(πα)|u|α.(-u+\mathrm{i}0^{+})^{\alpha}=\cos(\pi\alpha)|u|^{\alpha}+\mathrm{i}\sin(\pi\alpha)|u|^{\alpha}\,. (30)

As zxi0+z\to x-\mathrm{i}0^{+} with x[1,1]x\in[-1,1], we have

(1t/z2)c/2=(1t/x2+isign(x)0+)c/2,(1{-}t/z^{2})^{-c/2}{=}(1{-}t/x^{2}{+}\mathrm{i}\,\mathrm{sign}(x)\mathrm{0}^{+})^{-c/2}\,, (31)

such that we need to differentiate the cases t<x2t<x^{2} and t>x2t>x^{2} in Eq. (26). Since the distribution μcμ\mu\oplus_{c}\mu is symmetric, we also fix x>0x>0. Thus, if we introduce the two functions J1,2(x)J_{1,2}(x) corresponding respectively to the split of the integral of Eq. (26) for a=c/2a=c/2 and b=c/2b=c/2, into the segment [0,x2][0,x^{2}] and [x2,1][x^{2},1]:

J1(x):=C40x2dt(t(1t))c/2(1tx2)c/2,J_{1}(x):=C_{4}\int_{0}^{x^{2}}\mathrm{d}t\,(t(1-t))^{-c/2}\left(1-\frac{t}{x^{2}}\right)^{-c/2}\,, (32)

and

J2(x):=C4x21dt(t(1t))c/2(tx21)c/2,J_{2}(x):=C_{4}\int_{x^{2}}^{1}\mathrm{d}t\,(t(1-t))^{-c/2}\left(\frac{t}{x^{2}}-1\right)^{-c/2}\,, (33)

then the real and imaginary parts of the hypergeometric function in the denominator of Eq. (7) are given by:

𝔢F12(c2,c2;c;1(xi0+)2)=J1(x)+cos(πc2)J2(x),\mathfrak{Re}\,{}_{2}F_{1}\left(\frac{c}{2},\frac{c}{2};c;\frac{1}{(x{-}\mathrm{i}0^{+})^{2}}\right)=J_{1}(x)+\cos\left(\frac{\pi c}{2}\right)J_{2}(x)\,, (34)

and

𝔪F12(c2,c2;c;1(xi0+)2)=sin(πc2)J2(x).\mathfrak{Im}\,{}_{2}F_{1}\left(\frac{c}{2},\frac{c}{2};c;\frac{1}{(x{-}\mathrm{i}0^{+})^{2}}\right)=-\sin\left(\frac{\pi c}{2}\right)J_{2}(x)\,. (35)

If we now perform the change of variable tx2tt\to x^{2}t in Eq. (32), we can rewrite J1(x)J_{1}(x) as:

J1(x)=C4xc01dttc/21(1t)c/2(1x2t)c/21.J_{1}(x)=C_{4}\,x^{c}\int_{0}^{1}\mathrm{d}t\,t^{c/2-1}\left(1-t\right)^{-c/2}\left(1-x^{2}t\right)^{-c/2-1}\,. (36)

By Eq. (26) and up to a multiplicative constant we recognize the integral in Eq. (36) as the hypergeometric function V1(x):=F12(1c2,c2,1;x)V_{1}(x):={}_{2}F_{1}\left(1-\frac{c}{2},\frac{c}{2},1;x\right). The multiplicative constant can be simplified thanks to the complement formula of the Gamma function Γ(1z)Γ(z)=π/sin(πz)\Gamma(1-z)\Gamma(z)=\pi/\sin(\pi z) for zz\in\mathbb{C}\setminus\mathbb{Z}, and we finally obtain:

J1(x)=πΓ(c)Γ(c2)2sin(cπ2)xcV1(x2).J_{1}(x)=\frac{\pi\Gamma(c)}{\Gamma\left(\frac{c}{2}\right)^{2}\sin\left(\frac{c\pi}{2}\right)}x^{c}\,V_{1}(x^{2})\,. (37)

Note that the integral representation of Eq. (36) is actually only valid for c(0,2)c\in(0,2) but by analytic continuation of the hypergeometric function, the result holds for any c>0c>0 such that 2c2c\notin\mathbb{N}, due to the presence of the inverse of the sinus function in Eq. (37).

Similarly, by the change of variable tx2+(1x2)t2t\to x^{2}+(1-x^{2})t^{2} in Eq. (33), J2(x)J_{2}(x) can be expressed as:

J2(x)=πΓ(c)Γ(c2)2sin(cπ2)xcV1(1x2).J_{2}(x)=\frac{\pi\Gamma(c)}{\Gamma\left(\frac{c}{2}\right)^{2}\sin\left(\frac{c\pi}{2}\right)}x^{c}\,V_{1}(1-x^{2})\,. (38)

Thanks to Eq. (34) and Eq. (35), one has the complete behavior near the branch cut for the denominator of Eq. (7).

We then sketch the remaining steps to get the analytical expression for density: one can then repeat the exact same previous computation for the numerator of Eq. (7), where instead of the function V1(.)V_{1}(.), the function V2(.)V_{2}(.) will appear (with a different multiplicative constant) when splitting the integral into the segments [0,x2][0,x^{2}] and [x2,1][x^{2},1]. All in all, one gets the density by taking the imaginary part of the entire expression, divided by π\pi. After simplification with the trigonometric identity cos(cπ/2)2+sin(cπ/2)2=1\cos(c\pi/2)^{2}{+}\sin(c\pi/2)^{2}{=}1, appearing when one multiplies the denominator of Eq. (7) by its conjugate, one gets the desired expression of Eq. (10) for the density.

IV Conclusion

In this note, we studied the high-temperature convolution introduced in Ref [1], between two symmetric Bernoulli distributions. Our result provides a new family of distribution, indexed by the parameter cc of the high-temperature convolution, interpolating between the (shifted) binomial distribution with parameter (2,1/2)(2,1/2) and the (shifted and re-scaled) arcsine law. This family of distribution constitutes the first non-trivial case for the analytical expression of the high-temperature, and we believe that the ideas developed in this note can be used to obtain the density of the high-temperature convolution in other cases. The obtained distribution μcμ\mu\oplus_{c}\mu is absolutely continuous between each singular points (here being given by {1,0,1}\{-1,0,1\}) of the classical convolution μμ\mu\ast\mu, and we conjectured this phenomenon to be a specific feature of the high-temperature convolution. Our result paves the way for a better understanding of this new convolution and can serve as a benchmark for future construction of the underlying linear algebra operation.

V Acknowledgments

The author would like to warmly thank M. Potters and J-P. Bouchaud for fruitful discussions concerning the high-temperature convolution and precious comments regarding this note.

References