From law of the iterated logarithm to Zolotarev distance for supercritical branching processes in random environment
Abstract
Consider a supercritical branching process in an independent and identically distributed environment. Based on some recent development in martingale limit theory, we established law of the iterated logarithm, strong law of large numbers, invariance principle and optimal convergence rate in the central limit theorem under Zolotarev and Wasserstein distances of order for the process .
keywords:
Branching processes in random environment , Law of the iterated logarithm , Law of large numbers , Convergence rates in central limit theorem , Zolotarev distance , Wasserstein distanceMSC:
60J80; 60K37; 60F05; 62E201 Introduction
Branching process in random environment (BPRE), initially introduced by Smith and Wilkinson [19], is a generalization of Galton-Watson process. The asymptotic behaviours for BPRE have been studied widely and intensively since the 1970s. For instance, one may refer to Athreya and Karlin [2, 1], Tanny [21, 22], Guivarc’h and Liu [8] and Birkner et al. [3] for some classical results on branching processes in independent and identically distributed (i.i.d.) environment or stationary and ergodic environment; Le Page and Ye [14] and Grama et al. [5, 6] for branching processes in Markovian environment. Regarding to the recent advances especially for supercritical branching processes in i.i.d. environment, for instance, Huang and Liu [11, 12] and Li et al. [15] established the large and moderate deviation principles, convergence theorems in space and the central limit theorem (CLT); Grama et al. [7] studied asymptotic distribution and harmonic moments.
The main objective of this paper is to establish some additional limit theorems for supercritical BPRE, which have rarely appeared in the literature so far, including law of the iterated logarithm (LIL), strong law of large numbers (LLN), invariance principle and the rate of convergence in the CLT under Zolotarev and Wasserstein distances. The study of Wasserstein and Zolotarev distances are of independent interests, as they have been widely used in applications nowadays, such as machine learning ([9]), computer vision ([16], [20]) and microbiome studies ([23]). The methods used in this paper are mainly based on some recent development in martingale limit theory, especially on the invariance principles and LIL for martingales. One may refer to Hall and Heyde [10] and Shao [18] for the martingale limit theory and invariance principles; Wu et al. [24] for some recent advances on a Berry-Esseen bound for martingales.
Let’s firstly introduce the model of BPRE. Suppose is a sequence of i.i.d. random variables. Usually, is called environment process and represents the random environment in the -th generation. A discrete-time random process is called BPRE , if it satisfies the following recursive relation:
where represents the number of offspring produced by the -th particle in the -th generation. The distribution of depending on the environment is denoted by Suppose that given , is a sequence of i.i.d. random variables; moreover, is independent of . Let be the probability space under which the process is defined when the environment is given. The state space of the random environment is denoted by and the total probability space can be regarded as the product space where That is, for any measurable positive function defined on , we have
where represents the distribution law of the random environment . And can be regarded as the conditional probability of given the environment . The expectations with respect to (w.r.t) and are denoted by and , respectively. For any environment , integer and real number , define
Then
Consider the following random variables
It is easy to see that
Then by recursion, we get . Denote
The branching process is called supercritical, critical or subcritical according to , or , respectively. Over all the paper, assume that
(1.1) |
The condition (1.1) means that each particle has at least one offspring, which implies a.s. and hence a.s. for any .
Let’s introduce now Zolotarev distance and Wasserstein distance respectively between two probability laws. For any , denote by the largest integer which is strictly less than , i.e.
Set . Then can be decomposed uniquely as , with . In the sequel, we will use this notation for the unique decomposition of any . For a given , consider , the class of -times continuously differentiable real-valued functions, defined by
For -times continuously differentiable function , set
Then for any ,
(1.2) |
Consider a set of probability laws on with marginals and . Zolotarev distance of order between and is defined by
For any , the Wasserstein distance of order between and is defined by
Let be the collection of all Borel probability measures on with finite absolute moments of order ; then forms a metric space, which is closely related to the topology of weak convergence of probability distributions on .
According to Kantorovich and Rubinstein [13], for any ,
(1.3) |
From Rio [17], it is proved that for any ,
(1.4) |
where is a constant depending only on .
Throughout the paper, denotes a positive constant whose value may vary from line to line. If and are two random variables, such that they follow the probability laws and respectively, then (resp. ) is simply denoted by (resp. ). We will use to represent the -distributed random variable with variance ; in particular, represents the standard normal random variable. And , for any .
2 Main results
In the sequel, denote by Evidently, is a sequence of i.i.d. random variables depending only on the environment . Let be the random walk associated with the branching process, which is defined as follows:
Then we have the following decomposition of :
(2.5) |
where The normalized population size is a non-negative martingale under both and , w.r.t the natural filtration , defined by
By Doob’s martingale convergence theorem and Fatou’s lemma, we can obtain that converges a.s. to a finite limit and . We assume the following conditions throughout this paper:
(2.6) |
The first condition above together with (1.1) imply in particular that
The second condition in (2.6) implies that converges to in and
(See e.g. Tanny [22]). Therefore, it follows with the assumption (1.1) that and a.s.
In the sequel, we will need the following conditions.
Condition 1
There exists a constant , such that
Condition 2
There exist some constants and such that
We have the following LIL for the process .
Theorem 2.1
From the LIL theorem above, we can find immediately the following strong LLN for .
Corollary 2.1
Under the same condition of Theorem 2.1, then
We also have the following invariance principle for the process , which only requires the existence of the second order moment . And this has been insured in the assumption (2.6).
Theorem 2.2
Let be a standard Brownian motion. Then there exists a common probability space for and , such that the following holds:
as .
We have the following convergence rates of in the CLT under Zolotarev and Wasserstein distances respectively.
3 Proof of main results
Consider a sequence of martingale differences , defined on a probability space , w.r.t a filtration . Set
Then is a martingale. Set and . We have the following lemma (see also Corollary 4.2 and Theorem 4.8 in Hall and Heyde [10]), from which Theorem 2.1 can be obtained.
Lemma 3.1 (Hall and Heyde [10])
Suppose that is a zero-mean, square integrable martingale. If
(3.7) |
(3.8) |
and for some ,
(3.9) |
then
and
where for any .
Recall now that the random walk associated with the branching process
is the sum of i.i.d. random variables . In the sequel, we will use the following notations. Let , and
Then from (2.5), we have
(3.10) |
Since is a sequence of i.i.d. zero-mean random variables and , becomes a zero-mean and square integrable martingale w.r.t the filtration .
Proof of Theorem 2.1 1
Theorem 2.2 can be proved by using Theorem 2.1 in Shao [18], where the following invariance principle for the martingale with the martingale difference sequence is established.
Lemma 3.2 (Shao [18], Theorem 2.1)
Suppose that is a square integrable martingale and is a standard Brownian motion. Let , , . Assume that there exists a non-decreasing positive valued sequence with , such that, as ,
and
Then there exists a common probability space for and such that, as ,
Proof of Theorem 2.2 1
To prove Theorem 2.3, the following lemmas will be used.
Lemma 3.3
Proof 1
By Lyapunov’s inequality, it is enough to prove the assertion for . Let . Using truncation, we have
For the first term on the right-hand side above, we have
For the second term, let be the annealed Laplace transform of , defined by
By Markov’s inequality, we have for ,
Then
(3.12) |
From (3.20) of Grama et al. [7], it is proved that under Condition 2,
(3.13) |
for some constants , depending on and , and for all . Therefore, combining (1) with (3.13), we get
where is the Gamma function defined by , for . We hence obtain
(3.14) |
Lemma 3.4
Suppose that . If , then for any , there exists between and , such that
Proof 2
Proof of Theorem 2.3 1
For any , let and . Then becomes a sequence of zero-mean random variables. Moreover, by (3.10), we have the following decomposition, for ,
(3.17) |
By the triangle inequality and (3.17),
(3.18) |
where , and . We will estimate respectively , and .
For , recall that under Condition 1, is a centred martingale with i.i.d. martingale differences in and . According to Theorem 2.1 of Dedecker et al. [4], with , we can obtain for any ,
(3.19) |
Notice that for any pair of real-valued random variables and real constant ,
(3.20) |
Therefore, the inequality (3.19) implies that for any ,
(3.21) |
Let’s estimate now . Suppose that and . We will discuss in the following two cases: when or , respectively. If , then . By the definition of , we have for any ,
(3.22) |
Since and are both -measurable, by using (3.22), we have for any ,
Since for , the function is increasing on , and by using Lemma 3.3, we get
(3.23) |
If , then . By Lemma 3.4, we have there exists between and , such that
Hence,
(3.24) |
Now, let’s study , with . Since for , the function is increasing on , we have
(3.25) |
For , applying Lyapunov’s inequality to the two terms on the right-hand side, we obtain
(3.26) |
For the first term above, from the assumption (2.6), the LLN for and Lemma 3.3, we have
(3.27) |
With the same arguments, we can obtain
(3.28) |
Using Lemma 3.3 again, we also have
(3.29) |
Combining (1) for the case with (1) - (3.29) for the case , we can conclude that for any ,
(3.30) |
Now, let’s deal with . By Lemma 3.3, we have . Taking into account Remark 5.1 and applying Items 1 and 2 of Lemma 5.2 in Dedecker et al. [4] with therein, then we can obtain for any ,
(3.31) |
Now, combining (1), (3.21), (3.30) and (3.31), we have for any ,
Therefore, the theorem is derived by taking into account
References
- Athreya and Karlin [1971a] Athreya, K.B., Karlin, S., 1971a. Branching processes with random environments. \@slowromancapii@. Limit theorems. Ann. Math. Statist. 42, 1843–1858. https://doi.org/10.1214/aoms/1177693051
- Athreya and Karlin [1971b] Athreya, K.B., Karlin, S., 1971b. On branching processes with random environments. \@slowromancapi@. Extinction probabilities. Ann. Math. Statist., 42, 1499–1520. https://doi.org/10.1214/aoms/1177693150
- Birkner et al. [2005] Birkner, M., Geiger, J., Kersting, G., 2005. Branching Processes in Random Environment — A View on Critical and Subcritical Cases, in: Deuschel, J.D., Greven, A. (Eds.), Interacting Stochastic Systems. Springer, Berlin, Heidelberg.
- Dedecker et al. [2009] Dedecker, J., Merlevède, F., Rio, E., 2009. Rates of convergence for minimal distances in the central limit theorem under projective criteria. Electron. J. Probab., 14, 978–1011. https://doi.org/10.1214/EJP.v14-648
- Grama et al. [2019] Grama, I., Lauvergnat, R., Le Page, E., 2019. The survival probability of critical and subcritical branching processes in finite state space Markovian environment. Stochastic Process. Appl., 129(7), 2485–2527. https://doi.org/10.1016/j.spa.2018.07.016
- Grama et al. [2022] Grama, I., Lauvergnat, R., Le Page, E., 2022. Limit theorems for critical branching processes in a finite-state-space Markovian environment. Advances in Applied Probability, 54(1), 111-140. https://doi.org/10.1214/aoms/1177693150
- Grama et al. [2023] Grama, I., Liu, Q., Miqueu, E., 2023. Asymptotics of the distribution and harmonic moments for a supercritical branching process in a random environment. Ann. Inst. H. Poincaré Probab. Statist., 59(4), 1934–1950. DOI: 10.1214/22-AIHP1318
- Guivarc’h and Liu [2001] Guivarc’h, Y., Liu, Q., 2001. Propriété asymptotiques des processus de branchement en environnement aléatoire. C. R. Acad. Sci. Série I, 332(4), 339–344. https://doi.org/10.1016/S0764-4442(00)01783-3
- Gulrajani et al. [2017] Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A. C., 2017. Improved training of Wasserstein GANs, in: I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan and R. Garnett (Eds), Advances in Neural Information Processing Systems 30, Curran Associates, Inc., pp. 5769–5779.
- Hall and Heyde [1980] Hall, P., Heyde, C.C., 1980. Martingale limit theory and its Application, A volume in Probability and Mathematical Statistics. Academic Press.
- Huang and Liu [2012] Huang, C., Liu, Q., 2012. Moments, moderate and large deviations for a branching process in a random environment. Stochastic Process. Appl., 122, 522–545. https://doi.org/10.1016/j.spa.2011.09.001
- Huang and Liu [2014] Huang, C., Liu, Q., 2014. Convergence in and its exponential rate for a branching process in a random environment. Electron. J. Probab., 19(104), 1–22. https://doi.org/10.1214/EJP.v19-3388
- Kantorovich and Rubinstein [1958] Kantorovich, L. V., Rubinstein, G. Sh., 1958. Sur un espace de fonctions complètement additives. Vestnik Leningrad Univ., 13 (Ser. Mat. Astron. 2), 52–59.
- Le Page and Ye [2010] Le Page, E., Ye, Y., 2010. The survival probability of a critical branching process in a Markovian random environment. Comptes Rendus Mathématique, 348(5-6), 301–304. https://doi.org/10.1016/j.crma.2010.01.014
- Li et al. [2014] Li, Y., Liu, Q., Gao, Z., Wang, H., 2014. Asymptotic properties of supercritical branching processes in random environments. Front. Math. China, 9(4), 737–751. https://doi.org/10.1007/s11464-014-0397-z
- Ni et al. [2009] Ni, K., Bresson, X., Chan, T., Esedoglu, S., 2009. Local histogram based segmentation using the Wasserstein distance, International Journal of Computer Vision, 84, 97–111.
- Rio [1998] Rio, E., 1998. Distances minimales et distances idéales. Comptes Rendus de l’Académie des Sciences - Series \@slowromancapi@ - Mathematics, 326(9), 1127–1130. https://doi.org/10.1016/S0764-4442(98)80074-8
- Shao [1993] Shao, Q., 1993. Almost sure invariance principles for mixing sequences of random variables. Stochastic Process. Appl., 48(2), 319–334. https://doi.org/10.1016/0304-4149(93)90051-5
- Smith and Wilkinson [1969] Smith, W.L., Wilkinson, W.E., 1969. On branching processes in random environments. Ann. Math. Stat., 40(3), 814–827. https://doi.org/10.1214/aoms/1177697589
- Solomon et al. [2015] Solomon, J., De Goes, F., Peyré, G., Cuturi, M., Butscher, A., Nguyen, A., Du, T., and Guibas, L. 2015. Convolutional Wasserstein distances: efficient optimal transportation on geometric domains. ACM Transactions on Graphics (TOG), 34(4), 1–-11. https://doi.org/10.1145/2766963
- Tanny [1977] Tanny, D., 1977. Limit theorems for branching processes in a random environment. The Annals of Probability, 5(1), 100–116. https://doi.org/10.1214/aop/1176995894
- Tanny [1988] Tanny, D., 1988. A necessary and sufficient condition for a branching process in a random environment to grow like the product of its means. Stochastic Process. Appl., 28(1), 123–139. https://doi.org/10.1016/0304-4149(88)90070-1
- Wang et al. [2021] Wang, S., Cai, T.T., Li, H., 2021 Optimal estimation of Wasserstein distance on a tree with an application to microbiome studies. Journal of the American Statistical Association, 116(535), 1237–1253. https://doi.org/10.1080/01621459.2019.1699422
- Wu et al. [2020] Wu, S., Ma, X., Sang, H., Fan, X., 2020. A Berry-Esseen bound of order for martingales. Comptes Rendus Mathématique, 358(6): 701–712. https://doi.org/10.5802/crmath.81