This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Re-examining aggregation in the Tallis-Leyton model of parasite acquisition

R. McVinish School of Mathematics and Physics, University of Queensland [email protected]
Abstract.

The Tallis-Leyton model is a simple model of parasite acquisition where there is no interaction between the host and the acquired parasites. We examine the effect of model parameters on the distribution of the host’s parasite burden in the sense of the Lorenz order. This fits with an alternate view of parasite aggregation that has become widely used in empirical studies but is rarely used in the analysis of mathematical models of parasite acquisition.

1. Introduction

The distribution of parasites among its host population typically displays a high degree of inequality. In fact, this phenomenon, called aggregation, is almost a universally observed (Shaw and Dobson, 1995, Poulin, 2007). Despite its importance, aggregation lacks a universally accepted definition. Instead, the phenomenon is studied using a number of common measures of aggregation that summarise different properties of the parasite’s distribution (Pielou, 1977, McVinish and Lester, 2020, Morrill et al., 2023). The most commonly used measures of aggregation in theoretical models are the kk parameter of the negative binomial distribution (Anderson and May, 1978a, b, Rosa and Pugliese, 2002, Schreiber, 2006, McPherson et al., 2012) and the variance-to-mean ratio (Isham, 1995, Barbour and Pugliese, 2000, Herbert and Isham, 2000, Peacock et al., 2018). Both of these measures can be interpreted as quantifying how far the distribution of parasites deviates from a Poisson distribution.

An alternative view of aggregation was put forward by Poulin (1993), arguing that a measure of the discrepancy between the observed distribution of parasites in the hosts and the ideal distributions where all hosts are infected with the same number of parasites would be the best measure of aggregation. This view puts the Lorenz ordering of distributions (Lorenz, 1905, Arnold and Sarabia, 2018) central in the study of aggregation. As a measure of this discrepancy, Poulin proposed his index, D, which is essentially an estimator of the Gini index (Gini, 1924). This has since become become one of the standard measure of aggregation used in studies of wild parasite populations. Although the Gini index has been commonly used in empirical studies since Poulin’s proposal, we are unaware of any examination of its behaviour in theoretical models of parasite acquisition.

The aim of this paper is to apply Poulin’s view of aggregation to a simple stochastic model for the number of parasites in a definitive host proposed by Tallis and Leyton (1969). The Tallis-Leyton model assumes that the host makes infective contacts following a Poisson process and at each infective contact a random number of parasites enter the host. Once a parasite enters the host, it is alive for a random period of time. The lifetimes of parasites, numbers of parasites entering the host at infective contacts, and the Poisson process of infective contacts are all assumed to be independent. Furthermore, the parasites are assumed to have no effect on the host mortality.

In Section 2 we review some background on the Lorenz ordering and the closely related convex ordering of distributions. The Tallis-Leyton model is analysed in Section 3. We first show show how the host’s parasite burden can be represented as a compound Poisson distribution. This representation is then applied to determine how each of the model parameters affect the Lorenz ordering of the distribution of parasites in the host. The final part of the analysis shows that the host’s parasite burden is asymptotically normally distributed in the limit as the rate of infectious contacts goes to infinity. The paper concludes with a discussion of future challenges in analysing models of parasite aggregation.

2. Background

2.1. Tallis-Leyton model

Tallis and Leyton (1969) proposed the following model for the parasite burden M(a)M(a) of a definitive host at age aa, conditional on survival of the host to age aa. The host is assumed to be parasite free at birth so M(0)=0M(0)=0. The host makes infective contacts during its lifetime following a Poisson process with rate parameter λ\lambda. At each infective contact, the number of parasites that enter the host is a random variable NN and once a parasite enters the host it survives for a random period TT. The parasites have no effect on the host mortality. The lifetimes of parasites, numbers of parasites entering the host at infective contacts, and the Poisson process of infective contacts are all assumed to be independent. Although we won’t make use of this fact, we note that this process also describes an infinite server queue with bulk arrivals and general independent service times (Holman et al., 1983).

We let GXG_{X}, FXF_{X} and F¯X=1FX\bar{F}_{X}=1-F_{X} denote the probability generating function (PGF), distribution function, and survival function of a random variable XX. We write GM(;a)G_{M}(\cdot;a) for the PGF of M(a)M(a), conditional on survival of the host to age aa. Tallis and Leyton (1969) showed

GM(z;a)=exp(λ0a[GN(1+F¯T(as)(z1))1]𝑑s).G_{M}(z;a)=\exp\left(\lambda\int_{0}^{a}\left[G_{N}(1+\bar{F}_{T}(a-s)(z-1))-1\right]ds\right).

Standard arguments show that the mean and variance of M(a)M(a) are

μ(a)\displaystyle\mu(a) =λ𝔼N0aF¯T(s)𝑑s\displaystyle=\lambda\mathbb{E}N\int_{0}^{a}\bar{F}_{T}(s)\,ds
σ2(a)\displaystyle\sigma^{2}(a) =λ𝔼N(N1)0aF¯T2(s)𝑑s+λ𝔼N0aF¯T(s)𝑑s.\displaystyle=\lambda\mathbb{E}N(N-1)\int_{0}^{a}\bar{F}_{T}^{2}(s)\,ds+\lambda\mathbb{E}N\int_{0}^{a}\bar{F}_{T}(s)\,ds.

Hence, the variance-to-mean ratio is

VMR(M(a))=1+𝔼N(N1)𝔼N0aF¯T2(s)𝑑s0aF¯T(s)𝑑s.\text{VMR}(M(a))=1+\frac{\mathbb{E}N(N-1)}{\mathbb{E}N}\frac{\int^{a}_{0}\bar{F}^{2}_{T}(s)ds}{\int^{a}_{0}\bar{F}_{T}(s)ds}. (1)

Assuming 𝔼N<\mathbb{E}N<\infty and 𝔼T<\mathbb{E}T<\infty, the limiting distribution of parasite burden as aa\to\infty exists and has PGF

GM(z,)=exp(λ0[GN(1+F¯T(s)(z1))1]𝑑s).G_{M}(z,\infty)=\exp\left(\lambda\int_{0}^{\infty}\left[G_{N}(1+\bar{F}_{T}(s)(z-1))-1\right]ds\right).

Since scaling of the host age, the lifetime distribution of parasites, and the inverse of the rate of infective contacts by a common factor results in the same distribution for the host’s parasite burden, we assume that the expected lifetime of a parasite is 1.

2.2. Convex order and Lorenz order

Lorenz (1905) proposed the Lorenz curve as a graphical measure of inequality. The following general definition of the Lorenz curve was given by Gastwirth (1971).

Definition.

The Lorenz curve L:[0,1][0,1]L:[0,1]\to[0,1] for the distribution FF with finite mean μ\mu is given by

L(u)=0uF1(y)𝑑yμ,L(u)=\frac{\int_{0}^{u}F^{-1}(y)\,dy}{\mu},

where F1F^{-1} is the quantile function

F1(y)=sup{x:F(x)y}for y(0,1).F^{-1}(y)=\sup\{x:F(x)\leq y\}\quad\text{for }y\in(0,1).

Adapting the description in Arnold and Sarabia (2018, Section 3.1) to a parasitology context, the Lorenz curve L(u)L(u) represents the proportion of the parasite population infecting the least infected uu proportion of the host population. When all hosts are infected with the same number of parasite, the Lorenz curve is given by L(u)=uL(u)=u and is called the egalitarian line. Note that LX(u)uL_{X}(u)\geq u for all u[0,1]u\in[0,1].

The Lorenz curve defines a partial order on the class of all distributions on [0,)[0,\infty) with finite mean (Arnold and Sarabia, 2018, Definition 3.2.1).

Definition.

Let XX and YY be random variables with the respective Lorenz curves denoted LXL_{X} and LYL_{Y}. We say XX is smaller in the Lorenz order, denoted XLorenzYX\leq_{\rm{Lorenz}}Y if LX(u)LY(u)L_{X}(u)\geq L_{Y}(u) for every u[0,1]u\in[0,1].

The Lorenz curves of some standard distributions are given in Arnold and Sarabia (2018, Section 6.1). The negative binomial distribution is extensively used in parasitology. When negative binomial distributions are parameterised in terms of the mean mm and kk, they can be compared in the Lorenz order (McVinish and Lester, 2024). Specifically, let 𝖭𝖡(m,k)\mathsf{NB}(m,k) denote the negative binomial distribution with PGF

G(z)=(kk+mmz)k.G(z)=\left(\frac{k}{k+m-mz}\right)^{k}.

Then

  • (i)

    for any k>0k>0 and 0<m1<m20<m_{1}<m_{2}, 𝖭𝖡(k,m2)Lorenz𝖭𝖡(k,m1)\mathsf{NB}(k,m_{2})\leq_{\rm Lorenz}\mathsf{NB}(k,m_{1}), and

  • (ii)

    for any m>0m>0 and 0<k1<k20<k_{1}<k_{2}, 𝖭𝖡(k2,m)Lorenz𝖭𝖡(k1,m)\mathsf{NB}(k_{2},m)\leq_{\rm Lorenz}\mathsf{NB}(k_{1},m).

Closely related to the Lorenz order is the convex order of distributions.

Definition.

Let XX and YY be two random variables such that 𝔼X=𝔼Y\mathbb{E}X=\mathbb{E}Y. We say XX is small than YY in the convex order, denoted XcxYX\leq_{\rm cx}Y, if 𝔼ϕ(X)𝔼ϕ(Y)\mathbb{E}\phi(X)\leq\mathbb{E}\phi(Y) for all convex functions ϕ:\phi:\mathbb{R}\to\mathbb{R}, provided the expectations exist.

These two orderings are related since XLorenzYX\leq_{\rm Lorenz}Y if and only if

𝔼ϕ(X𝔼X)𝔼ϕ(Y𝔼Y)\mathbb{E}\,\phi\left(\frac{X}{\mathbb{E}X}\right)\leq\mathbb{E}\,\phi\left(\frac{Y}{\mathbb{E}Y}\right)

for every continuous convex function ϕ\phi (Arnold and Sarabia, 2018, Corollary 3.2.1). In other words,

X\displaystyle X LorenzY\displaystyle\leq_{\rm Lorenz}Y is equivalent to X𝔼X\displaystyle\frac{X}{\mathbb{E}X} cxY𝔼Y.\displaystyle\leq_{\rm cx}\frac{Y}{\mathbb{E}Y}.

Shaked and Shanthikumar (2007, Section 3.A) provide an extensive review of results on the convex order. We briefly mention some of the important results that are used in our analysis.

  • The convex order is closed under weak limits provided the expectations also converge (Shaked and Shanthikumar, 2007, Theorem 3.A.12 (c)).

  • The convex order is closed under mixtures (Shaked and Shanthikumar, 2007, Theorem 3.A.12 (b)). Let XX, YY, and Θ\Theta be random variables and write [XΘ=θ][X\mid\Theta=\theta] and [YΘ=θ][Y\mid\Theta=\theta] for the conditional distributions of XX and YY given Θ=θ\Theta=\theta. If [XΘ=θ]cx[YΘ=θ][X\mid\Theta=\theta]\leq_{\rm cx}[Y\mid\Theta=\theta] for all θ\theta in the support of Θ\Theta, then XcxYX\leq_{\rm cx}Y. As an application of this property we can say that if XcxYX\leq_{\rm cx}Y and ZZ is an independent non-negative random variable, then ZXcxZYZX\leq_{\rm cx}ZY.

  • The convex order is closed under convolutions (Shaked and Shanthikumar, 2007, Theorem 3.A.12 (d)). Let X1,X2,,XkX_{1},X_{2},\ldots,X_{k} and Y1,Y2,,YkY_{1},Y_{2},\ldots,Y_{k} be two sets of independent random variables . If XjcxYjX_{j}\leq_{\rm cx}Y_{j} for j=1,2,,kj=1,2,\ldots,k, then

    j=1kXjcxj=1kYj.\sum_{j=1}^{k}X_{j}\leq_{\rm cx}\sum_{j=1}^{k}Y_{j}.
  • Combining the properties of closure under mixtures and closure under convolutions, we see the convex order is closed under random sums so

    j=1KXjcxj=1KYj,\sum_{j=1}^{K}X_{j}\leq_{\rm cx}\sum_{j=1}^{K}Y_{j},

    for any non-negative integer random variable KK. As an application of the closure under random sums property of the convex order, consider two random variables KK and K~\tilde{K} that related by binomial thinning. That is, GK~(z)=GK(1p+pz)G_{\tilde{K}}(z)=G_{K}(1-p+pz) for some p(0,1)p\in(0,1). Then KLorenzK~K\leq_{\rm Lorenz}\tilde{K} (McVinish and Lester, 2020, Section 3)

  • The closure under random sums property can be adapted to the case where the X1,X2,X_{1},X_{2},\ldots and Y1,Y2,Y_{1},Y_{2},\ldots are two iid sequences with XcxYX\leq_{\rm cx}Y, and K1K_{1} and K2K_{2} are non-negative integer random variables such that K1cxK2K_{1}\leq_{\rm cx}K_{2}. In this case, Shaked and Shanthikumar (2007, Theorem 3.A.13) implies

    j=1K1Xjcxj=1K2Yj.\sum_{j=1}^{K_{1}}X_{j}\leq_{\rm cx}\sum_{j=1}^{K_{2}}Y_{j}.
  • The survival function can be used to establish if two random variables can be compared in the convex order. If XX and YY are two random variables with the same mean, then XcxYX\leq_{\rm cx}Y if F¯XF¯Y\bar{F}_{X}-\bar{F}_{Y} has a single sign change and the sign sequence is +,+,- (Shaked and Shanthikumar, 2007, Theorem 3.A.44(b)). This property can also be used to characterise the convex order (Shaked and Shanthikumar, 2007, Theorem 3.A.45).

2.3. Measures of aggregation

In practice, levels of aggregation are compared with numerical summaries rather than using the entire Lorenz curve. A useful measure of aggregation respects the Lorenz ordering. That is, if I()I(\cdot) is a measure of aggregation that respects the Lorenz ordering and XLorenzYX\leq_{\rm Lorenz}Y, then I(X)I(Y)I(X)\leq I(Y). Arnold and Sarabia (2018, Chapter 5) review several inequality measures and these can be applied as measures of aggregation. We restrict our attention in this paper to the Gini index, the Hoover index (also known as the Pietra index, or the Robin-Hood index), 1prevalence1-\text{prevalence}, and the coefficient of variation.

The Gini index (Gini, 1924) is given by twice the area between the egalitarian line and the Lorenz curve. For a random variable XX, the Gini index can be expressed as

G(X)=𝔼|XX~|2𝔼X,G(X)=\frac{\mathbb{E}\,|X-\tilde{X}|}{2\,\mathbb{E}X},

where X~\tilde{X} is an independent random variable with X~=dX\tilde{X}\stackrel{{\scriptstyle d}}{{=}}X. The Hoover index is given by the maximum vertical distance between the egalitarian line and the Lorenz curve. McVinish and Lester (2020) argue that this index could be useful due to its simple interpretation as the proportion of the parasite population that would need to be redistributed among the hosts in order for all hosts to have the same parasite burden. The Hoover index can be expressed as

H(X)=𝔼|X𝔼X|2𝔼XH(X)=\frac{\mathbb{E}\left|X-\mathbb{E}X\right|}{2\,\mathbb{E}X}

Prevalence, the probability that a host is infected by at least one parasite, is an important quantity in parasitology. Although prevalence is not usually thought of as a measure of aggregation, we may express 1prevalence1-\text{prevalence} in terms of the Lorenz curve as

1prevalence=max{u:L(u)=0}.1-\text{prevalence}=\max\{u:L(u)=0\}.

For the Tallis-Leyton model,

1prevalence=GM(0;a)=exp(λ0a[GN(1F¯T(as))1]𝑑s).1-\text{prevalence}=G_{M}(0;a)=\exp\left(\lambda\int_{0}^{a}\left[G_{N}(1-\bar{F}_{T}(a-s))-1\right]ds\right). (2)

Finally, the coefficient of variation is given by

CV(X)=Var(X)𝔼X.CV(X)=\frac{\sqrt{\text{Var}(X)}}{\mathbb{E}X}.

This measure is rarely used in parasitology, though it is mentioned in some reviews on parasite aggregation such as Wilson et al. (2001) and McVinish and Lester (2020). As means and variances are commonly reported in empirical studies and are often easily calculated for theoretical models, it may be useful in some contexts. For example, the squared coefficient of variation for the Tallis-Leyton model is

CV2(M(a))=1λ𝔼N(N1)(𝔼N)20aF¯T2(s)𝑑s(0aF¯T(s)𝑑s)2+1λ𝔼N0aF¯T(s)𝑑s.\text{CV}^{2}(M(a))=\frac{1}{\lambda}\frac{\mathbb{E}N(N-1)}{(\mathbb{E}N)^{2}}\frac{\int^{a}_{0}\bar{F}^{2}_{T}(s)ds}{\left(\int^{a}_{0}\bar{F}_{T}(s)ds\right)^{2}}+\frac{1}{\lambda\mathbb{E}N\int^{a}_{0}\bar{F}_{T}(s)ds}. (3)

These indices are related by the following inequality

1prevalenceH(X)G(X)H(X)(2H(X))CV(X)1-\text{prevalence}\leq H(X)\leq G(X)\leq H(X)(2-H(X))\leq CV(X)

(Taguchi, 1968, McVinish and Lester, 2020). In particular, if 𝔼X1\mathbb{E}X\leq 1, then we have the equality 1prevalence=H(X)1-\text{prevalence}=H(X). The Gini index and Hoover index can be further related to the coefficient of variation when the distribution of parasites is approximately normal. Suppose X1,X2,X_{1},X_{2},\ldots is a sequence of random variables such that

Xn𝔼XnVar(Xn)=:ZndZ,\frac{X_{n}-\mathbb{E}X_{n}}{\sqrt{\text{Var}(X_{n})}}=:Z_{n}\stackrel{{\scriptstyle d}}{{\to}}Z,

where Z𝖭(0,1)Z\sim\mathsf{N}(0,1). As Xn0X_{n}\geq 0 with probability one, the above limit is only possible if CV(Xn)0CV(X_{n})\to 0. Nevertheless, the ratio of the Hoover index to the coefficient of variation still has a well defined limit. The Hoover index of XnX_{n} can be expressed as

H(Xn)=𝔼|Xn𝔼Xn|2𝔼Xn=Var(Xn)2𝔼Xn𝔼|Zn|.H(X_{n})=\frac{\mathbb{E}\left|X_{n}-\mathbb{E}X_{n}\right|}{2\mathbb{E}X_{n}}=\frac{\sqrt{\text{Var}(X_{n})}}{2\mathbb{E}X_{n}}\mathbb{E}|Z_{n}|.

Since 𝔼Zn2=1\mathbb{E}Z_{n}^{2}=1, this collection of random variables is uniformly integrable and 𝔼|Zn|𝔼|Z|=2/π\mathbb{E}|Z_{n}|\to\mathbb{E}|Z|=\sqrt{2/\pi}. Hence,

H(Xn)CV(Xn)12π.\frac{H(X_{n})}{CV(X_{n})}\to\frac{1}{\sqrt{2\pi}}. (4)

Similarly, the Gini index of XnX_{n} can be expressed as

G(Xn)=𝔼|XnX~n|2𝔼Xn=Var(Xn)2𝔼Xn𝔼|ZnZ~n|,G(X_{n})=\frac{\mathbb{E}|X_{n}-\tilde{X}_{n}|}{2\mathbb{E}X_{n}}=\frac{\sqrt{\text{Var}(X_{n})}}{2\mathbb{E}X_{n}}\mathbb{E}|Z_{n}-\tilde{Z}_{n}|,

where Z~n\tilde{Z}_{n} is an independent random variable with Z~n=dZn\tilde{Z}_{n}\stackrel{{\scriptstyle d}}{{=}}Z_{n}. Applying the asymptotic normality and uniform integrability of the ZnZ_{n},

G(Xn)CV(Xn)1π.\frac{G(X_{n})}{CV(X_{n})}\to\frac{1}{\sqrt{\pi}}. (5)

2.4. Numerical evaluation

From equation (3), the coefficient of variation can be relatively easily evaluated for the Tallis-Leyton model. Numerical integration of F¯T(s)\bar{F}_{T}(s) and F¯T2(s)\bar{F}_{T}^{2}(s) may be required, but the dependence on age and λ\lambda is explicit. Similarly, 1prevalence1-\text{prevalence} could be evaluated with a single numerical integration using (2). On the other hand, evaluation of the Hoover and Gini indices require evaluation of the probability mass function. In the examples of the next section, we numerically evaluate the probability mass function of M(a)M(a) by inverting GM(z;a)G_{M}(z;a) using the Abate-Whitte algorithm (Abate and Whitt, 1992). The algorithm was implemented in MATLAB (The MathWorks Inc., 2022a) using the vpa function in the Symbolic Math Toolbox (The MathWorks Inc., 2022b) for high precision arithmetic.

3. Analysis of the Tallis-Leyton model

The analysis begins with a representation of the host’s parasite burden as a compound Poisson distribution. This representation is used extensively to understand how the rate of infective contacts, the distribution of the number of parasites that enter the host during an infective contact, the age of the host, and lifetime distribution of the parasites all affect the distribution of parasites in the host in terms of the Lorenz order. When comparing the host’s parasite burden in two systems, the parameters of the second parasite-host system is distinguished by a tilde.

3.1. Compound Poisson representation

Let U1,U2,U_{1},U_{2},\ldots be a sequence of independent standard uniform random variables and define X:0×[0,1]0X:\mathbb{Z}_{\geq 0}\times[0,1]\to\mathbb{Z}_{\geq 0} by

X(n,v):=k=1n𝕀(Ukv),X(n,v):=\sum_{k=1}^{n}\mathbb{I}(U_{k}\leq v),

with X(n,v)=0X(n,v)=0 when n=0n=0. For given nn and vv the distribution of X(n,v)X(n,v) is 𝖡𝗂𝗇(n,v)\mathsf{Bin}(n,v).

Theorem 1.

Assume TT has a continuous distribution. Define VV to be a random variable on [F¯T(a),1][\bar{F}_{T}(a),1] with distribution function

FV(v)=1a1F¯T1(v)F_{V}(v)=1-a^{-1}\bar{F}^{-1}_{T}(v) (6)

Let X1,X2,X_{1},X_{2},\ldots be a sequence of independent random variables with the same distribution as X(N,V)X(N,V), where NN and VV are independent. Let Λ(t)\Lambda(t) be a Poisson process with rate λ\lambda so Λ(t)𝖯𝗈𝗂(λt)\Lambda(t)\sim\mathsf{Poi}(\lambda t). Then

M(a)=dk=1Λ(a)Xk.M(a)\stackrel{{\scriptstyle d}}{{=}}\sum_{k=1}^{\Lambda(a)}X_{k}.
Proof.

Standard conditioning arguments show that the PGF of X(N,v)X(N,v) is

GX(N,v)(z)=GN(1+v(z1)).G_{X(N,v)}(z)=G_{N}\left(1+v(z-1)\right).

Hence,

GX(N,V)(z)\displaystyle G_{X(N,V)}(z) =F¯T(a)1GN(1+v(z1))d[1a1F¯T1(v)]\displaystyle=\int_{\bar{F}_{T}(a)}^{1}G_{N}\left(1+v(z-1)\right)d\left[1-a^{-1}\bar{F}^{-1}_{T}(v)\right]

Again applying standard conditioning arguments, we see the PGF of k=1Λ(a)Xk\sum_{k=1}^{\Lambda(a)}X_{k} is

Gk=1Λ(a)Xk(z)\displaystyle G_{\sum_{k=1}^{\Lambda(a)}X_{k}}(z) =exp{λa(F¯T(a)1GN(1+v(z1))d[1a1F¯T1(v)]1)}\displaystyle=\exp\left\{\lambda a\left(\int_{\bar{F}_{T}(a)}^{1}G_{N}\left(1+v(z-1)\right)d\left[1-a^{-1}\bar{F}^{-1}_{T}(v)\right]-1\right)\right\}
=exp{λF¯T(a)1[GN(1+v(z1))1]d[aF¯T1(v)]}.\displaystyle=\exp\left\{\lambda\int_{\bar{F}_{T}(a)}^{1}\left[G_{N}\left(1+v(z-1)\right)-1\right]d\left[a-\bar{F}^{-1}_{T}(v)\right]\right\}.

Upon making the substitution v=F¯T(as)v=\bar{F}_{T}(a-s), the PGF of k=1Λ(a)Xk\sum_{k=1}^{\Lambda(a)}X_{k} can be expressed as

Gk=1Λ(a)Xk(z)\displaystyle G_{\sum_{k=1}^{\Lambda(a)}X_{k}}(z) =exp(λ0a[GN(1+F¯T(as)(z1))1]𝑑s)=GM(a;z).\displaystyle=\exp\left(\lambda\int_{0}^{a}\left[G_{N}(1+\bar{F}_{T}(a-s)(z-1))-1\right]ds\right)=G_{M}(a;z).

3.2. Rate of infective contacts

Our first comparison result concerns the effect of the rate of infective contacts on the distribution of the host’s parasite burden. The rate of infective contacts has no effect on the variance-to-mean ratio (1), whereas the coefficient of variation is strictly decreasing as the rate of infective contacts increases (3). The following result shows that any index respecting the Lorenz order is decreasing as a function of the rate of infective contacts.

Theorem 2.

If λ~<λ\tilde{\lambda}<\lambda and all other model parameters are equal, then M(a)LorenzM~(a)M(a)\leq_{\rm Lorenz}\tilde{M}(a).

Proof.

Set κ=λ~/λ\kappa=\tilde{\lambda}/\lambda. Let X1,X2,X_{1},X_{2},\ldots be a sequence of independent random variables having the same distribution as X(N,V)X(N,V) and let B1,B2,B_{1},B_{2},\ldots be a sequence of independent 𝖡𝖾𝗋(κ)\mathsf{Ber}(\kappa) random variables that are also independent of the XkX_{k}. As κcxBk\kappa\leq_{\rm cx}B_{k} and the convex order is closed under mixtures, κXkcxBkXk\kappa X_{k}\leq_{\rm cx}B_{k}X_{k}. The PGF of BkXkB_{k}X_{k} is GBkXk(z)=κGX(N,V)(z)+1κG_{B_{k}X_{k}}(z)=\kappa\,G_{X(N,V)}(z)+1-\kappa. Let Λ(t)\Lambda(t) be a Poisson process with rate λ\lambda. As the convex order is closed under random sums,

κk=1Λ(a)Xkcxk=1Λ(a)BkXk.\kappa\sum_{k=1}^{\Lambda(a)}X_{k}\leq_{\rm cx}\sum_{k=1}^{\Lambda(a)}B_{k}X_{k}.

By Theorem 1, k=1Λ(a)Xk=dM(a)\sum_{k=1}^{\Lambda(a)}X_{k}\stackrel{{\scriptstyle d}}{{=}}M(a). To determine the distribution of k=1Λ(a)BkXk\sum_{k=1}^{\Lambda(a)}B_{k}X_{k}, we evaluate its PGF

Gk=1Λ(a)BkXk(z)\displaystyle G_{\sum_{k=1}^{\Lambda(a)}B_{k}X_{k}}(z) =exp{λa[(κGX(N,V)(z)+1κ)1]}\displaystyle=\exp\left\{\lambda a\left[(\kappa\,G_{X(N,V)}(z)+1-\kappa)-1\right]\right\}
=exp{λ~a[GX(N,V)(z)1]}=GM~(a;z).\displaystyle=\exp\left\{\tilde{\lambda}a\left[G_{X(N,V)}(z)-1\right]\right\}=G_{\tilde{M}}(a;z).

Hence, M(a)LorenzM~(a)M(a)\leq_{\rm Lorenz}\tilde{M}(a). ∎

Figure 1 shows the four indices (Gini, Hoover, 1prevalence1-\text{prevalence}, and coefficient of variation) for a host aged 3 where λ[0.25,128]\lambda\in[0.25,128], N𝖭𝖡(1,1)N\sim\mathsf{NB}(1,1), and T𝖤𝗑𝗉(1)T\sim\mathsf{Exp}(1). Since the coefficient of variation (3) is proportional to λ1/2\lambda^{-1/2}, it is not displayed for small values of λ\lambda. We see that all four indices are strictly decreasing as a function of λ\lambda. For λ1\lambda\leq 1, μ(3)1\mu(3)\leq 1 so the Hoover index and 1prevalence1-\text{prevalence}. The Hoover index appears to display some discontinuity in the first derivative at points where the expectation is integer valued. This behaviour is less apparent at larger values of λ\lambda.

Refer to caption
Figure 1. Plot of Hoover index (yellow solid line), Gini index (purple dashed line), 1prevalence1-\text{prevalence} (blue dot-dashed line), and coefficient of variation (orange dotted line) for a host aged 3 in the Tallis-Leyton model with N𝖭𝖡(1,1)N\sim\mathsf{NB}(1,1), and T𝖤𝗑𝗉(1)T\sim\mathsf{Exp}(1).

3.3. Distribution of NN

We now consider the role of the distribution of the number of parasites that enter the host during an infective contact. As a concrete example, suppose N𝖭𝖡(m,k)N\sim\mathsf{NB}(m,k). Then the variance-to-mean ratio is

VMR(M(a))=1+cm(1+1/k),\text{VMR}(M(a))=1+cm(1+1/k),

where c>0c>0 is a constant depending on FTF_{T}. From this expression we see that the variance-to-mean ratio is increasing in mm but decreasing in kk. In contrast, the coefficient of variation of M(a)M(a) is decreasing in both mm and kk. The next two results show that the distribution of the host’s parasite burden is decreasing in the Lorenz order as functions of both mm and kk. The first of these results requires the distributions being compared to have the same expectation.

Theorem 3.

Suppose NcxN~N\leq_{\rm cx}\tilde{N} and all other model parameters are equal. Then M(a)cxM~(a)M(a)\leq_{\rm cx}\tilde{M}(a).

Proof.

Using an extension of the closure under random sums property of the convex order Shaked and Shanthikumar (2007, Theorem 3.A.13),

X(N,v)cxX(N~,v).X(N,v)\leq_{\rm cx}X(\tilde{N},v).

As the convex order is closed under mixtures, X(N,V)cxX(N~,V)X(N,V)\leq_{\rm cx}X(\tilde{N},V). Let X1,X2,X_{1},X_{2},\ldots be a sequence of independent random variables having the same distribution as X(N,V)X(N,V) and let X~1,X~2,\tilde{X}_{1},\tilde{X}_{2},\ldots be a sequence of independent random variables having the same distribution as X(N~,V)X(\tilde{N},V). As the convex order is closed under random sums,

k=1Λ(a)Xkcxk=1Λ(a)X~k.\sum_{k=1}^{\Lambda(a)}X_{k}\leq_{\rm cx}\sum_{k=1}^{\Lambda(a)}\tilde{X}_{k}.

Theorem 1 shows M(a)cxM~(a)M(a)\leq_{\rm cx}\tilde{M}(a). ∎

For distributions with different means, we consider only the case where NN and N~\tilde{N} are related by binomial thinning. Recall that if GN~(z)=GN(1p+pz)G_{\tilde{N}}(z)=G_{N}(1-p+pz) for some p(0,1)p\in(0,1), then N~LorenzN\tilde{N}\leq_{\rm Lorenz}N.

Theorem 4.

Suppose that GN~(z)=GN(1p+pz)G_{\tilde{N}}(z)=G_{N}(1-p+pz) for some p(0,1)p\in(0,1) and all other model parameters are equal. Then M(a)LorenzM~(a)M(a)\leq_{\rm Lorenz}\tilde{M}(a).

Proof.

Let U1,U2,U_{1},U_{2},\dots and U1,U2,U^{\prime}_{1},U^{\prime}_{2},\ldots be independent standard uniform random variables. Then standard conditioning arguments show

X(N~,v)=dj=1N𝕀(Ujv)𝕀(Ujp).\displaystyle X(\tilde{N},v)\stackrel{{\scriptstyle d}}{{=}}\sum_{j=1}^{N}\mathbb{I}\left(U_{j}\leq v\right)\mathbb{I}\left(U^{\prime}_{j}\leq p\right).

As the convex order is closed under mixtures,

p𝕀(Ujv)cx𝕀(Ujv)𝕀(Ujp).p\,\mathbb{I}\left(U_{j}\leq v\right)\leq_{\rm cx}\mathbb{I}\left(U_{j}\leq v\right)\mathbb{I}\left(U^{\prime}_{j}\leq p\right).

As the convex order is closed under random sums, pX(N,v)cxX(N~,v)pX(N,v)\leq_{\rm cx}X(\tilde{N},v). Following the same arguments as in the proof of Theorem 3, we see pM(a)cxM~(a)pM(a)\leq_{\rm cx}\tilde{M}(a). Hence, M(a)LorenzM~(a)M(a)\leq_{\rm Lorenz}\tilde{M}(a). ∎

As noted previously, 𝖭𝖡(k,m)Lorenz𝖭𝖡(k~,m~)\mathsf{NB}(k,m)\leq_{\rm Lorenz}\mathsf{NB}(\tilde{k},\tilde{m}) if k~k\tilde{k}\leq k and m~m\tilde{m}\leq m. If N𝖭𝖡(k,m)N\sim\mathsf{NB}(k,m), N~𝖭𝖡(k~,m~)\tilde{N}\sim\mathsf{NB}(\tilde{k},\tilde{m}) and all other model parameters are equal, then Theorems 3 and 4 together imply that M(a)LorenzM~(a)M(a)\leq_{\rm Lorenz}\tilde{M}(a). Figure 2 shows the Hoover and Gini indices as functions of the negative binomial parameters mm and kk for a parasite host system with host aged 10, rate of infective contacts 5, and parasite lifetimes following an exponential distribution with mean 1. Both indices are decreasing in both mm and kk as we expect from the above results. The contours of the Hoover index display some discontinuity in the first derivative for m=1/5m=1/5 (log2(m)2.3\log_{2}(m)\approx-2.3), which corresponds to a mean of 1 for the host. The contours for both the Hoover and Gini indices tend to become parallel to the respective axes as mm\to\infty and kk\to\infty. This is a consequence of the limiting behaviour of the negative binomial distribution (Adell and Cal, 1994).

Refer to caption
Figure 2. Contour plots showing Hoover index (Left) and Gini index (Right) for a host aged 10 in the Tallis-Leyton model with λ=5\lambda=5, T𝖤𝗑𝗉(1)T\sim\mathsf{Exp}(1) and N𝖭𝖡(m,k)N\sim\mathsf{NB}(m,k).

It is natural to consider which distribution for NN results in the least aggregated distribution for the host’s parasite burden. This requires determining the smallest distribution in the convex ordering. The convex order requires that the distributions compared have the same expected value so let n=𝔼Nn=\mathbb{E}N. Define the random variable NN^{\prime} such that

(N=n)\displaystyle\mathbb{P}(N^{\prime}=\lfloor n\rfloor) =n+1n\displaystyle=\lfloor n\rfloor+1-n (N=n+1)\displaystyle\mathbb{P}(N^{\prime}=\lfloor n\rfloor+1) =nn.\displaystyle=n-\lfloor n\rfloor.

In the supplementary material of McVinish and Lester (2020) it was shown that NcxNN^{\prime}\leq_{\rm cx}N so NN^{\prime} is smallest distribution in convex order with expectation nn. When n1n\leq 1, the smallest distribution in convex order for NN leads to M(a)M(a) having a Poisson distribution. There is no largest distribution in the convex order.

3.4. Host age

Differentiating (1) with respect to age shows the variance-to-mean ratio is a decreasing function of age. Since the expected parasite burden is increasing in age, the coefficient of variation is also decreasing in age. The following result shows the host’s parasite burden is decreasing in the Lorenz order as a function of age.

Theorem 5.

If a~<a\tilde{a}<a, then M(a)LorenzM(a~)M(a)\leq_{\rm Lorenz}M(\tilde{a}).

The proof is built from the following lemmas.

Lemma 6.

Let VV have the distribution (6) and let V~\tilde{V} have the distribution (6) with aa replaced by a~\tilde{a}. Let B𝖡𝖾𝗋(μ(a~)/μ(a))B\sim\mathsf{Ber}(\mu(\tilde{a})/\mu(a)) independent of VV, and let B~𝖡𝖾𝗋(a~/a)\tilde{B}\sim\mathsf{Ber}(\tilde{a}/a) independent of V~\tilde{V}. Then BVcxB~V~BV\leq_{\rm cx}\tilde{B}\tilde{V}.

Proof.

Note that

𝔼V=1a0aF¯T(s)𝑑s\mathbb{E}V=\frac{1}{a}\int_{0}^{a}\bar{F}_{T}(s)ds

so 𝔼BV=𝔼B~V~\mathbb{E}BV=\mathbb{E}\tilde{B}\tilde{V}. We show that BVcxB~V~BV\leq_{\rm cx}\tilde{B}\tilde{V} by examining the sign changes of F¯BVF¯B~V~\bar{F}_{BV}-\bar{F}_{\tilde{B}\tilde{V}}. The survival functions of BVBV and B~V~\tilde{B}\tilde{V} are

F¯BV(w)\displaystyle\bar{F}_{BV}(w) ={μ(a~)μ(a),w[0,F¯T(a))μ(a~)aμ(a)F¯T1(w),w[F¯T(a),1)0,w>1,\displaystyle=\left\{\begin{array}[]{ll}\frac{\mu(\tilde{a})}{\mu(a)},&w\in[0,\bar{F}_{T}(a))\\ \frac{\mu(\tilde{a})}{a\,\mu(a)}\bar{F}^{-1}_{T}(w),&w\in[\bar{F}_{T}(a),1)\\ 0,&w>1,\end{array}\right.

and

F¯B~V~(w)\displaystyle\bar{F}_{\tilde{B}\tilde{V}}(w) ={a~a,w[0,F¯T(a~))1aF¯T1(w),w[F¯T(a~),1)0,w>1.\displaystyle=\left\{\begin{array}[]{ll}\frac{\tilde{a}}{a},&w\in[0,\bar{F}_{T}(\tilde{a}))\\ \frac{1}{a}\bar{F}^{-1}_{T}(w),&w\in[\bar{F}_{T}(\tilde{a}),1)\\ 0,&w>1.\end{array}\right.

Since μ(a)\mu(a) is increasing in aa and μ(a)/a\mu(a)/a is decreasing in aa,

a~a<μ(a~)μ(a)<1.\frac{\tilde{a}}{a}<\frac{\mu(\tilde{a})}{\mu(a)}<1.

Hence, F¯BV(w)F¯B~V~(w)>0\bar{F}_{BV}(w)-\bar{F}_{\tilde{B}\tilde{V}}(w)>0 for all w[0,F¯T(a)]w\in[0,\bar{F}_{T}(a)]. On [F¯T(a),F¯T(a~)][\bar{F}_{T}(a),\bar{F}_{T}(\tilde{a})], F¯B~V~(w)=a~/a\bar{F}_{\tilde{B}\tilde{V}}(w)=\tilde{a}/a whereas F¯BV\bar{F}_{BV} decreases from μ(a~)/μ(a)\mu(\tilde{a})/\mu(a) to a~μ(a~)/aμ(a)<a~/a\tilde{a}\mu(\tilde{a})/a\mu(a)<\tilde{a}/a. For all wF¯T(a~)w\geq\bar{F}_{T}(\tilde{a}), F¯BV(w)F¯B~V~(w)<0\bar{F}_{BV}(w)-\bar{F}_{\tilde{B}\tilde{V}}(w)<0. Hence, F¯BVF¯B~V~\bar{F}_{BV}-\bar{F}_{\tilde{B}\tilde{V}} has a single sign change and the sign sequence is +,+,-. Hence, BVcxB~V~BV\leq_{\rm cx}\tilde{B}\tilde{V} (Shaked and Shanthikumar, 2007, Theorem 3.A.44). ∎

Lemma 7.

For any convex function ϕ\phi and any non-negative integer valued random variable NN that is independent of U1,U2,U_{1},U_{2},\ldots, 𝔼ϕ(X(N,v))\mathbb{E}\phi(X(N,v)) is a convex function in vv

Proof.

As the binomial distribution 𝖡𝗂𝗇(n,v)\mathsf{Bin}(n,v) is a regular exponential family of distribution with expectation linear in vv, Schweder (1982, Proposition 2) implies 𝔼ϕ(X(n,v)\mathbb{E}\phi(X(n,v) is convex in vv for any positive integer nn. As non-negative weighted sums of convex functions are also convex, it follows that 𝔼ϕ(X(N,v))\mathbb{E}\phi(X(N,v)) is a convex function in vv. ∎

Proof of Theorem 5.

By construction of X(n,v)X(n,v), if bb takes values in {0,1}\{0,1\}, then bX(n,v)=X(n,bv)bX(n,v)=X(n,bv). Applying Shaked and Shanthikumar (2007, Theorem 3.A.21) with Lemmas 6 and 7,

BX(N,V)=X(N,BV)cxX(N,B~V~)=B~X(N,V~).BX(N,V)=X(N,BV)\leq_{\rm cx}X(N,\tilde{B}\tilde{V})=\tilde{B}X(N,\tilde{V}).

Since the convex order is transitive and closed under mixtures,

μ(a~)μ(a)X(N,V)cxBX(N,V)cxB~X(N,V~).\frac{\mu(\tilde{a})}{\mu(a)}X(N,V)\leq_{\rm cx}BX(N,V)\leq_{\rm cx}\tilde{B}X(N,\tilde{V}).

In the notation of Theorem 1, M(a)=dk=1Λ(a)XkM(a)\stackrel{{\scriptstyle d}}{{=}}\sum_{k=1}^{\Lambda(a)}X_{k}, where X1,X2,X_{1},X_{2},\ldots is a sequence of independent random variables with Xk=dX(N,V)X_{k}\stackrel{{\scriptstyle d}}{{=}}X(N,V). From the thinning property of the Poisson process and Theorem 1, we can write M(a~)=dk=1Λ(a)B~kX~kM(\tilde{a})\stackrel{{\scriptstyle d}}{{=}}\sum_{k=1}^{\Lambda(a)}\tilde{B}_{k}\tilde{X}_{k}, where X~1,X~2,\tilde{X}_{1},\tilde{X}_{2},\ldots is a sequence of independent random variables with X~k=dX(N,V~)\tilde{X}_{k}\stackrel{{\scriptstyle d}}{{=}}X(N,\tilde{V}) and B~1,B~2,\tilde{B}_{1},\tilde{B}_{2},\ldots is a sequence of independent 𝖡𝖾𝗋(a~/a)\mathsf{Ber}(\tilde{a}/a) random variables that are also independent of the X~k\tilde{X}_{k}. As the convex order is closed under random sums,

μ(a~)μ(a)M(a)=dμ(a~)μ(a)k=1Λ(a)Xkcxk=1Λ(a)BkXkcxk=1Λ(a)B~kX~k=dM(a~)\frac{\mu(\tilde{a})}{\mu(a)}M(a)\stackrel{{\scriptstyle d}}{{=}}\frac{\mu(\tilde{a})}{\mu(a)}\sum_{k=1}^{\Lambda(a)}X_{k}\leq_{\rm cx}\sum_{k=1}^{\Lambda(a)}B_{k}X_{k}\leq_{\rm cx}\sum_{k=1}^{\Lambda(a)}\tilde{B}_{k}\tilde{X}_{k}\stackrel{{\scriptstyle d}}{{=}}M(\tilde{a})

Refer to caption
Figure 3. Plot of Hoover index (yellow solid line), Gini index (purple dashed line), 1prevalence1-\text{prevalence} (blue dot-dashed line), and coefficient of variation (orange dotted line) for a host in the Tallis-Leyton model with λ=5\lambda=5, N𝖭𝖡(1,1)N\sim\mathsf{NB}(1,1), and T𝖤𝗑𝗉(1)T\sim\mathsf{Exp}(1).

Figure 3 shows the four indices (Gini, Hoover, 1prevalence1-\text{prevalence}, and coefficient of variation) for a host in the Tallis-Leyton model with λ=5\lambda=5, N𝖭𝖡(1,1)N\sim\mathsf{NB}(1,1), and T𝖤𝗑𝗉(1)T\sim\mathsf{Exp}(1). All four indices are strictly decreasing in host age. As in Figure 1, the Hoover index coincides with 1prevalence1-\text{prevalence} for μ(a)1\mu(a)\leq 1 and displays some discontinuity in the first derivative at points where the expectation is integer valued.

3.5. Parasite lifetime distribution

Our final comparison result concerns the parasite lifetime distribution. The result shows that variability in the parasite lifetimes decreases the variability in the host’s parasite burden. In particular, the result implies that the host’s parasite burden is most aggregated when parasites have constant lifetimes.

Theorem 8.

Suppose 𝔼T=𝔼T~\mathbb{E}T=\mathbb{E}\tilde{T} and F¯TF¯T~\bar{F}_{T}-\bar{F}_{\tilde{T}} has a single sign change with sign sequence +,+,- so TcxT~T\leq_{\rm cx}\tilde{T}. Assume all other model parameters are equal, then M~()cxM()\tilde{M}(\infty)\leq_{\rm cx}M(\infty).

Proof.

Assume For any a>0a>0 set a~\tilde{a} such that

0aF¯T(t)𝑑t=0a~F¯T~(t)𝑑t.\int_{0}^{a}\bar{F}_{T}(t)dt=\int_{0}^{\tilde{a}}\bar{F}_{\tilde{T}}(t)dt.

As 𝔼T=𝔼T~\mathbb{E}T=\mathbb{E}\tilde{T}, it follows that a~>a\tilde{a}>a. Let B𝖡𝖾𝗋(a/a~)B\sim\mathsf{Ber}(a/\tilde{a}). Let VV have distribution (6) and let V~\tilde{V} have the distribution (6) with aa replaced by a~\tilde{a} and TT replaced by T~\tilde{T}. The survival functions of V~\tilde{V} and BVBV are

F¯V~(w)\displaystyle\bar{F}_{\tilde{V}}(w) ={1,w[0,F¯T~(a~))1a~F¯T~1(w),w[F¯T~(a~),1)0,w>1,\displaystyle=\left\{\begin{array}[]{ll}1,&w\in[0,\bar{F}_{\tilde{T}}(\tilde{a}))\\ \frac{1}{\tilde{a}}\bar{F}^{-1}_{\tilde{T}}(w),&w\in[\bar{F}_{\tilde{T}}(\tilde{a}),1)\\ 0,&w>1,\end{array}\right.

and

F¯B~V~(w)\displaystyle\bar{F}_{\tilde{B}\tilde{V}}(w) ={aa~,w[0,F¯T(a))1a~F¯T1(w),w[F¯T(a),1)0,w>1.\displaystyle=\left\{\begin{array}[]{ll}\frac{a}{\tilde{a}},&w\in[0,\bar{F}_{T}(a))\\ \frac{1}{\tilde{a}}\bar{F}^{-1}_{T}(w),&w\in[\bar{F}_{T}(a),1)\\ 0,&w>1.\end{array}\right.

Since F¯TF¯T~\bar{F}_{T}-\bar{F}_{\tilde{T}} has a single sign change with sign sequence is +,+,-, it follows that F¯V~F¯BV\bar{F}_{\tilde{V}}-\bar{F}_{BV} also has a single sign change with sign sequence +,+,-. Applying Lemma 7 and Shaked and Shanthikumar (2007, Theorem 3.A.21) together shows X(N,V~)cxX(N,BV)X(N,\tilde{V})\leq_{\rm cx}X(N,BV). From Theorem 1, M~(a~)=dk=1Λ(a~)X~k\tilde{M}(\tilde{a})\stackrel{{\scriptstyle d}}{{=}}\sum_{k=1}^{\Lambda(\tilde{a})}\tilde{X}_{k} and M(a)=dk=1Λ(a)XkM(a)\stackrel{{\scriptstyle d}}{{=}}\sum_{k=1}^{\Lambda(a)}X_{k}, where X~k=dX(N,V~)\tilde{X}_{k}\stackrel{{\scriptstyle d}}{{=}}X(N,\tilde{V}) and Xk=dX(N,V)X_{k}\stackrel{{\scriptstyle d}}{{=}}X(N,V). Let B1,B2,B_{1},B_{2},\ldots be a sequence of independent 𝖡𝖾𝗋(a/a~)\mathsf{Ber}(a/\tilde{a}) random variables that are also independent of X1,X2,X_{1},X_{2},\ldots By construction BX(N,V)=X(N,BV)BX(N,V)=X(N,BV). From the thinning property of the Poisson process, M(a)=dk=1Λ(a~)BkXkM(a)\stackrel{{\scriptstyle d}}{{=}}\sum_{k=1}^{\Lambda(\tilde{a})}B_{k}X_{k}. As the convex order is closed under random sums, we see M~(a~)cxM(a)\tilde{M}(\tilde{a})\leq_{\rm cx}M(a). Letting aa\to\infty and noting that the convex order is closed under weak limits, we see M~()cxM()\tilde{M}(\infty)\leq_{\rm cx}M(\infty). ∎

3.6. Asymptotic normality

As noted previously, for the host’s parasite burden to converge to a normal distribution, the coefficient of variation must tend to 0. In the Tallis-Leyton model, this is only possible when the rate of infective contacts tends to infinity.

Theorem 9.

Suppose there exists positive constants ϵ,δ\epsilon,\ \delta and CC such that

|GN(1+ω)(1+GN(1)ω+12GN′′(1)ω2)|C|ω|2+ϵ\left|G_{N}(1+\omega)-\left(1+G_{N}^{\prime}(1)\omega+\frac{1}{2}G_{N}^{\prime\prime}(1)\omega^{2}\right)\right|\leq C\left|\omega\right|^{2+\epsilon}

for all ω\omega\in\mathbb{C} such that |ω|<δ|\omega|<\delta. Then

limλM(a)μ(a)σ(a)=d𝖭(0,1).\lim_{\lambda\to\infty}\frac{M(a)-\mu(a)}{\sigma(a)}\stackrel{{\scriptstyle d}}{{=}}\mathsf{N}(0,1).
Proof.

The characteristic function of M(a)M(a) is GM(eiω;a)G_{M}(e^{i\omega};a). We aim to show that

limλeiωμ(a)σ(a)GM(eiωσ(a);a)=exp(12ω2).\lim_{\lambda\to\infty}e^{-i\tfrac{\omega\mu(a)}{\sigma(a)}}G_{M}(e^{i\tfrac{\omega}{\sigma(a)}};a)=\exp\left(-\tfrac{1}{2}\omega^{2}\right). (7)

The result then follows by Lévy’s convergence theorem. Define

RN(ω)=GN(1+ω)(1+GN(1)ω+12GN′′(1)w2).R_{N}(\omega)=G_{N}(1+\omega)-\left(1+G_{N}^{\prime}(1)\omega+\frac{1}{2}G_{N}^{\prime\prime}(1)w^{2}\right).

For non-negative integers nn and real xx define

Rn(x)=eixk=0n(ix)kk!.R_{n}(x)=e^{ix}-\sum_{k=0}^{n}\frac{(ix)^{k}}{k!}.

Then |R0(x)|min(2,|x|)|R_{0}(x)|\leq\min(2,|x|) and

|Rn(x)|min(2|x|nn!,|x|n+1(n+1)!).\left|R_{n}(x)\right|\leq\min\left(\frac{2|x|^{n}}{n!},\frac{|x|^{n+1}}{(n+1)!}\right). (8)

(Williams, 1991, pg 183). Note that

eiωμ(a)σ(a)GM(eiωσ(a);a)=exp(λ0a[GN(1+F¯T(as)(eiωσ(a)1))1]𝑑siωμ(a)σ(a)).e^{-i\tfrac{\omega\mu(a)}{\sigma(a)}}G_{M}(e^{i\tfrac{\omega}{\sigma(a)}};a)=\exp\left(\lambda\int^{a}_{0}\left[G_{N}(1+\bar{F}_{T}(a-s)(e^{i\tfrac{\omega}{\sigma(a)}}-1))-1\right]ds-i\tfrac{\omega\mu(a)}{\sigma(a)}\right).

From the expressions for RNR_{N} and μ(a)\mu(a),

λ0a[GN(1+F¯T(as)(eiωσ(a)1))1]𝑑siωμ(a)σ(a)=λGN(1)(0aF¯T(as)𝑑s)R1(ωσ(a))+λ2GN′′(1)(0aF¯T2(as)𝑑s)R0(ωσ(a))2+λ0aRN(F¯T(as)(eiωσ(a)1))𝑑s\lambda\int^{a}_{0}\left[G_{N}(1+\bar{F}_{T}(a-s)(e^{i\tfrac{\omega}{\sigma(a)}}-1))-1\right]ds-i\tfrac{\omega\mu(a)}{\sigma(a)}\\ =\lambda G_{N}^{\prime}(1)\left(\int^{a}_{0}\bar{F}_{T}(a-s)ds\right)R_{1}\left(\frac{\omega}{\sigma(a)}\right)+\frac{\lambda}{2}G_{N}^{\prime\prime}(1)\left(\int^{a}_{0}\bar{F}_{T}^{2}(a-s)ds\right)R_{0}\left(\frac{\omega}{\sigma(a)}\right)^{2}\\ +\lambda\int_{0}^{a}R_{N}\left(\bar{F}_{T}(a-s)(e^{i\tfrac{\omega}{\sigma(a)}}-1)\right)ds

From the expression for σ2(a)\sigma^{2}(a) and the fact that

R1(x)2+x2=R2(2x)2R2(x),\displaystyle R_{1}(x)^{2}+x^{2}=R_{2}(2x)-2R_{2}(x),

we obtain

λ0a[GN(1+F¯T(as)(eiωσ(a)1))1]𝑑siωμ(a)σ(a)=ω22+λGN(1)(0aF¯T(as)𝑑s)R2(ωσ(a))+λ2GN′′(1)(0aF¯T2(as)𝑑s)(R2(2ωσ(a))2R2(ωσ(a)))+λ0aRN(F¯T(as)(eiωσ(a)1))𝑑s\lambda\int^{a}_{0}\left[G_{N}(1+\bar{F}_{T}(a-s)(e^{i\tfrac{\omega}{\sigma(a)}}-1))-1\right]ds-i\tfrac{\omega\mu(a)}{\sigma(a)}\\ =-\frac{\omega^{2}}{2}+\lambda G_{N}^{\prime}(1)\left(\int^{a}_{0}\bar{F}_{T}(a-s)ds\right)R_{2}\left(\frac{\omega}{\sigma(a)}\right)\\ +\frac{\lambda}{2}G_{N}^{\prime\prime}(1)\left(\int^{a}_{0}\bar{F}_{T}^{2}(a-s)ds\right)\left(R_{2}\left(\frac{2\omega}{\sigma(a)}\right)-2R_{2}\left(\frac{\omega}{\sigma(a)}\right)\right)\\ +\lambda\int_{0}^{a}R_{N}\left(\bar{F}_{T}(a-s)(e^{i\tfrac{\omega}{\sigma(a)}}-1)\right)ds

Using the bound (8) and the fact that σ2(a)λ\sigma^{2}(a)\propto\lambda, we see

limλλGN(1)(0aF¯T(as)𝑑s)R2(ωσ(a))=0\lim_{\lambda\to\infty}\lambda G_{N}^{\prime}(1)\left(\int^{a}_{0}\bar{F}_{T}(a-s)ds\right)R_{2}\left(\frac{\omega}{\sigma(a)}\right)=0

and

limλλ2GN′′(1)(0aF¯T2(as)𝑑s)(R2(2ωσ(a))2R2(ωσ(a)))=0.\lim_{\lambda\to\infty}\frac{\lambda}{2}G_{N}^{\prime\prime}(1)\left(\int^{a}_{0}\bar{F}_{T}^{2}(a-s)ds\right)\left(R_{2}\left(\frac{2\omega}{\sigma(a)}\right)-2R_{2}\left(\frac{\omega}{\sigma(a)}\right)\right)=0.

Finally, using |RN(ω)|C|ω|2+ϵ\left|R_{N}(\omega)\right|\leq C|\omega|^{2+\epsilon} together with the bound (8) and the fact that σ2(a)λ\sigma^{2}(a)\propto\lambda, we see

limλλ0aRN(F¯T(as)(eiωσ(a)1))𝑑s=0.\lim_{\lambda\to\infty}\lambda\int_{0}^{a}R_{N}\left(\bar{F}_{T}(a-s)(e^{i\tfrac{\omega}{\sigma(a)}}-1)\right)ds=0.

Hence, the limit (7) holds. ∎

Figure 4 compares the probability mass function from the Tallis-Leyton model with the probability density function of the approximating normal distribution. The host was aged 3 with N𝖭𝖡(1,1)N\sim\mathsf{NB}(1,1) and T𝖤𝗑𝗉(1)T\sim\mathsf{Exp}(1). When λ=8\lambda=8, the probability mass function still shows some right skewness. The normal approximation in this instance places a non-negligible probability on values less than zero. When λ=128\lambda=128, the probability mass function is very close to symmetric and the normal distribution provides a good approximation. Figure 5 shows the Hoover and Gini indices together with the approximations based on asymptotic normality (4) and (5). In this instance the approximations appear reasonably accurate even for λ\lambda as small as 2, where the normal approximation fails.

Refer to caption
Figure 4. Probability mass function (blue bars) and approximating normal probability density function (red line) for a host aged 3 in the Tallis-Leyton model with N𝖭𝖡(1,1)N\sim\mathsf{NB}(1,1), T𝖤𝗑𝗉(1)T\sim\mathsf{Exp}(1), and λ=8\lambda=8 (left) and λ=128\lambda=128 (right).
Refer to caption
Figure 5. Hoover index (yellow line) and Gini index (purple dashed line) together with the asymptotic normal approximations (dotted lines) for a host aged 3 in the Tallis-Leyton model with N𝖭𝖡(1,1)N\sim\mathsf{NB}(1,1) and T𝖤𝗑𝗉(1)T\sim\mathsf{Exp}(1).

4. Discussion

In this paper, we examined the aggregation of a host’s parasite burden in the Tallis-Leyton model, interpreting aggregation in terms of the Lorenz order. Our analysis showed that increasing the host’s age or the rate of infective contacts decreases aggregation. Similarly, increasing the aggregation of the clumps of parasites that enter the host as infective contacts also increases the aggregation of the host’s parasite burden. On the other hand, less variability in the parasite’s lifetime distribution, as defined in the convex order, results in greater aggregation of the host’s parasite burden.

Unfortunately, the population dynamics of parasites are often more complicated than what is represented in the Tallis-Leyton model. Some parasites need multiple hosts to complete its life cycle. Once a parasite finds a host it may be subject to intraspecific and interspecific competition for resources. Furthermore, parasites often interact with the host either by stimulating an immune response from the host or by increasing the host’s mortality rate.

Isham (1995) proposed a simple stochastic model that incorporate parasite induced host mortality. In Isham’s model, the host acquires parasites following the same dynamics as the Tallis-Leyton model and parasite lifetimes are assumed exponentially distributed. The important difference in Isham’s model is that each parasite present in the host increases the host’s death rate by a fixed amount α\alpha. A complete analysis of Isham’s model in terms of the Lorenz order is beyond the scope of this paper. In a special case, however, we can see that parasite induced host mortality increases aggregation of the parasite distribution, as interpreted in the Lorenz order. When the number of parasites that enter the host at an infective contact follows a geometric distribution, an explicit expression for the limiting distribution is possible. Specifically, if N𝖭𝖡(m,1)N\sim\mathsf{NB}(m,1), then

M()𝖭𝖡(λm𝔼T+α+αm,λ𝔼T+α+αm).M(\infty)\sim\mathsf{NB}\left(\frac{\lambda m}{\mathbb{E}T+\alpha+\alpha m},\frac{\lambda}{\mathbb{E}T+\alpha+\alpha m}\right).

As the negative binomial distribution is decreasing in Lorenz order in both mean and kk, it follows that indices respecting the Lorenz order are increasing in the parasite induced host mortality rate. In contrast, the variance-to-mean ratio is 1+m1+m so it is not affected by the parasite induced mortality.

A complete examination Isham’s model in terms of the Lorenz order may prove challenging. Even computing the Gini and Hoover indices may present difficulties since they require absolute moments, which are often not easily evaluated. In that case, the coefficient of variation may prove useful since it respects the Lorenz order, is easily evaluated, and can be used to approximate the Gini and Hoover indices when the distribution is approximately normal.


Data availability: Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

References

  • Abate and Whitt (1992) J. Abate and W. Whitt. Numerical inversion of probability generating functions. Oper. Res. Lett., 12:245–251, 1992.
  • Adell and Cal (1994) J.A. Adell and J. De La Cal. Approximating gamma distributions by normalized negative binomial distributions. J. Appl. Probab., 31:391–400, 1994.
  • Anderson and May (1978a) R.M. Anderson and R.M. May. Regulation and stability of host-parasite population interactions: I. regulatory processes. J. Anim. Ecol., 47:219–247, 1978a.
  • Anderson and May (1978b) R.M. Anderson and R.M. May. Regulation and stability of host-parasite population interactions: II. destabilizing processes. J. Anim. Ecol., 47:249–267, 1978b.
  • Arnold and Sarabia (2018) B.C. Arnold and J.M. Sarabia. Majorization and the Lorenz order with applications in applied mathematics and economics. Springer, New York, 2018.
  • Barbour and Pugliese (2000) A.D. Barbour and A. Pugliese. On the variance-to-mean ratio in models of parasite distributions. Adv. Appl. Probab., 32:701–719, 2000.
  • Gastwirth (1971) J.L. Gastwirth. A general definition of the Lorenz curve. Econometrica, 39:1037–1039, 1971.
  • Gini (1924) C. Gini. Measurement of inequality of incomes. Econ. J., 31:124–126, 1924.
  • Herbert and Isham (2000) J. Herbert and V. Isham. Stochastic host-parasite interaction models. J. Math. Biol., 40:343–371, 2000.
  • Holman et al. (1983) D.F. Holman, M.L. Chaudhry, and B.R.K. Kashyap. On the service system MX/G/{M^{X}/G/\infty}. Eur. J. Oper. Res., 13:142–145, 1983.
  • Isham (1995) V. Isham. Stochastic models of host-macroparasite interaction. Ann. Appl. Probab., 5:720–740, 1995.
  • Lorenz (1905) M.O. Lorenz. Methods of measuring the concentration of wealth. Publication of the American Statistical Association, 9:209–219, 1905.
  • McPherson et al. (2012) N.J. McPherson, R.A Norman, A.S. Hoyle, J.E. Bron, and N.G.H. Taylor. Stocking methods and parasite-induced reductions in capture: Modelling Argulus foliaceus in trout fisheries. J. Theor. Biol., 312:22–33, 2012.
  • McVinish and Lester (2020) R. McVinish and R.J.G. Lester. Measuring aggregation in parasite populations. J. R. Soc. Interface, 17:20190886, 2020.
  • McVinish and Lester (2024) R. McVinish and R.J.G. Lester. A graphical exploration of the relationship between parasite aggregation indices. arXiv:2409.03186, 2024.
  • Morrill et al. (2023) A. Morrill, R. Poulin, and M.R. Forbes. Interrelationships and properties of parasite aggregation measures: a user’s guide. Int. J. Parasitol., 53:763–776, 2023.
  • Peacock et al. (2018) S.J. Peacock, J. Bouhours, M.A. Lewis, and P.K. Molnár. Macroparasite dynamics of migratory host populations. Theor. Popul. Biol., 120:29–41, 2018.
  • Pielou (1977) E.C. Pielou. Mathematical ecology. Wiley, New York, 1977.
  • Poulin (1993) R. Poulin. The disparity between observed and uniform distributions: a new look at parasite aggregation. Int. J. Parasitol., 23:931–944, 1993.
  • Poulin (2007) R. Poulin. Are there general laws in parasite ecology? Parasitol., 134:763–776, 2007.
  • Rosa and Pugliese (2002) R. Rosa and A. Pugliese. Aggregation, stability, and oscillations in different models for host-macroparasite interactions. Theor. Popul. Biol., 61(3):319–334, 2002.
  • Schreiber (2006) S.J. Schreiber. Host-parasitoid dynamics of a generalized thompson model. J. Math. Biol., 52:719–732, 2006.
  • Schweder (1982) T. Schweder. On the dispersion of mixtures. Scand. J. Stat., 9:165–169, 1982.
  • Shaked and Shanthikumar (2007) M. Shaked and J. G. Shanthikumar. Stochastic orders. Springer, New York, 2007.
  • Shaw and Dobson (1995) D.J. Shaw and A.P. Dobson. Patterns of macroparasite abundance and aggregation in wildlife populations: a quantitative review. Parasitol., 111:S111–S133, 1995.
  • Taguchi (1968) T. Taguchi. Concentration-curve methods and structures of skew populations. Ann. Inst. Stat. Math., 20:107–141, 1968.
  • Tallis and Leyton (1969) G.M. Tallis and M.K. Leyton. Stochastic models of populations of helminthic parasites in the definitive host. I. Math. Biosci., 4:39–48, 1969.
  • The MathWorks Inc. (2022a) The MathWorks Inc. Matlab version: 9.13.0.2080170 (r2022b) update 1, 2022a.
  • The MathWorks Inc. (2022b) The MathWorks Inc. Symbolic math toolbox (r2022b), 2022b.
  • Williams (1991) D. Williams. Probability with Martingales. Cambridge, 1991.
  • Wilson et al. (2001) K. Wilson, O. Bjørnstad, A. Dobson, S. Merler, G. Poglayen, S. Randolph, A. Read, and A. Skorping. Heterogeneities in macroparasite infections: Patterns and processes. In P. Hudson, A. Rizzoli, B. Grenfell, H. Heesterbeek, and A. Dobson, editors, The Ecology of Wildlife Diseases, pages 6–44. Oxford University Press, 2001.