1 Introduction
Analysing bivariate datasets are very challenging as it involves association between two variables. Data with ties are also common. The Marshall-Olkin copula defined by
|
|
|
(1.1) |
where and is apt in modelling association of these types (see Nelsen (2007)). Many authors have proposed very general classes of bivariate distributions and studied their properties. In (1.1), when is exponential() for and and , the well studied Marshall-Olkin bivariate exponential distribution with a singular component is obtained as
|
|
|
(1.2) |
where .
Here is a bivariate random vector with with support . The distribution in (1.2) is characterised by the functional equation popularly known as bivariate lack of memory property (BLMP) given by
|
|
|
for all . Kolev and Pinto (2018b) worked on the Marshall-Olkin’s bivariate exponential distribution thus providing a weak version of the BLMP which can be used to construct bivariate distributions having a singularity component along arbitrary line through the origin. Lin et al. (2019) studied the moment generating function, product moments and dependence structure of the bivariate distributions satisfying BLMP. Many authors have studied the BLMP under different perspectives (Pinto and Kolev (2015) and Kolev and Pinto (2018a)). In fact the BLMP has been translated as a property of arbitrary bivariate continuous distributions (Galambos and Kotz (2006)).
If is Weibull() for , the Marshall-Olkin copula in (1.1) gives a bivariate distribution with a singular component whose survival function is
|
|
|
(1.3) |
where , characterised by the functional equation,
|
|
|
for all .
If is Pareto() for , the Marshall-Olkin copula in (1.1) gives a bivariate distribution with a singular component whose survival function is
|
|
|
(1.4) |
where , characterised by the functional equation,
|
|
|
for all .
If the marginals belong to the proportional hazard (PH) class given by for , then (1.1) becomes the copula associated with the bivariate proportional hazard class (BPHC) mentioned in Kundu (2022)
whose bivariate survival function is
|
|
|
(1.5) |
where is the baseline distribution with for without loss of generality and . In this work, we study the functional equation characterised by singular family of bivariate distributions in (1.5).
The rest of the paper is organised as below.
In Section 2, we propose a class of distributions as a general solution to the functional equation characterising (1.5). This functional equation characterises a very general class of distributions which includes (1.2), (1.3), (1.4) and (1.5). It is observed through a counter example that the general solutions of this functional equation need not necessarily generate a bivariate distribution function. But conditions on the marginals ensures that the solution is a bivariate distribution. In Section 3, necessary and sufficient conditions are derived for a univariate distribution to be a marginal for the bivariate distribution belonging to this class. In Section 4, we translate these in terms of failure rates for the ease of constructing bivariate distributions satisfying the functional equation.
3 Necessary and sufficient conditions for generating bivariate distributions
In this section, we discuss the conditions to be satisfied by the univariate distributions for them to qualify as a marginal for the bivariate survival function as in (2.2). Let , where is the vector of parameters involved in baseline and is the vector of parameters involved in the marginal .
Theorem 3.2.
Let be a distribution function with absolutely continuous density for which . The necessary and sufficient conditions for in (2.2) be a bivariate distribution is that
-
(i)
-
(ii)
,
where .
Proof.
will be a bivariate survival function if and only if it can be written as a convex mixture of and where is the absolutely continuous part and is the singular part and and are survival functions.
Observe, for ,
|
|
|
|
|
|
|
|
(3.1) |
Also,
|
|
|
|
and
|
|
|
Writing,
|
|
|
we have,
|
|
|
|
|
|
|
|
|
|
so that
|
|
|
(3.2) |
Hence the absolutely continuous part has density given by (3.1) and (3.2). Also, we have
|
|
|
But , so that
. Therefore, is a valid survival function if
-
(i)
is a convex mixture of and . From (3.2), since , we get
|
|
|
-
(ii)
is a valid survival function. From (3.1), since , we get for ,
|
|
|
This completes the proof.
∎
Counter Example 2.1 (contd.) LFR-exponential: Consider again the bivariate function given in (2.6). Condition (ii) of Theorem 3.2 becomes
|
|
|
which is not satisfied for the choice of and . This again shows that a bivariate distribution cannot be formed with marginal being the linear failure rate model and baseline as exponential distribution in (2.2). However any proportional hazard distribution qualify as marginals in (2.2). This is discussed in detail in the next theorem.
Theorem 3.3.
The survival function with the marginals belonging to the PH class satisfies the functional equation (2.1) if and only if it is of the form
|
|
|
|
(3.3) |
for some with .
Proof.
Suppose (2.1) is satisfied. Then, the general solution of is given by (2.2). Let the marginals belong to the PH class so that . Then, the bivariate function in (2.2) reduces to (3.3). Now, it remains to show that is a valid bivariate distribution. Proceeding as in Theorem 3.2,
|
|
|
|
|
|
|
|
(3.4) |
Then,
|
|
|
|
|
|
|
|
so that .
Hence, the absolute continuous part has density given by,
|
|
|
(3.5) |
We have, . But, so that . From (i) and (ii) as in Theorem 3.2, we have which is true since . Also, is a valid survival function implying which is true for any choice of the baseline distribution . The converse is direct.
∎
4 Construction of bivariate distributions satisfying functional equation (2.1)
The functional equation (2.1) can be equivalently represented in terms of the hazard gradient. The discussions above translated in terms of the hazard gradient would make it computationally easier to check if marginals qualify for the function (2.2) to be a bivariate survival function. Accordingly in this section we visit the functional equation in terms of its hazard gradient. The hazard gradient vector (Johnson and Kotz (1975)) is defined as
|
|
|
|
|
|
|
|
|
|
|
|
where .
Note that the functional equation (2.1) can be equivalently expressed as
|
|
|
Differentiating with respect to , we get,
|
|
|
which gives a condition in terms of the hazard gradient to satisfy the functional equation (2.1) as
|
|
|
(4.1) |
where is the baseline hazard function. On integrating (4.1), we can retrieve the functional equation (2.1).
Theorem 4.4.
A bivariate random vector satisfy (2.1) if and only if
|
|
|
(4.2) |
where is the marginal hazard rate of .
Proof.
Suppose satisfy,
|
|
|
|
|
|
For we have for ,
|
|
|
|
|
|
|
|
Taking negative of the logarithms on both sides and differentiating w.r.t we get,
|
|
|
and
|
|
|
In a similar manner we can prove the result for to obtain (4.2). The converse is straight forward through the representation
|
|
|
or
|
|
|
(Johnson and Kotz (1975)).
∎
If we closely investigate equation (4.2), we see that the marginal hazard function should satisfy the condition
|
|
|
(4.3) |
since . Based on these observations necessary and sufficient conditions for differentiable to qualify as marginal failure rates for to be a bivariate survival function is discussed in the next theorem.
Theorem 4.5.
The necessary and sufficient conditions for differentiable functions to qualify as marginal failure rates of in (2.2) is that for ,
-
(i)
-
(ii)
-
(iii)
-
(iv)
where . The bivariate function given by equation (2.2) is a survival function satisfying the functional equation in (2.1) where
|
|
|
(4.4) |
Proof.
Condition (i) follows from equation (4.3) and condition (ii) is necessary for to be a hazard rate function. Conditions (iii) and (iv) implies conditions (i) and (ii) of Theorem 3.2.
Conversely the hazard gradient associated with is given by (4.2) and the condition (i) follows from the non-negativity of . Since are univariate failure rates, condition (ii) follows. Condition (iii) follows from the non-negativity of the density function associated with and condition (iv) follows from Theorem 3.2.
∎
Counter Example 2.1 (contd.) LFR-exponential: The marginal failure rates are given by
|
|
|
Then the condition (i) in Theorem 4.5 becomes which is not satisfied for the choice of and . Hence, we cannot construct a bivariate survival function with the marginals as linear failure rate and baseline distribution as exponential.
Example 4.1.
BPHC-Weibull: For a proportional hazard rate,
|
|
|
with and , it is easily seen that conditions of Theorem 4.5 is satisfied. For
-
(i)
-
(ii)
-
(iii)
-
(iv)
.
for . From (4.4), , where . Substituting in (2.2), we get
|
|
|
|
Hence a class of distributions which include the bivariate semi-parametric singular family of distributions given by Kundu (2022) has been proposed as a general solution to the functional equation in (2.1). The estimation of popular distributions belonging to this class have been already discussed in the literature (see Proschan and Sullo (1976), Pena and Gupta (1990), Kundu (2022)).
Conflict of Interest Statement
The authors have no conflicts of interest to declare.
Acknowledgments
This work was financially supported by the Council of Scientific & Industrial Research (CSIR), Government of India through the Senior Research Fellowship scheme vide No 09/239(0551) /2019-EMR-I.