A Tractable Probability Distribution with Applications in Three-Dimensional Statistics
Seyed Mohammad Azimi-Abarghouyi
S. M. Azimi-Abarghouyi is with the School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden (Email: [email protected]).
Abstract
This paper introduces and characterizes a new family of continuous probability distributions applicable to norm distributions in three-dimensional random spaces, specifically for the Euclidean norm of three random Gaussian variables with non-zero means. The distribution is specified over the semi-infinite range and is notable for its computational tractability. Building on this foundation, we also introduce a separate family of continuous probability distributions suitable for power distributions in three-dimensional random spaces. Despite being previously unknown, these distributions are attractive for numerous applications, some of which are discussed in this work.
Index Terms:
Probability theory, continuous probability distribution, semi-infinite density function, Euclidean norm, 3D statistics.
I Introduction
To date, numerous probability distributions have been introduced and utilized in various branches of science, including the Rayleigh, Rice, and Nakagami distributions, among others [4, 5, 6, 3, 2, 1]. For example, Rice can model the distribution of Euclidean norms in two-dimensional random spaces.
In this work, we introduce, define, and characterize a new family of probability distributions, including its special cases, which we name Abar distribution111Abar means superior in Persian language, and refers to applications of this distribution for higher dimensions than two.. Also, we obtain its statistical measures in closed-forms. This distribution is recently discovered in our work [7] for a research in biological communication networks. However, to the best of our knowledge, this distribution has not been previously discovered. For example, a similar distribution cannot be found in websites and books on "probability and statistics" [4, 5, 6]. We investigate use-case applications for the Abar distribution that are applicable to Euclidean norm or distance statistics in three-dimensional random spaces. Our focus is on revealing that
this distribution admits a remarkably simple closed form, which is not even the case in the simpler two-dimensional statistics. The proposed distribution has a unique and completely different function structure that is impossible to be extrapolated from its lower or higher dimensional cases, which all include integrals in their functions. Using the Abar distribution, we also find another new family of probability distributions with its special cases, which we name Abar+ distribution. It is applicable to power statistics within three-dimensional random spaces. Due to their remarkable tractability and general applicability, the Abar and Abar+ distributions hold promise for a wide range of applications, particularly in areas such as communication networks, localization and signal processing, stochastic geometry, and machine learning. By highlighting these distributions and the potential applications, researchers will become aware of their existence and have the opportunity to utilize them effectively.
II Distribution Characterization
Definition 1
Let and two non-negative parameters. We define the Abar distribution with the following probability density function (PDF)
which is equal to the PDF of a Gaussian distribution with mean and variance , denoted as .
In Figs 1 and 2, the defined PDF in (1) is shown for different parameters and .
Remark 1
The structure of the Abar distribution’s PDF is fundamentally different from that of the Generalized Folded Normal (GFN) distribution [8]. This is primarily because the GFN is characterized by the positive addition of two exponential functions, whose scaling factors are not dependent on the variable, contrasting with the structure of the Abar distribution.
From (1), the cumulative distribution function (CDF) of the Abar distribution is obtained as
(4)
where . It can be observed in (II) that determines the ratio of the first part with exponential functions to the second part with erfc functions. Also, and , which justifies the validity of the PDF in (1).
where comes from the L’Hopital’s rule. This result denotes the fact that the terms and
in (II) contribute the same mean when . Also, when , we have .
For the variance, first
(7)
which is a very simple linear function of the parameters of the PDF, and , and then
(8)
When , we have . Also, when , .
Finally, the moment generating function (MGF) of this distribution is obtained as
(9)
It is notable that all the results in (1)-(9) are in tractable closed-forms. This fact renders the Abar distribution extremely attractive for use in various applications.
III Applications
We have discovered two applications that exhibit the Abar distribution, specifically relating to the norm of three Gaussian random variables. In contrast, applications involving the norm of two Gaussian random variables result in the Rice distribution [2, 9]. Unlike the Abar distribution, the Rice distribution cannot be expressed in a closed form and requires numerical computations involving Bessel functions. Additionally, it possesses a distinct structure compared to the Abar distribution. The simplicity found in the structure of the Abar distribution is fundamentally dependent on the three-dimensional assumption and cannot be inferred from cases with lower or higher dimensions. The unique and independent nature of the Abar distribution, even in comparison with the Rice distribution, raises an intriguing question: how can a simpler, closed-form function for the three-dimensional case emerge from the more complex, integral-based function characteristic of the two-dimensional case. This question is addressed in the following subsection. In the third application, we employ the Abar distribution to derive yet another new tractable distribution relevant to the squared norm of three Gaussian random variables.
Figure 1: The Abar PDF when .Figure 2: The Abar PDF when .
III-AApplication 1
It is presented in the following theorem.
Theorem 1
Assume the norm , where , , and are independent Gaussian random variables with the same variance and means , , and , respectively. Also, let . Then, has Abar distribution with parameters and .
Proof:
We can write where are independent identical Gaussian random variables with mean zero and variance . Then, we have
(10)
where the last equality is because Gaussian random variables are rotationally invariant. Then, by taking derivative of (III-A), the PDF of is obtained with the help of the Leibniz integral rule and then
(11)
where
(12)
The constant equality in (12) plays the crucial role for providing a simple closed-form PDF structure, which is not the possible case in lower or higher dimensions (since integrals persist in the final results).
Finally, we obtain
(13)
∎
III-BApplication 2
It is shown in our work [7, Th.1] that the distance distributions in the three-dimensional Thomas cluster process (TCP) have the Abar distribution.444Application 1 is the abstract form of Application 2. This pertains to the distance from each cluster center to any point in other clusters. TCP is a widely-used class of Poisson cluster processes in the field of stochastic geometry and has numerous applications in various domains such as biology and communication networks, serving as a valuable tool for modeling and analysis purposes [9, 7, 10, 11]. Given that the spatial domain in these fields is typically represented by the three dimensions , the PDF introduced and the findings presented in this work are anticipated to be of significant value for evaluating spatial patterns and conducting performance analyses. For a practical and real-time numerical example, please see [7, Sections IV, V, and VI] for the utilization and crucial application of Abar distribution in deriving the expected interference value and the error probability in communication nanonetworks, where nanomachine sensors are located based on three-dimensional TCP. An alternative application worth considering is the network of clustered unmanned aerial vehicles (UAVs), where three-dimensional TCP can be utilized for modeling the locations of UAVs in the sky.
In addition, the distance distribution is a widely used metric in signal processing and communication systems. For example, it can be used in target localization, such as satellite localization or range detection in radar systems, and also satellite communication systems. In these applications, there are three location dimensions and two-dimensional modeling is not accurate. Also, it has applications in signal (modulation) classification and outlier detection, where distances of signals from each other and the noise are important features.
Figure 3: The Abar+ PDF when .Figure 4: The Abar+ PDF when .
III-CApplication 3
The following theorem presents it.
Theorem 2
Assume , where , , and are independent Gaussian random variables with the same variance and means , , and , respectively. Also, let . Then, has the following PDF with parameters and
(14)
Proof:
The CDF of can be obtained from the CDF of Abar as
(15)
where is given in (II). Then, by taking derivative of (III-C) and using , the proof is complete. ∎
As the PDF in (2) has not been previously discovered [4, 5, 6], we define its distribution with the name Abar+.555This distribution, not present in [7], is discovered in the current study. The parameters and are non-negative. In Figs 3 and 4, the defined PDF in (2) is shown for different parameters and . Using the L’Hopital’s rule, the special case of this distribution for is given for as
(16)
which is equal to the Gamma distribution with the shape and scale , denoted as . In addition, for , (2) goes to the all-zero function.
The Abar+ distribution has the following statistical measures.666The proof has not been included.
Mean:
(17)
Variance:
(18)
MGF:
(19)
Remark 2
The Gamma distribution is associated with the sum of squared independent Gaussian random variables that have a zero mean. In contrast, the Abar+ distribution incorporates Gaussian variables with distinct, non-zero means.
As signal samples can be well approximately modeled by Gaussian random variables, where the mean represents the bias of signal, Application 3 possesses exceptional worth in the modeling and analysis of the power of additive signals, as well as in their subsequent processing. In wireless multipath environments, for instance, there exist three main signal paths between a transmitter and a receiver, which can be represented as Gaussian random variables with different means: 1) the direct link, 2) the reflection from ground nodes and surfaces, and 3) the reflection from sky nodes, such as UAVs and satellites.
It is our belief that there exists a wealth of additional and potentially unknown applications within the fields of pure and applied mathematics and other scientific domains, specially communications and signal processing. Furthermore, due to the remarkable tractability and closed-form structures, the outcomes of this work hold significant potential for applications in the areas of statistics and machine learning, particularly with regards to classification and clustering, where norm distributions can be leveraged [13, 12]. To be more specific, clustering is a technique that groups similar data points together based on their distance from each other. By modeling the distances between data points, clustering algorithms can be optimized to better identify and separate distinct clusters within the data. It is worth mentioning that higher dimensional spaces can be visualized in three dimensions. This is achieved through a process called dimensionality reduction [13]. Another approach is to divide a higher dimensional space into multiple three-dimensional spaces.
IV conclusions
In this work, we introduced a new probability distribution with a remarkably simple but diverse form of PDF. Then, we derived its statistical measures, all of which are in closed forms. Additionally, we provided asymptotic cases, one of which approaches Gaussian distribution. Furthermore, we made another new discovery by finding another probability distribution, which includes the Gamma distribution as a special case. We highlighted diverse applications of the proposed probability distributions in the fields of pure mathematics, statistics and learning, stochastic geometry, signal processing, and communication systems and networks.
References
[1]
L. Rayleigh, "On the problem of random vibrations," Philosophical Magazine, vol. 37, no. 220, pp. 321-347, 1919.
[2]
S. O. Rice, "Mathematical analysis of random noise," The Bell System Technical Journal, vol. 23, no. 3, pp. 282-332, 1944.
[3]
M. Nakagami, "The -distribution: A general formula of intensity distribution of rapid fading," Statistical Methods in Radio Wave Propagation, pp. 3-35, 1960.
[5]
A. Papoulis and U. Pillai, Probability, Random Variables and Stochastic Processes. McGraw-Hill Europe, 2002.
[6]
N. L. Johnson, S. Kotz, and N. Balakrishnan. Continuous Univariate Distributions. Vols. 1-2, John Wiley & Sons, 1994.
[7]
S. M. Azimi-Abarghouyi, H. S. Dhillon, and L. Tassiulas, "Fundamentals of clustered molecular nanonetworks," IEEE Transactions on Molecular, Biological and Multi-Scale Communications, vol. 9, no. 2, pp. 135-145, June 2023.
[8]
P. C. Lin, "Application of the generalized folded-normal distribution to the process capability measures," Int. J. Adv. Manuf. Technol., vol. 26, pp. 825-830, 2005.
[9]
M. Afshang, H. S. Dhillon, and P. H. J. Chong, "Modeling and performance analysis of clustered device-to-device networks," IEEE Trans. Wireless Commun., vol. 15, no. 7, pp. 4957-4972, July 2016.
[10]
M. Haenggi, Stochastic Geometry for Wireless Networks. Cambridge University Press, 2012.
[11]
S. M. Azimi-Abarghouyi, B. Makki, M. Haenggi, M. Nasiri-Kenari, and
T. Svensson, "Stochastic geometry modeling and analysis of single- and
multi-cluster wireless networks," IEEE Trans. Commun., vol. 66, no. 10, pp. 4981-4996, Oct. 2018.
[12]
K. Murphy, Machine Learning: A Probabilistic Perspective. MIT, 2012.
[13]
M. Wainwright, High-Dimensional Statistics: A Non-Asymptotic Viewpoint. Cambridge University Press, 2019.