Moments of the multivariate Beta distribution

Feng Zhao

Abstract

In this paper, we extend Beta distribution to 2 by 2 matrix and give the analytical formula for its moments. Our analytical formula can be used to analyze the asymptotic behavior of Beta distribution for 2 by 2 matrix.

keywords:

multivariate Beta distribution , higher moments

1 Introduction

Moments of probability distribution are an important topic in statistics. Given the moment sequence, the probability distribution is unique under some mind conditions. To prove the convergence of random variables, we can prove the convergence of its moment sequences instead. To accomplish such goal, the analytical form of moments is a prerequisite. The techniques to compute moments for different distributions differ. In this article, we focus on the Beta distribution of 2 by 2 matrix.

David introduces an extension of multivariate extension for Beta distribution, denoted as $\mathbf{B}(\alpha,\beta;I_{p})$ (see [1]). It is a random $p\times p$ symmetric matrix $W$ whose density function is given by

	$\displaystyle p(w)$	$\displaystyle=\frac{1}{B_{p}(\alpha,\beta)}\lvert I-w\rvert^{\alpha-\frac{p+1}{2}}\lvert w\rvert^{\beta-\frac{p+1}{2}}\textrm{ where }w,I-w\in S_{p,p}^{++}$		(1)
	$\displaystyle B_{p}(\alpha,\beta)$	$\displaystyle=\int_{w,I-w\in S_{p,p}^{+}}\lvert I-w\rvert^{\alpha-\frac{p+1}{2}}\lvert w\rvert^{\beta-\frac{p+1}{2}}dw\textrm{ where }\alpha,\beta>\frac{p-1}{2}$		(2)

$B_{p}(\alpha,\beta)$ is called the multivariate Beta function (see [5]); $\lvert W\rvert$ is the determinant of matrix $W$ and $S_{p,p}^{++}$ is the collection of positive definite matrix. When $p=1$ , the distribution reduces to normal Beta distribution for $0<x<1$ .

This extension may have useful applications in multivariate statistical problems but little is known about the analytical property of such extension. For example, it is unknown whether the moments $\mathbb{E}[f(W)]$ can be written in concise form, where $f$ is a monomial about the positive-definite matrix $W$ .

Konno has derived the formula of the moment up to second order (see [4]). In this paper, we focus on the case $p=2$ and deduce the analytical form of moments for $\mathbf{B}(\alpha,\beta;I_{2})$ . This formula includes the expectation and variance, which are the first and second order moment respectively. Our moments formula, as far as we know, is novel and can be used directly in the computation related with multivariate Beta models instead of approximating numerical integration.

In this article, the following notation convention is adopted: $W=\begin{pmatrix}X&Z\\ Z&Y\end{pmatrix}$ is the symmetric random matrix to be considered. Its density function is given by Equation (1), which can also be treated as the joint density function of $X,Y,Z$ . $\lvert W\rvert=XY-Z^{2}$ . Let $\mathbb{E}_{\alpha,\beta}[f(X,Y,Z)]=\int f(x,y,z)p(w)dw$ denotes the expectation with $\mathbf{B}(\alpha,\beta;I_{2})$ where $f(\cdot,\cdot,\cdot)$ is an arbitrary function with three variables. We will compute $\mathbb{E}_{\alpha,\beta}[f(X,Y,Z)]$ when $f(X,Y,Z)$ takes the monomial form: $f(X,Y,Z)=X^{m}Y^{r}Z^{2t}$ .

2 Marginal Distribution

In this section we will compute $\mathbb{E}_{\alpha,\beta}[f(X,Y,Z)]$ for $f(X,Y,Z)=X^{m}$ and show that $X$ is one dimensional Beta distribution. To accomplish our goals, we need the following lemma:

Lemma 1.

Let $A=XY-Z^{2},B=1-X-Y+A$ , then we have

	$\displaystyle\mathbb{E}_{\alpha,\beta}[Af(X,Y,Z)]=$	$\displaystyle\frac{\alpha(\alpha-1/2)}{(\alpha+\beta)(\alpha+\beta-1/2)}\mathbb{E}_{\alpha+1,\beta}[f(X,Y,Z)]$		(3)
	$\displaystyle\mathbb{E}_{\alpha,\beta}[Bf(X,Y,Z)]=$	$\displaystyle\frac{\beta(\beta-1/2)}{(\alpha+\beta)(\alpha+\beta-1/2)}\mathbb{E}_{\alpha,\beta+1}[f(X,Y,Z)]$		(4)

Proof.

For multivariate Beta function we have $B_{p}(a,b)=\frac{\Gamma_{p}(a)\Gamma_{p}(b)}{\Gamma_{p}(a+b)}$ where $\Gamma_{p}$ is the multivariate Gamma function (see [3]). For $p=2$ we have $\Gamma_{2}(a)=\sqrt{\pi}\Gamma(a)\Gamma(a-1/2)$ .

	$\displaystyle\frac{\mathbb{E}_{\alpha,\beta}[Af(X,Y,Z)]}{\mathbb{E}_{\alpha+1,\beta}[f(X,Y,Z)]}$	$\displaystyle=\frac{B_{2}(\alpha+1,\beta)}{B_{2}(\alpha,\beta)}$
		$\displaystyle=\frac{\Gamma_{2}(\alpha+1)}{\Gamma_{2}(\alpha)}\frac{\Gamma_{2}(\alpha+\beta)}{\Gamma_{2}(\alpha+\beta+1)}$
		$\displaystyle=\frac{\Gamma(\alpha+1)}{\Gamma(\alpha)}\frac{\Gamma(\alpha+1/2)}{\Gamma(\alpha-1/2)}\frac{\Gamma(\alpha+\beta)}{\Gamma(\alpha+\beta+1)}\frac{\Gamma(\alpha+\beta-1/2)}{\Gamma(\alpha+\beta+1/2)}$
		$\displaystyle=\frac{\alpha(\alpha-1/2)}{(\alpha+\beta)(\alpha+\beta-1/2)}$

Thus Equation (3) is proved and Equation (4) follows similarly. ∎

Using the above Lemma, we give the main conclusion of this section:

Theorem 1.

$\mathbb{E}_{\alpha,\beta}[X^{m}]=\prod_{i=0}^{m-1}\frac{\alpha+i}{\alpha+\beta+i}$ , and $X$ follows Beta distribution $\textrm{Beta}(\alpha,\beta)$ .

Proof.

Since the position of $X$ and $Y$ is symmetric, $\mathbb{E}_{\alpha,\beta}[X]=\mathbb{E}_{\alpha,\beta}[Y]$ . Taking the expectation about $\textrm{Beta}_{2}(\alpha,\beta)$ on both sides of $B=1-X-Y+A$ and using the conclusion of Lemma 1, we have

\frac{\beta(\beta-1/2)}{(\alpha+\beta)(\alpha+\beta-1/2)}=1-2\mathbb{E}_{\alpha,\beta}[X]+\frac{\alpha(\alpha-1/2)}{(\alpha+\beta)(\alpha+\beta-1/2)}

Solving the about equation we get $\mathbb{E}_{\alpha,\beta}[X]=\frac{\alpha}{\alpha+\beta}$ . Recursively using Equation (3) with $f(X,Y,Z)=X$ we have $\mathbb{E}_{\alpha,\beta}[X^{m}]=\prod_{i=0}^{m-1}\frac{\alpha+i}{\alpha+\beta+i}$ . This expression of moments is the same with that of Beta distribution on bounded interval $[0,1]$ , we conclude that $X$ is actually Beta distribution $B(\alpha,\beta)$ . ∎

3 Mixed Moments

In this section, we further compute $\mathbb{E}_{\alpha,\beta}[X^{m}Y^{r}Z^{t}]$ . By symmetric property $\mathbb{E}_{\alpha,\beta}[X^{m}Y^{r}Z^{2t+1}]=0$ . Therefore we only need to consider the case when the power of $Z$ is even. Firstly We consider the case when $r=0$ :

Theorem 2.

\mathbb{E}_{\alpha,\beta}[X^{m}Z^{2t}]=\frac{(2t-1)!!}{2^{t}}\prod_{i=0}^{t-1}\frac{\beta+i}{(\alpha+\beta+i-1/2)}\frac{\prod_{i=0}^{t+m-1}\alpha+i}{\prod_{i=0}^{2t+m-1}\alpha+\beta+i}

(5)

Proof.

We use induction to show Equation (5) is true. Firstly, Equation (5) is true for $t=0$ from Theorem 1. Let $A,B$ be the same as those in Lemma 1. Suppose Equation (5) holds for $\mathbb{E}[Z^{2t-2}X^{m}]$ , using $Z^{2}=XY-A=X(1-X-A+B)-A$ , then

	$\displaystyle\mathbb{E}_{\alpha,\beta}[Z^{2t}X^{m}]$	$\displaystyle=\mathbb{E}_{\alpha,\beta}[Z^{2t-2}X^{m}(X-X^{2}+AX-BX-A)]$
		$\displaystyle=\mathbb{E}_{\alpha,\beta}[Z^{2t-2}(X^{m+1}-X^{m+2})]+\mathbb{E}_{\alpha,\beta}[AZ^{2t-2}(X^{m+1}-X^{m})]$
		$\displaystyle-\mathbb{E}_{\alpha,\beta}[BZ^{2t-2}X^{m+1}]$
		$\displaystyle=\left(1-\frac{\alpha+t+m}{\alpha+\beta+2t+m-1}\right)\mathbb{E}_{\alpha,\beta}[Z^{2t-2}X^{m+1}]$
		$\displaystyle+\frac{\alpha(\alpha-1/2)}{(\alpha+\beta)(\alpha+\beta-1/2)}\mathbb{E}_{\alpha+1,\beta}[Z^{2t-2}(X^{m+1}-X^{m})]$
		$\displaystyle-\frac{\beta(\beta-1/2)}{(\alpha+\beta)(\alpha+\beta-1/2)}\mathbb{E}_{\alpha,\beta+1}[Z^{2t-2}X^{m+1}]$
		$\displaystyle=\frac{\beta+t-1}{\alpha+\beta+2t+m-1}\mathbb{E}_{\alpha,\beta}[Z^{2t-2}X^{m+1}]$
		$\displaystyle+\frac{(\alpha+t+m)(\alpha-1/2)}{(\alpha+\beta+2t-1+m)(\alpha+\beta+t-3/2)}$
		$\displaystyle\cdot\left(1-\frac{\alpha+\beta+2(t-1)+m+1}{\alpha+t+m}\right)\mathbb{E}_{\alpha,\beta}[Z^{2t-2}X^{m+1}]$
		$\displaystyle-\frac{(\beta+t-1)(\beta-1/2)}{(\alpha+\beta+2t-1+m)(\alpha+\beta+t-3/2)}\mathbb{E}_{\alpha,\beta}[Z^{2t-2}X^{m+1}]$
		$\displaystyle=\frac{(t-1/2)(\beta+t-1)}{(\alpha+\beta+t-3/2)(\alpha+\beta+2t+m-1)}\mathbb{E}_{\alpha,\beta}[Z^{2t-2}X^{m+1}]$

Using Equation (5) for $t-1$ we can get the same form of expression for $t$ . ∎

From Theorem 2, we can get the general formula for the mixed moment when $m\geq r$ :

Corollary 1.

	$\displaystyle\mathbb{E}_{\alpha,\beta}[X^{m}Y^{r}Z^{2t}]$	$\displaystyle=\frac{(2t-1)!!}{2^{t}}\frac{\prod_{j=0}^{t-1}(\beta+j)}{\prod_{j=0}^{t+r-1}(\alpha+\beta-1/2+j)}\frac{\prod_{j=0}^{t+m-1}(\alpha+j)}{\prod_{j=0}^{2t+m-1}(\alpha+\beta+j)}$
		$\displaystyle\cdot\sum_{i=0}^{r}\frac{1}{2^{i}}\binom{r}{i}\prod_{j=1}^{i}(2t-1+2j)\frac{\displaystyle\prod_{j=0}^{r-i-1}(\alpha-1/2+j)\prod_{j=0}^{i-1}(\beta+j+t)}{\prod_{j=0}^{i-1}(\alpha+\beta+j+2t+m)}$		(6)

Since $\mathbb{E}_{\alpha,\beta}[X^{m}Y^{r}Z^{2t}]=\mathbb{E}_{\alpha,\beta}[X^{r}Y^{m}Z^{2t}]$ , when $m<r$ we can exchange $m$ with $r$ and then use Corollary 1.

Proof.

From Lemma 1, we have $X^{m}Y^{r}Z^{2t}=X^{m-r}(A+Z^{2})^{r}Z^{2t}$ . Using binomial theorem we have $\mathbb{E}_{\alpha,\beta}[X^{m}Y^{r}Z^{2t}]=\sum_{i=0}^{r}\binom{r}{i}\mathbb{E}_{\alpha,\beta}[A^{r-i}X^{m-r}Z^{2(t+i)}]$ . Then using Equation (3) recursively we have

	$\displaystyle\mathbb{E}_{\alpha,\beta}[A^{r-i}X^{m-r}Z^{2(t+i)}]=$	$\displaystyle\prod_{j=0}^{r-i-1}\frac{(\alpha+j)(\alpha+j-1/2)}{(\alpha+j+\beta)(\alpha+j+\beta-1/2)}$
	$\displaystyle\cdot$	$\displaystyle\mathbb{E}_{\alpha+r-i,\beta}[X^{m-r}Z^{2(t+i)}]$

Using Theorem 2 we can finally get the expression in Equation (6). ∎

4 Case Study

In this section, we will give a natural example which illustrates how our result can be used. We will consider the random matrix $S=QQ^{T}$ where $Q$ is $n\times k$ random orthogonal matrix. We are interested in how $\mathbb{E}[S_{11}^{m}S_{12}^{2t}]$ changes as $k\to\infty$ when $r=\frac{k}{n}$ is fixed.

From Proposition 7.2 of [2], $\begin{pmatrix}S_{11}&S_{12}\\ S_{21}&S_{22}\end{pmatrix}$ is exactly 2 by 2 random matrix of Beta distribution with parameter $B(\frac{k}{2},\frac{n-k}{2},I_{2})$ . Using Theorem 2, we could write $\mathbb{E}[S_{11}^{m}S_{12}^{2t}]\sim\frac{(2t-1)!!}{2^{t}}\frac{r^{t}(1-t)^{t+m}}{n^{t}}=O(n^{-t})$ . That is, $\mathbb{E}[S_{11}^{m}S_{12}^{2t}]$ decreases in the order of $n^{-t}$ .

5 Conclusion

We have derived the formula of moments for multivariate Beta distribution of 2 by 2 matrix. This result is helpful for analyzing other statistical properties of multivariate Beta distribution.

References

[1] A. P. Dawid. Some matrix-variate distribution theory: Notational considerations and a bayesian application. Biometrika, 68(1):265–274, 1981.
[2] Morris L. Easton. Chapter 7: Random orthogonal matrices, volume Volume 1 of Regional Conference Series in Probability and Statistics, pages 100–107. Institute of Mathematical Statistics and American Statistical Association, Haywood CA and Alexandria VA, 1989.
[3] A. E. Ingham. An integral which occurs in statistics. Mathematical Proceedings of the Cambridge Philosophical Society, 29(2):271–276, 1933.
[4] Yoshihiko Konno. Exact moments of the multivariate f and beta distributions. Journal of the Japan Statistical Society, 18(2):123–130, 1988.
[5] Carl Ludwig Siegel. Über die analytische theorie der quadratischen formen. Annals of Mathematics, 36(3):527–606, 1935.