A Note On Convexity Inequalities Of Weighted Matrix Geometric Means

Victoria M. Chayes
Department of Mathematics
Rutgers University
Piscataway, NJ 08854
[email protected]

Abstract

We offer a new proof of uniform convexity inequalities for the Finsler manifold of nonpositive curvature taken on the space of positive-semidefinite matrices with the weighted matrix geometric mean defining the geodesic between two points. Using the technique of log majorization, we are able to characterize that the equality cases of said equalities occur if and only if the matrices commute, and hence are the same as in $\ell^{p}$ .

Keywords Matrix Geometric Mean $\cdot$ Uniform Convexity $\cdot$ $p$ -Schatten Norms $\cdot$ Log Majorization

1 Introduction

…….We can consider the space of positive $n\times n$ matrix as a Reimannian metric with geodesic distance between $A,B\in\mathbf{P}_{n\times n}(\mathbb{C})$ of

\inf L(\gamma);\;\;\;L(\gamma):=\int_{a}^{b}||\gamma(t)^{-1/2}\gamma^{\prime}(t)\gamma(t)^{-1/2}||_{2}dt,

(1.1)

with $\gamma:[a,b]\rightarrow\mathbf{P}_{n\times n}(\mathbb{C})$ being a smooth curve between $A$ and $B$ and $L(\gamma)$ its arc length.

…….This space was first introduced in Skovgaard [15] for its applications in statistics, but is of general interest as it is a quintessential example of a metric space with non-positive curvature. The corresponding unit time geodesic distance

\delta_{2}(A,B)=\inf\left\{\int_{0}^{1}||\gamma(t)^{-1/2}\gamma^{\prime}(t)\gamma(t)^{-1/2}||_{2}\;dt,\;\;\;\gamma(0)=A,\;\gamma(1)=B\right\}

(1.2)

is reached uniquely by the path

\gamma(t)=A^{\frac{1}{2}}\left(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}}\right)^{t}A^{\frac{1}{2}}

(1.3)

with midpoint

A\#B=A^{\frac{1}{2}}\left(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}}.\right)^{\frac{1}{2}}A^{\frac{1}{2}}.

(1.4)

…….This midpoint is the ‘geometric mean’ of matrices $A$ and $B$ , introduced by Pusz and Woronowicz [14] as a way of generalizing $\sqrt{xy}$ to sesquilinear forms, and the location along the geodesic for $t\in[0,1]$ is the ‘weighted geometric mean’, which has been studied in great detail with respect to its relation to the Golden-Thompson inequality, quantum entropy, relative quantum entropy, and Rényi divergences [1] [2] [6] [12] [13] .

…….Distance in this metric is invariant under conjugation, as

|||(X\gamma(t)X^{\ast})^{-1/2}(X\gamma(t)X^{\ast})^{\prime}(X\gamma(t)X^{\ast})^{-1/2}|||=|||\gamma(t)^{-1/2}\gamma^{\prime}(t)\gamma(t)^{-1/2}|||

(1.5)

for any untitarily invariant norm $|||\cdot|||$ , including $||\cdot||_{2}$ . When $A$ and $B$ commute, we have

\gamma(t)=A^{1-t}B^{t}

(1.6)

and

\delta_{2}(A,B)=||\log(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}})||_{2}=||\log(A)-\log(B)||_{2};

(1.7)

and in general,

\delta_{2}(A,B)=||\log(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}})||_{2}\geq||\log(A)-\log(B)||_{2}.

(1.8)

…….However, instead of taking the Hilbert-Schmidt norm, arc length and respectively geodesic distance can be defined with the $p$ -Schatten norm with $1<p<\infty$ ¹¹1The $p=1$ case can be defined in the same manner, but the weighted geometric mean is no longer the unique geodesic. of

L(\gamma):=\int_{a}^{b}||\gamma(t)^{-1/2}\gamma^{\prime}(t)\gamma(t)^{-1/2}||_{p}\;dt,

(1.9)

now producing a Finsler manifold of non-positive curvature. Equation 1.3 is still the unique geodesic between two matrices $A$ and $B$ [4], but now with distance

\delta_{p}(A,B)=||\log(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}})||_{p}.

(1.10)

Note distance is still invariant under conjugation, and a number of properties of the geodesic are preserved. Full properties of this Finsler manifold are explored in [4], both using techniques involving derivatives of the exponential map, and using log majorization.

…….Two significant advances in work have been done recently: in [9], Conde looks at the generalized Finsler manifold (ie $1<p<\infty$ ) and uses Clarkson–McCarthy inequalities

2\left(||A||_{p}^{p}+||B||_{p}^{p}\right)\leq||A+B||_{p}^{p}+||A-B||_{p}^{p}\leq 2^{p-1}\left(||A||_{p}^{p}+||B||_{p}^{p}\right)

(1.11)

for $2\leq p<\infty$ and reversing for $1\leq p\leq 2$ to derive $r$ -uniform convexity inequalities as defined in [3]; most significantly, Conde establishes

	$\displaystyle\delta_{p}(A\#B,C)^{2}$	$\displaystyle\leq\frac{1}{2}\delta_{p}(A,C)^{2}+\frac{1}{2}\delta_{p}(B,C)^{2}-\frac{p-1}{4}\delta_{p}(A,B)^{2}\qquad\qquad 1<p\leq 2$		(1.12)
	$\displaystyle\delta_{p}(A\#B,C)^{p}$	$\displaystyle\leq\frac{1}{2}\delta_{p}(A,C)^{p}+\frac{1}{2}\delta_{p}(B,C)^{p}-\frac{1}{2^{p}}\delta_{p}(A,B)^{p}\qquad\qquad\;\;\;\;\;2\leq p$		(1.13)

Then in [5], Bhatia and Holbrook look to define a geometric mean for three positive matrices and analyze this in terms of convex hulls for the Reimannian metric ( $p=2$ ), in particular using a different method of proof relying on log majorization and conjugation invariance of the distance.

…….This paper applies an additional note to these results: using a the methodology of Bhatia and Holbrook, we can show that all inequalities must be strict unless the matrices involve $\Gamma$ -commute, reducing the equality cases of these inequalities to those in $\ell^{p}$ for both two-matrix and three-matrix inequalities. This comes from a fairly simple but vital theorem: the inequality of Equation (1.8) is an equality if and only if $A$ and $B$ commute. In Section 2 we introduce the technique log majorization to prove this theorem, and in Section 3 we use it prove the equality cases of Equations (1.12)-(1.13).

…….We note that the equality case is of particular interest, because Equations (1.12)-(1.13) are used to establish coefficients such as the modulus of convexity of the space being considered. The coefficients in $\ell^{p}$ are known to be the best coefficients. Showing that inequalities are sharp except in commuting cases shows that for these matrix spaces, non-commutativity always makes things ‘worse’. This is notable, given that there are some inequalities (ie singular value rearrangement inequalities in [8]) where inequalities between positive-semidefinite matrices $A$ and $\{UBU^{\ast}\colon U\text{ unitary}\}$ are not minimized when $A$ and $UBU^{\ast}$ commute.

2 Log Majorization And Equality Cases

…….Let $\mathbf{a},\mathbf{b}\in\mathbb{R}^{n}$ with components labeled in descending order $a_{1}\geq\dots\geq a_{n}$ and $b_{1}\geq\dots\geq b_{n}$ . Then $\mathbf{b}$ weakly majorizes $\mathbf{a}$ , written $\mathbf{a}\prec_{w}\bf{b}$ , when

\sum_{i=1}^{k}a_{i}\leq\sum_{i=1}^{k}b_{i},\qquad 1\leq k\leq n

(2.1)

and majorizes $\mathbf{a}\prec\mathbf{b}$ when the final inequality is an equality. Weak log majorization $\mathbf{a}\prec_{w(\log)}\mathbf{b}$ is similarly defined for non-negative vectors as

\prod_{i=1}^{k}a_{i}\leq\prod_{i=1}^{k}b_{i},\qquad 1\leq k\leq n

(2.2)

with log majorization $\mathbf{a}\prec_{(\log)}\mathbf{b}$ when the final inequality is an equality. Majorization applied to matrices $A$ and $B$ is understood to be applied to the vectors of their singular values respectively, and is a vital tool for proving inequalities with respect to any unitarily invariant norms.

…….An important relationship between majorization and log majorization was established in [11] (Lemma 2.2):

Lemma 2.1.

Let $\mathbf{a},\mathbf{b}\in\mathbb{R}_{+}^{n}$ with $\mathbf{a}_{(\log)}\mathbf{b}$ . Suppose there exists a stricly convex function $\Phi:\mathbb{R}\rightarrow\mathbb{R}$ such that $\sum_{i=1}^{n}\Phi(a_{i})=\sum_{i=1}^{n}\Phi(b_{i})$ . Then $\mathbf{a}=\Theta\mathbf{b}$ for some permutation matrix $\Theta$ .

…….This was extended in [7](Theorem 2.3) to general strictly convex functions:

Lemma 2.2.

Let $\mathbf{a},\mathbf{b}\in\mathbb{R}^{n}$ with $\mathbf{a}\prec\mathbf{b}$ . Suppose there exists a stricly convex function $\Phi:\mathbb{R}\rightarrow\mathbb{R}$ such that $\sum_{i=1}^{n}\Phi(a_{i})=\sum_{i=1}^{n}\Phi(b_{i})$ . Then $\mathbf{a}=\Theta\mathbf{b}$ for some permutation matrix $\Theta$ .

…….This, along with some known majorization results on matrices with equality cases from [11], gives us the tools we need to prove the equality conditions:

Theorem 2.3.

Let $A,B\in\mathbf{P}_{n}$ , and suppose $[A,B]\neq 0$ and $p>1$ . Then

\delta_{p}(A,B)>||\log(A)-\log(B)||_{p}

(2.3)

Proof.

By [11] (Theorem 3.1), for any two Hermitian matrices $H,K$ , the majorization inequality

\lambda(H+K)\prec\lambda\left(\log\left(e^{K/2}e^{H}e^{K/2}\right)\right)

(2.4)

holds. Writing $H=\log(A)$ , $K=-\log(B)$ for positive matrices $A,B$ , we rewrite Equation 2.4 as

\lambda(\log(A)-\log(B))\prec\lambda\left(\log\left(B^{-1/2}AB^{-1/2}\right)\right)

(2.5)

and as $x\mapsto|x|^{p}$ is strictly convex for $p>1$ , then by Lemma 2.2 the equality case of

\delta_{p}(A,B)=||\log(A)-\log(B)||_{p}

(2.6)

implies that $\lambda(\log(A)-\log(B))=\lambda\left(\log\left(B^{-1/2}AB^{-1/2}\right)\right)$ . Then by [11] (Corollary 2.4), $H$ and $K$ and hence $A$ and $B$ must commute. ∎

3 Convexity Implications

…….We will make use of the concept of $\Gamma$ -commuting for equality cases as established in [5]:

Definition 3.1.

Let $\Gamma_{X}(A)=XAX^{\ast}$ for any $X\in M_{n\times n}(\mathbb{C})$ . Then $A,B,C\in\mathbf{P}_{n\times n}(\mathbb{C})$ $\Gamma$ -commute if there exists some $X\in M_{n\times n}(\mathbb{C})$ such that $\Gamma_{X}(A),\Gamma_{X}(B)$ , and $\Gamma_{X}(C)$ all commute.

with the following equivalent conditions:

Lemma 3.2.

$A,B,C\in\mathbf{P}_{n\times n}(\mathbb{C})$ $\Gamma$ -commute if and only if $AB^{-1}C=CB^{-1}A$ if and only if $[A^{-\frac{1}{2}}BA^{-\frac{1}{2}},A^{-\frac{1}{2}}CA^{-\frac{1}{2}}]=0$ .

Note that these conditions clearly imply the reduction to standard commuting when $C=I$ ; ie $A,B,I$ $\Gamma$ -commute if and only if $[A,B]=0$ .

…….We also introduce notation for the ‘exponential unit sphere’

E_{p}=\left\{U\colon\;\;U\in\mathbf{P}_{n},\;\delta_{p}(U,I)=1\right\},

(3.1)

as elements of the unit sphere appear in the traditional convexity inequalities we want to explore; we also state the inequalities in their more traditional form in $\ell^{p}$ . We now have all the tools we need to address the strictness of Inequalities (1.12)-(1.13).

Theorem 3.3.

Let $A,B\in\mathbf{P}_{n}$ , and $1<p\leq 2$ . Then for any $C\in\mathbf{P}_{n}$ we have

\frac{\delta_{p}(A,C)^{2}+\delta_{p}(B,C)^{2}}{2}\geq\delta_{p}(A\#B,C)^{2}+\frac{p-1}{4}\delta_{p}(A,B)^{2}.

(3.2)

Letting $C=I$ , then

\frac{\delta_{p}(A,I)^{2}+\delta_{p}(B,I)^{2}}{2}\geq\delta_{p}(A\#B,I)^{2}+\frac{p-1}{4}\delta_{p}(A,B)^{2}.

(3.3)

In particular, for $A,B\in E_{p}$ , we have

1-\delta_{p}(A\#B,I)\geq\frac{p-1}{8}\delta_{p}(A,B)^{2}.

(3.4)

The constant $\frac{p-1}{4}$ of Equation (3.3) (and consequently $\frac{p-1}{8}$ of Equation (3.4)) is ideal, but there is only equality when $p=2$ with $[A,B]=0$ . In general, the inequalities are strict unless $A,B$ , and $C$ $\Gamma$ -commute or $A$ and $B$ commute respectively.

Proof.

We use a very similar method to [5]. Let $A,B,C\in\mathbf{P}_{n}$ . As geodesic distance is invariant to conjugation, letting $M=A\#B$ , we can define $\tilde{A}=M^{-1/2}AM^{-1/2}$ , $\tilde{B}=M^{-1/2}BM^{-1/2}$ , $\tilde{C}=M^{-1/2}CM^{-1/2}$ , and then noting that $\tilde{A}\#\tilde{B}=I$ and hence $\log(\tilde{B})=-\log(\tilde{A})$ , we can write all of the following geodesic distance relationships

$\displaystyle\delta_{p}(M,C)$	$\displaystyle=\delta_{p}(\tilde{C},I)=\|\|\log(\tilde{C})\|\|_{p}$	(3.5)
$\displaystyle\delta_{p}(A,B)$	$\displaystyle=\delta_{p}(\tilde{A},\tilde{B})=\|\|2\log(\tilde{A})\|\|_{p}$	(3.6)
$\displaystyle\delta_{p}(A,C)$	$\displaystyle=\delta_{p}(\tilde{A},\tilde{C})\geq\|\|\log(\tilde{A})-\log(\tilde{C})\|\|_{p}$	(3.7)
$\displaystyle\delta_{p}(B,C)$	$\displaystyle=\delta_{p}(\tilde{B},\tilde{C})\geq\|\|\log(\tilde{B})-\log(\tilde{C})\|\|_{p}=\|\|\log(\tilde{A})+\log(\tilde{C})\|\|_{p}$	(3.8)

Applying the known 2-uniform convexity inequality for matrices [3]

\frac{||X+Y||_{p}^{2}+||X-Y||_{p}^{2}}{2}\geq||X||_{p}^{2}+(p-1)||Y||_{p}^{2}\qquad\qquad 1\leq p\leq 2

(3.9)

we have

\frac{||\log(\tilde{A})+\log(\tilde{C})||_{p}^{2}+||\log(\tilde{A})-\log(\tilde{C})||_{p}^{2}}{2}\geq||\log(\tilde{C})||_{p}^{2}+(p-1)||\log(\tilde{A})||_{p}^{2}

(3.10)

and hence

\frac{\delta_{p}(A,C)^{2}+\delta_{p}(B,C)^{2}}{2}-\frac{p-1}{4}\delta_{p}(A,B)^{2}\geq\delta_{p}(A\#B,C)^{2}

(3.11)

Choosing $C=I$ and letting $A,B\in E_{p}$ , we see

1-\delta_{p}(A\#B,I)\geq\frac{p-1}{8}\delta_{p}(A,B)^{2}

(3.12)

…….To consider the sharpness of constancts and equality case, we note by Lemma 2.3, the Inequality (3.7) is strict unless $\tilde{A}$ , $\tilde{B}$ , and $\tilde{C}$ commute. This gives our $\Gamma$ -commuting and commuting requirements for equality. Therefore the question is reduced to that of 2-uniform convexity in $\ell^{p}$ , where the constant $(p-1)$ is ideal. As the 2-uniform convexity constant can be seen as a second order expansion of p-uniform convexity and the ideal constant from Hanner’s inequality, there is no equality until $p=2$ , when the inequality agrees with Hanner’s inequality; then the only inequalities inolved in the expression are of Lines (3.7) and (3.8), which by Lemma 2.3 are strict if and only if $[A,B]=0$ . ∎

Theorem 3.4.

Let $A,B\in\mathbf{P}_{n}$ , and $p\geq 2$ . Then for any $C\in\mathbf{P}_{n}$ we have

\frac{\delta_{p}(A,C)^{p}+\delta_{p}(B,C)^{p}}{2}\geq 2^{-p}\delta_{p}(A,B)^{p}+\delta_{p}(A\#B,C)^{p}

(3.13)

Letting $C=I$ , then

\frac{\delta_{p}(A,I)^{p}+\delta_{p}(B,I)^{p}}{2}\geq 2^{-p}\delta_{p}(A,B)^{p}+\delta_{p}(A\#B,I)^{p}

(3.14)

In particular, for $A,B\in E_{p}$ , we have

1-\delta_{p}(A\#B,I)^{p}\geq 2^{-p}\delta_{p}(A,B)^{p}

(3.15)

All the inequalities are strict unless $A,B$ , and $C$ $\Gamma$ -commute or $A$ and $B$ commute respectively.

Proof.

Using the distance formulations and conjugation of Equations (3.5)-(3.8) and now the Clarkson–McCarthy inequalities, we have

$\displaystyle\frac{\delta_{p}(A,C)^{p}+\delta_{p}(B,C)^{p}}{2}$	$\displaystyle\geq\frac{\|\|\log(\tilde{A})+\log(\tilde{C})\|\|_{p}^{p}+\|\|\log(\tilde{A})-\log(\tilde{C})\|\|_{p}^{p}}{2}$	(3.16)
	$\displaystyle\geq\|\|\log(\tilde{A})\|\|_{p}^{p}+\|\|\log(\tilde{C})\|\|_{p}^{p}$	(3.17)
	$\displaystyle=2^{-p}\delta_{p}(A,B)^{p}+\delta_{p}(M,C)^{p}$	(3.18)

Once more, the inequalities of Equations (3.7) and (3.8) are strict unless $A,B$ , and $C$ $\Gamma$ -commute or $A$ and $B$ commute in the choice of $C=I$ . ∎

…….For completeness, we also add the following theorem addressing $p$ -uniform convexity for $1<p\leq 2$ , which does not appear in other literature:

Theorem 3.5.

Let $A,B\in\mathbf{P}_{n}$ , and $1<p\leq 2$ . Then for any $C\in\mathbf{P}_{n}$ we have

\delta_{p}(A,C)^{p}+\delta_{p}(B,C)^{p}\geq\frac{\delta_{p}(A,B)^{p}+2^{p}\delta_{p}(A\#B,C)^{p}}{2}

(3.19)

Letting $C=I$ , then

\delta_{p}(A,I)^{p}+\delta_{p}(B,I)^{p}\geq\frac{\delta_{p}(A,B)^{p}+2^{p}\delta_{p}(A\#B,I)^{p}}{2}

(3.20)

In particular, for $A,B\in E_{p}$ , we have

1-2^{p-2}\delta_{p}(A\#B,I)^{p}\geq\frac{\delta_{p}(A,B)^{p}}{4}

(3.21)

All the inequalities are strict unless $A,B$ , and $C$ $\Gamma$ -commute or commute respectively.

Proof.

We once again use the distance formulations and conjugation of Equations (3.5)-(3.8) and the Clarkson–McCarthy inequalities to generate the inequality

$\displaystyle\delta_{p}(A,C)^{p}+\delta_{p}(B,C)^{p}$	$\displaystyle\geq\|\|\log(\tilde{A})+\log(\tilde{C})\|\|_{p}^{p}+\|\|\log(\tilde{A})-\log(\tilde{C})\|\|_{p}^{p}$	(3.22)
	$\displaystyle\geq 2^{p-1}\left(\|\|\log(\tilde{A})\|\|_{p}^{p}+\|\|\log(\tilde{C})\|\|_{p}^{p}\right)$	(3.23)
	$\displaystyle=\frac{\delta_{p}(A,B)^{p}}{2}+2^{p-1}\delta_{p}(M,C)^{p}$	(3.24)

Once more, Lemma 2.3 tells us the $\Gamma$ -commuting and commuting requirements.

…….We note that the constants are not ideal, as for $1\leq p\leq\frac{4}{3}$ , in Line (3.23) the stronger inequality to use would be

	[Line (3.22)]	$\displaystyle\geq\left(\|\|\log(\tilde{A})\|\|_{p}+\|\|\log(\tilde{C})\|\|_{p}\right)^{p}+\left\|\|\|\log(\tilde{A})\|\|_{p}-\|\|\log(\tilde{C})\|\|_{p}\right\|^{p}$		(3.25)
		$\displaystyle=\left(\frac{1}{2}\delta_{p}(A,B)+\delta_{p}(A\#B,C)\right)^{p}+\left\|\frac{1}{2}\delta_{p}(A,B)-\delta_{p}(A\#B,C)\right\|^{p}$		(3.26)

This is the matrix form of Hanner’s inequality, and is conjectured to hold for the full regime of $1\leq p\leq 2$ , but has not yet been proven outside of the given range except in the $p=\frac{3}{2}$ case [10]. ∎

References

[1] Ando, T., Hiai, F.: Log majorization and complementary Golden-Thompson type inequalities. Linear Algebra and its Applications 197-198, 113 – 131 (1994). DOI https://doi.org/10.1016/0024-3795(94)90484-7. URL http://www.sciencedirect.com/science/article/pii/0024379594904847
[2] Araki, H.: On an inequality of Lieb and Thirring. Letters in Mathematical Physics 19(2), 167–170 (1990). DOI 10.1007/BF01045887. URL https://doi.org/10.1007/BF01045887
[3] Ball, K., Carlen, E.A., Lieb, E.H.: Sharp uniform convexity and smoothness inequalities for trace norms. Inventiones mathematicae 115(1), 463–482 (1994). DOI 10.1007/BF01231769. URL https://doi.org/10.1007/BF01231769
[4] Bhatia, R.: On the exponential metric increasing property. Linear Algebra and its Applications 375, 211–220 (2003). DOI https://doi.org/10.1016/S0024-3795(03)00647-5. URL https://www.sciencedirect.com/science/article/pii/S0024379503006475
[5] Bhatia, R., Holbrook, J.: Riemannian geometry and matrix geometric means. Linear Algebra and its Applications 413(2), 594–618 (2006). DOI https://doi.org/10.1016/j.laa.2005.08.025. URL https://www.sciencedirect.com/science/article/pii/S0024379505004350. Special Issue on the 11th Conference of the International Linear Algebra Society, Coimbra, 2004
[6] Chayes, V.: Interpolation of operators with trace inequalities related to the positive weighted geometric mean. arXiv preprint arXiv:1906.07833 (2019)
[7] Chayes, V.: Reverse holder, minkowski, and hanner inequalities for matrices. arXiv e-prints pp. arXiv–2103 (2021)
[8] Chayes, V.M.: Matrix rearrangement inequalities revisited. Mathematical Inequalities and Applications 24(2), 431–444 (2021). DOI dx.doi.org/10.7153/mia-2021-24-30
[9] Conde, C.: Nonpositive curvature in p-schatten class. Journal of Mathematical Analysis and Applications 356(2), 664–673 (2009). DOI https://doi.org/10.1016/j.jmaa.2009.03.036. URL https://www.sciencedirect.com/science/article/pii/S0022247X09002315
[10] Heinävaara, O.: Planes in schatten- $3$ . arXiv preprint arXiv:2207.12812 (2022)
[11] Hiai, F.: Equality cases in matrix norm inequalities of golden-thompson type. Linear and Multilinear Algebra 36(4), 239–249 (1994). DOI 10.1080/03081089408818297. URL https://doi.org/10.1080/03081089408818297
[12] Hiai, F.: Log-majorization related to Rényi divergences. Linear Algebra and its Applications 563, 255 – 276 (2019). DOI https://doi.org/10.1016/j.laa.2018.11.004. URL http://www.sciencedirect.com/science/article/pii/S0024379518305275
[13] Hiai, F., Petz, D.: The Golden-Thompson trace inequality is complemented. Linear Algebra and its Applications 181, 153–185 (1993). DOI 10.1016/0024-3795(93)90029-N
[14] Pusz, W., Woronowicz, S.: Functional calculus for sesquilinear forms and the purification map. Reports on Mathematical Physics 8(2), 159 – 170 (1975). DOI https://doi.org/10.1016/0034-4877(75)90061-0. URL http://www.sciencedirect.com/science/article/pii/0034487775900610
[15] Skovgaard, L.T.: A Riemannian geometry of the multivariate normal model. Scandinavian Journal of Statistics 11(4), 211–223 (1984)