This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Note On Convexity Inequalities Of Weighted Matrix Geometric Means

Victoria M. Chayes
Department of Mathematics
Rutgers University
Piscataway, NJ 08854
[email protected]
Abstract

We offer a new proof of uniform convexity inequalities for the Finsler manifold of nonpositive curvature taken on the space of positive-semidefinite matrices with the weighted matrix geometric mean defining the geodesic between two points. Using the technique of log majorization, we are able to characterize that the equality cases of said equalities occur if and only if the matrices commute, and hence are the same as in p\ell^{p}.

Keywords Matrix Geometric Mean  \cdot Uniform Convexity  \cdot pp-Schatten Norms  \cdot Log Majorization

1 Introduction

…….We can consider the space of positive n×nn\times n matrix as a Reimannian metric with geodesic distance between A,B𝐏n×n()A,B\in\mathbf{P}_{n\times n}(\mathbb{C}) of

infL(γ);L(γ):=abγ(t)1/2γ(t)γ(t)1/22𝑑t,\inf L(\gamma);\;\;\;L(\gamma):=\int_{a}^{b}||\gamma(t)^{-1/2}\gamma^{\prime}(t)\gamma(t)^{-1/2}||_{2}dt, (1.1)

with γ:[a,b]𝐏n×n()\gamma:[a,b]\rightarrow\mathbf{P}_{n\times n}(\mathbb{C}) being a smooth curve between AA and BB and L(γ)L(\gamma) its arc length.

…….This space was first introduced in Skovgaard [15] for its applications in statistics, but is of general interest as it is a quintessential example of a metric space with non-positive curvature. The corresponding unit time geodesic distance

δ2(A,B)=inf{01||γ(t)1/2γ(t)γ(t)1/2||2dt,γ(0)=A,γ(1)=B}\delta_{2}(A,B)=\inf\left\{\int_{0}^{1}||\gamma(t)^{-1/2}\gamma^{\prime}(t)\gamma(t)^{-1/2}||_{2}\;dt,\;\;\;\gamma(0)=A,\;\gamma(1)=B\right\} (1.2)

is reached uniquely by the path

γ(t)=A12(A12B12A12)tA12\gamma(t)=A^{\frac{1}{2}}\left(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}}\right)^{t}A^{\frac{1}{2}} (1.3)

with midpoint

A#B=A12(A12B12A12.)12A12.A\#B=A^{\frac{1}{2}}\left(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}}.\right)^{\frac{1}{2}}A^{\frac{1}{2}}. (1.4)

…….This midpoint is the ‘geometric mean’ of matrices AA and BB, introduced by Pusz and Woronowicz [14] as a way of generalizing xy\sqrt{xy} to sesquilinear forms, and the location along the geodesic for t[0,1]t\in[0,1] is the ‘weighted geometric mean’, which has been studied in great detail with respect to its relation to the Golden-Thompson inequality, quantum entropy, relative quantum entropy, and Rényi divergences [1] [2] [6] [12] [13] .

…….Distance in this metric is invariant under conjugation, as

|(Xγ(t)X)1/2(Xγ(t)X)(Xγ(t)X)1/2|=|γ(t)1/2γ(t)γ(t)1/2||||(X\gamma(t)X^{\ast})^{-1/2}(X\gamma(t)X^{\ast})^{\prime}(X\gamma(t)X^{\ast})^{-1/2}|||=|||\gamma(t)^{-1/2}\gamma^{\prime}(t)\gamma(t)^{-1/2}||| (1.5)

for any untitarily invariant norm |||||||||\cdot|||, including ||||2||\cdot||_{2}. When AA and BB commute, we have

γ(t)=A1tBt\gamma(t)=A^{1-t}B^{t} (1.6)

and

δ2(A,B)=log(A12B12A12)2=log(A)log(B)2;\delta_{2}(A,B)=||\log(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}})||_{2}=||\log(A)-\log(B)||_{2}; (1.7)

and in general,

δ2(A,B)=log(A12B12A12)2log(A)log(B)2.\delta_{2}(A,B)=||\log(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}})||_{2}\geq||\log(A)-\log(B)||_{2}. (1.8)

…….However, instead of taking the Hilbert-Schmidt norm, arc length and respectively geodesic distance can be defined with the pp-Schatten norm with 1<p<1<p<\infty111The p=1p=1 case can be defined in the same manner, but the weighted geometric mean is no longer the unique geodesic. of

L(γ):=abγ(t)1/2γ(t)γ(t)1/2p𝑑t,L(\gamma):=\int_{a}^{b}||\gamma(t)^{-1/2}\gamma^{\prime}(t)\gamma(t)^{-1/2}||_{p}\;dt, (1.9)

now producing a Finsler manifold of non-positive curvature. Equation 1.3 is still the unique geodesic between two matrices AA and BB [4], but now with distance

δp(A,B)=log(A12B12A12)p.\delta_{p}(A,B)=||\log(A^{-\frac{1}{2}}B^{\frac{1}{2}}A^{-\frac{1}{2}})||_{p}. (1.10)

Note distance is still invariant under conjugation, and a number of properties of the geodesic are preserved. Full properties of this Finsler manifold are explored in [4], both using techniques involving derivatives of the exponential map, and using log majorization.

…….Two significant advances in work have been done recently: in [9], Conde looks at the generalized Finsler manifold (ie 1<p<1<p<\infty) and uses Clarkson–McCarthy inequalities

2(App+Bpp)A+Bpp+ABpp2p1(App+Bpp)2\left(||A||_{p}^{p}+||B||_{p}^{p}\right)\leq||A+B||_{p}^{p}+||A-B||_{p}^{p}\leq 2^{p-1}\left(||A||_{p}^{p}+||B||_{p}^{p}\right) (1.11)

for 2p<2\leq p<\infty and reversing for 1p21\leq p\leq 2 to derive rr-uniform convexity inequalities as defined in [3]; most significantly, Conde establishes

δp(A#B,C)2\displaystyle\delta_{p}(A\#B,C)^{2} 12δp(A,C)2+12δp(B,C)2p14δp(A,B)21<p2\displaystyle\leq\frac{1}{2}\delta_{p}(A,C)^{2}+\frac{1}{2}\delta_{p}(B,C)^{2}-\frac{p-1}{4}\delta_{p}(A,B)^{2}\qquad\qquad 1<p\leq 2 (1.12)
δp(A#B,C)p\displaystyle\delta_{p}(A\#B,C)^{p} 12δp(A,C)p+12δp(B,C)p12pδp(A,B)p     2p\displaystyle\leq\frac{1}{2}\delta_{p}(A,C)^{p}+\frac{1}{2}\delta_{p}(B,C)^{p}-\frac{1}{2^{p}}\delta_{p}(A,B)^{p}\qquad\qquad\;\;\;\;\;2\leq p (1.13)

Then in [5], Bhatia and Holbrook look to define a geometric mean for three positive matrices and analyze this in terms of convex hulls for the Reimannian metric (p=2p=2), in particular using a different method of proof relying on log majorization and conjugation invariance of the distance.

…….This paper applies an additional note to these results: using a the methodology of Bhatia and Holbrook, we can show that all inequalities must be strict unless the matrices involve Γ\Gamma-commute, reducing the equality cases of these inequalities to those in p\ell^{p} for both two-matrix and three-matrix inequalities. This comes from a fairly simple but vital theorem: the inequality of Equation (1.8) is an equality if and only if AA and BB commute. In Section 2 we introduce the technique log majorization to prove this theorem, and in Section 3 we use it prove the equality cases of Equations (1.12)-(1.13).

…….We note that the equality case is of particular interest, because Equations (1.12)-(1.13) are used to establish coefficients such as the modulus of convexity of the space being considered. The coefficients in p\ell^{p} are known to be the best coefficients. Showing that inequalities are sharp except in commuting cases shows that for these matrix spaces, non-commutativity always makes things ‘worse’. This is notable, given that there are some inequalities (ie singular value rearrangement inequalities in [8]) where inequalities between positive-semidefinite matrices AA and {UBU:U unitary}\{UBU^{\ast}\colon U\text{ unitary}\} are not minimized when AA and UBUUBU^{\ast} commute.

2 Log Majorization And Equality Cases

…….Let 𝐚,𝐛n\mathbf{a},\mathbf{b}\in\mathbb{R}^{n} with components labeled in descending order a1ana_{1}\geq\dots\geq a_{n} and b1bnb_{1}\geq\dots\geq b_{n}. Then 𝐛\mathbf{b} weakly majorizes 𝐚\mathbf{a}, written 𝐚w𝐛\mathbf{a}\prec_{w}\bf{b}, when

i=1kaii=1kbi,1kn\sum_{i=1}^{k}a_{i}\leq\sum_{i=1}^{k}b_{i},\qquad 1\leq k\leq n (2.1)

and majorizes 𝐚𝐛\mathbf{a}\prec\mathbf{b} when the final inequality is an equality. Weak log majorization 𝐚w(log)𝐛\mathbf{a}\prec_{w(\log)}\mathbf{b} is similarly defined for non-negative vectors as

i=1kaii=1kbi,1kn\prod_{i=1}^{k}a_{i}\leq\prod_{i=1}^{k}b_{i},\qquad 1\leq k\leq n (2.2)

with log majorization 𝐚(log)𝐛\mathbf{a}\prec_{(\log)}\mathbf{b} when the final inequality is an equality. Majorization applied to matrices AA and BB is understood to be applied to the vectors of their singular values respectively, and is a vital tool for proving inequalities with respect to any unitarily invariant norms.

…….An important relationship between majorization and log majorization was established in [11] (Lemma 2.2):

Lemma 2.1.

Let 𝐚,𝐛+n\mathbf{a},\mathbf{b}\in\mathbb{R}_{+}^{n} with 𝐚(log)𝐛\mathbf{a}_{(\log)}\mathbf{b}. Suppose there exists a stricly convex function Φ:\Phi:\mathbb{R}\rightarrow\mathbb{R} such that i=1nΦ(ai)=i=1nΦ(bi)\sum_{i=1}^{n}\Phi(a_{i})=\sum_{i=1}^{n}\Phi(b_{i}). Then 𝐚=Θ𝐛\mathbf{a}=\Theta\mathbf{b} for some permutation matrix Θ\Theta.

…….This was extended in [7](Theorem 2.3) to general strictly convex functions:

Lemma 2.2.

Let 𝐚,𝐛n\mathbf{a},\mathbf{b}\in\mathbb{R}^{n} with 𝐚𝐛\mathbf{a}\prec\mathbf{b}. Suppose there exists a stricly convex function Φ:\Phi:\mathbb{R}\rightarrow\mathbb{R} such that i=1nΦ(ai)=i=1nΦ(bi)\sum_{i=1}^{n}\Phi(a_{i})=\sum_{i=1}^{n}\Phi(b_{i}). Then 𝐚=Θ𝐛\mathbf{a}=\Theta\mathbf{b} for some permutation matrix Θ\Theta.

…….This, along with some known majorization results on matrices with equality cases from [11], gives us the tools we need to prove the equality conditions:

Theorem 2.3.

Let A,B𝐏nA,B\in\mathbf{P}_{n}, and suppose [A,B]0[A,B]\neq 0 and p>1p>1. Then

δp(A,B)>log(A)log(B)p\delta_{p}(A,B)>||\log(A)-\log(B)||_{p} (2.3)
Proof.

By [11] (Theorem 3.1), for any two Hermitian matrices H,KH,K, the majorization inequality

λ(H+K)λ(log(eK/2eHeK/2))\lambda(H+K)\prec\lambda\left(\log\left(e^{K/2}e^{H}e^{K/2}\right)\right) (2.4)

holds. Writing H=log(A)H=\log(A), K=log(B)K=-\log(B) for positive matrices A,BA,B, we rewrite Equation 2.4 as

λ(log(A)log(B))λ(log(B1/2AB1/2))\lambda(\log(A)-\log(B))\prec\lambda\left(\log\left(B^{-1/2}AB^{-1/2}\right)\right) (2.5)

and as x|x|px\mapsto|x|^{p} is strictly convex for p>1p>1, then by Lemma 2.2 the equality case of

δp(A,B)=log(A)log(B)p\delta_{p}(A,B)=||\log(A)-\log(B)||_{p} (2.6)

implies that λ(log(A)log(B))=λ(log(B1/2AB1/2))\lambda(\log(A)-\log(B))=\lambda\left(\log\left(B^{-1/2}AB^{-1/2}\right)\right). Then by [11] (Corollary 2.4), HH and KK and hence AA and BB must commute. ∎

3 Convexity Implications

…….We will make use of the concept of Γ\Gamma-commuting for equality cases as established in [5]:

Definition 3.1.

Let ΓX(A)=XAX\Gamma_{X}(A)=XAX^{\ast} for any XMn×n()X\in M_{n\times n}(\mathbb{C}). Then A,B,C𝐏n×n()A,B,C\in\mathbf{P}_{n\times n}(\mathbb{C}) Γ\Gamma-commute if there exists some XMn×n()X\in M_{n\times n}(\mathbb{C}) such that ΓX(A),ΓX(B)\Gamma_{X}(A),\Gamma_{X}(B), and ΓX(C)\Gamma_{X}(C) all commute.

with the following equivalent conditions:

Lemma 3.2.

A,B,C𝐏n×n()A,B,C\in\mathbf{P}_{n\times n}(\mathbb{C}) Γ\Gamma-commute if and only if AB1C=CB1AAB^{-1}C=CB^{-1}A if and only if [A12BA12,A12CA12]=0[A^{-\frac{1}{2}}BA^{-\frac{1}{2}},A^{-\frac{1}{2}}CA^{-\frac{1}{2}}]=0.

Note that these conditions clearly imply the reduction to standard commuting when C=IC=I; ie A,B,IA,B,I Γ\Gamma-commute if and only if [A,B]=0[A,B]=0.

…….We also introduce notation for the ‘exponential unit sphere’

Ep={U:U𝐏n,δp(U,I)=1},E_{p}=\left\{U\colon\;\;U\in\mathbf{P}_{n},\;\delta_{p}(U,I)=1\right\}, (3.1)

as elements of the unit sphere appear in the traditional convexity inequalities we want to explore; we also state the inequalities in their more traditional form in p\ell^{p}. We now have all the tools we need to address the strictness of Inequalities (1.12)-(1.13).

Theorem 3.3.

Let A,B𝐏nA,B\in\mathbf{P}_{n}, and 1<p21<p\leq 2. Then for any C𝐏nC\in\mathbf{P}_{n} we have

δp(A,C)2+δp(B,C)22δp(A#B,C)2+p14δp(A,B)2.\frac{\delta_{p}(A,C)^{2}+\delta_{p}(B,C)^{2}}{2}\geq\delta_{p}(A\#B,C)^{2}+\frac{p-1}{4}\delta_{p}(A,B)^{2}. (3.2)

Letting C=IC=I, then

δp(A,I)2+δp(B,I)22δp(A#B,I)2+p14δp(A,B)2.\frac{\delta_{p}(A,I)^{2}+\delta_{p}(B,I)^{2}}{2}\geq\delta_{p}(A\#B,I)^{2}+\frac{p-1}{4}\delta_{p}(A,B)^{2}. (3.3)

In particular, for A,BEpA,B\in E_{p}, we have

1δp(A#B,I)p18δp(A,B)2.1-\delta_{p}(A\#B,I)\geq\frac{p-1}{8}\delta_{p}(A,B)^{2}. (3.4)

The constant p14\frac{p-1}{4} of Equation (3.3) (and consequently p18\frac{p-1}{8} of Equation (3.4)) is ideal, but there is only equality when p=2p=2 with [A,B]=0[A,B]=0. In general, the inequalities are strict unless A,BA,B, and CC Γ\Gamma-commute or AA and BB commute respectively.

Proof.

We use a very similar method to [5]. Let A,B,C𝐏nA,B,C\in\mathbf{P}_{n}. As geodesic distance is invariant to conjugation, letting M=A#BM=A\#B, we can define A~=M1/2AM1/2\tilde{A}=M^{-1/2}AM^{-1/2}, B~=M1/2BM1/2\tilde{B}=M^{-1/2}BM^{-1/2}, C~=M1/2CM1/2\tilde{C}=M^{-1/2}CM^{-1/2}, and then noting that A~#B~=I\tilde{A}\#\tilde{B}=I and hence log(B~)=log(A~)\log(\tilde{B})=-\log(\tilde{A}), we can write all of the following geodesic distance relationships

δp(M,C)\displaystyle\delta_{p}(M,C) =δp(C~,I)=log(C~)p\displaystyle=\delta_{p}(\tilde{C},I)=||\log(\tilde{C})||_{p} (3.5)
δp(A,B)\displaystyle\delta_{p}(A,B) =δp(A~,B~)=2log(A~)p\displaystyle=\delta_{p}(\tilde{A},\tilde{B})=||2\log(\tilde{A})||_{p} (3.6)
δp(A,C)\displaystyle\delta_{p}(A,C) =δp(A~,C~)log(A~)log(C~)p\displaystyle=\delta_{p}(\tilde{A},\tilde{C})\geq||\log(\tilde{A})-\log(\tilde{C})||_{p} (3.7)
δp(B,C)\displaystyle\delta_{p}(B,C) =δp(B~,C~)log(B~)log(C~)p=log(A~)+log(C~)p\displaystyle=\delta_{p}(\tilde{B},\tilde{C})\geq||\log(\tilde{B})-\log(\tilde{C})||_{p}=||\log(\tilde{A})+\log(\tilde{C})||_{p} (3.8)

Applying the known 2-uniform convexity inequality for matrices [3]

X+Yp2+XYp22Xp2+(p1)Yp21p2\frac{||X+Y||_{p}^{2}+||X-Y||_{p}^{2}}{2}\geq||X||_{p}^{2}+(p-1)||Y||_{p}^{2}\qquad\qquad 1\leq p\leq 2 (3.9)

we have

log(A~)+log(C~)p2+log(A~)log(C~)p22log(C~)p2+(p1)log(A~)p2\frac{||\log(\tilde{A})+\log(\tilde{C})||_{p}^{2}+||\log(\tilde{A})-\log(\tilde{C})||_{p}^{2}}{2}\geq||\log(\tilde{C})||_{p}^{2}+(p-1)||\log(\tilde{A})||_{p}^{2} (3.10)

and hence

δp(A,C)2+δp(B,C)22p14δp(A,B)2δp(A#B,C)2\frac{\delta_{p}(A,C)^{2}+\delta_{p}(B,C)^{2}}{2}-\frac{p-1}{4}\delta_{p}(A,B)^{2}\geq\delta_{p}(A\#B,C)^{2} (3.11)

Choosing C=IC=I and letting A,BEpA,B\in E_{p}, we see

1δp(A#B,I)p18δp(A,B)21-\delta_{p}(A\#B,I)\geq\frac{p-1}{8}\delta_{p}(A,B)^{2} (3.12)

…….To consider the sharpness of constancts and equality case, we note by Lemma 2.3, the Inequality (3.7) is strict unless A~\tilde{A}, B~\tilde{B}, and C~\tilde{C} commute. This gives our Γ\Gamma-commuting and commuting requirements for equality. Therefore the question is reduced to that of 2-uniform convexity in p\ell^{p}, where the constant (p1)(p-1) is ideal. As the 2-uniform convexity constant can be seen as a second order expansion of p-uniform convexity and the ideal constant from Hanner’s inequality, there is no equality until p=2p=2, when the inequality agrees with Hanner’s inequality; then the only inequalities inolved in the expression are of Lines (3.7) and (3.8), which by Lemma 2.3 are strict if and only if [A,B]=0[A,B]=0. ∎

Theorem 3.4.

Let A,B𝐏nA,B\in\mathbf{P}_{n}, and p2p\geq 2. Then for any C𝐏nC\in\mathbf{P}_{n} we have

δp(A,C)p+δp(B,C)p22pδp(A,B)p+δp(A#B,C)p\frac{\delta_{p}(A,C)^{p}+\delta_{p}(B,C)^{p}}{2}\geq 2^{-p}\delta_{p}(A,B)^{p}+\delta_{p}(A\#B,C)^{p} (3.13)

Letting C=IC=I, then

δp(A,I)p+δp(B,I)p22pδp(A,B)p+δp(A#B,I)p\frac{\delta_{p}(A,I)^{p}+\delta_{p}(B,I)^{p}}{2}\geq 2^{-p}\delta_{p}(A,B)^{p}+\delta_{p}(A\#B,I)^{p} (3.14)

In particular, for A,BEpA,B\in E_{p}, we have

1δp(A#B,I)p2pδp(A,B)p1-\delta_{p}(A\#B,I)^{p}\geq 2^{-p}\delta_{p}(A,B)^{p} (3.15)

All the inequalities are strict unless A,BA,B, and CC Γ\Gamma-commute or AA and BB commute respectively.

Proof.

Using the distance formulations and conjugation of Equations (3.5)-(3.8) and now the Clarkson–McCarthy inequalities, we have

δp(A,C)p+δp(B,C)p2\displaystyle\frac{\delta_{p}(A,C)^{p}+\delta_{p}(B,C)^{p}}{2} log(A~)+log(C~)pp+log(A~)log(C~)pp2\displaystyle\geq\frac{||\log(\tilde{A})+\log(\tilde{C})||_{p}^{p}+||\log(\tilde{A})-\log(\tilde{C})||_{p}^{p}}{2} (3.16)
log(A~)pp+log(C~)pp\displaystyle\geq||\log(\tilde{A})||_{p}^{p}+||\log(\tilde{C})||_{p}^{p} (3.17)
=2pδp(A,B)p+δp(M,C)p\displaystyle=2^{-p}\delta_{p}(A,B)^{p}+\delta_{p}(M,C)^{p} (3.18)

Once more, the inequalities of Equations (3.7) and (3.8) are strict unless A,BA,B, and CC Γ\Gamma-commute or AA and BB commute in the choice of C=IC=I. ∎

…….For completeness, we also add the following theorem addressing pp-uniform convexity for 1<p21<p\leq 2, which does not appear in other literature:

Theorem 3.5.

Let A,B𝐏nA,B\in\mathbf{P}_{n}, and 1<p21<p\leq 2. Then for any C𝐏nC\in\mathbf{P}_{n} we have

δp(A,C)p+δp(B,C)pδp(A,B)p+2pδp(A#B,C)p2\delta_{p}(A,C)^{p}+\delta_{p}(B,C)^{p}\geq\frac{\delta_{p}(A,B)^{p}+2^{p}\delta_{p}(A\#B,C)^{p}}{2} (3.19)

Letting C=IC=I, then

δp(A,I)p+δp(B,I)pδp(A,B)p+2pδp(A#B,I)p2\delta_{p}(A,I)^{p}+\delta_{p}(B,I)^{p}\geq\frac{\delta_{p}(A,B)^{p}+2^{p}\delta_{p}(A\#B,I)^{p}}{2} (3.20)

In particular, for A,BEpA,B\in E_{p}, we have

12p2δp(A#B,I)pδp(A,B)p41-2^{p-2}\delta_{p}(A\#B,I)^{p}\geq\frac{\delta_{p}(A,B)^{p}}{4} (3.21)

All the inequalities are strict unless A,BA,B, and CC Γ\Gamma-commute or commute respectively.

Proof.

We once again use the distance formulations and conjugation of Equations (3.5)-(3.8) and the Clarkson–McCarthy inequalities to generate the inequality

δp(A,C)p+δp(B,C)p\displaystyle\delta_{p}(A,C)^{p}+\delta_{p}(B,C)^{p} log(A~)+log(C~)pp+log(A~)log(C~)pp\displaystyle\geq||\log(\tilde{A})+\log(\tilde{C})||_{p}^{p}+||\log(\tilde{A})-\log(\tilde{C})||_{p}^{p} (3.22)
2p1(log(A~)pp+log(C~)pp)\displaystyle\geq 2^{p-1}\left(||\log(\tilde{A})||_{p}^{p}+||\log(\tilde{C})||_{p}^{p}\right) (3.23)
=δp(A,B)p2+2p1δp(M,C)p\displaystyle=\frac{\delta_{p}(A,B)^{p}}{2}+2^{p-1}\delta_{p}(M,C)^{p} (3.24)

Once more, Lemma 2.3 tells us the Γ\Gamma-commuting and commuting requirements.

…….We note that the constants are not ideal, as for 1p431\leq p\leq\frac{4}{3}, in Line (3.23) the stronger inequality to use would be

[Line (3.22)] (log(A~)p+log(C~)p)p+|log(A~)plog(C~)p|p\displaystyle\geq\left(||\log(\tilde{A})||_{p}+||\log(\tilde{C})||_{p}\right)^{p}+\left|||\log(\tilde{A})||_{p}-||\log(\tilde{C})||_{p}\right|^{p} (3.25)
=(12δp(A,B)+δp(A#B,C))p+|12δp(A,B)δp(A#B,C)|p\displaystyle=\left(\frac{1}{2}\delta_{p}(A,B)+\delta_{p}(A\#B,C)\right)^{p}+\left|\frac{1}{2}\delta_{p}(A,B)-\delta_{p}(A\#B,C)\right|^{p} (3.26)

This is the matrix form of Hanner’s inequality, and is conjectured to hold for the full regime of 1p21\leq p\leq 2, but has not yet been proven outside of the given range except in the p=32p=\frac{3}{2} case [10]. ∎

References

  • [1] Ando, T., Hiai, F.: Log majorization and complementary Golden-Thompson type inequalities. Linear Algebra and its Applications 197-198, 113 – 131 (1994). DOI https://doi.org/10.1016/0024-3795(94)90484-7. URL http://www.sciencedirect.com/science/article/pii/0024379594904847
  • [2] Araki, H.: On an inequality of Lieb and Thirring. Letters in Mathematical Physics 19(2), 167–170 (1990). DOI 10.1007/BF01045887. URL https://doi.org/10.1007/BF01045887
  • [3] Ball, K., Carlen, E.A., Lieb, E.H.: Sharp uniform convexity and smoothness inequalities for trace norms. Inventiones mathematicae 115(1), 463–482 (1994). DOI 10.1007/BF01231769. URL https://doi.org/10.1007/BF01231769
  • [4] Bhatia, R.: On the exponential metric increasing property. Linear Algebra and its Applications 375, 211–220 (2003). DOI https://doi.org/10.1016/S0024-3795(03)00647-5. URL https://www.sciencedirect.com/science/article/pii/S0024379503006475
  • [5] Bhatia, R., Holbrook, J.: Riemannian geometry and matrix geometric means. Linear Algebra and its Applications 413(2), 594–618 (2006). DOI https://doi.org/10.1016/j.laa.2005.08.025. URL https://www.sciencedirect.com/science/article/pii/S0024379505004350. Special Issue on the 11th Conference of the International Linear Algebra Society, Coimbra, 2004
  • [6] Chayes, V.: Interpolation of operators with trace inequalities related to the positive weighted geometric mean. arXiv preprint arXiv:1906.07833 (2019)
  • [7] Chayes, V.: Reverse holder, minkowski, and hanner inequalities for matrices. arXiv e-prints pp. arXiv–2103 (2021)
  • [8] Chayes, V.M.: Matrix rearrangement inequalities revisited. Mathematical Inequalities and Applications 24(2), 431–444 (2021). DOI dx.doi.org/10.7153/mia-2021-24-30
  • [9] Conde, C.: Nonpositive curvature in p-schatten class. Journal of Mathematical Analysis and Applications 356(2), 664–673 (2009). DOI https://doi.org/10.1016/j.jmaa.2009.03.036. URL https://www.sciencedirect.com/science/article/pii/S0022247X09002315
  • [10] Heinävaara, O.: Planes in schatten-33. arXiv preprint arXiv:2207.12812 (2022)
  • [11] Hiai, F.: Equality cases in matrix norm inequalities of golden-thompson type. Linear and Multilinear Algebra 36(4), 239–249 (1994). DOI 10.1080/03081089408818297. URL https://doi.org/10.1080/03081089408818297
  • [12] Hiai, F.: Log-majorization related to Rényi divergences. Linear Algebra and its Applications 563, 255 – 276 (2019). DOI https://doi.org/10.1016/j.laa.2018.11.004. URL http://www.sciencedirect.com/science/article/pii/S0024379518305275
  • [13] Hiai, F., Petz, D.: The Golden-Thompson trace inequality is complemented. Linear Algebra and its Applications 181, 153–185 (1993). DOI 10.1016/0024-3795(93)90029-N
  • [14] Pusz, W., Woronowicz, S.: Functional calculus for sesquilinear forms and the purification map. Reports on Mathematical Physics 8(2), 159 – 170 (1975). DOI https://doi.org/10.1016/0034-4877(75)90061-0. URL http://www.sciencedirect.com/science/article/pii/0034487775900610
  • [15] Skovgaard, L.T.: A Riemannian geometry of the multivariate normal model. Scandinavian Journal of Statistics 11(4), 211–223 (1984)