This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A remark on the geometric interpretation of the A3w condition from optimal transport

Cale Rankin The Fields Institute for Research in Mathematical Sciences [email protected]
[email protected]
Abstract.

We provide a geometric interpretation of the well known A3w condition for regularity of optimal transport maps.

This research is supported by ARC DP 200101084 and the Fields Institute for Research in Mathematical Sciences.

1. Introduction

In optimal transport a condition known as A3w is necessary for regularity of the optimal transport map. Here we provide a geometric interpretation of A3w. We’ll use freely the notation from [4]. Let cC2(𝐑n×𝐑n)c\in C^{2}(\mathbf{R}^{n}\times\mathbf{R}^{n}) satisfy A1 and A2 (see §2). Keeping in mind the prototypical case c(x,y)=|xy|2c(x,y)=|x-y|^{2}, we fix x0,y0𝐑nx_{0},y_{0}\in\mathbf{R}^{n} and perform a linear transformation so cxy(x0,y0)=Ic_{xy}(x_{0},y_{0})=-I. Define coordinates

(1) q(x)\displaystyle q(x) :=cy(x,y0),\displaystyle:=-c_{y}(x,y_{0}),
(2) p(y)\displaystyle p(y) :=cx(x0,y),\displaystyle:=-c_{x}(x_{0},y),

and the inverse transformations by x(q),y(p)x(q),y(p). Write c(q,p)=c(x(q),y(p))c(q,p)=c(x(q),y(p)) and let q0=q(x0)q_{0}=q(x_{0}) and p0=p(y0)p_{0}=p(y_{0}). We prove A3w is satisfied if and only if whenever these transformations are performed

(qq0)(pp0)0\displaystyle(q-q_{0})\cdot(p-p_{0})\geq 0\implies c(q,p)+c(q0,p0)c(q,p0)+c(q0,p).\displaystyle c(q,p)+c(q_{0},p_{0})\leq c(q,p_{0})+c(q_{0},p).

Heuristically, A3w implies when qq0q-q_{0} “points in the same direction” as pp0p-p_{0} it is cheaper to transport qq to pp and q0q_{0} to p0p_{0} than the alternative qq to p0p_{0} and q0q_{0} to pp. Thus, A3w implies compatibility between directions in the cost-convex geometry and the cost of transport.

A3w first appeared (in a stronger form) in [4]. It was weakened in [6] and a new interpretation given in [2]. The impetus for the above interpretation is Lemma 2.1 in [1]. Our result can also be realised by a particular choice of c-convex function in the unpublished preprint [5].

Acknowledgements. My thanks to Jiakun Liu and Robert McCann for helpful comments and discussion.

2. Proof of result

Let cC2(𝐑n×𝐑n)c\in C^{2}(\mathbf{R}^{n}\times\mathbf{R}^{n}) satisfy following the well known conditions
A1. For each x0,y0𝐑nx_{0},y_{0}\in\mathbf{R}^{n} the following mappings are injective

xcy(x,y0),andycx(x0,y).\displaystyle x\mapsto c_{y}(x,y_{0}),\quad\text{and}\quad y\mapsto c_{x}(x_{0},y).

A2. For each x0,y0𝐑nx_{0},y_{0}\in\mathbf{R}^{n} we have detci,j(x0,y0)0\det c_{i,j}(x_{0},y_{0})\neq 0.

Here, and throughout, subscripts before a comma denote differentiation with respect to the first variable, subscripts after a comma denote differentiation with respect to the second variable.

By A1 we define on 𝒰:={(x,cx(x,y));x,y𝐑n}\mathcal{U}:=\{(x,c_{x}(x,y));x,y\in\mathbf{R}^{n}\} a mapping Y:𝒰𝐑nY:\mathcal{U}\rightarrow\mathbf{R}^{n} by

cx(x,Y(x,p))=p.c_{x}(x,Y(x,p))=p.

The A3w condition, usually expressed with fourth derivatives but written here as in [3], is the following.
A3w. Fix xx. The function

pcij(x,Y(x,p))ξiξj,p\mapsto c_{ij}(x,Y(x,p))\xi_{i}\xi_{j},

is concave along line segments orthogonal to ξ\xi.

To verify A3w it suffices to verify midpoint concavity, that is whenever ξη=0\xi\cdot\eta=0 there holds

(3) 0[cij(x,Y(x,p+η))2cij(x,Y(x,p))+cij(x,Y(x,pη))]ξiξj.0\geq[c_{ij}(x,Y(x,p+\eta))-2c_{ij}(x,Y(x,p))+c_{ij}(x,Y(x,p-\eta))]\xi_{i}\xi_{j}.

Finally, we recall A𝐑nA\subset\mathbf{R}^{n} is called cc-convex with respect to y0y_{0} provided cy(A,y0)c_{y}(A,y_{0}) is convex. When A3w is satisfied and y,y0𝐑ny,y_{0}\in\mathbf{R}^{n} are given the section {x𝐑n;c(x,y)>c(x,y0)}\{x\in\mathbf{R}^{n};c(x,y)>c(x,y_{0})\} is cc-convex with respect to y0y_{0} [3].

Now fix (x0,p0)𝒰(x_{0},p_{0})\in\mathcal{U} and y0=Y(x0,p0)y_{0}=Y(x_{0},p_{0}). To simplify the proof we assume x0,y0,q0,p0=0x_{0},y_{0},q_{0},p_{0}=0. Up to an affine transformation (replace yy with y~:=cxy(0,0)y\tilde{y}:=-c_{xy}(0,0)y) we assume cxy(0,0)=Ic_{xy}(0,0)=-I. Note with q,pq,p as defined in (1), (2), this implies qx(0)=I\frac{\partial q}{\partial x}(0)=I. Put

c~(x,y)\displaystyle\tilde{c}(x,y) :=c(x,y)c(x,0)c(0,y)+c(0,0),\displaystyle:=c(x,y)-c(x,0)-c(0,y)+c(0,0),
c¯(q,p)\displaystyle\overline{c}(q,p) :=c~(x(q),y(p)).\displaystyle:=\tilde{c}(x(q),y(p)).
Theorem 1.

The A3w condition is satisfied if and only if whenever the above transformations are applied the following implication holds

(4) qp0c¯(q,p)0.q\cdot p\geq 0\implies\overline{c}(q,p)\leq 0.
Proof.

Observe by a Taylor series

(5) c¯(q,p)=(qp)+c¯ij(τq,p)qiqj,\overline{c}(q,p)=-(q\cdot p)+\overline{c}_{ij}(\tau q,p)q_{i}q_{j},

for some τ(0,1)\tau\in(0,1). First, assume A3w and let qp>0q\cdot p>0. By (5) we have c¯(tq,p)>0>c¯(tq,p)\overline{c}(-tq,p)>0>\overline{c}(tq,p) for t>0t>0 sufficiently small. If c¯(q,p)>0\overline{c}(q,p)>0 then the cc-convexity (in our coordinates, convexity) of the section

{q;c¯(q,p)>c¯(q,0)=0},\{q\ ;\ \overline{c}(q,p)>\overline{c}(q,0)=0\},

is violated. By continuity c¯(q,p)0\overline{c}(q,p)\leq 0 whenever qp0q\cdot p\geq 0.

In the other direction, take nonzero qq with qp=0q\cdot p=0 and small tt. By (4) and (5)

0c¯(tq,p)/t2=c¯ij(tτq,p)qiqj.0\geq\overline{c}(tq,p)/t^{2}=\overline{c}_{ij}(t\tau q,p)q_{i}q_{j}.

This inequality also holds with p-p. Moreover c¯ij(tτq,0)=0\overline{c}_{ij}(t\tau q,0)=0. Thus

0[c¯ij(tτq,p)2c¯ij(tτq,0)+c¯ij(tτq,p)]qiqj.0\geq[\overline{c}_{ij}(t\tau q,p)-2\overline{c}_{ij}(t\tau q,0)+\overline{c}_{ij}(t\tau q,-p)]q_{i}q_{j}.

Sending t0t\rightarrow 0 and returning to our original coordinates we obtain (3). ∎

Remarks. (1) On a Riemannian manifold with c(x,y)=d(x,y)2c(x,y)=d(x,y)^{2}, for dd the distance function, Loeper [2] proved A3w implies nonnegative sectional curvature. Our result expedites his proof. Let x0=y0Mx_{0}=y_{0}\in M and u,vTx0Mu,v\in T_{x_{0}}M satisfy uv=0u\cdot v=0 with x=expx0(tu),y=expx0(tv)x=\exp_{x_{0}}(tu),y=\exp_{x_{0}}(tv). Working in a sufficiently small local coordinate chart our previous proof implies if A3w is satisfied

(6) d(x,y)2d(x0,y)2+d(x0,x)2=2t.d(x,y)^{2}\leq d(x_{0},y)^{2}+d(x_{0},x)^{2}=2t.

The sectional curvature in the plane generated by u,vu,v is the κ\kappa satisfying

(7) d(expx0(tu),expx0(tv))=2t(1κ12t2+O(t3)) as t0,d(\exp_{x_{0}}(tu),\exp_{x_{0}}(tv))=\sqrt{2}t\big{(}1-\frac{\kappa}{12}t^{2}+O(t^{3})\big{)}\text{ as }t\rightarrow 0,

whereby comparison with (6) proves the result (see [7, eq. 1] for (7)). We note Loeper proved his result using an infinitesimal version of (6).

References

  • [1] Shibing Chen and Xu-Jia Wang. Strict convexity and C1,αC^{1,\alpha} regularity of potential functions in optimal transportation under condition A3w. J. Differential Equations, 260(2):1954–1974, 2016.
  • [2] Grégoire Loeper. On the regularity of solutions of optimal transportation problems. Acta Math., 202(2):241–283, 2009.
  • [3] Grégoire Loeper and Neil S. Trudinger. Weak formulation of the MTW condition and convexity properties of potentials. Methods Appl. Anal., 28(1):53–60, 2021.
  • [4] Xi-Nan Ma, Neil S. Trudinger, and Xu-Jia Wang. Regularity of potential functions of the optimal transportation problem. Arch. Ration. Mech. Anal., 177(2):151–183, 2005.
  • [5] Neil S. Trudinger and Xu-Jia Wang. On convexity notions in optimal transportation. preprint, 2008.
  • [6] Neil S. Trudinger and Xu-Jia Wang. On the second boundary value problem for Monge-Ampère type equations and optimal transportation. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5), 8(1):143–174, 2009.
  • [7] Cédric Villani. Synthetic theory of Ricci curvature bounds. Jpn. J. Math., 11(2):219–263, 2016.