This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Vectorial Approach to Unbalanced Optimal Mass Transport

Jiening Zhu, Rena Elkin, Jung Hun Oh, Joseph O. Deasy, Allen Tannenbaum J. Zhu is with the Department of Applied Mathematics & Statistics, Stony Brook University, NY; email: [email protected]. Elkin, J-H Oh, and J. Deasy are with the Department of Medical Physics, Memorial Sloan Kettering Cancer Center, NY; email: [email protected], [email protected], [email protected]. Tannenbaum is with the Departments of Computer Science and Applied Mathematics & Statistics, Stony Brook University, NY; email: [email protected]
Abstract

Unbalanced optimal mass transport (OMT) seeks to remove the conservation of mass constraint by adding a source term to the standard continuity equation in the Benamou-Brenier formulation of OMT. In this note, we show how the addition of the source fits into the vector-valued OMT framework.

1 Introduction

Optimal mass transport (OMT) is a very important subject in mathematics, originating with the French civil engineer and mathematician Gaspard Monge in 1781 [15, 16]. Recently, the theory has undergone a massive growth, with numerous applications to various areas including signal processing, machine learning, computer vision, meteorology, statistical physics, quantum mechanics, and network theory [1, 11, 14, 3, 12]. Several works deal with extensions of the theory to the unbalanced and vector-valued cases; see [9, 6] and the references therein. In standard treatments, vector-valued and unbalanced extensions are treated separately. In the present work, we show that unbalanced OMT can fit into the model of vector OMT by taking a special set of weight parameters, which gives us fresh way to treat the problem.

In the L2L^{2} setting, both extensions arise from the computational fluid dynamics (CFD) approach to OMT by Benamou and Brenier [2], which was a major development in optimal mass transport theory. For “square of distance” based cost functionals, the Benamou and Brenier framework is equivalent to the original one while giving an explicit interpolation of two mass distributions regarded as a geodesic on the space of probability distributions [13]. The CFD method views the optimal mass transport as the minimizer of a kinetic energy functional subject to a continuity constraint. By modifying the energy functional and the constraint, unbalanced OMT and vector OMT versions were developed [8, 9, 6]. More specifically, by adding a source term, unbalanced OMT can handle transport problems when the mass is not conserved. That is, mass can be created or destroyed during the process. By introducing the divergence of the flow between channels, vector OMT extends the density from scalar to vector-valued and even matrix-valued [5, 4].

In this note, we indicate how to reformulate unbalanced OMT into a vector-valued OMT framework, and thus may be implemented through any existing code for vector OMT. All that needs to be done is adding a new weight matrix without changing the main structure of the code.

In this note, we focus on the connection between these two extensions. We will first give the background on the two extensions of OMT. After that we will show how to reformulate unbalanced OMT to a vector OMT problem. We will also look into reformulating unbalanced vector OMT to a common vector OMT problem. We conclude with some illustrative numerical results.

2 Background

In this section, we briefly introduce the Benamou and Brenier approach of OMT [2] as well as the two extensions mentioned above.

2.1 Benamou and Brenier

The original formulation of OMT due to Monge [15, 16] may be expressed as follows:

infT{Ec(x,T(x))ρ0(x)𝑑x|T#ρ0=ρ1},\inf_{T}\{\int_{E}c(x,T(x))\rho_{0}(x)dx\ |\ T_{\#}\rho_{0}=\rho_{1}\}, (1)

where c(x,y)c(x,y) is the cost of moving unit mass from xx to yy, TT is the transport map, and ρ0,ρ1\rho_{0},\rho_{1} are two probability distributions defined on EE a subdomain of n\mathbb{R}^{n}. T#T_{\#} denotes the push forward of TT.

As pioneered by Leonid Kantorovich [15, 16], the Monge formulation of OMT may be relaxed replacing transport maps TT by couplings π\pi:

infπΠ(ρ0,ρ1)Ec(x,y)π(dx,dy),\inf_{\pi\in\Pi(\rho_{0},\rho_{1})}\int_{E}c(x,y)\pi(dx,dy), (2)

where Π(ρ0,ρ1)\Pi(\rho_{0},\rho_{1}) denotes the set of all the couplings between ρ0\rho_{0} and ρ1\rho_{1} (joint distributions whose two marginal distributions are ρ0\rho_{0} and ρ1\rho_{1}). One may show that for c(x,y)=xy2c(x,y)=\|x-y\|^{2} (square of distance function), the Kantorovich and Monge formulations are equivalent; see [15, 16] and the references therein.

Moreover for c(x,y)=xy2c(x,y)=||x-y||^{2}, the specific infimum is called Wasserstein 2 distance (𝒲2\mathcal{W}_{2}). Benamou and Brenier pointed out that this can be written in an equivalent computational fluid dynamic formulation:

𝒲2(ρ0,ρ1)2=\displaystyle\mathcal{W}_{2}(\rho_{0},\rho_{1})^{2}= infρ,v01Eρ(t,x)v(t,x)2𝑑x𝑑t\displaystyle\inf_{\rho,v}\int_{0}^{1}\int_{E}\rho(t,x)||v(t,x)||^{2}dxdt (3a)
ρt+(ρv)=0\displaystyle\frac{\partial\rho}{\partial t}+\nabla\cdot(\rho v)=0 (3b)
ρ(0,)=ρ0,ρ(1,)=ρ1.\displaystyle\rho(0,\cdot)=\rho_{0},\rho(1,\cdot)=\rho_{1}. (3c)

Thus, the latter reformulates OMT as one of minimizing a kinetic energy functional subject to a continuity constraint. The continuity constraint states that the change of density at each point over time is due to the flux of its neighborhood.

The Benamou-Brenier optimal solution may be related to the the original one of Monge. Indeed, if we have the optimal transport plan TT in (1), the interpolation can be expressed as ρ(t,)=((tT+(1t)id)#ρ0\rho(t,\cdot)=((t\cdot T+(1-t)\cdot id)_{\#}\rho_{0}. For computational purposes, the Benamou-Brenier model is usually written as a convex optimization problem in momentum form (p=ρvp=\rho v):

𝒲2(ρ0,ρ1)2=\displaystyle\mathcal{W}_{2}(\rho_{0},\rho_{1})^{2}= infρ,p01Ep(t,x)2ρ(t,x)𝑑x𝑑t\displaystyle\inf_{\rho,p}\int_{0}^{1}\int_{E}\frac{p(t,x)^{2}}{\rho(t,x)}dxdt (4a)
ρt+p=0\displaystyle\frac{\partial\rho}{\partial t}+\nabla\cdot p=0 (4b)
ρ(0,)=ρ0,ρ(1,)=ρ1.\displaystyle\rho(0,\cdot)=\rho_{0},\rho(1,\cdot)=\rho_{1}. (4c)

2.2 Unbalanced OMT

In the standard formulation, ρ0,ρ1\rho_{0},\rho_{1} need to be balanced, i.e., have the same total mass. We enforced this by simply taking them to be probability distributions (Eρ0(x)𝑑x=Eρ1(x)𝑑x=1\int_{E}\rho_{0}(x)dx=\int_{E}\rho_{1}(x)dx=1). In the unbalanced case, total mass is not required to be preserved. This is very important for a number of applications in which mass may be created or destroyed.

There are many ways to extend the original setting to the unbalanced case [10]. In the present work, we will just treat the case in which a source term is added under the Fisher-Rao smooth formulation [9]. Namely,

𝒲FR(ρ0,ρ1)2=\displaystyle\mathcal{W}_{FR}(\rho_{0},\rho_{1})^{2}= infρ,p01Ep(t,x)2ρ(t,x)+γs(t,x)2ρ(t,x)dxdt\displaystyle\inf_{\rho,p}\int_{0}^{1}\int_{E}\frac{p(t,x)^{2}}{\rho(t,x)}+\gamma\frac{s(t,x)^{2}}{\rho(t,x)}dxdt (5a)
ρt+p=s\displaystyle\frac{\partial\rho}{\partial t}+\nabla\cdot p=s (5b)
ρ(0,)=ρ0,ρ(1,)=ρ1.\displaystyle\rho(0,\cdot)=\rho_{0},\rho(1,\cdot)=\rho_{1}. (5c)

There is an extra source term ss in the continuity equation. The change of density at each point over time now is not only due to the flux of its neighborhood, but also a source. Mass can be created or destroyed at any point and time. The parameter γ\gamma is a weight to control how much source one wants to use.

2.3 Vector-valued OMT

Vector-valued OMT considers vector-valued distributions instead of only the classical scalar ones [6]. With more than one channel, the change of mass is not only due to the flux of its neighborhood but also the redistribution within the given channels. Namely, we have

𝒲V(ρ0,ρ1)2=\displaystyle\mathcal{W}_{V}(\rho_{0},\rho_{1})^{2}= infρ,p,u01EpTdiag(ρ)1p+γuT[diag(𝔽2Tρ)1+diag(𝔽1Tρ)1]udxdt\displaystyle\inf_{\rho,p,u}\int_{0}^{1}\int_{E}p^{T}{\rm diag}(\rho)^{-1}p+\gamma u^{T}[{\rm diag}(\mathbb{F}^{T}_{2}\rho)^{-1}+{\rm diag}(\mathbb{F}^{T}_{1}\rho)^{-1}]u\ dxdt (6a)
ρt+xpu=0\displaystyle\frac{\partial\rho}{\partial t}+\nabla_{x}\cdot p-\nabla_{\mathcal{F}}^{*}u=0 (6b)
ρ(0,)=ρ0,ρ(1,)=ρ1.\displaystyle\rho(0,\cdot)=\rho_{0},\quad\rho(1,\cdot)=\rho_{1}. (6c)

As mentioned above, there are two kinds of flows for the vector-valued density ρ\rho. The first is the spatial flux pp and its corresponding divergence is denoted as x\nabla_{x}\cdot, and the second is the flux uu between channels, whose discrete divergence is denoted by \nabla_{\mathcal{F}}^{*}. pp and uu are both vectors. We note that 𝔽1\mathbb{F}_{1} is the all-1 part of the incident matrix 𝔽\mathbb{F} (sources of all the edges) and 𝔽2=𝔽1𝔽\mathbb{F}_{2}=\mathbb{F}_{1}-\mathbb{F} (sinks of all the edges). As uu is defined on every edge and the density is only defined on nodes, diag(𝔽2Tρ)1+diag(𝔽1Tρ)1{\rm diag}(\mathbb{F}^{T}_{2}\rho)^{-1}+{\rm diag}(\mathbb{F}^{T}_{1}\rho)^{-1} assigns a density to each edge using the density of two end points so that we can compute the kinetic energy for the flows on edges.

Vector-valued OMT can be considered as a general OMT problem on E×FE\times F, where EE is the space on which the vector-valued density is defined and FF is a connected graph. With this understanding, we can rewrite the energy functional in the following equivalent form:

𝒲V(ρ0,ρ1)2=infρ,p,u01EcNodes(F)p(t,x,c)2ρ(t,x,c)dxdt+γ01EeEdges(F)u(t,x,e)2ρ~(t,x,e)dxdt,\mathcal{W}_{V}(\rho_{0},\rho_{1})^{2}=\inf_{\rho,p,u}\int_{0}^{1}\int_{E}\sum_{c\in Nodes(F)}\frac{p(t,x,c)^{2}}{\rho(t,x,c)}dxdt+\gamma\int_{0}^{1}\int_{E}\sum_{e\in Edges(F)}\frac{u(t,x,e)^{2}}{\tilde{\rho}(t,x,e)}dxdt, (7)

where ρ~\tilde{\rho} is density of edge.

2.4 Unbalanced vector-valued OMT

In this section, we write down the unbalanced vector-valued version of OMT. Namely,

𝒲VS(ρ0,ρ1)2=\displaystyle\mathcal{W}_{VS}(\rho_{0},\rho_{1})^{2}= infρ,p,u01EpTdiag(ρ)1p+γuT[diag(𝔽2Tρ)1+diag(𝔽1Tρ)1]u+ηsTdiag(ρ)1sdxdt\displaystyle\inf_{\rho,p,u}\int_{0}^{1}\int_{E}p^{T}{\rm diag}(\rho)^{-1}p+\gamma u^{T}[{\rm diag}(\mathbb{F}^{T}_{2}\rho)^{-1}+{\rm diag}(\mathbb{F}^{T}_{1}\rho)^{-1}]u+\eta s^{T}{\rm diag}(\rho)^{-1}s\ dxdt (8a)
ρt+xpu=s\displaystyle\frac{\partial\rho}{\partial t}+\nabla_{x}\cdot p-\nabla_{\mathcal{F}}^{*}u=s (8b)
ρ(0,)=ρ0,ρ(1,)=ρ1.\displaystyle\rho(0,\cdot)=\rho_{0},\quad\rho(1,\cdot)=\rho_{1}. (8c)

Note that the source term is also a vector. So we need to write the energy functional for this vector term. Regarding the continuity equation, there are three possibilities for the change of mass over time: the flux of its neighborhood, flows between channels and source.

3 Reformulation unbalanced scalar OMT to vector-valued OMT

When rewriting vector OMT in (6), we should note the similarity with unbalanced OMT. The flow between channels may be utilized as the source. The source term describes the creation or vanishing of mass. This can be modeled via an extra layer: the mass created originates from that layer and the vanishing mass just goes to that layer.

3.1 Source layer: scalar case

Here the input source and target are regarded as two mass distributions on the subdomain EE. Total mass may or may not be preserved. We add an extra source layer which is parallel to original space. For each point in EE, there is an edge connecting with source layer. As now we have an extra layer, we can put the difference of mass into the new layer so that the total mass of the 2-vector new structure is preserved. The difference of mass can just be distributed uniformly to source layer or put to some area of interest for further specific applications.

Now we view the flow between channels as source term. Consider the continuity equations of (5b) and (6b):

ρt+p=\displaystyle\frac{\partial\rho}{\partial t}+\nabla\cdot p= s\displaystyle s
ρt+xp=\displaystyle\frac{\partial\rho}{\partial t}+\nabla_{x}\cdot p= u.\displaystyle\nabla_{\mathcal{F}}^{*}u.

We can use u\nabla_{\mathcal{F}}^{*}u as the source term ss. Because of the special graph structure (there is only one edge), we have:

u=u=s.\nabla_{\mathcal{F}}^{*}u=u=s. (9)

Now consider the second term in (7). Since there is only one edge, we can just omit the summation:

γ01EeEdges(F)u(t,x,e)2ρ~(t,x,e)dxdt=γ01Eu(t,x)2ρ~(t,x)𝑑x𝑑t.\gamma\int_{0}^{1}\int_{E}\sum_{e\in Edges(F)}\frac{u(t,x,e)^{2}}{\tilde{\rho}(t,x,e)}dxdt=\gamma\int_{0}^{1}\int_{E}\frac{u(t,x)^{2}}{\tilde{\rho}(t,x)}dxdt. (10)

If we further take the density of each edge ρ~(t,x)=ρ(t,x)\tilde{\rho}(t,x)=\rho(t,x), this term is exactly the same as the second term in the energy functional in unbalanced OMT setting.

Now we consider the first term of the energy functional:

01EcNodes(F)p(t,x,c)2ρ(t,x,c)dxdt=01Ep(t,x,c1)2ρ(t,x,c1)+p(t,x,c2)2ρ(t,x,c2)dxdt.\int_{0}^{1}\int_{E}\sum_{c\in Nodes(F)}\frac{p(t,x,c)^{2}}{\rho(t,x,c)}dxdt=\int_{0}^{1}\int_{E}\frac{p(t,x,c_{1})^{2}}{\rho(t,x,c_{1})}+\frac{p(t,x,c_{2})^{2}}{\rho(t,x,c_{2})}dxdt. (11)

There are only two channels, i.e., c1c_{1} denotes the original channel and c2c_{2} denotes the source layer. Notice that the integral with respect to c1c_{1} term is exactly the first term of (5). We want to get rid of the c2c_{2} integral. This can be done by introducing a small weight parameter for that layer.

3.2 A naive generalization of vector-valued OMT: addition of weight

In the original setting, one treats the kinetic energy of different layers in an identical manner. But by simply introducing a weight, we can treat each layer differently:

𝒲V(ρ0,ρ1)2=infρ,p,u01EcNodes(F)w(c)p(t,x,c)2ρ(t,x,c)dxdt+γ01EeEdges(F)u(t,x,e)2ρ~(t,x,e)dxdt,\mathcal{W}_{V^{\prime}}(\rho_{0},\rho_{1})^{2}=\inf_{\rho,p,u}\int_{0}^{1}\int_{E}\sum_{c\in Nodes(F)}w(c)\frac{p(t,x,c)^{2}}{\rho(t,x,c)}dxdt+\gamma\int_{0}^{1}\int_{E}\sum_{e\in Edges(F)}\frac{u(t,x,e)^{2}}{\tilde{\rho}(t,x,e)}dxdt, (12)

where w(c)w(c) is the weighting parameters. We can choose different values for different channels. For another form of vector OMT:

𝒲V(ρ0,ρ1)2=infρ,p,u01EpTdiag(w)diag(ρ)1p+γuT[diag(𝔽2Tρ)1+diag(𝔽1Tρ)1]udxdt.\mathcal{W}_{V^{\prime}}(\rho_{0},\rho_{1})^{2}=\inf_{\rho,p,u}\int_{0}^{1}\int_{E}p^{T}{\rm diag}(w){\rm diag}(\rho)^{-1}p+\gamma u^{T}[{\rm diag}(\mathbb{F}^{T}_{2}\rho)^{-1}+{\rm diag}(\mathbb{F}^{T}_{1}\rho)^{-1}]u\ dxdt. (13)

Under this setting, if we take w(c1)=1w(c_{1})=1 and w(c2)w(c_{2}) very small, then the integral in the source layer is very small in that energy functional. Clearly, unbalanced OMT is almost equivalent to the extra source layer vector OMT with the latter set of weight parameters.

3.3 Implementation: scalar case

By introducing the source layer, there is an edge between two channels for which we use the flow on that edge as source, and thus there is an extra integral on that new layer. We add very small weight parameters so that two energy functionals are almost identical. See Figure 1.

Refer to caption
Figure 1: The bottom layer (gray) is the source layer, the weight for the flow within it is very small. The flow on the edge between two layers is our source.

It is quite straightforward to implement unbalanced OMT from vector-valued OMT code. We need to only alter a few lines of code to make it work for unbalanced case:

  1. 1.

    Initialization: From the input, first construct two extended structures for vector OMT. The first layer is the original input and the second layer contains the mass difference. If the starting density distribution has more total mass, then the mass difference is added to the second layer of target density distribution. If the starting density distribution has less total mass, then the mass difference is added to the second layer of starting density distribution itself.

  2. 2.

    Set weighting parameters: Employing the code for the energy functional, just add weighting parameters to the corresponding layers.

4 Reformulation of unbalanced vector OMT to vector-valued OMT

The reformulation is very similar to the scalar case. The same simple idea is to add a new source layer which connects to each of the original layers. The flows on those added edges are our sources.

4.1 Source layer: vector-valued case

In the vector-valued case, the input source and target are two nn-vector valued distributions. Total mass may or may not be preserved. We add an extra source layer which is connected to each of the existing layers. As before, we can put the difference of mass into the new layer so that the total mass of the (n+1)(n+1)-vector new structures is preserved.

We call the incident matrix for those newly added edges 𝒢\mathcal{G}, which is a submatrix of the new large incident matrix ~\tilde{\mathcal{F}}. We just split ~\tilde{\mathcal{F}} for illustration purpose. They can be easily pieced together as in original vector setting:

~[uv]=[𝒢][uv]=u+𝒢v,\nabla_{\tilde{\mathcal{F}}}^{*}\left[\begin{array}[]{c}u\\ v\end{array}\right]=\left[\nabla_{\mathcal{F}}^{*}\ \nabla_{\mathcal{G}}^{*}\right]\left[\begin{array}[]{c}u\\ v\end{array}\right]=\nabla_{\mathcal{F}}^{*}u+\nabla_{\mathcal{G}}^{*}v, (14)

where uu is the flow within the existing edges and vv is the flow within the newly added edges.

Now we consider the continuity equations of unbalanced vector OMT and vector OMT with a new layer:

ρt+pu=\displaystyle\frac{\partial\rho}{\partial t}+\nabla\cdot p-\nabla_{\mathcal{F}}^{*}u= s\displaystyle s
ρt+xpu=\displaystyle\frac{\partial\rho}{\partial t}+\nabla_{x}\cdot p-\nabla_{\mathcal{F}}^{*}u= 𝒢v.\displaystyle\nabla_{\mathcal{G}}^{*}v.

We can use 𝒢v\nabla_{\mathcal{G}}^{*}v as the source ss. Now similar to the scalar case:

𝒢v=v=s.\nabla_{\mathcal{G}}^{*}v=v=s. (15)

The energy functional now becomes:

𝒲V′′(ρ0,ρ1)2=infρ,p,u,v01EcNodes(~)p(t,x,c)2ρ(t,x,c)dxdt+\displaystyle\mathcal{W}_{V^{\prime\prime}}(\rho_{0},\rho_{1})^{2}=\inf_{\rho,p,u,v}\int_{0}^{1}\int_{E}\sum_{c\in Nodes(\tilde{\mathcal{F}})}\frac{p(t,x,c)^{2}}{\rho(t,x,c)}dxdt+ γ01EeEdges()u(t,x,e)2ρ~(t,x,e)dxdt\displaystyle\gamma\int_{0}^{1}\int_{E}\sum_{e\in Edges(\mathcal{F})}\frac{u(t,x,e)^{2}}{\tilde{\rho}(t,x,e)}dxdt (16)
+\displaystyle+ η01EeEdges(𝒢)v(t,x,e)2ρ~(t,x,e)dxdt.\displaystyle\eta\int_{0}^{1}\int_{E}\sum_{e\in Edges(\mathcal{G})}\frac{v(t,x,e)^{2}}{\tilde{\rho}(t,x,e)}dxdt.

With the above relationship (15), it is easy to see that the last term exactly fits the original source energy term (8a).

Again there is a new part of kinetic energy due to the newly added source layer. We can introduce weight parameters to make the extra term almost disappear. Same as adding weight technique for different layers, we can put γ\gamma and η\eta together as a weight matrix for different edges so that the final form is more concise:

𝒲V′′(ρ0,ρ1)2=\displaystyle\mathcal{W}_{V^{\prime\prime}}(\rho_{0},\rho_{1})^{2}= infρ,p,u~01EpTdiag(w1)diag(ρ)1p+u~Tdiag(w2)diag(ρ~)1u~dxdt\displaystyle\inf_{\rho,p,\tilde{u}}\int_{0}^{1}\int_{E}p^{T}{\rm diag}(w_{1}){\rm diag}(\rho)^{-1}p+\tilde{u}^{T}{\rm diag}(w_{2}){\rm diag}(\tilde{\rho})^{-1}\tilde{u}\ dxdt (17a)
ρt+xp~u~=0\displaystyle\frac{\partial\rho}{\partial t}+\nabla_{x}\cdot p-\nabla_{\tilde{\mathcal{F}}}^{*}\tilde{u}=0 (17b)
ρ(0,)=ρ0,ρ(1,)=ρ1,\displaystyle\rho(0,\cdot)=\rho_{0},\quad\rho(1,\cdot)=\rho_{1}, (17c)

where diag(w1){\rm diag}(w_{1}) denotes the weighting matrix for different layers and diag(w2){\rm diag}(w_{2}) denotes the weighting matrix for the edges we add (weight =η=\eta) and existing edges (weight=γ=\gamma). u~=[uv]\tilde{u}=\left[\begin{array}[]{c}u\\ v\end{array}\right] is all the flows that between channels within the new large graph ~\tilde{\mathcal{F}} and ρ~\tilde{\rho} is the density assigned to each edge.

4.2 Implementation: vector-valued case

By introducing a source layer, there is an edge between the source layer and each existing channel. We use the flows on those edges as sources. As before, we add very small weight parameters for the source layer so that two energy functionals are almost identical. See Figure 2.

Refer to caption
Figure 2: The bottom layer (gray) is the source layer, the weight for the flow within it is very small. The flow on the edge between the source layer and other existing layers are our sources.

As before, it is also very easy to implement unbalanced vector OMT from vector OMT code.

  1. 1.

    Initialization: From the input, first construct the extended structures for vector OMT. Add an extra layer. Connect that new layer with each of other layers. Put the difference of total mass into that newly added layer.

  2. 2.

    Set weight parameters: Employing the code for the energy functional, add weighting parameters to corresponding layer and corresponding edges (existing edges and newly added edges).

With the simple change of the original vector-valued OMT, we can use the same code for unbalanced vector OMT case without even touching the main structure of the original code.

5 Numerical results

We tested our new formulation on several images using the numerical algorithm from [7]. Gray scale images are general mass distributions on a rectangular area while color images are vector valued distributions.

5.1 Unbalanced OMT

We tested an example of moving two Gaussian distributions. Though this example has preserved total mass, the optimal solution of OMT still uses the source term. We can look at the effect of tuning the weight parameter γ\gamma. The source image and target images are:

Refer to caption
Refer to caption
Figure 3: Source and target distributions

With large γ\gamma, it is expensive to use the source term, so the source layer is almost zero at all times.

\foreach

i͡n 1,2,3,4,5 Refer to caption

\foreachi͡n 6,7,8,9,10 Refer to caption

Figure 4: Density over time
\foreach

i͡n 1,2,3,4,5 Refer to caption

\foreachi͡n 6,7,8,9,10 Refer to caption

Figure 5: Source over time

With small γ\gamma, it is cheap to use the source term, thus much of the mass goes through the source layer.

\foreach

i͡n 1,2,3,4,5 Refer to caption

\foreachi͡n 6,7,8,9,10 Refer to caption

Figure 6: Density over time
\foreach

i͡n 1,2,3,4,5 Refer to caption

\foreachi͡n 6,7,8,9,10 Refer to caption

Figure 7: Source over time

We also tested two images with very large total mass difference:

Refer to caption
Refer to caption
Figure 8: Source and target distributions
\foreach

i͡n 1,2,3,4,5 Refer to caption

\foreachi͡n 6,7,8,9,10 Refer to caption

Figure 9: Density over time
\foreach

i͡n 1,2,3,4,5 Refer to caption

\foreachi͡n 6,7,8,9,10 Refer to caption

Figure 10: Source over time

5.2 Unbalanced vector OMT

We tested our approach on color image data. While the interpolations of density are color, the source layer is still gray scale.

Refer to caption
Refer to caption
Figure 11: Source and target distributions
\foreach

i͡n 1,2,3,4,5 Refer to caption

\foreachi͡n 6,7,8,9,10 Refer to caption

Figure 12: Density over time
\foreach

i͡n 1,2,3,4,5 Refer to caption

\foreachi͡n 6,7,8,9,10 Refer to caption

Figure 13: Source over time

6 Conclusion

Vector-valued OMT is a very powerful model. In this note, we reformulated unbalanced OMT and unbalanced vector-valued OMT by adding the source as a new layer. We gave a new way to include a source term and we proposed a very simple way to implement unbalanced OMT from general vector-valued OMT code. In the present work, we only considered one kind of unbalanced formulation. Other flow-based unbalanced OMT settings fit into our model as well. We believe our generalization of vector-valued OMT has not reached its full potential. Indeed, we only give different weight parameters to different layers. But what if we give different weight parameters to different areas, different nodes or different edges? We may use different parameters according to specific applications. This will be the subject of future research.

7 Acknowledgments

This study was supported by AFOSR grants (FA9550-17-1-0435, FA9550-20-1-0029), NIH grant (R01-AG048769), MSK Cancer Center Support Grant/Core Grant (P30 CA008748), and a grant from Breast Cancer Research Foundation (grant BCRF-17-193).

References

  • [1] Martin Arjovsky, Soumith Chintala, and Léon Bottou. Wasserstein GAN. 2017.
  • [2] Jean-David Benamou and Yann Brenier. A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem. Numerische Mathematik, 84(3):375–393, 2000.
  • [3] Yongxin Chen, Tryphon Georgiou, and Allen Tannenbaum. Wasserstein Geometry of Quantum States and Optimal Transport of Matrix-Valued Measures, pages 139–150. 01 2018.
  • [4] Yongxin Chen, Tryphon T Georgiou, and Allen Tannenbaum. Interpolation of density matrices and matrix-valued measures: The unbalanced case. Euro. Jnl of Applied Mathematics, 30(3):458–480, 2018.
  • [5] Yongxin Chen, Tryphon T Georgiou, and Allen Tannenbaum. Matrix optimal mass transport: a quantum mechanical approach. IEEE Trans. Automatic Control, 63(8):2612 – 2619, 2018.
  • [6] Yongxin Chen, Tryphon T Georgiou, and Allen Tannenbaum. Vector-valued optimal mass transport. SIAM Journal Applied Mathematics, 78(3):1682–1696, 2018.
  • [7] Yongxin Chen, Kaoru Yamamoto, Eldad Haber, Tryphon T. Georgiou, and Allen Tannenbaum. An efficient algorithm for matrix-valued and vector-valued optimal mass transport. in preparation, 2017.
  • [8] Lenaic Chizat, Gabriel Peyré, Bernhard Schmitzer, and François-Xavier Vialard. Unbalanced Optimal Transport: Geometry and Kantorovich Formulation. working paper or preprint, August 2015.
  • [9] Lenaic Chizat, Gabriel Peyré, Bernhard Schmitzer, and Francois-Xavier Vialard. An interpolating distance between optimal transport and Fisher-Rao metrics. Foundations of Computational Mathematics, 10:1–44, 2016.
  • [10] Wilfrid Gangbo, Wuchen Li, Stanley Osher, and Michael Puthawala. Unnormalized optimal transport. Journal of Computational Physics, 399:1–19, 2019.
  • [11] Steven Haker, Lei Zhu, Allen Tannenbaum, and Sigurd Angenent. Optimal mass transport for registration and warping. International Journal of Computer Vision, 60(3):225–240, 2004.
  • [12] James C. Mathews, Saad Nadeem, Maryam Pouryahya, Zehor Belkhatir, Joseph O. Deasy, Arnold J. Levine, and Allen R. Tannenbaum. Functional network analysis reveals an immune tolerance mechanism in cancer. Proceedings of the National Academy of Sciences, 117(28):16339–16345, jul 2020.
  • [13] Felix Otto. The geometry of dissipative evolution equations: the porous medium equation. Communications in Partial Differential Equations, 2001.
  • [14] Svetlozar T Rachev and Ludger Rüschendorf. Mass Transportation Problems: Volumes I and II. Springer Science & Business Media, 1998.
  • [15] Cédric Villani. Topics in Optimal Transportation. American Mathematical Soc., 2003.
  • [16] Cédric Villani. Optimal Transport: Old and New, volume 338. Springer Science & Business Media, 2008.