This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Geometry of the matching distance for 22D filtering functions

Marc Ethier [email protected] Patrizio Frosini [email protected] Nicola Quercioli [email protected]  and  Francesca Tombari [email protected]
Abstract.

In this paper we exploit the concept of extended Pareto grid to study the geometric properties of the matching distance for 2\mathbb{R}^{2}-valued regular functions defined on a Riemannian closed manifold. In particular, we prove that in this case the matching distance is realised either at special values or at values corresponding to vertical, horizontal or slope 11 lines.

2010 Mathematics Subject Classification:
Primary 55N31, Secondary 57R19

1. Introduction

Feature extraction and comparison are the main tasks of data analysis. In topological data analysis this translates into the problem of comparing persistence modules, which encode the homological features extracted from geometric objects. In order to be able to compare persistence modules a distance is needed. There is a wide variety of distances in the space of 11-parameter persistence modules, such as the bottleneck and Wasserstein distances. However, such distances do not directly generalise to the multiparameter setting. Thus, different ones have been proposed over the past years, turning into a substantial catalogue, see for example [4, 5, 10]. One of those is the matching distance, which can be defined in particular for 2-parameter persistence modules. This pseudometric, introduced in [4], is a generalisation of the classical bottleneck distance for 11-parameter persistence modules and measures the difference between the 22-dimensional Betti numbers functions (also known as rank invariants) of persistence modules. The definition of matching distance is based on a foliation method, consisting of “slicing” the 22-parameter persistence module into infinitely many 11-dimensional components by means of lines of positive slope, that we refer to as filtering lines. The matching distance is then obtained by taking the supremum over all such lines of the bottleneck distances between the resulting persistence diagrams after a suitable normalisation.

According to the definition, in order to compute the matching distance between persistence modules, one should take into account infinitely many bottleneck distance computations. Many efforts have been devoted to make this computation efficient, for example, by identifying a finite number of filtering lines contributing to the actual computation [1, 3, 8] or approximation [6, 2, 9] of the distance. However, what many of these works have in common is that their starting point is a pair of 22-parameter persistence modules. Our approach is similar in scope, but different in nature. We consider regular filtering functions on a smooth manifold with values in 2\mathbb{R}^{2}. Their sublevel-set filtrations still return 22-parameter persistence modules, for which we can compute the 22-dimensional persistent Betti numbers functions and, hence, the matching distance between them. Our approach allows us to observe phenomena and exploit structures that are not visible when directly considering persistence modules. For example, it is possible to exploit the differentiable structure of the filtering functions to identify points in the persistence diagrams associated to each filtering line. This structure made of arcs and half-lines is known as extended Pareto grid [5] (see also [11]). The convenience of such an approach relies on the fact that, by using this approach, the changes in homology that occur when the filtering line changes are easy to follow and control.

In this context of 22-parameter persistence modules derived from regular filtering functions on smooth manifolds, we show that filtering lines of slope 11 play a special role in the computation of the matching distance. Our main result shows that the matching distance between the 22-dimensional persistent Betti numbers functions of two filtering functions is indeed realised either on values corresponding to vertical, horizontal or slope 11 lines, or on special values associated with the two functions. The authors of [1] recently obtained an analogous result in the discrete setting. They show that the matching distance is realised either on values corresponding to diagonal lines or on what they call switch values. One main difference is that the collection of special values that we encounter, called special set, is strongly related with the differentiable structure of our input. In particular, it relies on the structure of the extended Pareto grid associated with a function and on the Position Theorem, proved in [5], which relates points of a persistence diagram to points in the extended Pareto grid.

In this paper, we aim to prove the following:

Theorem.

The matching distance between ff and gg is realised either on a value associated with a line of slope 11, a vertical or horizontal line, or on a special value of (f,g)(f,g).

2. Matching distance

Let MM be a closed CC^{\infty}-manifold with a Riemannian metric defined on it. Let f:Mf\colon M\to\mathbb{R} be a smooth function. The filtered homology of the sublevel sets of ff is known as persistent homology. This information can be encoded as a multiset of points in {(x,y)2xy}\{(x,y)\in\mathbb{R}^{2}\mid x\leq y\}, known as the persistence diagram of ff and denoted by Dgm(f)\mathrm{Dgm}(f). The subset Δ={(x,y)2x=y}\Delta=\{(x,y)\in\mathbb{R}^{2}\mid x=y\} is always considered to be in the persistence diagram of a function and, by convention, we treat it as a unique point with infinite multiplicity. See [7] for more details about 11-parameter persistent homology for sublevel set filtrations.

Let f=(f1,f2):M2f=(f_{1},f_{2})\colon M\to\mathbb{R}^{2} and g=(g1,g2):M2g=(g_{1},g_{2})\colon M\to\mathbb{R}^{2} be smooth functions. Consider the set of pairs (a,b)(a,b) in ]0,1[×]0,1[\times\mathbb{R} with the uniform metric d((a,b),(a,b))=max{|aa|,|bb|}d_{\infty}((a,b),(a^{\prime},b^{\prime}))=\max\{\lvert a-a^{\prime}\rvert,\lvert b-b^{\prime}\rvert\}. It parameterises all the lines of 2\mathbb{R}^{2} with positive slope in the following way: r(a,b)r_{(a,b)} is the line containing points of the form t(a,1a)+(b,b)t(a,1-a)+(b,-b) with tt in \mathbb{R}. Each point (u(t),v(t))(u(t),v(t)) of r(a,b)r_{(a,b)} can be associated with the set Mta,b=M(u(t),v(t))={xMf1(x)u(t) and f2(x)v(t)}M_{t}^{a,b}=M_{(u(t),v(t))}=\allowbreak\left\{x\in M\mid f_{1}(x)\leq u(t)\text{ and }f_{2}(x)\leq v(t)\right\}. This defines a 11-dimensional filtration, depending on the line r(a,b)r_{(a,b)}, which can be associated with a persistence diagram. Letting (a,b)(a,b) vary, one obtains a collection of persistence diagrams described by the 22D persistent Betti numbers function of ff. As observed in [4], Mt(a,b)M_{t}^{(a,b)} is also equal to {xMf(a,b)(x)t}\left\{x\in M\mid f_{(a,b)}(x)\leq t\right\}, where f(a,b)(x)=max{f1(x)ba,f2(x)+b1a}f_{(a,b)}(x)=\max\left\{\frac{f_{1}(x)-b}{a},\frac{f_{2}(x)+b}{1-a}\right\} (see Figure 1). However, more commonly, f(a,b)f_{(a,b)} is normalised, without changing the nature of the filtration, and f(a,b)=min{a,1a}f(a,b)f^{*}_{(a,b)}=\min\{a,1-a\}f_{(a,b)} is instead considered.

Given two functions ff and gg, the matching distance [4] is defined by

Dmatch(f,g)=sup(a,b)]0,1[×dB(Dgm(f(a,b)),Dgm(g(a,b))).D_{\text{match}}(f,g)=\sup_{(a,b)\in]0,1[\times\mathbb{R}}d_{B}\left(\mathrm{Dgm}\left(f^{*}_{(a,b)}\right),\mathrm{Dgm}\left(g^{*}_{(a,b)}\right)\right).

Here, dB(Dgm(f(a,b)),Dgm(g(a,b)))d_{B}\left(\mathrm{Dgm}\left(f^{*}_{(a,b)}\right),\mathrm{Dgm}\left(g^{*}_{(a,b)}\right)\right) is the bottleneck distance between the persistence diagrams of f(a,b)f^{*}_{(a,b)} and g(a,b)g^{*}_{(a,b)}, i.e.,

dB(Dgm(f(a,b)),Dgm(g(a,b)))=infσcost(σ),d_{B}\left(\mathrm{Dgm}\left(f^{*}_{(a,b)}\right),\mathrm{Dgm}\left(g^{*}_{(a,b)}\right)\right)=\inf_{\sigma}\text{cost}(\sigma),

where cost(σ)=maxXDgm(f(a,b))d(X,σ(X))\text{cost}(\sigma)=\max_{X\in\mathrm{Dgm}\left(f^{*}_{(a,b)}\right)}d\left(X,\sigma(X)\right), σ\sigma runs over all bijections, called matchings, between Dgm(f(a,b))\mathrm{Dgm}\left(f^{*}_{(a,b)}\right) and Dgm(g(a,b))\mathrm{Dgm}\left(g^{*}_{(a,b)}\right) and, if X=(x1,x2)X=(x_{1},x_{2}) and Y=(y1,y2)Y=(y_{1},y_{2}), then

d(X,Y)={κif X=(x1,x2),Y=(y1,y2)Δ+,|x1y1|if X=(x1,),Y=(y1,),x2x12if X=(x1,x2)Δ+,Y=Δ,y2y12if Y=(y1,y2)Δ+,X=Δ,0if X=Y=Δ,otherwise.d(X,Y)=\begin{cases}\kappa&\textup{if }X=(x_{1},x_{2}),\;Y=(y_{1},y_{2})\in\Delta^{+},\\ |x_{1}-y_{1}|&\textup{if }X=(x_{1},\infty),\;Y=(y_{1},\infty),\\ \frac{x_{2}-x_{1}}{2}&\textup{if }X=(x_{1},x_{2})\in\Delta^{+},\;Y=\Delta,\\ \frac{y_{2}-y_{1}}{2}&\textup{if }Y=(y_{1},y_{2})\in\Delta^{+},\;X=\Delta,\\ 0&\textup{if }X=Y=\Delta,\\ \infty&\textup{otherwise.}\end{cases}

where κ=min{max{|x1y1|,|x2y2|},max{x2x12,y2y12}}\kappa=\min\{\max\{|x_{1}-y_{1}|,|x_{2}-y_{2}|\},\max\{\frac{x_{2}-x_{1}}{2},\frac{y_{2}-y_{1}}{2}\}\}. A matching realising the matching distance, whenever it exists, is called an optimal matching. Note that the matching distance Dmatch(f,g)D_{\text{match}}(f,g) can be seen both as a pseudo-metric between the persistent Betti numbers functions of ff and gg, and between the filtering functions ff and gg. For the sake of simplicity, we keep the notation Dmatch(f,g)D_{\text{match}}(f,g) for both cases. For more details about this definition and the foliation method we refer to [4].

Refer to caption
Figure 1. On the left the space of parameters ]0,1[×]0,1[\times\mathbb{R} is represented. On the right we show the filtering lines corresponding to the two parameters selected and the projection of a torus on the plane. Each of the lines gives a sublevel set filtration of the torus in the direction of the line. The area down and left of the red point (a1t¯+b1,(1a1)t¯b1)(a_{1}\bar{t}+b_{1},(1-a_{1})\bar{t}-b_{1}) on the line r(a1,b1)r_{(a_{1},b_{1})} is the sublevel set Mt¯a1,b1M_{\bar{t}}^{a_{1},b_{1}} associated to such a point.

3. Extended Pareto grid

In this section we recall the relation between a differential construction associated with a smooth function f:M2f\colon M\to\mathbb{R}^{2}, called the extended Pareto grid, and the points of the persistence diagrams Dgm(f(a,b))\mathrm{Dgm}\left(f^{*}_{(a,b)}\right). This connection is established in the Position Theorem proved in [5].

Recall that the Jacobi set of ff is the collection

𝕁(f)={pMf1=λf2 or f2=λf1, for some λ}.\mathbb{J}(f)=\{p\in M\mid\nabla f_{1}=\lambda\nabla f_{2}\text{ or }\nabla f_{2}=\lambda\nabla f_{1},\text{ for some }\lambda\in\mathbb{R}\}.

The Pareto critical set of ff is the subset of 𝕁(f)\mathbb{J}(f) given by

𝕁P(f)={p𝕁(f)f1=λf2 or f2=λf1, for some λ0}.\mathbb{J}_{P}(f)=\left\{p\in\mathbb{J}(f)\mid\nabla f_{1}=\lambda\nabla f_{2}\text{ or }\nabla f_{2}=\lambda\nabla f_{1},\text{ for some }\lambda\leq 0\right\}.

Assume now that ff is not only smooth, but it also satisfies the following properties:

  1. (i)

    No point pp exists in MM at which both f1\nabla f_{1} and f2\nabla f_{2} vanish.

  2. (ii)

    𝕁(f)\mathbb{J}(f) is a 11-manifold smoothly embedded in MM consisting of finitely many components, each one diffeomorphic to a circle.

  3. (iii)

    𝕁P(f)\mathbb{J}_{P}(f) is a 11-dimensional closed submanifold of M, with boundary in 𝕁(f)\mathbb{J}(f).

  4. (iv)

    If we denote by 𝕁C(f)\mathbb{J}_{C}(f) the subset of 𝕁(f)\mathbb{J}(f) where f1\nabla f_{1} and f2\nabla f_{2} are orthogonal to 𝕁(f)\mathbb{J}(f), then the connected components of 𝕁P(f)𝕁C(f)\mathbb{J}_{P}(f)\setminus\mathbb{J}_{C}(f) are finite in number, each one being diffeomorphic to an interval. With respect to any parameterisation of each component, one of f1f_{1} and f2f_{2} is strictly increasing and the other is strictly decreasing. Each component can meet critical points for f1f_{1}, f2f_{2} only at its endpoints.

Denote by {p1,,ph}\{p_{1},\dots,p_{h}\} and {q1,,qk}\{q_{1},\dots,q_{k}\}, respectively, the critical points of f1f_{1} and f2f_{2}. Since the function ff satisfies (i), then {p1,,ph}{q1,,qk}=\{p_{1},\dots,p_{h}\}\cap\{q_{1},\dots,q_{k}\}=\emptyset. The extended Pareto grid of ff is defined as the union

Γ(f)=f(𝕁P(f))(ivi)(jhj)\Gamma(f)=f\left(\mathbb{J}_{P}(f)\right)\cup\left(\bigcup_{i}v_{i}\right)\cup\left(\bigcup_{j}h_{j}\right)

where viv_{i} is the vertical half-line {(x,y)2x=f1(pi),yf2(pi)}\{(x,y)\in\mathbb{R}^{2}\mid x=f_{1}(p_{i}),y\geq f_{2}(p_{i})\} and hjh_{j} is the horizontal half-line {(x,y)2xf1(qj),y=f2(qj)}\{(x,y)\in\mathbb{R}^{2}\mid x\geq f_{1}(q_{j}),y=f_{2}(q_{j})\}. We refer to these half-lines as improper contours and to the closure of the image of the connected components of 𝕁P(f)𝕁C(f)\mathbb{J}_{P}(f)\setminus\mathbb{J}_{C}(f) as proper contours of Γ(f)\Gamma(f). Figure 2 shows an example of extended Pareto grid for the projection of a sphere in 3\mathbb{R}^{3} on the plane y=0y=0. The violet horizontal half-lines originate at critical values of f2f_{2}, while the vertical ones originate at critical values of f1f_{1}. The red arcs are the images of those arcs on the sphere in which the gradients f1\nabla f_{1} and f2\nabla f_{2} have the same direction but opposite orientation. Observe that, because of property (ii), the number of contours in Γ(f)\Gamma(f) is finite. Moreover, property (iv) ensures that every contour can be parameterised as a curve whose two coordinates are respectively strictly decreasing and strictly increasing. For more details about properties (i)-(iv) we refer the interested reader to [5, 11].

Refer to caption
Figure 2. The extended Pareto grid of the function f(x,y,z)=(x,z)f(x,y,z)=(x,z) on S2={(x,y,z)3x2+y2+z2=1}S^{2}=\{(x,y,z)\in\mathbb{R}^{3}\mid x^{2}+y^{2}+z^{2}=1\}.On the left, the red arcs are the proper contours and the violet half-lines are the improper contours. Any point in this extended Pareto grid corresponds to a birth or death of a homological class. For example, on the right side of this figure, the red points correspond to the birth of homology classes in degree 0, while the green points correspond to the birth of homology classes in degree 22.

One may observe that the portions of contours delimited by points of intersection between different contours correspond to births and deaths of homology classes. For example, the red union of contours corresponds to the birth of a homology class in degree 0 and the green portions of contour to the birth of a homology class in degree 2. For a richer example we refer the reader to [5, Figure 8].

The Position Theorem (Theorem 2 in [5]) allows us to obtain the coordinates of the points in the persistence diagram of f(a,b)f^{*}_{(a,b)} just by looking at the extended Pareto grid of the function and the filtering line r(a,b)r_{(a,b)}. It reads as follows:

Theorem 3.1.

Let (a,b)(a,b) be in ]0,1[×]0,1[\times\mathbb{R} and XX in Dgm(f(a,b)){Δ}\mathrm{Dgm}(f^{*}_{(a,b)})\setminus\{\Delta\}. Then, for each finite coordinate ww of XX, a point (p1,p2)(p_{1},p_{2}) in r(a,b)Γ(f)r_{(a,b)}\cap\Gamma(f) exists such that w=min{a,1a}a(p1b)=min{a,1a}1a(p2+b)w=\frac{\min\{a,1-a\}}{a}(p_{1}-b)=\frac{\min\{a,1-a\}}{1-a}(p_{2}+b).

In [5] the set of filtering functions considered is the set of normal functions. However, the reader can observe that the proof of this specific theorem is actually independent from this assumption and it is valid also in our current setting.

4. Extension of persistence diagrams

In this section we show that it is possible to extend each 22D persistent Betti numbers function from the open set ]0,1[×]0,1[\times\mathbb{R}, where it is defined, to the closed set [0,1]×[0,1]\times\mathbb{R}. Moreover, we prove that the matching distance between ff and gg can be realised on the compact set [0,1]×[C¯,C¯][0,1]\times[-\overline{C},\overline{C}], with C¯=max{f,g}\overline{C}=\max\{\lVert f\rVert_{\infty},\lVert g\rVert_{\infty}\}.

Proposition 4.1.

Let CC be a positive real number. If 0<a,a120<a,a^{\prime}\leq\frac{1}{2} and |b|C\lvert b\rvert\leq C, then, for every bb^{\prime},

f(a,b)f(a,b)4|aa|(f2+C)+3|bb|.\left\lVert f^{*}_{(a,b)}-f^{*}_{(a^{\prime},b^{\prime})}\right\rVert_{\infty}\leq 4\lvert a-a^{\prime}\rvert\left(\lVert f_{2}\rVert_{\infty}+C\right)+3\lvert b-b^{\prime}\rvert.
Proof.

Since a,a12a,a^{\prime}\leq\frac{1}{2}, then min{a,1a}=a\min\left\{a,1-a\right\}=a and min{a,1a}=a\min\left\{a^{\prime},1-a^{\prime}\right\}=a^{\prime}. Therefore, recalling that |max{α,β}max{γ,δ}|max{|αγ|,|βδ|}\lvert\max\left\{\alpha,\beta\right\}-\max\{\gamma,\delta\}\rvert\leq\max\{\lvert\alpha-\gamma\rvert,\lvert\beta-\delta\rvert\} and observing that (1a)(1a)14(1-a)(1-a^{\prime})\geq\frac{1}{4},

f(a,b)f(a,b)=amax{f1ba,f2+b1a}amax{f1ba,f2+b1a}\displaystyle\left\lVert f^{*}_{(a,b)}-f^{*}_{(a^{\prime},b^{\prime})}\right\rVert_{\infty}=\left\lVert a\max\left\{\frac{f_{1}-b}{a},\frac{f_{2}+b}{1-a}\right\}-a^{\prime}\max\left\{\frac{f_{1}-b^{\prime}}{a^{\prime}},\frac{f_{2}+b^{\prime}}{1-a^{\prime}}\right\}\right\rVert_{\infty}
=supxM|max{f1(x)b,a1a(f2(x)+b)}max{f1(x)b,a1a(f2(x)+b)}|\displaystyle=\sup_{x\in M}\left\lvert\max\left\{f_{1}(x)-b,\frac{a}{1-a}(f_{2}(x)+b)\right\}-\max\left\{f_{1}(x)-b^{\prime},\frac{a^{\prime}}{1-a^{\prime}}(f_{2}(x)+b^{\prime})\right\}\right\rvert
supxMmax{|bb|,|a1a(f2(x)+b)a1a(f2(x)+b)|}\displaystyle\leq\sup_{x\in M}\max\left\{\lvert b-b^{\prime}\rvert,\left\lvert\frac{a}{1-a}(f_{2}(x)+b)-\frac{a^{\prime}}{1-a^{\prime}}(f_{2}(x)+b^{\prime})\right\rvert\right\}
=supxMmax{|bb|,|af2(x)+abaabaf2(x)ab+aab|(1a)(1a)}\displaystyle=\sup_{x\in M}\max\left\{\lvert b-b^{\prime}\rvert,\frac{\left\lvert af_{2}(x)+ab-aa^{\prime}b-a^{\prime}f_{2}(x)-a^{\prime}b^{\prime}+aa^{\prime}b^{\prime}\right\rvert}{(1-a)(1-a^{\prime})}\right\}
max{|bb|,|aa|f2+|abab|+|bb|aa(1a)(1a)}\displaystyle\leq\max\left\{\lvert b-b^{\prime}\rvert,\frac{\lvert a-a^{\prime}\rvert\lVert f_{2}\rVert_{\infty}+\lvert ab-a^{\prime}b^{\prime}\rvert+\lvert b-b^{\prime}\rvert aa^{\prime}}{(1-a)(1-a^{\prime})}\right\}
max{|bb|,4|aa|f2+4|abab|+4|bb|aa}\displaystyle\leq\max\left\{\lvert b-b^{\prime}\rvert,4\lvert a-a^{\prime}\rvert\lVert f_{2}\rVert_{\infty}+4\lvert ab-a^{\prime}b^{\prime}\rvert+4\lvert b-b^{\prime}\rvert aa^{\prime}\right\}
max{|bb|,4|aa|f2+4|abab|+|bb|}\displaystyle\leq\max\{\lvert b-b^{\prime}\rvert,4\lvert a-a^{\prime}\rvert\lVert f_{2}\rVert_{\infty}+4\lvert ab-a^{\prime}b^{\prime}\rvert+\lvert b-b^{\prime}\rvert\}
=4|aa|f2+4|abab|+|bb|\displaystyle=4\lvert a-a^{\prime}\rvert\lVert f_{2}\rVert_{\infty}+4\lvert ab-a^{\prime}b^{\prime}\rvert+\lvert b-b^{\prime}\rvert
4|aa|f2+4|aa||b|+4|bb|a+|bb|\displaystyle\leq 4\lvert a-a^{\prime}\rvert\lVert f_{2}\rVert_{\infty}+4\lvert a-a^{\prime}\rvert\lvert b\rvert+4\lvert b-b^{\prime}\rvert a^{\prime}+\lvert b-b^{\prime}\rvert
4|aa|f2+4|aa||b|+3|bb|\displaystyle\leq 4\lvert a-a^{\prime}\rvert\lVert f_{2}\rVert_{\infty}+4\lvert a-a^{\prime}\rvert\lvert b\rvert+3\lvert b-b^{\prime}\rvert
4|aa|(f2+C)+3|bb|.\displaystyle\leq 4\lvert a-a^{\prime}\rvert(\lVert f_{2}\rVert_{\infty}+C)+3\lvert b-b^{\prime}\rvert.

By observing that, if f=(f1,f2)f=(f_{1},f_{2}) and h=(f2,f1)h=(f_{2},f_{1}), then f(a,b)=h(1a,b)f^{*}_{(a,b)}=h^{*}_{(1-a,-b)}, we obtain an analogous result to Proposition 4.1 for a,a12a,a^{\prime}\geq\frac{1}{2}.

Proposition 4.2.

Let CC be a positive real number. If 12a,a<1\frac{1}{2}\leq a,a^{\prime}<1 and |b|C\lvert b\rvert\leq C, then, for every bb^{\prime},

f(a,b)f(a,b)4|aa|(f1+C)+3|bb|.\left\lVert f^{*}_{(a,b)}-f^{*}_{(a^{\prime},b^{\prime})}\right\rVert_{\infty}\leq 4\lvert a-a^{\prime}\rvert(\lVert f_{1}\rVert_{\infty}+C)+3\lvert b-b^{\prime}\rvert.

As a consequence, the function

f(,):]0,1[×C(M,),(a,b)f(a,b)f^{*}_{(\cdot,\cdot)}\colon]0,1[\times\mathbb{R}\to C(M,\mathbb{R}),\quad(a,b)\mapsto f^{*}_{(a,b)}

is locally Lipschitz. This is the content of the following result:

Theorem 4.3.

If |b|C\lvert b\rvert\leq C, then for every 0<a,a<10<a,a^{\prime}<1 and every bb^{\prime},

f(a,b)f(a,b)4|aa|(f+C)+3|bb|.\left\lVert f^{*}_{(a,b)}-f^{*}_{(a^{\prime},b^{\prime})}\right\rVert_{\infty}\leq 4\lvert a-a^{\prime}\rvert(\lVert f\rVert_{\infty}+C)+3\lvert b-b^{\prime}\rvert.
Proof.

If a,a12a,a^{\prime}\leq\frac{1}{2} or a,a12a,a^{\prime}\geq\frac{1}{2}, the statement follows directly from Propositions 4.1 and 4.2. Without loss of generality, we can assume that a12a\leq\frac{1}{2} and a12a^{\prime}\geq\frac{1}{2}. Moreover, consider (12,b),(12,b)\left(\frac{1}{2},b\right),\left(\frac{1}{2},b^{\prime}\right). We have that

f(a,b)f(a,b)\displaystyle\left\lVert f^{*}_{(a,b)}-f^{*}_{(a^{\prime},b^{\prime})}\right\rVert_{\infty} f(a,b)f(12,b)+f(12,b)f(12,b)+\displaystyle\leq\left\lVert f^{*}_{(a,b)}-f^{*}_{(\frac{1}{2},b)}\right\rVert_{\infty}+\left\lVert f^{*}_{(\frac{1}{2},b)}-f^{*}_{(\frac{1}{2},b^{\prime})}\right\rVert_{\infty}+
+f(12,b)f(a,b)\displaystyle+\left\lVert f^{*}_{(\frac{1}{2},b^{\prime})}-f^{*}_{(a^{\prime},b^{\prime})}\right\rVert_{\infty}
4|a12|(f+C)+3|bb|+4|1212|(f+C)+\displaystyle\leq 4\left\lvert a-\frac{1}{2}\right\rvert\left(\lVert f\rVert_{\infty}+C\right)+3\lvert b-b\rvert+4\left\lvert\frac{1}{2}-\frac{1}{2}\right\rvert(\lVert f\rVert_{\infty}+C)+
+3|bb|+4|12a|(f+C)+3|bb|\displaystyle+3\lvert b-b^{\prime}\rvert+4\left\lvert\frac{1}{2}-a^{\prime}\right\rvert(\lVert f\rVert_{\infty}+C)+3\lvert b^{\prime}-b^{\prime}\rvert
=4(|a12|+|12a|)(f+C)+3|bb|\displaystyle=4\left(\left\lvert a-\frac{1}{2}\right\rvert+\left\lvert\frac{1}{2}-a^{\prime}\right\rvert\right)(\lVert f\rVert_{\infty}+C)+3\lvert b-b^{\prime}\rvert
=4|aa|(f+C)+3|bb|.\displaystyle=4\lvert a-a^{\prime}\rvert(\lVert f\rVert_{\infty}+C)+3\lvert b-b^{\prime}\rvert.

In Theorem 4.3 we showed that the function f(,)f^{*}_{(\cdot,\cdot)} is locally Lipschitz. As such, it can be extended to the parameter values (0,b)(0,b) (resp. (1,b)(1,b)) as the limit f(0,b):=lim(a,b)(0,b)f(a,b)f^{*}_{(0,b)}:=\lim_{(a^{\prime},b^{\prime})\to(0,b)}f^{*}_{(a^{\prime},b^{\prime})} (resp., f(1,b):=lim(a,b)(1,b)f(a,b)f^{*}_{(1,b)}:=\lim_{(a^{\prime},b^{\prime})\to(1,b)}f^{*}_{(a^{\prime},b^{\prime})}), for every bb in \mathbb{R}. Such a function is continuous, and the stability of persistence diagrams with respect to the uniform norm implies that the limit lim(a,b)(0,b)Dgm(f(a,b))\lim_{(a^{\prime},b^{\prime})\to(0,b)}\mathrm{Dgm}\left(f^{*}_{(a^{\prime},b^{\prime})}\right) (resp. lim(a,b)(1,b)Dgm(f(a,b))\lim_{(a^{\prime},b^{\prime})\to(1,b)}\mathrm{Dgm}\left(f^{*}_{(a^{\prime},b^{\prime})}\right)) also exists and is equal to Dgm(f(0,b))\mathrm{Dgm}\left(f^{*}_{(0,b)}\right) (resp. Dgm(f(1,b))\mathrm{Dgm}\left(f^{*}_{(1,b)}\right)). In other words, f(,)f^{*}_{(\cdot,\cdot)} can be uniquely extended to [0,1]×[0,1]\times\mathbb{R} and this extension is also a locally Lipschitz function. Therefore, in the rest of this paper, we will be allowed to consider the functions f(a,b)f^{*}_{(a,b)} for any (a,b)(a,b) in [0,1]×[0,1]\times\mathbb{R}. The limit functions f(0,b)f_{(0,b)}^{*} and f(1,b)f^{*}_{(1,b)} can be computed explicitly for any bb in \mathbb{R}:

f(0,b)(x)\displaystyle f^{*}_{(0,b)}(x) =lim(a,b)(0,b)f(a,b)(x)\displaystyle=\lim_{(a^{\prime},b^{\prime})\to(0,b)}f^{*}_{(a^{\prime},b^{\prime})}(x)
=lima0amax{f1(x)ba,f2(x)+b1a}\displaystyle=\lim_{a^{\prime}\to 0}a^{\prime}\max\left\{\frac{f_{1}(x)-b}{a^{\prime}},\frac{f_{2}(x)+b}{1-a^{\prime}}\right\}
=max{f1(x)b,0},\displaystyle=\max\left\{f_{1}(x)-b,0\right\},
f(1,b)(x)\displaystyle f^{*}_{(1,b)}(x) =lim(a,b)(1,b)f(a,b)(x)\displaystyle=\lim_{(a^{\prime},b^{\prime})\to(1,b)}f^{*}_{(a^{\prime},b^{\prime})}(x)
=lima1(1a)max{f1(x)ba,f2(x)+b1a}\displaystyle=\lim_{a^{\prime}\to 1}(1-a^{\prime})\max\left\{\frac{f_{1}(x)-b}{a^{\prime}},\frac{f_{2}(x)+b}{1-a^{\prime}}\right\}
=max{0,f2(x)+b}.\displaystyle=\max\left\{0,f_{2}(x)+b\right\}.

Since Theorem 4.3 enables us to extend the functions f(,)f^{*}_{(\cdot,\cdot)} and g(,)g^{*}_{(\cdot,\cdot)} to [0,1]×[0,1]\times\mathbb{R}, the function

(a,b)dB(Dgm(f(a,b)),Dgm(g(a,b)))(a,b)\mapsto d_{B}\left(\mathrm{Dgm}\left(f^{*}_{(a,b)}\right),\mathrm{Dgm}\left(g^{*}_{(a,b)}\right)\right)

can be extended to [0,1]×[0,1]\times\mathbb{R}, too. Furthermore, it is continuous because of the stability of persistence diagrams and, hence, it admits a maximum in its compact domain.

Next, we show that it is not restrictive to compute the matching distance for parameters in [0,1]×[C¯,C¯][0,1]\times[-\overline{C},\overline{C}], where C¯=max{f,g}\overline{C}=\max\{\lVert f\rVert_{\infty},\lVert g\rVert_{\infty}\}.

Proposition 4.4.

There exists (a¯,b¯)(\bar{a},\bar{b}) in [0,1]×[C¯,C¯][0,1]\times[-\overline{C},\overline{C}], with C¯=max{f,g}\overline{C}=\max\{\lVert f\rVert_{\infty},\lVert g\rVert_{\infty}\}, such that

Dmatch(f,g)\displaystyle D_{\mathrm{match}}(f,g) =max(a,b)[0,1]×[C¯,C¯]dB(Dgm(f(a,b)),Dgm(g(a,b)))\displaystyle=\max_{(a,b)\in[0,1]\times[-\overline{C},\overline{C}]}d_{B}\left(\mathrm{Dgm}\left(f_{(a,b)}^{*}\right),\mathrm{Dgm}\left(g_{(a,b)}^{*}\right)\right)
=dB(Dgm(f(a¯,b¯)),Dgm(g(a¯,b¯))).\displaystyle=d_{B}\left(\mathrm{Dgm}\left(f_{(\bar{a},\bar{b})}^{*}\right),\mathrm{Dgm}\left(g_{(\bar{a},\bar{b})}^{*}\right)\right).
Proof.

Our strategy is to check what happens when |b|C¯\lvert b\rvert\geq\overline{C}. There are four possible cases given by the combinations of a12a\leq\frac{1}{2} or a12a\geq\frac{1}{2} and bC¯b\leq-\overline{C} or bC¯b\geq\overline{C}. Consider the case a12a\leq\frac{1}{2} and bC¯b\leq-\overline{C}. We have f(a,b)=amax{1a(f1b),11a(f2+b)}f^{*}_{(a,b)}=a\max\left\{\frac{1}{a}(f_{1}-b),\frac{1}{1-a}(f_{2}+b)\right\}. However, 1a(f1b)1a(f1+C¯)0\frac{1}{a}(f_{1}-b)\geq\frac{1}{a}(f_{1}+\overline{C})\geq 0 and 11a(f2+b)11a(f2C¯)0\frac{1}{1-a}(f_{2}+b)\leq\frac{1}{1-a}(f_{2}-\overline{C})\leq 0. Thus, f(a,b)=f1bf^{*}_{(a,b)}=f_{1}-b and, similarly, g(a,b)=g1bg^{*}_{(a,b)}=g_{1}-b. The bottleneck distance between their persistence diagrams will thus be dB(Dgm(f(a,b)),Dgm(g(a,b)))=dB(Dgm(f1b),Dgm(g1b))=dB(Dgm(f1),Dgm(g1))d_{B}\left(\mathrm{Dgm}\left(f^{*}_{(a,b)}\right),\mathrm{Dgm}\left(g^{*}_{(a,b)}\right)\right)=\allowbreak d_{B}\left(\mathrm{Dgm}\left(f_{1}-b\right),\mathrm{Dgm}\left(g_{1}-b\right)\right)=\allowbreak d_{B}\left(\mathrm{Dgm}\left(f_{1}\right),\mathrm{Dgm}\left(g_{1}\right)\right). Therefore, dB(Dgm(f(a,b)),Dgm(g(a,b)))d_{B}\left(\mathrm{Dgm}\left(f^{*}_{(a,b)}\right),\mathrm{Dgm}\left(g^{*}_{(a,b)}\right)\right) is constant for a12a\leq\frac{1}{2} and bC¯b\leq-\overline{C}. Hence we can limit ourselves to computing its value for a=12a=\frac{1}{2} and b=C¯b=-\overline{C}.

Consider now a12a\geq\frac{1}{2} and bC¯b\leq-\overline{C}. We have f(a,b)=1aa(f1b)f^{*}_{(a,b)}=\frac{1-a}{a}(f_{1}-b) and, similarly, g(a,b)=1aa(g1b)g^{*}_{(a,b)}=\frac{1-a}{a}(g_{1}-b). Fixing aa, we observe that in this case dB(Dgm(f(a,b)),Dgm(g(a,b)))=1aadB(Dgm(f1b),Dgm(g1b))=1aadB(Dgm(f1),Dgm(g1))d_{B}\left(\mathrm{Dgm}\left(f^{*}_{(a,b)}\right),\mathrm{Dgm}\left(g^{*}_{(a,b)}\right)\right)=\frac{1-a}{a}d_{B}\left(\mathrm{Dgm}(f_{1}-b),\mathrm{Dgm}(g_{1}-b)\right)=\frac{1-a}{a}d_{B}(\mathrm{Dgm}(f_{1}),\mathrm{Dgm}(g_{1})) is constant with respect to bb. Since a12a\geq\frac{1}{2} was chosen arbitrarily, and there is no dependence on bb, we can choose them to be a=12a=\frac{1}{2} and b=C¯b=-\overline{C} and conclude.

The other two cases follow the same strategy. ∎

Refer to caption
Figure 3. The bottleneck distance is constant on the coloured regions and half-lines, whereas it is non-increasing with respect to aa on b=C¯b=-\overline{C}, if a12a\geq\frac{1}{2}, and non-decreasing with respect to aa on b=C¯b=\overline{C}, if a12a\leq\frac{1}{2}.

The above proof also shows that the continuous function dB(Dgm(f(a,b)),Dgm(g(a,b)))d_{B}\left(\mathrm{Dgm}\left(f^{*}_{(a,b)}\right),\mathrm{Dgm}\left(g^{*}_{(a,b)}\right)\right) is constant on the segments {(a,b)0a12,b=C¯}\{(a,b)\mid 0\leq a\leq\frac{1}{2},b=-\overline{C}\} and {(a,b)12a1,b=C¯}\{(a,b)\mid\frac{1}{2}\leq a\leq 1,b=\overline{C}\}, non-increasing on the segment {(a,b)1a12,b=C¯}\{(a,b)\mid 1\geq a\geq\frac{1}{2},b=-\overline{C}\} and non-decreasing on the segment {(a,b)0a12,b=C¯}\{(a,b)\mid 0\leq a\leq\frac{1}{2},b=\overline{C}\}. Moreover, it is 0 on (0,C¯)(0,\overline{C}) and (1,C¯)(1,-\overline{C}) (see Figure 3). Furthermore, we would like to point out that Proposition 4.4 gives us a new formulation for the definition of the matching distance DmatchD_{\mathrm{match}} as follows:

Dmatch(f,g)\displaystyle D_{\mathrm{match}}(f,g) =max(a,b)[0,1]×dB(Dgm(f(a,b)),Dgm(g(a,b))).\displaystyle=\max_{(a,b)\in[0,1]\times\mathbb{R}}d_{B}\left(\mathrm{Dgm}\left(f_{(a,b)}^{*}\right),\mathrm{Dgm}\left(g_{(a,b)}^{*}\right)\right).

5. Special set and matching distance

In this section we introduce the special set associated with a pair of functions (f,g)(f,g). We prove that the matching distance between two functions is realised either on values associated with vertical, horizontal or slope 11 lines, or on this special set.

Definition 5.1.

Let Ctr(f,g)\text{Ctr}(f,g) be the set of all curves that are contours of ff or gg. The special set of (f,g)(f,g), denoted by Sp(f,g)\text{Sp}(f,g), is the collection of all (a,b)(a,b) in ]0,1[×[C¯,C¯]]0,1[\times[-\overline{C},\overline{C}] for which two distinct pairs {αp,αq}\{\alpha_{p},\alpha_{q}\}, {αs,αt}\{\alpha_{s},\alpha_{t}\} of contours in Ctr(f,g)\text{Ctr}(f,g) intersecting r(a,b)r_{(a,b)} exist, such that {αp,αq}{αs,αt}\{\alpha_{p},\alpha_{q}\}\neq\{\alpha_{s},\alpha_{t}\} and

  • c1|xPxQ|=c2|xSxT|c_{1}\lvert x_{P}-x_{Q}\rvert=c_{2}\lvert x_{S}-x_{T}\rvert, with c1,c2{1,2}c_{1},c_{2}\in\{1,2\}, if a12a\leq\frac{1}{2},

  • c1|yPyQ|=c2|ySyT|c_{1}\lvert y_{P}-y_{Q}\rvert=c_{2}\lvert y_{S}-y_{T}\rvert, with c1,c2{1,2}c_{1},c_{2}\in\{1,2\}, if a12a\geq\frac{1}{2},

where P=P(a,b)=r(a,b)αpP=P_{(a,b)}=r_{(a,b)}\cap\alpha_{p}, Q=Q(a,b)=r(a,b)αqQ=Q_{(a,b)}=r_{(a,b)}\cap\alpha_{q}, S=S(a,b)=r(a,b)αsS=S_{(a,b)}=r_{(a,b)}\cap\alpha_{s} and T=T(a,b)=r(a,b)αtT=T_{(a,b)}=r_{(a,b)}\cap\alpha_{t}, and xx_{*}, yy_{*} denote abscissas and ordinates of these points. An element of the special set Sp(f,g)\text{Sp}(f,g) is called a special value of the pair (f,g)(f,g).

Special values are values of ]0,1[×[C¯,C¯]]0,1[\times[-\overline{C},\overline{C}] in which the optimal matching may abruptly change because of the presence of more than one pair of points with the same distance between abscissas (for a12a\leq\frac{1}{2}) or same distance between ordinates (for a12a\geq\frac{1}{2}). This discontinuity behaviour gives an obstruction to proving that the matching distance is realised only on vertical, horizontal and slope 1 lines. Indeed, the key for proving Theorem 5.4 is being able to continuously move in the space of parameters and not losing track of the points realising the optimal matching. When encountering a special value this continuity may be missing.

Figure 4 shows two examples of lines associated with special values of (f,g)(f,g), with f,g:S22f,g\colon S^{2}\to\mathbb{R}^{2}, f(x,y,z)=(x,z)f(x,y,z)=(x,z) and g(x,y,z)=(2.1x+2,0.6z+1.8)g(x,y,z)=(2.1x+2,0.6z+1.8). The green and light blue lines correspond respectively to the parameter values (0.6,0)(0.6,0) and (0.491,0.451)(0.491,0.451). The intersection points A1,A2A_{1},A_{2} and B1,B2B_{1},B_{2}, between the green line and the extended Pareto grid have equal difference between abscissas, thus (0.6,0)(0.6,0) is a special value. On the other hand, the intersection points C1,C2C_{1},C_{2} and D1,D2D_{1},D_{2}, between the light blue line and the extended Pareto grid have equal difference between ordinates. In particular, (0.491,0.451)(0.491,0.451) approximates a special value up to a 5×1075\times 10^{-7} error.

Refer to caption
Figure 4. The light blue line corresponds to the pair (0.6,0)(0.6,0) and the green approximately to (0.491,0.451)(0.491,0.451). They are both special, since |xA1xA2|=|xB1xB2|\lvert x_{A_{1}}-x_{A_{2}}\rvert=\lvert x_{B_{1}}-x_{B_{2}}\rvert and |yC1yC2|=|yD1yD2|\lvert y_{C_{1}}-y_{C_{2}}\rvert=\lvert y_{D_{1}}-y_{D_{2}}\rvert.
Proposition 5.2.

Sp(f,g)\mathrm{Sp}(f,g) is closed in ]0,1[×[C¯,C¯]]0,1[\times[-\overline{C},\overline{C}].

Proof.

First, we show that Sp(f,g)(]0,12]×[C¯,C¯])\text{Sp}(f,g)\cap\left(]0,\frac{1}{2}]\times[-\overline{C},\overline{C}]\right) is closed. Consider a sequence {(an,bn)}\{(a_{n},b_{n})\} in Sp(f,g)(]0,12]×[C¯,C¯])\text{Sp}(f,g)\cap\left(]0,\frac{1}{2}]\times[-\overline{C},\overline{C}]\right) that converges to (a¯,b¯)(\overline{a},\overline{b}) in ]0,12]×[C¯,C¯]]0,\frac{1}{2}]\times[-\overline{C},\overline{C}]. Since such a sequence consists of special values of (f,g)(f,g), there exist two distinct sets {αpn,αqn}\{\alpha_{p}^{n},\alpha_{q}^{n}\} and {αsn,αtn}\{\alpha_{s}^{n},\alpha_{t}^{n}\} in Ctr(f,g)\text{Ctr}(f,g) such that c1n|xPnxQn|=c2n|xSnxTn|c_{1}^{n}\lvert x_{P_{n}}-x_{Q_{n}}\rvert=c_{2}^{n}\lvert x_{S_{n}}-x_{T_{n}}\rvert, where Pn=r(an,bn)αpnP_{n}=r_{(a_{n},b_{n})}\cap\alpha_{p}^{n}, Qn=r(an,bn)αqnQ_{n}=r_{(a_{n},b_{n})}\cap\alpha_{q}^{n}, Sn=r(an,bn)αsnS_{n}=r_{(a_{n},b_{n})}\cap\alpha_{s}^{n} and Tn=r(an,bn)αtnT_{n}=r_{(a_{n},b_{n})}\cap\alpha_{t}^{n} and c1n,c2n{1,2}c_{1}^{n},c_{2}^{n}\in\{1,2\}, for every nn. Since Ctr(f,g)\text{Ctr}(f,g) has finitely many contours, we can assume, up to subsequences, that the sequences {Pn}\{P_{n}\}, {Qn}\{Q_{n}\}, {Sn}\{S_{n}\} and {Tn}\{T_{n}\} lie respectively in the contours αpn=αp\alpha_{p}^{n}=\alpha_{p}, αqn=αq\alpha_{q}^{n}=\alpha_{q}, αsn=αs\alpha_{s}^{n}=\alpha_{s} and αtn=αt\alpha_{t}^{n}=\alpha_{t}, for every nn. For the same reason, we can assume that c1n=c1c_{1}^{n}=c_{1} and c2n=c2c_{2}^{n}=c_{2}, for every nn. Since {(an,bn)}\{(a_{n},b_{n})\} is convergent, it is also bounded. In particular, besides C¯bnC¯-\overline{C}\leq b_{n}\leq\overline{C}, there is D¯\overline{D} such that 0<D¯an120<\overline{D}\leq a_{n}\leq\frac{1}{2}. Then (nr(an,bn))(Γ(f)Γ(g))\left(\bigcup_{n}r_{(a_{n},b_{n})}\right)\cap\left(\Gamma(f)\cup\Gamma(g)\right) is bounded below by the line r(12,C¯)r_{(\frac{1}{2},\overline{C})} and above by the line r(D¯,C¯)r_{(\overline{D},-\overline{C})}. Thus, {Pn}\{P_{n}\}, {Qn}\{Q_{n}\}, {Sn}\{S_{n}\} and {Tn}\{T_{n}\} converge, respectively, to P¯\overline{P}, Q¯\overline{Q}, S¯\overline{S} and T¯\overline{T}, up to restriction to subsequences. Since c1|xPnxQn|=c2|xSnxTn|c_{1}\lvert x_{P_{n}}-x_{Q_{n}}\rvert=c_{2}\lvert x_{S_{n}}-x_{T_{n}}\rvert, their limits are also equal, so we have c1|xP¯xQ¯|=c2|xS¯xT¯|c_{1}\lvert x_{\overline{P}}-x_{\overline{Q}}\rvert=c_{2}\lvert x_{\overline{S}}-x_{\overline{T}}\rvert. Since P¯\overline{P}, Q¯\overline{Q}, S¯\overline{S} and T¯\overline{T} all lie in r(a¯,b¯)r_{(\overline{a},\overline{b})}, (a¯,b¯)(\overline{a},\overline{b}) is also a special value of (f,g)(f,g), concluding that Sp(f,g)(]0,12]×[C¯,C¯])\text{Sp}(f,g)\cap\left(]0,\frac{1}{2}]\times[-\overline{C},\overline{C}]\right) is closed.

Analogously, one can see that Sp(f,g)([12,1[×[C¯,C¯])\text{Sp}(f,g)\cap\left([\frac{1}{2},1[\times[-\overline{C},\overline{C}]\right) is closed. The set Sp(f,g)\text{Sp}(f,g) is then a union of two closed sets, hence it is closed itself. ∎

Let 𝒮\mathcal{S} be the set of all pairs (a¯,b¯)(\bar{a},\bar{b}) in [0,1]×[C¯,C¯][0,1]\times[-\overline{C},\overline{C}] realising the matching distance between ff and gg, i.e., such that

Dmatch(f,g)=dB(Dgm(f(a¯,b¯)),Dgm(g(a¯,b¯))).D_{\mathrm{match}}(f,g)=d_{B}\left(\mathrm{Dgm}\left(f_{(\bar{a},\bar{b})}^{*}\right),\mathrm{Dgm}\left(g_{(\bar{a},\bar{b})}^{*}\right)\right).

As observed about (4), dB(Dgm(f(a,b)),Dgm(g(a,b)))d_{B}\left(\mathrm{Dgm}\left(f_{(a,b)}^{*}\right),\mathrm{Dgm}\left(g_{(a,b)}^{*}\right)\right) is a continuous function on [0,1]×[C¯,C¯][0,1]\times[-\overline{C},\overline{C}], thus it admits a maximum in its domain and 𝒮\mathcal{S} is not empty. Moreover, 𝒮\mathcal{S} is compact because it is the preimage of a point in \mathbb{R} via a continuous function defined on a compact set.

Note that for any (a,b)(a,b) in [0,1]×[C¯,C¯][0,1]\times[-\overline{C},\overline{C}], we have

dB(Dgm(f(a,b)),Dgm(g(a,b)))=cost(σ(a,b)),d_{B}\left(\mathrm{Dgm}\left(f_{(a,b)}^{*}\right),\mathrm{Dgm}\left(g_{(a,b)}^{*}\right)\right)=\text{cost}(\sigma_{(a,b)}),

where σ(a,b)\sigma_{(a,b)} is an optimal matching. By applying a straightforward generalisation of Theorem 28 in [opt-matching] for arbitrary nthn^{th} persistence diagrams, one can see that such a matching always exists. Theorem 4.3 and the stability of the bottleneck distance with respect to the uniform norm imply that cost(σ(a,b))\text{cost}(\sigma_{(a,b)}) can be seen as a continuous function in the variable (a,b)(a,b) in [0,1]×[C¯,C¯][0,1]\times[-\overline{C},\overline{C}].

Definition 5.3.

Let σ:Dgm1Dgm2\sigma:\mathrm{Dgm}_{1}\to\mathrm{Dgm}_{2} be a matching between two persistence diagrams and let XX in Dgm1\mathrm{Dgm}_{1} be such that cost(σ)=d(X,σ(X))\mathrm{cost}(\sigma)=d(X,\sigma(X)). The matching σ\sigma is of type (1)(1) if Δ{X,σ(X)}\Delta\notin\{X,\sigma(X)\}, and of type (2)(2) if Δ{X,σ(X)}\Delta\in\{X,\sigma(X)\}.

Observe that a matching can be both of type (1)(1) and type (2)(2). We use this terminology in the proof of the following theorem.

Theorem 5.4.
𝒮(Sp(f,g)({0,12,1}×[C¯,C¯])).\mathcal{S}\cap\left(\mathrm{Sp}(f,g)\cup\left(\left\{0,\frac{1}{2},1\right\}\times[-\overline{C},\overline{C}]\right)\right)\neq\emptyset.
Proof.

Assume by contradiction that every (a,b)(a,b) in 𝒮\mathcal{S} is not in Sp(f,g)\text{Sp}(f,g) and that a0,12,1a\neq 0,\frac{1}{2},1. Since 𝒮\mathcal{S} is compact, it is possible to take (a,b)(a,b) in 𝒮\mathcal{S} minimising the distance from the line a=12a=\frac{1}{2}. Among these, consider (a^,b^)(\hat{a},\hat{b}) and a corresponding matching σ^\widehat{\sigma} of minimum cost between Dgm(f(a^,b^))\mathrm{Dgm}\left(f^{*}_{(\hat{a},\hat{b})}\right) and Dgm(g(a^,b^))\mathrm{Dgm}\left(g^{*}_{(\hat{a},\hat{b})}\right). If 0<a^<120<\hat{a}<\frac{1}{2}, the Position Theorem 3.1 implies that there exist α^\widehat{\alpha} and β^\widehat{\beta} in Ctr(f,g)\text{Ctr}(f,g) intersecting r(a^,b^)r_{(\hat{a},\hat{b})}, such that P^=r(a^,b^)α^\widehat{P}=r_{(\hat{a},\hat{b})}\cap\widehat{\alpha} and Q^=r(a^,b^)β^\widehat{Q}=r_{(\hat{a},\hat{b})}\cap\widehat{\beta} realise at least one of these properties:

  1. (1)

    P^Γ(f)\widehat{P}\in\Gamma(f), Q^Γ(g)\widehat{Q}\in\Gamma(g), and Dmatch(f,g)=cost(σ^)=|xP^xQ^|D_{\mathrm{match}}(f,g)=\mathrm{cost}(\widehat{\sigma})=\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert;

  2. (2)

    P^,Q^Γ(f)\widehat{P},\widehat{Q}\in\Gamma(f) or P^,Q^Γ(g)\widehat{P},\widehat{Q}\in\Gamma(g), and Dmatch(f,g)=cost(σ^)=12|xP^xQ^|D_{\mathrm{match}}(f,g)=\mathrm{cost}(\widehat{\sigma})=\frac{1}{2}\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert.

Observe that the former matching is of type (1)(1) and the latter of type (2)(2). Note also that xP^xQ^x_{\widehat{P}}\neq x_{\widehat{Q}}, and hence P^Q^\widehat{P}\neq\widehat{Q}. If not, then Dmatch(f,g)=0D_{\text{match}}(f,g)=0, implying that any (a,b)(a,b) belongs to 𝒮\mathcal{S}, including (12,b)\left(\frac{1}{2},b\right), which is a contradiction.

Consider a sequence {(an,bn)}\{(a_{n},b_{n})\} in ]0,1[×[C¯,C¯]]0,1[\times[-\overline{C},\overline{C}] such that these (an,bn)(a_{n},b_{n}) are chosen to identify lines obtained by rotating r(a,b)r_{(a,b)} around P^\widehat{P} clockwise in such a way that (an,bn)(a^,b^)(a_{n},b_{n})\to(\hat{a},\hat{b}), where {an}\{a_{n}\} is a decreasing sequence. Furthermore, given a sequence {σn}\{\sigma_{n}\} of optimal matchings between Dgm(f(an,bn))\mathrm{Dgm}\left(f^{*}_{(a_{n},b_{n})}\right) and Dgm(g(an,bn))\mathrm{Dgm}\left(g^{*}_{(a_{n},b_{n})}\right) we have that cost(σn)cost(σ^)\text{cost}(\sigma_{n})\to\text{cost}(\widehat{\sigma}) (see (5)). Since Sp(f,g)({0,12,1}×[C¯,C¯])\text{Sp}(f,g)\cup\left(\left\{0,\frac{1}{2},1\right\}\times[-\overline{C},\overline{C}]\right) is closed, by Proposition 5.2, and (a^,b^)(\hat{a},\hat{b}) does not belong to this set, we can assume that the sequence {(an,bn)}\{(a_{n},b_{n})\} also has no points in this set. Hence, for any nn in \mathbb{N} there exists a pair {Pn,Qn}\{P_{n},Q_{n}\} in r(an,bn)(Γ(f)Γ(g))r_{(a_{n},b_{n})}\cap(\Gamma(f)\cup\Gamma(g)) for which at least one of the following properties holds:

  1. (A)

    PnΓ(f)P_{n}\in\Gamma(f), QnΓ(g)Q_{n}\in\Gamma(g) and cost(σn)=|xPnxQn|\mathrm{cost}(\sigma_{n})=\lvert x_{P_{n}}-x_{Q_{n}}\rvert;

  2. (B)

    Pn,QnΓ(f)P_{n},Q_{n}\in\Gamma(f) or Pn,QnΓ(g)P_{n},Q_{n}\in\Gamma(g), and cost(σn)=12|xPnxQn|\mathrm{cost}(\sigma_{n})=\frac{1}{2}\lvert x_{P_{n}}-x_{Q_{n}}\rvert.

Up to subsequences, we can assume that the matchings σn\sigma_{n} are either all of type (1)(1) or all of type (2)(2). We now show that {Pn}\{P_{n}\} and P^\widehat{P} belong to the same contour in Ctr(f,g)\text{Ctr}(f,g), and {Qn}\{Q_{n}\} and Q^\widehat{Q} also belong to the same contour in Ctr(f,g)\text{Ctr}(f,g). Analogously to the proof of Proposition 5.2 we may observe that the set (nr(an,bn))(Γ(f)Γ(g))\left(\bigcup_{n}r_{(a_{n},b_{n})}\right)\cap(\Gamma(f)\cup\Gamma(g)) is a bounded subset of Γ(f)Γ(g)\Gamma(f)\cup\Gamma(g). Thus, {Pn}\{P_{n}\} and {Qn}\{Q_{n}\} are convergent up to subsequences in the closed set Γ(f)Γ(g)\Gamma(f)\cup\Gamma(g), respectively, to P¯\overline{P} and Q¯\overline{Q}. By assumption, there are only a finite number of contours, thus there exists at least a contour in Ctr(f,g)\text{Ctr}(f,g) for each sequence, {Pn}\{P_{n}\} and {Qn}\{Q_{n}\}, containing infinitely many points of the sequence. Hence, we can assume that each sequence, up to subsequences, lies entirely on a single contour in Ctr(f,g)\text{Ctr}(f,g), i.e., we can suppose that for every nn in \mathbb{N}, PnP_{n} is in α¯\overline{\alpha} and QnQ_{n} is in β¯\overline{\beta}, with α¯\overline{\alpha} and β¯\overline{\beta} in Ctr(f,g)\text{Ctr}(f,g). Since contours are closed, P¯\overline{P} belongs to α¯\overline{\alpha} and Q¯\overline{Q} belongs to β¯\overline{\beta}. We observe that {P¯,Q¯}r(a^,b^)\{\overline{P},\overline{Q}\}\subseteq r_{(\hat{a},\hat{b})}. Furthermore, we have that

c|xP^xQ^|=cost(σ^)=limncost(σn)=limnc′′|xPnxQn|=c′′|xP¯xQ¯|c^{\prime}\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert=\text{cost}(\widehat{\sigma})=\lim_{n\to\infty}\text{cost}(\sigma_{n})=\lim_{n\to\infty}c^{\prime\prime}\lvert x_{P_{n}}-x_{Q_{n}}\rvert=c^{\prime\prime}\lvert x_{\overline{P}}-x_{\overline{Q}}\rvert

where c,c′′c^{\prime},c^{\prime\prime} in {12,1}\{\frac{1}{2},1\}. If {α^,β^}{α¯,β¯}\{\widehat{\alpha},\widehat{\beta}\}\neq\{\overline{\alpha},\overline{\beta}\}, then (a^,b^)(\hat{a},\hat{b}) is a special value, contradicting the initial assumption. Thus, {α^,β^}={α¯,β¯}\{\widehat{\alpha},\widehat{\beta}\}=\{\overline{\alpha},\overline{\beta}\}. Without loss of generality, by possibly exchanging the roles of the contours α^\widehat{\alpha} and β^\widehat{\beta}, and of the points P^\widehat{P} and Q^\widehat{Q}, we can assume that α^=α¯\widehat{\alpha}=\overline{\alpha}, β^=β¯\widehat{\beta}=\overline{\beta}, P^=P¯\widehat{P}=\overline{P} and Q^=Q¯\widehat{Q}=\overline{Q}. Consequently, by the fact that {Pn}\{P_{n}\} and P^\widehat{P} are contained in the same line r(an,bn)r_{(a_{n},b_{n})} and the same contour α^\widehat{\alpha}, Pn=P^P_{n}=\widehat{P} for every nn, since a contour and a positive slope line can meet in at most one point.

Case 1. Assume that σ^\widehat{\sigma} and σn\sigma_{n} are both of the same type for every nn. Since QnQ_{n} belongs to β¯\overline{\beta} in Ctr(f,g)\text{Ctr}(f,g) for any nn, one can easily check that |xP^xQn||xP^xQ^|\lvert x_{\widehat{P}}-x_{Q_{n}}\rvert\geq\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert (see Figure 5), and hence cost(σn)cost(σ^)\text{cost}(\sigma_{n})\geq\text{cost}(\widehat{\sigma}). If the equality holds there is a contradiction with the assumption of (a^,b^)(\hat{a},\hat{b}) minimising the distance from the line a=12a=\frac{1}{2}, since |an12|<|a^12|\left\lvert a_{n}-\frac{1}{2}\right\rvert<\left\lvert\hat{a}-\frac{1}{2}\right\rvert. If the strict inequality holds, there is a contradiction with the assumption of σ^\widehat{\sigma} being in 𝒮\mathcal{S}.

Case 2. Assume that all σn\sigma_{n} and σ^\widehat{\sigma} are of different types. This means that cost(σn)=c|xP^xQn|\text{cost}(\sigma_{n})=c^{\prime}\lvert x_{\widehat{P}}-x_{Q_{n}}\rvert, cost(σ^)=c′′|xP^xQ^|\text{cost}(\widehat{\sigma})=c^{\prime\prime}\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert, with cc′′c^{\prime}\neq c^{\prime\prime} and c,c′′c^{\prime},c^{\prime\prime} in {12,1}\{\frac{1}{2},1\}, and c|xP^xQn|c′′|xP^xQ^|c^{\prime}\lvert x_{\widehat{P}}-x_{Q_{n}}\rvert\to c^{\prime\prime}\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert. However, since QnQ^Q_{n}\to\widehat{Q}, c|xP^xQn|c|xP^xQ^|c^{\prime}\lvert x_{\widehat{P}}-x_{Q_{n}}\rvert\to c^{\prime}\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert. Thus, c′′|xP^xQ^|=c|xP^xQ^|c^{\prime\prime}\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert=c^{\prime}\lvert x_{\widehat{P}}-x_{\widehat{Q}}\rvert, which is a contradiction since Dmatch(f,g)0D_{\text{match}}(f,g)\neq 0 and, hence, xP^xQ^x_{\widehat{P}}\neq x_{\widehat{Q}}.

Inverting the role of abscissas and ordinates as described by the Position Theorem 3.1 and rotating the lines counterclockwise, one can see that an analogous procedure holds for 12<a^<1\frac{1}{2}<\hat{a}<1. ∎

Refer to caption
Figure 5. The clockwise rotation around P^\widehat{P} increases the distance between the abscissas of the intersection points. This fact is used in the proof of Theorem 5.4 (case 1).

6. Conclusions

In this article we took advantage of the differential structure associated with smooth functions from a Riemannian manifold MM to 2\mathbb{R}^{2} to characterise some geometric properties of the matching distance. We proved that the filtering lines that actually contribute to the computation of the matching distance are horizontal, vertical, of slope 1, or they are associated with parameter values in the special set. This new approach to the computation of the matching distance could lead to new effective algorithms. In this direction, we would like to highlight an open question that arose during our work. We have not yet provided a characterisation of the special set. However, we conjecture that the special set consists of a collection of curves, up to a small perturbation of the filtering functions.

Figure 6 shows a selection of points in the special set for the functions f,g:S22f,g\colon S^{2}\to\mathbb{R}^{2}, where S2={(x,y,z)x2+y2+z2=1}S^{2}=\{(x,y,z)\mid x^{2}+y^{2}+z^{2}=1\}, f(x,y,z)=(x+1,z1)f(x,y,z)=(x+1,z-1) and g(x,y,z)=(0.75x2,0.75z+2)g(x,y,z)=(0.75x-2,0.75z+2). One may notice clear segments, two of which, on the left, correspond to values identifying lines through intersections of contours. Such lines are in fact always associated with special values.

Refer to caption
Figure 6. Approximation of a special set.

References

  • [1] Asilata Bapat, Robyn Brooks, Celia Hacker, Claudia Landi, Barbara I. Mahler, and Elizabeth R. Stephenson. Computing the matching distance of 2-parameter persistence. arXiv:2210.12868.
  • [2] Silvia Biasotti, Andrea Cerri, Patrizio Frosini, and Daniela Giorgi. A new algorithm for computing the 2-dimensional matching distance between size functions. Pattern Recognition Letters, 32(14):1735–1746, 2011.
  • [3] Havard Bjerkevik and Michael Kerber. Asymptotic improvements on the exact matching distance for 2-parameter persistence. arXiv: 2111.10303.
  • [4] Andrea Cerri, Barbara Di Fabio, Massimo Ferri, Patrizio Frosini, and Claudia Landi. Betti numbers in multidimensional persistent homology are stable functions. Math. Methods Appl. Sci., 36(12):1543–1557, 2013.
  • [5] Andrea Cerri, Marc Ethier, and Patrizio Frosini. On the geometrical properties of the coherent matching distance in 2D persistent homology. J. Appl. Comput. Topol., 3(4):381–422, 2019.
  • [6] Andrea Cerri and Patrizio Frosini. A new approximation algorithm for the matching distance in multidimensional persistence. Journal of Computational Mathematics, 38(2):291–309, 2020.
  • [7] Herbert Edelsbrunner and Dmitriy Morozov. Persistent homology: theory and practice. In European Congress of Mathematics, pages 31–50. Eur. Math. Soc., Zürich, 2013.
  • [8] Michael Kerber, Michael Lesnick, and Steve Oudot. Exact computation of the matching distance on 2-parameter persistence modules. In 35th International Symposium on Computational Geometry, volume 129 of LIPIcs. Leibniz Int. Proc. Inform., pages Art. No. 46, 15. Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern, 2019.
  • [9] Michael Kerber and Arnur Nigmetov. Efficient approximation of the matching distance for 2-parameter persistence. In 36th International Symposium on Computational Geometry, volume 164 of LIPIcs. Leibniz Int. Proc. Inform., pages Art. No. 53, 16. Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern, 2020.
  • [10] Michael Lesnick. The theory of the interleaving distance on multidimensional persistence modules. Found. Comput. Math., 15(3):613–650, 2015.
  • [11] Y. H. Wan. Morse theory for two functions. Topology, 14(3):217–228, 1975.