This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Structural Validation Of Synthetic Power Distribution Networks Using The Multiscale Flat Norm

Kostiantyn Lyman Washington State University Rounak Meyur Pacific Northwest National Laboratory Bala Krishnamoorthy Washington State University Mahantesh Halappanavar Pacific Northwest National Laboratory
Abstract

We study the problem of comparing a pair of geometric networks that may not be similarly defined, i.e., when they do not have one-to-one correspondences between their nodes and edges. Our motivating application is to compare power distribution networks of a region. Due to the lack of openly available power network datasets, researchers synthesize realistic networks resembling their actual counterparts. But the synthetic digital twins may vary significantly from one another and from actual networks due to varying underlying assumptions and approaches. Hence the user wants to evaluate the quality of networks in terms of their structural similarity to actual power networks. But the lack of correspondence between the networks renders most standard approaches, e.g., subgraph isomorphism and edit distance, unsuitable.

We propose an approach based on the multiscale flat norm, a notion of distance between objects defined in the field of geometric measure theory, to compute the distance between a pair of planar geometric networks. Using a triangulation of the domain containing the input networks, the flat norm distance between two networks at a given scale can be computed by solving a linear program. In addition, this computation automatically identifies the 2D regions (patches) that capture where the two networks are different. We demonstrate through 2D examples that the flat norm distance can capture the variations of inputs more accurately than the commonly used Hausdorff distance. As a notion of stability, we also derive upper bounds on the flat norm distance between a simple 1D curve and its perturbed version as a function of the radius of perturbation for a restricted class of perturbations. We demonstrate our approach on a set of actual power networks from a county in the USA. Our approach can be extended to validate synthetic networks created for multiple infrastructures such as transportation, communication, water, and gas networks.

1 Introduction

The power grid is the most vital infrastructure that provides crucial support for the delivery of basic services to most segments of society. Once considered a passive entity in power grid planning and operation, the power distribution system poses significant challenges in the present day. The increased adoption of rooftop solar photovoltaics (PVs) and electric vehicles (EVs) augmented with residential charging units has altered the energy consumption profile of an average consumer. Access to extensive datasets pertaining to power distribution networks and residential consumer demand is vital for public policy researchers and power system engineers alike. However, the proprietary nature of power distribution system data hinders their public availability. This has led researchers to develop frameworks that synthesize realistic datasets pertaining to the power distribution system [4, 13, 16, 17, 28, 29]. These frameworks create digital replicates similar to the actual power distribution networks in terms of their structure and function. Hence the created networks can be used as digital duplicates in simulation studies of policies and methods before implementation in real systems.

The algorithms associated with these frameworks vary widely—ranging from first principles based approaches [17, 28] to learning statistical distributions of network attributes [29] to using deep learning models such as generative adversarial neural networks [14]. Validating the synthetic power distribution networks with respect to their physical counterpart is vital for assessing the suitability of their use as effective digital duplicates. Since the underlying assumptions and algorithms of each framework are distinct from each other, some of them may excel compared to others in reproducing digital replicates with better precision for selective regions. To this end, we require well-defined metrics to rank the frameworks and judge their strengths and weaknesses in generating digital duplicates of power distribution networks for a particular geographic region.

The literature pertaining to frameworks for synthetic distribution network creation include certain validation results that compare the generated networks to the actual counterpart [4, 12, 29]. But the validation results are mostly limited to comparing the statistical network attributes such as degree and hop distributions and power engineering operational attributes such as node voltages and edge power flows. Since power distribution networks represent real physical systems, the created digital replicates have associated geographic embedding. Therefore, a structural comparison of synthetic network graphs to their actual counterpart becomes pertinent for power distribution networks with geographic embedding. Consider an example where a digital twin is used to analyze impact of a weather event [27]. Severe weather events such as hurricanes, earthquakes and wild fires occur in specific geographic trajectories, affecting only portions of societal infrastructures. In order to correctly identify them during simulations, the digital twin should structurally resemble the actual infrastructure.

Problem Statement: In recent years, the problem of evaluating the quality of reconstructed networks has been studied for street maps. Certain metrics were defined to compare outputs of frameworks that use GPS trajectory data to reconstruct street map graphs [1, 2]. The abstract problem can be stated as follows: compute the similarity between a given pair of embedded planar graphs. This is similar to the well known subgraph isomorphism problem [7] wherein we look for isomorphic subgraphs in a pair of given graphs. A major precursor to this problem is that we require a one-to-one mapping between nodes and edges of the two graphs. While such mappings are well-defined for street networks, the same cannot be inferred for power distribution networks. Since power network datasets are proprietary, the node and edge labels are redacted from the network before it is shared. The actual network is obtained as a set of “drawings” with associated geographic embeddings. Each drawing can be considered as a collection of line segments termed a geometry. Hence the problem of comparing a set of power distribution networks with geographic embedding can be stated as the following: compute the similarity between a given pair of geometries lying on a geographic plane.

Our Contributions: We propose a new distance measure to compare a pair of geometries using the flat norm, a notion of distance between generalized objects studied in geometric measure theory [9, 21]. This distance combines the difference in length of the geometries with the area of the patches contained between them. The area of patches in between the pair of geometries accounts for the lateral displacement between them. We employ a multiscale version of the flat norm [22] that uses a scale parameter λ0\lambda\geq 0 to combine the length and area components (for the sake of brevity, we refer to the multiscale flat norm simply as the flat norm). Intuitively, a smaller value of λ\lambda captures larger patches of area between the geometries while a large value of λ\lambda captures more of the (differences in) lengths of the geometries. Computing the flat norm over a range of values of λ\lambda allows us to compare the geometries at multiple scales. For computation, we use a discretized version of the flat norm defined on simplicial complexes [10], which are triangulations in our case. It may not be possible to derive standard stability results for this distance measure that imply only small changes in the flat norm metric when the inputs change by a small amount—there is no alternative metric to measure the small change in the input. We demonstrate through 2D examples (in Fig. 8), for instance, that the commonly used Hausdorff metric cannot be used for this purpose. Instead, we have derived upper bounds on the flat norm distance between a piecewise linear 1-current and its perturbed version as a function of the radius of perturbation under certain assumptions provided the perturbations are performed carefully (see Section 4.3).

A lack of one-to-one correspondence between edges and nodes in the pair of networks prevents us from performing one-to-one comparison of edges. Instead we can sample random regions in the area of interest and compare the pair of geometries within each region. For performing such local comparisons, we define a normalized flat norm where we normalize the flat norm distance between the parts of the two geometries by the sum of the lengths of the two parts in the region. Such comparison enables us to characterize the quality of the digital duplicate for the sampled region. Further, such comparisons over a sequence of sampled regions allows us to characterize the suitability of using the entire synthetic network as a duplicate of the actual network.

Our main contributions are the following: (i) we propose a distance measure for comparing a pair of geometries embedded in the same plane using the flat norm that accounts for deviation in length and lateral displacement between the geometries; and (ii) we perform a region-based characterization of synthetic networks by sampling random regions and comparing the pair of geometries contained within the sampled region. The proposed distance allows us to perform global as well as local comparisons between a pair of network geometries.

1.1 Related Work

Several well defined graph structure comparison metrics such as subgraph isomorphism and edit distance have been proposed in the literature along with algorithms to compute them efficiently. Tantardini et al. [31] compare graph network structures for the entire graph (global comparison) as well as for small portions of the graph known as motifs (local comparison). Other researchers have proposed methodologies to identify structural similarities in embedded graphs [3, 23]. However, all these methods depend on one-to-one correspondence of graph nodes and edges rather than considering the node and edge geometries of the graphs. The edit distance, i.e., the minimum number of edit operations to transform one network to the other, has been widely used to compare networks having structural properties [25, 26, 32]. Riba et al. [26] used the Hausdorff distance between nodes in the network to compare network geometries. Majhi et al. [15] modified the traditional definition of graph edit distance to be applicable in the context of “geometric graphs” embedded in a Euclidean space. Along with the usual insertion and deletion operations, the authors have proposed a cost for translation in computing the geometric edit distance between the graphs. However, the authors also show that the problem of computing this metric is NP-hard.

Meyur et al. [18] compared network geometries using the Hausdorff distance after partitioning the geographic region into small rectangular grids and comparing the geometries for each grid. However, the Hausdorff metric is sensitive to outliers as it focuses only on the maximum possible distance between the pair of geometries. When the geometries coincide almost entirely except in a few small portions, the Hausdorff metric still records the discrepancy in those small portions without accounting for the similarity over the majority of portions. The similar approach used by Brovelli et al. [5] to compare a pair of road networks in a geographic region suffers from the same drawback. This necessitates a well-defined distance metric between networks with geographic embedding [2].

Several comparison methods have been proposed in the context of planar graphs embedded in a Euclidean space [6, 19]. They include local and global metrics to compare road networks. The local metrics characterize the networks based on cliques and motifs, while the global metrics involve computing the efficiency of constructing the infrastructure network. The most efficient network is assumed to be the one with only straight line geometries connecting node pairs. Albeit useful to characterize network structures, these methods are not suitable for a numeric comparison of network geometries.

2 Preliminaries

Definition 2.1 (Geometric graph).

A graph 𝒢(𝒱,)\mathscr{G}\left(\mathscr{V},\mathscr{E}\right) with node set 𝒱\mathscr{V} and edge set \mathscr{E} is said to be a geometric graph of d\,\mathbb{R}^{d} if the set of nodes 𝒱d\mathscr{V}\subset\mathbb{R}^{d} and the edges are Euclidean straight line segments {uv¯|e:=(u,v)}\left\{\overline{uv}~{}|~{}e:=\left(u,v\right)\in\mathscr{E}\right\} which intersect (possibly) at their endpoints.

Definition 2.2 (Structurally similar geometric graphs).

Two geometric graphs 𝒢0(𝒱0,0)\mathscr{G}_{0}\left(\mathscr{V}_{0},\mathscr{E}_{0}\right) and 𝒢1(𝒱1,1)\mathscr{G}_{1}\left(\mathscr{V}_{1},\mathscr{E}_{1}\right) are said to be structurally similar at the level of γ0\gamma\geq 0, termed γ\gamma-similar, if dist(𝒢0,𝒢1)γ\,\operatorname{dist}\left(\mathscr{G}_{0},\mathscr{G}_{1}\right)\leq\gamma for the distance function dist\operatorname{dist} between the two graphs.

We could consider a given network as a set of edge geometries. Hence we could consider the problem of comparing geometric graphs 𝒢0\mathscr{G}_{0} and 𝒢1\mathscr{G}_{1} as that of comparing the set of edge geometries 0\mathscr{E}_{0} and 1\mathscr{E}_{1}. In this paper, we propose a suitable distance that allows us to compare between a pair of geometric graphs or a pair of geometries. We use the multiscale flat norm, which has been well explored in the field of geometric measure theory, to define such a distance between the geometries.

2.1 Multiscale Flat Norm

We use the multiscale simplicial flat norm proposed by Ibrahim et al. [10] to compute the distance between two networks. We now introduce some background for this computation. A dd-dimensional current TT (referred to as a dd-current) is a generalized dd-dimensional geometric object with orientations (or direction) and multiplicities (or magnitude). An example of a 22-current is a surface with finite area (multiplicity) and a specific orientation (clockwise or counterclockwise). We use 𝒞d\mathcal{C}_{d} to denote the set of all dd-currents, and 𝒞d(p)\mathcal{C}_{d}(^{p}) to denote the set of dd-currents embedded in p. 𝐕d(T)\mathbf{V}_{d}(T) or |T|\left|{T}\right| denotes the dd-dimensional volume of TT, e.g., length in 1D or area in 2D. The boundary of TT, denoted by T\partial T, is a (d1)(d-1)-current. The multiscale flat norm of a dd-current T𝒞dT\in\mathcal{C}_{d}, at scale λ0\lambda\geq 0 is defined as

𝔽λ(T)=minS𝒞d+1{𝐕d(TS)+λ𝐕d+1(S)},\mathbb{F}_{\lambda}\left(T\right)=\min_{S\in\mathcal{C}_{d+1}}\left\{\mathbf{V}_{d}\left(T-\partial S\right)+\lambda\mathbf{V}_{d+1}\left(S\right)\right\}, (1)

where the minimum is taken over all (d+1)(d+1)-currents SS.Computing the flat norm of a 1-current (curve) TT identifies the optimal 2-current (area patches) SS that minimizes the sum of the length of current TST-\partial S and the area of patch(es) SS. Fig. 1 shows the flat norm computation for a generic 1D current TT (blue). The 2D area patches SS (magenta) are computed such that the expression in Eq. (1) is minimized for the chosen value of λ\lambda that ends up using most of the patch under the sharper spike on the left but only a small portion of the patch under the wider bump to the right.

Refer to caption
Figure 1: Multiscale flat norm of a 1D current TT (blue). The flat norm is the sum of length of the resulting 1D current TST-\partial S (green) and the area of 2D patches SS (magenta). We show TST-\partial S slightly separated for easy visualization.

The scale parameter λ\lambda can be intuitively understood as follows. Rolling a ball of radius 1/λ1/\lambda on the 11-current TT traces the output current TST-\partial S and the untraced regions constitute the patches SS. Hence we observe that for a large λ\lambda, the radius of the ball is very small and hence it traces major features while smoothing out (i.e., missing) only minor features (wiggles) of the input current. But for a small λ\lambda, the ball with a large radius smoothes out larger scale features (bumps) in the current. Note that for smaller λ\lambda, the cost of area patches is smaller in the minimization function and hence more patches are used for computing the flat norm. We can use the flat norm to define a natural distance between a pair of 1-currents T1T_{1} and T2T_{2} as follows [10].

𝔽λ(T1,T2)=𝔽λ(T1T2)\mathbb{F}_{\lambda}\left(T_{1},T_{2}\right)=\mathbb{F}_{\lambda}\left(T_{1}-T_{2}\right) (2)

We compute the flat norm distance between a pair of input geometries (synthetic and actual) as the flat norm of the current T=T1T2T=T_{1}-T_{2} where T1T_{1} and T2T_{2} are the currents corresponding to individual geometries. Let Σ\Sigma denote the set of all line segments in the input current TT. We perform a constrained triangulation of Σ\Sigma to obtain a 22-dimensional finite oriented simplicial complex KK. A constrained triangulation ensures that each line segment σiΣ\sigma_{i}\in\Sigma is an edge in KK, and that TT is an oriented 11-dimensional subcomplex of KK.

Let mm and nn denote the numbers of edges and triangles in KK. We can denote the input current TT as a 11-chain i=1mtiσi\sum_{i=1}^{m}t_{i}\sigma_{i} where σi\sigma_{i} denotes an edge in KK and tit_{i} is the corresponding multiplicity. Note that ti=1t_{i}=-1 indicates that orientation of σi\sigma_{i} and TT are opposite, ti=0t_{i}=0 denotes that σi\sigma_{i} is not contained in TT, and ti=1t_{i}=1 implies that σi\sigma_{i} is oriented the same way as TT. Similarly, we define the set SS to be the 22-chain of KK and denote it by i=1msiωi\sum_{i=1}^{m}s_{i}\omega_{i} where ωi\omega_{i} denotes a 22-simplex in KK and sis_{i} is the corresponding multiplicity.

The boundary matrix []m×n\left[\partial\right]\in\mathbb{Z}^{m\times n} captures the intersection of the 11 and 22-simplices of KK. The entries of the boundary matrix []ij{1,0,1}\left[\partial\right]_{ij}\in\{-1,0,1\}. If edge σi\sigma_{i} is a face of triangle ωj\omega_{j}, then []ij\left[\partial\right]_{ij} is nonzero and it is zero otherwise. The entry is 1-1 if the orientations of σi\sigma_{i} and ωj\omega_{j} are opposite and it is +1+1 if the orientations agree.

We can respectively stack the tit_{i}’s and sis_{i}’s in mm and nn-length vectors 𝐭m\mathbf{t}\in\mathbb{Z}^{m} and 𝐬n\mathbf{s}\in\mathbb{Z}^{n}. The 11-chain representing TST-\partial S is denoted by 𝐱m\mathbf{x}\in\mathbb{Z}^{m} and is given as 𝐱=𝐭[]𝐬\mathbf{x}=\mathbf{t}-\left[\partial\right]\mathbf{s}. The multiscale flat norm defined in Eq. (1) can be computed by solving the following optimization problem:

𝔽λ(T)=min𝐬n\displaystyle\mathbb{F}_{\lambda}\left(T\right)=\min_{\mathbf{s}\in\mathbb{Z}^{n}} i=1mwi|xi|+λ(j=1nvj|sj|)\displaystyle\sum_{i=1}^{m}w_{i}\left|x_{i}\right|+\lambda\left(\sum_{j=1}^{n}v_{j}\left|s_{j}\right|\right) (3)
s.t. 𝐱=𝐭[]𝐬,𝐱m,\displaystyle\quad\mathbf{x}=\mathbf{t}-\left[\partial\right]\mathbf{s},\quad\mathbf{x}\in\mathbb{Z}^{m},

where 𝐕d(τ)\mathbf{V}_{d}\left(\tau\right) in Eq. (1) denotes the volume of the dd-dimensional simplex τ\tau. We denote volume of the edge σi\sigma_{i} as 𝐕1(σi)=wi\mathbf{V}_{1}(\sigma_{i})=w_{i} and set it to be the Euclidean length, and volume of a triangle τj\tau_{j} as 𝐕2(τj)=vj\mathbf{V}_{2}(\tau_{j})=v_{j} and set it to be the area of the triangle.

In this work, we consider geometric graphs embedded on the geographic plane and are associated with longitude and latitude coordinates. We compute the Euclidean length of edge σi\sigma_{i} as wi=RΔϕiw_{i}=R\Delta\phi_{i} where Δϕi\Delta\phi_{i} is the Euclidean normed distance between the geographic coordinates of the terminals of σi\sigma_{i} and RR is the radius of the earth. Similarly, the area of triangle τj\tau_{j} is computed as vj=R2ΔΩjv_{j}=R^{2}\Delta\Omega_{j} where ΔΩj\Delta\Omega_{j} is the solid angle subtended by the geographic coordinates of the vertices of τj\tau_{j}.

Using the fact that the objective function is piecewise linear in 𝐱\mathbf{x} and 𝐬\mathbf{s}, the minimization problem can be reformulated as an integer linear program (ILP) as follows:

𝔽λ(T)=min\displaystyle\mathbb{F}_{\lambda}\left(T\right)=\min i=1mwi(xi++xi)+λ(j=1nvj(sj++sj))\displaystyle\sum_{i=1}^{m}w_{i}\left(x_{i}^{+}+x_{i}^{-}\right)+\lambda\left(\sum_{j=1}^{n}v_{j}\left(s_{j}^{+}+s_{j}^{-}\right)\right) (4a)
s.t. 𝐱+𝐱=𝐭[](𝐬+𝐬)\displaystyle\quad\mathbf{x}^{+}-\mathbf{x}^{-}=\mathbf{t}-\left[\partial\right]\left(\mathbf{s}^{+}-\mathbf{s}^{-}\right) (4b)
𝐱+,𝐱0,𝐬+,𝐬0\displaystyle\quad\mathbf{x}^{+},\mathbf{x}^{-}\geq 0,\quad\mathbf{s}^{+},\mathbf{s}^{-}\geq 0 (4c)
𝐱+,𝐱m,𝐬+,𝐬n\displaystyle\quad\mathbf{x}^{+},\mathbf{x}^{-}\in\mathbb{Z}^{m},\quad\mathbf{s}^{+},\mathbf{s}^{-}\in\mathbb{Z}^{n} (4d)

The linear programming relaxation of the ILP in Eq. (4) is obtained by ignoring the integer constraints Eq. (4d). We refer to this relaxed linear program (LP) as the flat norm LP. Ibrahim et al. [10] showed that the boundary matrix [][\partial] is totally unimodular for our application setting. Hence the flat norm LP will solve the ILP, and hence the flat norm can be computed in polynomial time.

Algorithm 1 describes how we compute the distance between a pair of geometries with the associated embedding on a metric space \mathcal{M}. We assume that the geometries (networks) 𝒢1(𝒱1,1)\mathscr{G}_{1}\left(\mathscr{V}_{1},\mathscr{E}_{1}\right) and 𝒢2(𝒱2,2)\mathscr{G}_{2}\left(\mathscr{V}_{2},\mathscr{E}_{2}\right) with respective node sets 𝒱1,𝒱2\mathscr{V}_{1},\mathscr{V}_{2} and edge sets 1,2\mathscr{E}_{1},\mathscr{E}_{2} have no one-to-one correspondence between the 𝒱i\mathscr{V}_{i}’s or i\mathscr{E}_{i}’s. Note that each vertex v𝒱1,𝒱2v\in\mathscr{V}_{1},\mathscr{V}_{2} is a point and each edge e1,2e\in\mathscr{E}_{1},\mathscr{E}_{2} is a straight line segment in \mathcal{M}. We consider the collection of edges 1,2\mathscr{E}_{1},\mathscr{E}_{2} as input to our algorithm. First, we orient the edge geometries in a particular direction (left to right in our case) to define the currents T1T_{1} and T2T_{2}, which have both magnitude and direction. Next, we consider the bounding rectangle bound\mathscr{E}_{\textrm{bound}} for the edge geometries and define the set Σ\Sigma to be triangulated as the set of all edges in either geometry and the bounding rectangle. We perform a constrained Delaunay triangulation [30] on the set Σ\Sigma to construct the 22-dimensional simplicial complex KK. The constrained triangulation ensures that the set of edges in Σ\Sigma is included in the simplicial complex KK. Then we define the currents T1T_{1} and T2T_{2} corresponding to the respective edge geometries 1\mathscr{E}_{1} and 2\mathscr{E}_{2} as 11-chains in KK. Finally, the flat norm LP is solved to compute the simplicial flat norm.

Input: Geometries 1,2\mathscr{E}_{1},\mathscr{E}_{2}
Parameter: Scale λ\lambda
1:  Orient each edge in the edge sets from left to right: ~1:=𝖮𝗋𝗂𝖾𝗇𝗍(1);~2:=𝖮𝗋𝗂𝖾𝗇𝗍(2)\tilde{\mathscr{E}}_{1}:=\mathsf{Orient}\left(\mathscr{E}_{1}\right);~{}~{}\tilde{\mathscr{E}}_{2}:=\mathsf{Orient}\left(\mathscr{E}_{2}\right).
2:  Find bounding rectangle for the pair of geometries:
bound=𝗋𝖾𝖼𝗍(~1,~2)\mathscr{E}_{\textrm{bound}}=\mathsf{rect}\left(\tilde{\mathscr{E}}_{1},\tilde{\mathscr{E}}_{2}\right).
3:  Define the set of line segments to be triangulated:
Σ=~1~2bound\Sigma=\tilde{\mathscr{E}}_{1}\cup\tilde{\mathscr{E}}_{2}\cup\mathscr{E}_{\textrm{bound}}.
4:  Perform constrained triangulation on set Σ\Sigma to construct 22-dimensional simplicial complex KK.
5:  Define the currents T1,T2T_{1},T_{2} as 11-chains of oriented edges
~1\tilde{\mathscr{E}}_{1} and ~2\tilde{\mathscr{E}}_{2} in KK.
6:  Solve the flat norm LP to compute flat norm 𝔽λ(T1T2)\mathbb{F}_{\lambda}\left(T_{1}-T_{2}\right).
Output: Flat norm distance 𝔽λ(T1T2)\mathbb{F}_{\lambda}\left(T_{1}-T_{2}\right).
Algorithm 1 Distance between a pair of geometries

2.2 Normalized Flat Norm

Recall that in our context of synthetic power distribution networks, the primary goal of comparing a synthetic network to its actual counterpart is to infer the quality of the replica or the digital duplicate synthesized by the framework. The proposed approach using the flat norm for structural comparison of a pair of geometries provides us a method to perform global as well as local comparison. While we can produce a global comparison by computing the flat norm distance between the two networks, it may not provide us with complete information on the quality of the synthetic replicate. On the other hand, a local comparison can provide us details about the framework generating the synthetic networks. For example, a synthetic network generation framework might produce higher quality digital replicates of actual power distribution networks for urban regions as compared to rural areas. A local comparison highlights this attribute and identifies potential use case scenarios of a given synthetic network generation framework.

Furthermore, availability of actual power distribution network data is sparse due to its proprietary nature. We may not be able to produce a global comparison between two networks due to unavailability of network data from one of the sources. Hence, we want to restrict our comparison to only the portions in the region where data from either network is available, which also necessitates a local comparison between the networks.

For a local comparison, we consider uniform sized regions and compute the flat norm distance between the pair of geometries within the region. However, the computed flat norm is dependent on the length of edges present within the region from either network. Hence we define the normalized multiscale flat norm, denoted by 𝔽~λ\widetilde{\mathbb{F}}_{\lambda}, for a given region as

𝔽~λ(T1T2)=𝔽λ(T1T2)|T1|+|T2|.\widetilde{\mathbb{F}}_{\lambda}\left(T_{1}-T_{2}\right)=\frac{\mathbb{F}_{\lambda}\left(T_{1}-T_{2}\right)}{|T_{1}|+|T_{2}|}\,. (5)

For a given parameter ϵ\epsilon, a local region is defined as a square of size 2ϵ×2ϵ2\epsilon\times 2\epsilon steradians. Let T1,ϵT_{1,\epsilon} and T2,ϵT_{2,\epsilon} denote the currents representing the input geometries inside the local region characterized by ϵ\epsilon. Note that the “amount” or the total length of network geometries within a square region varies depending on the location of the local region. In this case, the lengths of the network geometries are respectively |T1,ϵ||T_{1,\epsilon}| and |T2,ϵ||T_{2,\epsilon}|. Therefore, we use the ratio of the total length of network geometries inside a square region to the parameter ϵ\epsilon to characterize this “amount” and denote it by |T|/ϵ|T|/\epsilon where

|T|/ϵ=|T1,ϵ|+|T2,ϵ|ϵ.|T|/\epsilon=\frac{|T_{1,\epsilon}|+|T_{2,\epsilon}|}{\epsilon}\,. (6)

Note that while performing a comparison between a pair of network geometries in a local region using the multiscale flat norm, we need to ensure that comparison is performed for similar length of the networks inside similar regions. Therefore, the ratio |T|/ϵ|T|/\epsilon, which indicates the length of networks inside a region scaled to the size of the region, becomes an important aspect of characterization while performing the flat norm based comparison.

3 Implementation of flat norm.

3.1 Geometry comparison using flat norm.

We show a simple example depicting the use of flat norm to compute the distance between a pair of geometries that are two line segments of equal length meeting at their midpoints in Fig. 2. As the angle between the two line segments decreases from 9090 to 1515 degrees, the computed flat norm also decreases.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 2: Variation in flat norm for pairs of geometries as the angle between them decreases. When the geometries are perpendicular to each other, flat norm distance is the maximum and it decreases as the angle decreases.

3.2 Flat norm computation for a pair of geometries.

We demonstrate the steps involved in computing the flat norm for a pair of input geometries in Fig. 3. The input geometries are a collection of line segments shown in blue and red (top left). We construct the set Σ\Sigma by combining all the edges of either geometry along with the bounding rectangle (top right). Thereafter, we perform a constrained triangulation to construct the 22-dimensional simplicial complex KK (bottom left). Finally, we compute the multiscale simplicial flat norm with λ=1\lambda=1 (bottom right). Note that this computation captures the length deviation (shown by green edges) and the lateral displacement (shown by the magenta patches).

Refer to caption
Figure 3: Steps in computing the flat norm for a pair of input geometries.

3.3 Flat norm computation for a pair of networks.

The steps involved in computing the multiscale flat norm distance between a pair of geometries are shown in Fig. 4. They include the actual power distribution network (red) for a region in a county from USA and the synthetic network (blue) constructed for the same region [17]. First, we orient each edge in either network from left to right. Thereafter, we find the enclosing rectangular boundary around the pair of networks. We perform a constrained Delaunay triangulation which ensures that the edges in the geometries and the convex boundary are selected as edges of the triangles. Finally the flat norm LP (relaxation of the ILP in Eq. (4)) is solved to compute the flat norm distance between the networks.

Refer to caption
Figure 4: Steps showing the flat norm distance computation between two networks (shown in blue and red in the top two plots). First, the convex rectangular boundary around the pair of networks is identified. A constrained triangulation is computed such that the edges in the networks and convex boundary are edges of triangles (middle). The flat norm LP is solved to compute the simplicial flat norm, which includes the sum of areas of the magenta triangles and lengths of green edges (bottom).
Refer to caption
Refer to caption
Refer to caption
Figure 5: The flat norm computed between the pair of network geometries for three values of the scale parameter λ\lambda ranging between λ=1000\lambda=1000 to λ=10000\lambda=10000.

The multiscale flat norm produces different distance values for different values of the scale parameter λ\lambda. Fig. 5 shows the flat norm distance between the actual and synthetic power network for the same region for multiple values of the scale parameter λ\lambda. We observe that as λ\lambda becomes larger, the 2D patches used in computing the flat norm become smaller as it becomes more expensive to use the area term in the flat norm LP minimization problem.

The variation of the computed flat norm for different values of the scale parameter is summarized in Fig. 6. As the scale parameter is increased, fewer area patches are considered in the simplicial flat norm computation. This is captured by the blue decreasing curve in the plot. The computed flat norm increases for larger values of the scale parameter λ\lambda as more and more individual currents contribute their unscaled length toward the flat norm value instead of becoming a boundary of some area component, which, if there are any, now also contribute more because of the increased scale λ\lambda. We show the plot with two different vertical scales: the left scale indicates the deviation in length (measured in km) and the right scale shows the deviation expressed through the area patches (measured in sq.km).

Refer to caption
Figure 6: Effect of varying the scale parameter λ\lambda in the flat norm computation. The flat norm for a 11-dimensional current consists of two parts: a length component and a scaled surface area component. The variations in the length component and the unscaled surface area component (right vertical scale) are also shown.

3.4 Comparing Network Geometries

The primary goal of computing the flat norm is to compare the pair of input geometries. As mentioned earlier, the flat norm provides an accurate measure of difference between the geometries by considering both the length deviation and area patches in between the geometries. Further, we normalize the computed flat norm to the total length of the geometries. In this section, we show examples where we computed the normalized flat norm for the pair of network geometries (actual and synthetic) for a few regions.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 7: Normalized flat norm (with scale λ=1000\lambda=1000) distances for pairs of regions in the network of same size (ϵ=0.001\epsilon=0.001) with similar |T|/ϵ|T|/\epsilon ratios (two pairs each in the top and bottom rows). The pairs of geometries for the first plot (on left) are quite similar, which is reflected in the low flat norm distances between them. The network geometries on the right plots are more dissimilar and hence the flat norm distances are high.

The top two plots in Fig. 7 show two regions characterized by ϵ=0.001\epsilon=0.001 and almost similar |T|/ϵ|T|/\epsilon ratios. This indicates that the length of network scaled to the region size is almost equal for the two regions. From a mere visual perspective, we can conclude that the first pair of network geometries resemble each other where as the second pair are fairly different. This is further validated from the results of the flat norm distance between the network geometries computed with the scale λ=1000\lambda=1000, since the first case produces a smaller flat norm distance compared to the latter. The bottom two plots show another example of two regions with almost similar |T|/ϵ|T|/\epsilon ratios and enable us to infer similar conclusions. The results strengthens our case of using flat norm as an appropriate measure to perform a local comparison of network geometries. We can choose a suitable γ>0\gamma>0 which differentiates between these example cases and use the proposed flat norm distance metric to identify structurally similar network geometries. However, the choice of γ\gamma has to be made empirically. This necessitates a statistical study of randomly chosen local regions in different sections of the networks, which is performed in the Section 5.

4 Notion of stability for the flat norm distance

We now investigate two approaches to define a notion of stability for the flat norm distance. For any measure of discrepancy between objects, the notion of stability is not only desirable from a theoretical standpoint but is also necessary for practical applications. The comparison metric is said to be stable if small changes in the input geometries lead to only small changes in the measured discrepancy. But such a formulation introduces a “chicken and an egg” problem—in order to evaluate the stability of a proposed metric we need an alternative baseline metric to measure the small change in the input. Of course, the baseline metric should be stable as well. This constitutes the first approach. The alternative approach is to derive directly an upper bound on the proposed metric under well-defined controlled perturbations of input geometries.

The Hausdorff distance metric 𝔻H\mathbb{D}_{H}, which has been extensively used in the literature for comparing geometrically embedded networks, is stable in this sense, and is hence a natural choice for use as the baseline metric. At the same time, the Hausdorff distance is not sensitive enough to adequately measure small changes in the input geometries. Let us consider a δ\delta-ball around each node in the network for a chosen perturbation radius δ>0\delta>0. We then uniformly sample a point in each circular region and use them as the perturbed embeddings of the nodes. The Hausdorff distance will change if the perturbation is either significantly large to overshadow the current value by moving some node far enough, or it is very specific and affects the maximizer nodes of 𝔻H\mathbb{D}_{H}. As will be shown in the next subsection (Sec. 4.1) on a few simple counter-examples and the real-world networks introduced in the previous section (Sec. 3.3), knowing the value of the Hausdorff distance or how it changed is not enough to infer any useful information about the flat norm distance, and vice versa. Even though our examples are 1-dimensional, the behavior can be observed in any dimension.

In the subsequent subsection (Sec. 4.3), under some mild assumptions about the scale and the radius of perturbation, we derive an upper bound on the flat norm distance for the case of simple piecewise linear curves in 2. We formalize these curves as a special class of integral 1-currents, namely the piecewise linear currents, and study a class of (positive) δ\delta-perturbations of their nodes. It allows us to track the change of the components of the flat norm distance while perturbing each node one by one, which in turn allows us to construct a non-trivial upper bound on 𝔽λ\mathbb{F}_{\lambda} between the original 1-current and its final perturbed version.

4.1 Comparing the Hausdorff and the flat norm distance

We refer the reader to standard textbooks on geometric measure theory [9, 21] for the formal definition of integral currents and other related concepts. For our purposes, it is sufficient to consider an integral current as a collection of oriented manifolds with or without boundary, and with integer multiplicities as well as orientations for each submanifold. Recall from Sec. 2.1 that 𝒞d(d+1)\mathcal{C}_{d}(^{d+1}) denotes the set of all oriented dd-dimensional integral currents (dd-current) embedded in d+1, and supp(T)d+1\operatorname*{supp}(T)\subset^{d+1} is the dd-dimensional support for T𝒞d(d+1)T\in\mathcal{C}_{d}(^{d+1}). Let X𝒞d(d+1)X\in\mathcal{C}_{d}(^{d+1}) with (d1)(d-1)-boundary X𝒞d1(d+1)\partial X\in\mathcal{C}_{d-1}(^{d+1}), then the set of all dd-currents embedded in d+1 spanned by the boundary of XX is denoted as 𝒞d[X;d+1]𝒞d(d+1)\mathcal{C}_{d}[\partial X;^{d+1}]\subset\mathcal{C}_{d}(^{d+1}), or simply 𝒞d[X]\mathcal{C}_{d}[\partial X] if the embedding space is clear from the context.

Let T0,T1𝒞d(d+1)T_{0},T_{1}\in\mathcal{C}_{d}(^{d+1}) be two integral dd-currents in d+1, and vud\left\|{v-u}\right\|_{d} be the Euclidean distance between u,vdu,v\in^{d}. The Hausdorff distance between currents T1T_{1} and T0T_{0} is given by

𝔻H(T1,T0)\displaystyle\mathbb{D}_{H}(T_{1},T_{0}) =max{supvsuppT0𝔻(v,T1),supvsuppT1𝔻(v,T0),}\displaystyle=\max\left\{\sup\limits_{v\in\operatorname*{supp}{T_{0}}}\mathbb{D}(v,T_{1}),\sup\limits_{v\in\operatorname*{supp}{T_{1}}}\mathbb{D}(v,T_{0}),\right\}

where 𝔻(v,T)\mathbb{D}(v,T) is the distance from a point to a current given as

𝔻(v,T)=infusuppTvud.\displaystyle\mathbb{D}(v,T)=\inf\limits_{u\in\operatorname*{supp}{T}}\left\|{v-u}\right\|_{d}\,.

Let X=T1T0SX=T_{1}-T_{0}-\partial S be the dd-component of the flat norm decomposition in Eq. (1). Note that X=T1T0S=T1T0\partial X=\partial T_{1}-\partial T_{0}-\partial\partial S=\partial T_{1}-\partial T_{0}, i.e., XCd[T1T0]X\in C_{d}[\partial T_{1}-\partial T_{0}], and it can be rendered to zero only if T1T00\partial T_{1}-\partial T_{0}\equiv 0. Hence, the volume of the minimal dd-current spanned by T1T0\partial T_{1}-\partial T_{0} provides a lower bound on 𝔽λ(T1T0)\mathbb{F}_{\lambda}(T_{1}-T_{0}). On the other hand the Hausdorff distance between boundaries 𝔻H(T1,T0)\mathbb{D}_{H}(\partial T_{1},\partial T_{0}) doesn’t provide any meaningful insights about the actual value of 𝔻H(T1,T0)\mathbb{D}_{H}(T_{1},T_{0}). This prompts us to suspect that the Hausdorff distance between two dd-currents does not have any meaningful relations with the value of the flat norm distance. In fact, we expect 𝔽λ(T1T0)\mathbb{F}_{\lambda}(T_{1}-T_{0}) to be more sensitive to perturbations of input geometries. Moreover, as can be seen from the examples below and the discussion in the following Section 4.3, given that the scale λ>0\lambda>0 is small enough, when T1T_{1} is perturbed within a δ\delta-tube the range of incurred changes of the flat norm distance depends, mainly, on the size of perturbation δ>0\delta>0 and the input volume 𝐕d(T1)\mathbf{V}_{d}(T_{1}), and not on 𝔻H(T1,T0)\mathbb{D}_{H}(T_{1},T_{0}). Although the examples in this Section are given for 1-currents in 2 for illustrative purposes, this conclusion holds in the general case of dd-currents as well.

Example 4.1 (Fixing 𝔻H(T~1,T0)\mathbb{D}_{H}(\widetilde{T}_{1},T_{0})).

Let T1T_{1} and T0T_{0} be two 1-currents in 2 with common boundaries, T1=T0\partial T_{1}=-\partial T_{0}, and let H=𝔻H(T1,T0)H=\mathbb{D}_{H}(T_{1},T_{0}) be the Hausdorff distance between them. For some small δ>0\delta>0, consider T~1\widetilde{T}_{1}—a perturbation of T1T_{1} within an δ\delta-tube—such that the boundaries and the Hausdorff distance do not change:

T~1=T1 and 𝔻H(T~1,T0)=H.\displaystyle\begin{array}[]{ccccc}\displaystyle\partial\widetilde{T}_{1}=\partial T_{1}&\text{ and }&\displaystyle\mathbb{D}_{H}(\widetilde{T}_{1},T_{0})=H.\end{array}

Note that 𝔻H(T1,T0)=𝔻H(T1,T~1)=0\mathbb{D}_{H}(\partial T_{1},\partial T_{0})=\mathbb{D}_{H}(\partial T_{1},\partial\widetilde{T}_{1})=0. See Fig. 8 for examples of perturbations T~1\widetilde{T}_{1}.

Refer to captionRefer to caption
Refer to captionRefer to caption
Figure 8: Top Left: The input curves T1T_{1} and T0T_{0} with shared endpoints and Hausdorff distance 𝔻H(T1,T0)=H\mathbb{D}_{H}(T_{1},T_{0})=H. At a small enough scale λ>0\lambda>0, the flat norm distance 𝔽λ(T1T0)\mathbb{F}_{\lambda}(T_{1}-T_{0}) corresponds to the orange patch in between them. Right: Example 1. The example perturbations T~1\widetilde{T}_{1} (solid blue) that lie within a δ\delta-neighborhood (gray) of T1T_{1}(dashed blue). The Hausdorff distance between T~1\widetilde{T}_{1} and T0T_{0} remains same, i.e., HH. The green patch (Right, Top) captures the increment Δ𝔽\Delta\mathbb{F}, and beneath it (Right, Bottom) the pink area corresponds to the decrement Δ𝔽\Delta\mathbb{F}. Bottom Left: Example 2. The example perturbation of T1T_{1} that moves only the maximizer of 𝔻H(T1,T0)\mathbb{D}_{H}(T_{1},T_{0}) further away from T0T_{0} so that 𝔻H(T~1,T0)=H>>H\mathbb{D}_{H}(\widetilde{T}_{1},T_{0})=H^{\prime}>>H. The flat norm distance increases by Δ𝔽\Delta\mathbb{F} that corresponds to the area of the created spike, which can be arbitrary small.

We could have cases where T~1\widetilde{T}_{1} lies mostly at the upper envelope of this δ\delta-tube, causing the flat norm distance to increase by Δ𝔽λ=|𝔽λ(T1T0)𝔽λ(T~1T0)|\Delta\mathbb{F}_{\lambda}=\left|{\mathbb{F}_{\lambda}(T_{1}-T_{0})-\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0})}\right| (highlighted in green), or mostly at the lower envelope causing a decrease in the flat norm distance, respectively (highlighted in pink). In both cases, one would expect the ideal measure of discrepancy between T~1\widetilde{T}_{1} and T0T_{0} to change significantly as well (compared to the one between T1T_{1} and T0T_{0}). The flat norm distance accurately captures all such changes (to keep the example simple, we consider the default flat norm distance and not the normalized version). At the same time, both such variations could have the same Hausdorff distance HH from T0T_{0} as T1T_{1}, which completely misses all the changes applied to T1T_{1} in either case.

Example 4.2 (Fixing 𝔽λ(T~1T0)\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0})).

A modification of this example can illustrate the other extreme case—when Hausdorff distance changes by a lot but the flat norm distance does not change much at all, see bottom row in Fig. 8. Consider moving only the highest point on T1T_{1} further away from T0T_{0} so that Hausdorff distance becomes H>>HH^{\prime}>>H, as shown on the bottom left figure of Fig. 8. We keep T1T_{1} a connected curve, thus creating a sharp spike in it. While the Hausdorff distance between the curves has increased dramatically, the flat norm distance sees only a minute increase as measured by the area under the spike. Moreover, the increment Δ𝔽λ\Delta\mathbb{F}_{\lambda} can be decreased to almost zero by narrowing the spike. Once again, the flat norm distance accurately captures the intuition that the curves have not changed much when just a single point moves away while the rest of the curve stays the same. Hence the flat norm provides a more robust metric that better captures significant changes while maintaining stability to small perturbations (also see Section 4.3 for theoretical bounds).

4.2 Empirical study

We observe similar behavior to those illustrated by the theoretical example (Fig. 8) in our computational experiments. Fig. 9 shows scatter plots denoting empirical distribution of percentage deviation of the two metrics from the original values (%Δ𝔻H,%Δ𝔽~λ)\left(\%\Delta\mathbb{D}_{H},\%\Delta\widetilde{\mathbb{F}}_{\lambda}\right) for a local region. The perturbations are considered for three different radii shown in separate plots. We note that the percentage deviations in the two metrics are comparable in most cases. In other words, neither metric behaves abnormally for a small perturbation in one of the networks.

Refer to caption
Figure 9: Scatter plots showing the effect of network perturbation on the normalized flat norm and Hausdorff distances for a local region. The percentage deviation in the metrics for the perturbations is shown along each axis. We do not observe significantly large deviations in any one metric for a given perturbation.

Next, we compare the sensitivity of the two metrics to outliers. Here, we consider a single random node in one of the networks and perturb it. Fig. 10 shows the sensitivity of the metrics to these outliers. The original normalized flat norm and Hausdorff distance metrics are shown by the horizontal and vertical dashed lines respectively. The points along the horizontal dashed line denote the cases where the Hausdorff distance metric is more sensitive to the outliers, while the normalized flat norm metric remains the same. These cases occur when the perturbed random node determines the Hausdorff distance, similar to the second Example where Hausdorff distance increased from HH to HH^{\prime}. On the flip side, the points along the vertical dashed line denote the Hausdorff distance remaining unchanged while the normalized flat norm metric shows variation. Just as in the theoretical example (Fig. 8), such variation in the normalized flat norm metric implies a variation in the network structure. However, such variation is not captured by the Hausdorff distance metric. Hence, our proposed metric is capable of identifying structural differences due to perturbations while remaining stable when widely separated nodes (which are involved in Hausdorff distance computation) are perturbed. The other points which are neither on the horizontal nor the vertical dashed lines indicate that either metric can identify the structural variation due to the perturbation.

Refer to caption
Figure 10: Scatter plots showing the effect of a few outliers on normalized flat norm and Hausdorff distance for a local region. The original normalized flat norm and Hausdorff distance are highlighted by the dashed horizontal and vertical lines. We observe multiple cases where the Hausdorff distance is more sensitive to outliers compared to 𝔽~λ\tilde{\mathbb{F}}_{\lambda}.

4.3 Stability of the Flat Norm

The results of the previous subsection are applicable in all dimensions, i.e., one cannot expect to bound changes in the flat norm by small functions of the changes in the Hausdorff distance. In this subsection, we adopt a more direct approach to the investigation of the stability of our discrepancy measure between geometric objects. Our goal is to construct an upper bound on 𝔽λ\mathbb{F}_{\lambda} from the bottom up based only on the input geometries and an appropriately defined radius of perturbation. To this end, we consider simple piecewise linear (PWL) curves spanned by a pair of points with no self-intersections embedded in 2. Here, simple means that there is no self intersection or branching in the curve that connects two points. Despite its apparent simplicity, this class of curves is of particular interest, since they can potentially approximate any continuous non-intersecting curve in 2. More directly, power grid networks that form the main motivated for our work can be seen as collections of such simple curves. Although, the results of this subsection are proven only for a pair of simple PWL currents, the empirical findings on the real-world networks, presented in the next section (Sec. 5) comply surprisingly well with the upper bounds established for simple curves (see Fig. 17).

We conceptualize a simple piecewise linear curve 𝒯2\mathcal{T}\subset^{2} between points ss and tt as the 1-current TT embedded in 2 that is equipped with an edge set 𝐄(T)={e1,e2,,en1,en}\mathbf{E}(T)=\left\{e_{1},e_{2},\ldots,e_{n-1},e_{n}\right\} given by the linear segments of 𝒯\mathcal{T}, and a node set 𝐍(T)={v0,v1,,vn1,vn}\mathbf{N}(T)=\left\{v_{0},v_{1},\ldots,v_{n-1},v_{n}\right\}, where v0=sv_{0}=s and vn=tv_{n}=t, and vi=eiei+1v_{i}=e_{i}\cap e_{i+1} for i=1,,n1i=1,\dots,n-1. Such currents are defined as a formal sum of their edges T=e1++enT=e_{1}+\ldots+e_{n}, and can be thought of as discretized linear approximations of simple continuous curves in 2. We refer to this type of currents as PWL currents, and denote the set of all PWL currents with nn edges spanned by ss and tt (i.e., starting at ss and ending and tt) as n[s,t]\mathcal{L}_{n}[s,t] or n[s,t;2]\mathcal{L}_{n}[s,t;^{2}]. A subcurrent of Tn[s,t]T\in\mathcal{L}_{n}[s,t] spanned by nodes viv_{i} and vjv_{j}, where i<ji<j, is denoted T[vi,vj]TT[v_{i},v_{j}]\subseteq T, and 𝐄(T[vi,vj])={ei+1,,ej}\mathbf{E}\big{(}T[v_{i},v_{j}]\big{)}=\left\{e_{i+1},\ldots,e_{j}\right\}, which implies T[vi,vj]ji[vi,vj]T[v_{i},v_{j}]\in\mathcal{L}_{j-i}[v_{i},v_{j}]. The length of TT is given by |T|=j=1n|ej||{T}|=\sum_{j=1}^{n}|{e_{j}}|.

We start with a PWL current T0T_{0} in 2 and its copy T1T_{1}. Next, we consider a sequence of perturbations of T1T_{1}’s nodes within some δ\delta-neighborhood to obtain T~1\widetilde{T}_{1}, while tracking the components of the flat norm distance between the original and the perturbed copy. Recall that the components of the flat norm distance between generic inputs T0T_{0} and T1T_{1} are the perimeter of the unfilled void |T1T0S||{T_{1}-T_{0}-\partial S}| and the area of a 2-current SS given as 𝒜(S)=𝐕2(S)\mathcal{A}\left(S\right)=\mathbf{V}_{2}\left(S\right), see Fig. 1. We refer to them as the length (or perimeter) and the area components of flat norm distance, respectively:

𝔽λ(T1T0)\displaystyle\mathbb{F}_{\lambda}\left(T_{1}-T_{0}\right) =minS𝒞2(2){|T1T0S|+λ𝒜(S)}.\displaystyle=\min\limits_{S\in\mathcal{C}_{2}(^{2})}\big{\{}\left|{T_{1}-T_{0}-\partial S}\right|+\lambda\mathcal{A}\left(S\right)\big{\}}. (7)

Since T0T_{0} and T1T_{1} are identical to start with, the flat norm distance between them is zero (for S=S=\emptyset). Now let us take a look at the components of the flat norm distance 𝔽λ(T~1T0)\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0}) between the original current and the perturbed copy T~1\widetilde{T}_{1}. Let X2X\subset^{2} be the 2-dimensional void with the boundary given by T~1T0\widetilde{T}_{1}-T_{0}, and SC2(2)S\in C_{2}(^{2}) be a 2-current that fills in, possibly partially, the void XX. The area component in the optimal decomposition of 𝔽λ(T~1T0)\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0}) is bounded by the area of XX, 𝒜(S)𝒜(X)\mathcal{A}(S)\leq\mathcal{A}(X), and is maximized when the void is fully filled, i.e., SXS\equiv X. On the other hand, the length component can be, potentially, arbitrarily large due to the complexity of S\partial S:

0|T~1T0S||T~1T0|+|S|.\displaystyle 0\leq|{\widetilde{T}_{1}-T_{0}-\partial S}|\leq|{\widetilde{T}_{1}-T_{0}}|+|{\partial S}|. (8)

Here we make an important assumption about the values of parameters λ>0\lambda>0 and δ>0\delta>0, formalized below, that allows us to circumvent the potential unboundedness of the perimeter component. We mention that the problem of identifying the ranges of values of parameters that fit the assumption is out of the scope of this paper, and will be the focus of future research.

Assumption 4.3 (Filled voids).

For any original current T0T_{0} embedded in 2, the scale parameter λ=λ(T0)>0\lambda=\lambda(T_{0})>0 is small enough such that for any size of the perturbation δ=δ(λ,T0)>0\delta=\delta(\lambda,T_{0})>0 taken within some range 0<δ<Mλ(T0)0<\delta<M_{\lambda}(T_{0}), the optimal decomposition of the flat norm distance between T0T_{0} and its consecutive perturbations always fills in all the voids that appear as a result of the perturbations.

We note that this assumption does not introduce any additional challenges in implementing the flat norm distance, since we always can find a small enough λ\lambda by scaling it by one-half until all gaps between the input geometries are filled.

Given parameters λ>0\lambda>0 and δ>0\delta>0 under Assumption 4.3, the minimization objective of 𝔽λ(T~1T0)\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0}) in Eq. (7) is achieved by a 2-current SXS\equiv X, where XX is a 2D-void in between T~1\widetilde{T}_{1} and T0T_{0}. Hence, S=X=T~1T0\partial S=\partial X=\widetilde{T}_{1}-T_{0}, i.e., S𝒞2[X]=𝒞2[T~1T0]S\in\mathcal{C}_{2}[\partial X]=\mathcal{C}_{2}[\widetilde{T}_{1}-T_{0}]. This implies that the length component in Eq. (8) renders to 0—its minimum value. Conversely, if we would leave the void unfilled, i.e., S0S\equiv 0, then the area component as well as its boundary become zero, 𝒜(S)=0\mathcal{A}(S)=0 and |S|=0|{\partial S}|=0, while the length component is equal to the void’s perimeter: |T~1T0S|=|T~1T0|=|X||{\widetilde{T}_{1}-T_{0}-\partial S}|=|{\widetilde{T}_{1}-T_{0}}|=|{\partial X}|. Hence, under Assumption 4.3, the optimization objective of 𝔽λ(T~1T0)\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0}) reduces to the scaled area of SS for S𝒞2[T~1T0]S\in\mathcal{C}_{2}[\widetilde{T}_{1}-T_{0}]:

𝔽λ(T~1T0)\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0}) =minS𝒞2[X]{|T~1T0S|+λ𝒜(S)}=λ𝒜(S).\displaystyle=\min\limits_{S\in\mathcal{C}_{2}[\partial X]}\left\{|{\widetilde{T}_{1}-T_{0}-\partial S}|+\lambda\mathcal{A}\left(S\right)\right\}=\lambda\mathcal{A}(S). (9)

This collapsed minimization objective implies that for the scale and the perturbation radius given by our assumption, the void produced by perturbing the copy of T0T_{0} is filled in by a 2-current SS spanned by T~1T0\widetilde{T}_{1}-T_{0}, such that the scaled area of SS is upper bounded by its (non-scaled) perimeter:

𝔽λ(T~1T0)=λ𝒜(S)<|S|=|T~1T0|.\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0})=\lambda\mathcal{A}(S)<|{\partial S}|=|{\widetilde{T}_{1}-T_{0}}|. (10)

4.3.1 δ\delta-perturbations of PWL currents

We consider a PWL current T0n[s,t]T_{0}\in\mathcal{L}_{n}[s,t] spanned by s,t2s,t\in^{2}, called the original current, and its copy T1=T0T_{1}=T_{0}, e.g., see Fig. 11. Obviously, 𝔽λ(T1T0)=0\mathbb{F}_{\lambda}\left(T_{1}-T_{0}\right)=0 at any scale λ>0\lambda>0.

Refer to caption
Figure 11: The original PWL current T0n[s,t]{\color[rgb]{1,0.125,0.1953125}\definecolor[named]{pgfstrokecolor}{rgb}{1,0.125,0.1953125}T_{0}}\in\mathcal{L}_{n}[s,t] and its copy T1T_{1}. The arrows show the orientations in T1T0T_{1}-T_{0}, and the gray disk is the δ\delta-ball δ(v)\mathcal{B}_{\delta}(v) centered at an interior vertex vv.

Let v=viT1v=v_{i}\in T_{1} be an interior vertex of T1T_{1}, 1in11\leq i\leq n-1. Given a radius of perturbation δ>0\delta>0, consider a mapping vv~v\to\widetilde{v} such that v~\widetilde{v} stays within an open δ\delta-ball centered at vv, i.e., v~δ(v)={x2vx2<δ}\widetilde{v}\in\mathcal{B}_{\delta}(v)=\left\{x\in^{2}\mid\left\|{v-x}\right\|_{2}<\delta\right\}. Let u=vi1u=v_{i-1} and w=vi+1w=v_{i+1} be the adjacent nodes of vv, and let the corresponding incident edges be euv=(u,v)=(vi1,vi)=eie_{uv}=(u,v)=(v_{i-1},v_{i})=e_{i} and evw=(v,w)=(vi,vi+1)=ei+1e_{vw}=(v,w)=(v_{i},v_{i+1})=e_{i+1}. Then, let e~uv=(u,v~)\widetilde{e}_{uv}=(u,\widetilde{v}) and e~vw=(v~,w)\widetilde{e}_{vw}=(\widetilde{v},w) be the edge-like currents connecting the neighbors of vv to its perturbation v~\widetilde{v}, see Fig. 13. We define a 𝜹\boldsymbol{\delta}-perturbation of vT1v\in T_{1} as the collection of mappings of vv along with its two connected edges under the assumption that the new edges e~uv\widetilde{e}_{uv} and e~vw\widetilde{e}_{vw} do not cross any of the original edges.

v𝛿v~={vv~,euve~uv,evwe~vw} subject to v~δ(v)\displaystyle v\overset{\delta}{\rightsquigarrow}\widetilde{v}=\big{\{}v\to\widetilde{v},e_{uv}\to\widetilde{e}_{uv},e_{vw}\to\widetilde{e}_{vw}\big{\}}~{}\text{ subject to }~{}\widetilde{v}\in\mathcal{R}_{\delta}(v)

where δ(v)δ(v)\mathcal{R}_{\delta}(v)\subseteq\mathcal{B}_{\delta}(v) is the allowed region of δ\delta-perturbation defined as a subregion of the δ\delta-ball centered at vv that is in the direct line of sight of the vertices adjacent to vv (see Fig. 12):

δ(v)=δ(v;T1)\displaystyle\mathcal{R}_{\delta}(v)=\mathcal{R}_{\delta}(v;T_{1}) ={v~δ(v)e~uvT1=,e~vwT1=}.\displaystyle=\big{\{}\widetilde{v}\in\mathcal{B}_{\delta}(v)\mid\widetilde{e}_{uv}\cap T_{1}=\emptyset,\,\widetilde{e}_{vw}\cap T_{1}=\emptyset\big{\}}. (11)
Refer to caption
Figure 12: The yellow and orange circular arcs indicate the subregions of the δ\delta-ball that are in the line of sight of nodes uu and ww. The allowed region δ(v)\mathcal{R}_{\delta}(v) of a δ\delta-perturbation v𝛿v~v\overset{\delta}{\rightsquigarrow}\widetilde{v} is shown in green, in which it is guaranteed that the perturbed edges will not cross any of the original edges (see Eq. (11)).

A δ\delta-perturbation of a vertex v𝛿v~v\overset{\delta}{\rightsquigarrow}\widetilde{v} maps T1T_{1} into T~1\widetilde{T}_{1} by acting on its node and edge sets as defined below, and defines a δ\delta-perturbation T1𝛿T~1T_{1}\overset{\delta}{\rightsquigarrow}\widetilde{T}_{1}. We say that the δ\delta-perturbation of a current T1𝛿T~1T_{1}\overset{\delta}{\rightsquigarrow}\widetilde{T}_{1} is induced by v𝛿v~v\overset{\delta}{\rightsquigarrow}\widetilde{v}. Note that the non-overlapping conditions in Eq. (11), implies that e~uvT~1=e~vwT~1=\widetilde{e}_{uv}\cap\widetilde{T}_{1}=\widetilde{e}_{vw}\cap\widetilde{T}_{1}=\emptyset, which means that T~1\widetilde{T}_{1} is injective (has no self intersections).

𝐍(T~1)\displaystyle\mathbf{N}(\widetilde{T}_{1}) =v𝛿v~(𝐍(T1))={s,v1,,u,v~,w,,vn1,t} and\displaystyle=v\overset{\delta}{\rightsquigarrow}\widetilde{v}\big{(}\mathbf{N}(T_{1})\big{)}=\left\{s,v_{1},\ldots,u,\widetilde{v},w,\ldots,v_{n-1},t\right\}~{}~{}~{}\text{ and }
𝐄(T~1)\displaystyle\mathbf{E}(\widetilde{T}_{1}) =v𝛿v~(𝐄(T1))={es,e2,,e~uv,e~vw,,en1,et}.\displaystyle=v\overset{\delta}{\rightsquigarrow}\widetilde{v}\big{(}\mathbf{E}(T_{1})\big{)}=\left\{e_{s},e_{2},\ldots,\widetilde{e}_{uv},\widetilde{e}_{vw},\ldots,e_{n-1},e_{t}\right\}.

The δ\delta-perturbations of boundaries are given by the following maps:

s𝛿s~\displaystyle s\overset{\delta}{\rightsquigarrow}\widetilde{s} ={ss~,ese~s} and\displaystyle=\big{\{}s\to\widetilde{s},e_{s}\to\widetilde{e}_{s}\big{\}}\text{\hskip 43.36243pt and } t𝛿t~\displaystyle t\overset{\delta}{\rightsquigarrow}\widetilde{t} ={tt~,ete~t},\displaystyle=\big{\{}t\to\widetilde{t},e_{t}\to\widetilde{e}_{t}\big{\}},

where s~δ(s)=δ(s){s~e~sT1=}\widetilde{s}\in\mathcal{R}_{\delta}(s)=\mathcal{B}_{\delta}(s)\cap\{\widetilde{s}\mid\widetilde{e}_{s}\cap T_{1}=\emptyset\} and t~δ(t)=δ(t){t~e~tT1=}\widetilde{t}\in\mathcal{R}_{\delta}(t)=\mathcal{B}_{\delta}(t)\cap\{\widetilde{t}\mid\widetilde{e}_{t}\cap T_{1}=\emptyset\}.

4.3.2 Flat Norm of δ\delta-perturbations

Refer to caption
Figure 13: The quadrilateral S=[u,v,w,v~]=Δv+1+Δv1S=[u,v,w,\widetilde{v}]=\Delta_{v+1}+\Delta_{v-1} appears as result of a δ\delta-perturbation T1𝛿T~1T_{1}\overset{\delta}{\rightsquigarrow}\widetilde{T}_{1} induced by a perturbation of a non-boundary vertex v𝛿v~v\overset{\delta}{\rightsquigarrow}\widetilde{v}.

Having specified the necessary definitions and procedures, we are now ready to prove our first result regarding a single δ\delta-perturbation of an interior node.

Theorem 4.4.

Let T0n[s,t;2]T_{0}\in\mathcal{L}_{n}[s,t;^{2}] for n2n\geq 2 and T1=T0T_{1}=T_{0} be its copy. Given a small enough scale λ>0\lambda>0 and an appropriate radius of perturbation δ>0\delta>0, consider a δ\delta-perturbation T1𝛿T~1T_{1}\overset{\delta}{\rightsquigarrow}\widetilde{T}_{1} induced by perturbation of an interior node v𝛿v~v\overset{\delta}{\rightsquigarrow}\widetilde{v}. Then the flat norm distance between T0T_{0} and T~1\widetilde{T}_{1} is upper bounded as follows:

𝔽λ(T~1T0)λδ2|T0[u,w]|\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0})\leq\frac{\lambda\delta}{2}\big{|}{T_{0}[u,w]}\big{|}

where uu and ww are vertices of T0T_{0} adjacent to vv.

Proof.

Recall that by the Assumption 4.3, the optimal 2-current SS is spanned by T~1T0\widetilde{T}_{1}-T_{0}, and by the construction of δ(v)\mathcal{R}_{\delta}(v) the new edges e~uv\widetilde{e}_{uv} and e~vw\widetilde{e}_{vw} do not overlap with the original current. Then S=T~1T0=euv+evwe~vwe~uv\partial S=\widetilde{T}_{1}-T_{0}=e_{uv}+e_{vw}-\widetilde{e}_{vw}-\widetilde{e}_{uv}, which is the boundary of a quadrilateral spanned by the vertices [u,v,w,v~][u,v,w,\widetilde{v}], see Fig. 13. To keep the notation simple, we just write S=[u,v,w,v~]𝒞2[T~1T0]S=[u,v,w,\widetilde{v}]\in\mathcal{C}_{2}[\widetilde{T}_{1}-T_{0}]. Consider the diagonal δ~v=(v,v~)1[v,v~]\widetilde{\delta}_{v}=(v,\widetilde{v})\in\mathcal{L}_{1}[v,\widetilde{v}] that connects vv to its perturbation. Its length is bounded by the radius of perturbation |δ~v|δ|{\widetilde{\delta}_{v}}|\leq\delta and it partitions SS into a pair of triangles Δv+1\Delta_{v+1} and Δv1\Delta_{v-1}:

S=[u,v,w,v~]=[v,w,v~]+[v~,u,v]=Δv+1+Δv1\displaystyle S=[u,v,w,\widetilde{v}]=[v,w,\widetilde{v}]+[\widetilde{v},u,{v}]=\Delta_{v+1}+\Delta_{v-1} (12)

with boundaries given by Δv+1=evwe~vwδ~v\partial\Delta_{v+1}=e_{vw}-\widetilde{e}_{vw}-\widetilde{\delta}_{v} and Δv1=euv+δ~ve~uv\partial\Delta_{v-1}=e_{uv}+\widetilde{\delta}_{v}-\widetilde{e}_{uv}. The corresponding areas are 𝒜(Δv+1)=12|δ~v||evw|sinθv+1\mathcal{A}(\Delta_{v+1})=\frac{1}{2}|{\widetilde{\delta}_{v}}||{e_{vw}}|\sin\theta_{v+1} and 𝒜(Δv1)=12|δ~v||euv|sinθv1\mathcal{A}(\Delta_{v-1})=\frac{1}{2}|{\widetilde{\delta}_{v}}||{e_{uv}}|\sin\theta_{v-1}, where θi\theta_{i} is the angle between the diagonal δ~\widetilde{\delta} and the original edge in the corresponding triangle, (see Fig. 13). Note that both Δv+1\Delta_{v+1} and Δv1\Delta_{v-1}, and consequently SS, attain the largest possible area when the perturbed vertex v~\widetilde{v} lands on the boundary of the δ\delta-ball δ(v)\mathcal{B}_{\delta}(v) and δ~v\widetilde{\delta}_{v} is perpendicular to the original edges, i.e., θv±1=π/2\theta_{v\pm 1}=\nicefrac{{\pi}}{{2}}. Let Δv\Delta_{v} be one of the triangles and ev=evwe_{v}=e_{vw} or ev=euve_{v}=e_{uv} be its original edge, then the following are the upper bounds on the area of Δv\Delta_{v} and SS:

𝒜(Δv)=12|δ~v||ev|sinθ\displaystyle\mathcal{A}(\Delta_{v})=\frac{1}{2}|{\widetilde{\delta}_{v}}||{e_{v}}|\sin\theta δ2|ev|\displaystyle\leq\frac{\delta}{2}|{e_{v}}| (13)
𝒜(S)=𝒜(Δv+1)+𝒜(Δv1)\displaystyle\mathcal{A}(S)=\mathcal{A}(\Delta_{v+1})+\mathcal{A}(\Delta_{v-1}) δ2(|evw|+|euv|)\displaystyle\leq\frac{\delta}{2}\big{(}|{e_{vw}}|+|{e_{uv}}|\big{)} (14)

Note that euv+evwe_{uv}+e_{vw} defines a subcurrent of T0T_{0} spanned by uu and ww, namely T0[u,w]T_{0}[u,w]. Finally, recall that under our main Assumption 4.3, 𝔽λ(T~1T0)\mathbb{F}_{\lambda}(\widetilde{T}_{1}-T_{0}) is given by λ𝒜(S)\lambda\mathcal{A}(S) (see Eq. (9)), which together with the upper bound on 𝒜(S)\mathcal{A}(S) in Eqn. 14 implies the main statement of the Theorem. ∎

Furthermore, observe that by the triangle inequality (see Fig. 13) the length of the perturbed edges in the boundary of SS is bounded as follows:

|ev|δ|ev||δ~||e~v||ev|+|δ~||ev|+δ\displaystyle|{e_{v}}|-\delta\leq|{e_{v}}|-|{\widetilde{\delta}}|\leq|{\widetilde{e}_{v}}|\leq|{e_{v}}|+|{\widetilde{\delta}}|\leq|{e_{v}}|+\delta (15)

which implies that |Δv|=|ev|+|e~v|+|δ~|2|ev||δ~|+|δ~|=2|ev|\big{|}{\partial\Delta_{v}}\big{|}=|{e_{v}}|+|{\widetilde{e}_{v}}|+|{\widetilde{\delta}}|\geq 2|{{e}_{v}}|-|{\widetilde{\delta}}|+|{\widetilde{\delta}}|=2|{e_{v}}|, and thus the following bounds hold:

2|ev|\displaystyle 2|{e_{v}}|\leq |Δv|2|ev|+2δ and\displaystyle~{}\big{|}{\partial\Delta_{v}}\big{|}\leq\displaystyle 2|{e_{v}}|+2\delta\quad\text{ and }
2(|evw|+|euv|)2δ\displaystyle 2\big{(}|{e_{vw}}|+|{e_{uv}}|\big{)}-2\delta\leq |S|2(|evw|+|euv|)+2δ.\displaystyle~{}~{}\big{|}{\partial S}\big{|}\,\leq\displaystyle 2\big{(}|{e_{vw}}|+|{e_{uv}}|\big{)}+2\delta.

4.3.3 Sequential δ\delta-perturbations

We now want to derive results similar to Theorem 4.4 for the case where perturbations are applied to a subcurrent of T1T_{1} given by a subset of its adjacent nodes. To this end, we consider a sequence of δ\delta-perturbations of the copy current T1𝛿T~1𝛿𝛿T~1n1T_{1}\overset{\delta}{\rightsquigarrow}\widetilde{T}_{1}\overset{\delta}{\rightsquigarrow}\ldots\overset{\delta}{\rightsquigarrow}\widetilde{T}^{n-1}_{1} induced by sequential perturbations of the interior points v1𝛿v~1,,vn1𝛿v~n1v_{1}\overset{\delta}{\rightsquigarrow}\widetilde{v}_{1},\ldots,v_{n-1}\overset{\delta}{\rightsquigarrow}\widetilde{v}_{n-1}, which we denote as [v1,,vn1]𝛿[v~1,,v~n1][v_{1},\ldots,v_{n-1}]\overset{\delta}{\rightsquigarrow}[\widetilde{v}_{1},\ldots,\widetilde{v}_{n-1}].

The procedure of δ\delta-perturbations described above does not guarantee “out of the box” additivity of the area components 𝒜(Si)\mathcal{A}(S_{i}) of the corresponding flat norm distances 𝔽λ(T~1iT~1i1)\mathbb{F}_{\lambda}(\widetilde{T}^{i}_{1}-\widetilde{T}^{i-1}_{1}). The problem arises when a δ\delta-perturbation v~i\widetilde{v}_{i} lands within a region SjS_{j} produced by vj𝛿v~jv_{j}\overset{\delta}{\rightsquigarrow}\widetilde{v}_{j} for some j<ij<i, where Sj=T~1jT~1j1\partial S_{j}=\widetilde{T}^{j}_{1}-\widetilde{T}^{j-1}_{1}. It means that SiSjS_{i}\cap S_{j}\neq\emptyset and 𝒜(Si+Sj)𝒜(Si)+𝒜(Sj)\mathcal{A}(S_{i}+S_{j})\neq\mathcal{A}(S_{i})+\mathcal{A}(S_{j}). But since SiS_{i} and SjS_{j} are embedded in 2 the area of a formal sum Si+SjS_{i}+S_{j} is not larger than the area of their union: 𝒜(Si+Sj)𝒜(SiSj)𝒜(Si)+𝒜(Sj)\mathcal{A}\big{(}S_{i}+S_{j}\big{)}\leq\mathcal{A}\left(S_{i}\cup S_{j}\right)\leq\mathcal{A}(S_{i})+\mathcal{A}(S_{j}). Therefore, to find an upper bound for 𝔽λ(T~1n1T0)\mathbb{F}_{\lambda}(\widetilde{T}^{n-1}_{1}-T_{0}) we can consider only perturbation sequences T1𝛿𝛿T~1n1T_{1}\overset{\delta}{\rightsquigarrow}\ldots\overset{\delta}{\rightsquigarrow}\widetilde{T}^{n-1}_{1} that produce non-overlapping regions SiS_{i}, and hence, additive area components 𝒜(Si)\mathcal{A}(S_{i}).

One way to achieve this is to force v~\widetilde{v} for any choice of vv to land on the same “side” of T1T_{1}, and hence of T0T_{0}, by restricting the δ\delta-ball δ(v)\mathcal{B}_{\delta}(v) to a positive cone δ+(v)\mathcal{B}^{\scalebox{0.6}{+}}_{\delta}(v) given by the edges incident to vv in T1T_{1}. As an example in Fig. 14, the light green circular arc indicates the segment of the δ\delta-ball that corresponds to δ+(v)\mathcal{B}^{\scalebox{0.6}{+}}_{\delta}(v). Let v=vi𝐍(T1)v=v_{i}\in\mathbf{N}(T_{1}) be an interior vertex of T1T_{1}, i.e., 1in11\leq i\leq n-1, and euv=ei=(vi1,vi)e_{uv}=e_{i}=(v_{i-1},v_{i}) and evw=ei+1=(vi,vi+1)e_{vw}=e_{i+1}=(v_{i},v_{i+1}) are the incident edges of vv in T1T_{1}. Then we get that

δ+(v)=δ+(v;T1)\displaystyle\mathcal{B}^{\scalebox{0.6}{+}}_{\delta}(v)=\mathcal{B}^{\scalebox{0.6}{+}}_{\delta}(v;T_{1}) ={xδ(v)|det[xv,euv]𝖳>0 and det[xv,evw]𝖳>0}\displaystyle=\bigg{\{}x\in\mathcal{B}_{\delta}(v)\,\bigg{|}\det\big{[}x-v,\,e_{uv}\big{]}^{\scalebox{0.5}{$\mathsf{T}$}}>0\text{ and }\det\big{[}x-v,\,e_{vw}\big{]}^{\scalebox{0.5}{$\mathsf{T}$}}>0\bigg{\}} (16)

where det[x,y]𝖳=x1y2x2y1\det[x,y]^{\scalebox{0.5}{$\mathsf{T}$}}=x_{1}y_{2}-x_{2}y_{1} for x=(x1,x2)2x=(x_{1},x_{2})\in^{2} and y=(y1,y2)2y=(y_{1},y_{2})\in^{2}. In the case of boundary vertices ss or tt, the positive cone reduces to the half-disk bounded by a line that contains ese_{s} or ete_{t}, respectively. As previously specified in Eq. (11), the allowed region of v~\widetilde{v} is specified by the non-overlapping conditions:

δ+(v)=δ+(v;T1)\displaystyle\mathcal{R}^{\scalebox{0.6}{+}}_{\delta}(v)=\mathcal{R}^{\scalebox{0.6}{+}}_{\delta}(v;T_{1}) ={v~δ+(v)|e~uvT1=e~vwT1=}=δ+(v;T1)δ(v;T1).\displaystyle=\bigg{\{}\widetilde{v}\in\mathcal{B}^{\scalebox{0.6}{+}}_{\delta}(v)\,\bigg{|}\,\widetilde{e}_{uv}\cap T_{1}=\widetilde{e}_{vw}\cap T_{1}=\emptyset\bigg{\}}=\mathcal{B}^{\scalebox{0.6}{+}}_{\delta}(v;T_{1})\cap\mathcal{R}_{\delta}(v;T_{1}). (17)
Refer to caption
Figure 14: The light green arc indicates the positive cone δ+(v)\mathcal{B}^{\scalebox{0.6}{+}}_{\delta}(v) (see Eq. (16)). The allowed region δ+(v)\mathcal{R}^{\scalebox{0.6}{+}}_{\delta}(v) (see Eq. (17)) of a positive δ\delta-perturbation vδ+v~v\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{v} is shown in green. It ensures that the sequential perturbations produce non-overlapping regions SiS_{i} (Proposition 4.5). Compare this figure to the allowed region shown in Fig. 12.

We call a perturbation vi𝛿v~iv_{i}\overset{\delta}{\rightsquigarrow}\widetilde{v}_{i} a positive 𝜹\boldsymbol{\delta}-perturbation of viT~1i1v_{i}\in\widetilde{T}^{i-1}_{1} if v~iδ+(vi;T~1i1)\widetilde{v}_{i}\in\mathcal{R}^{\scalebox{0.6}{+}}_{\delta}(v_{i};\widetilde{T}^{i-1}_{1}). We denote it as viδ+v~iv_{i}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{v}_{i}, and it induces T~1i1δ+T~1i\widetilde{T}^{i-1}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{i}_{1}. It easy to see that a sequence of kk positive δ\delta-perturbations T1δ+δ+T~1kT_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k}_{1} produces non-overlapping regions S1,,SkS_{1},\ldots,S_{k}.

Proposition 4.5.

Let T1n[s,t;2]T_{1}\in\mathcal{L}_{n}[s,t;^{2}] for n2n\geq 2.Consider a sequence of perturbations T1δ+T~1k0δ+δ+T~1kT_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k_{0}}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k}_{1} induced by positive δ\delta-perturbations of adjacent nodes [vk0,,vk][v_{k_{0}},\ldots,v_{k}] for some 0k0<kn0\leq k_{0}<k\leq n. Then the regions Sk0,,SkS_{k_{0}},\ldots,S_{k} produced by the corresponding perturbations in the sequence do not overlap.

Proof.

First, when two adjacent nodes are perturbed, vi1δ+v~i1v_{i-1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{v}_{i-1} and viδ+v~iv_{i}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{v}_{i}, the edge ei=(vi1,vi)e_{i}=(v_{i-1},v_{i}) is perturbed twice: eie~ie~~ie_{i}\to\widetilde{e}_{i}\to\widetilde{\widetilde{e}}_{i}, where e~i1[v~i1,vi]\widetilde{e}_{i}\in\mathcal{L}_{1}[\widetilde{v}_{i-1},v_{i}] and e~~i1[v~i1,v~i]\widetilde{\widetilde{e}}_{i}\in\mathcal{L}_{1}[\widetilde{v}_{i-1},\widetilde{v}_{i}]. Therefore, the boundaries of corresponding regions Si1S_{i-1} and SiS_{i} are:

Si1=\displaystyle\partial S_{i-1}= T~1i1T~1i2=e~i1+eie~~i1e~i and\displaystyle\,\widetilde{T}^{i-1}_{1}-\widetilde{T}^{i-2}_{1}=\widetilde{e}_{i-1}+e_{i}-\widetilde{\widetilde{e}}_{i-1}-\widetilde{e}_{i}~{}\text{ and } (18)
Si=\displaystyle\partial S_{i}= T~1iT~1i1=e~i+ei+1e~~ie~i+1.\displaystyle~{}~{}\widetilde{T}^{i}_{1}-\widetilde{T}^{i-1}_{1}~{}=\widetilde{e}_{i}+e_{i+1}-\widetilde{\widetilde{e}}_{i}-\widetilde{e}_{i+1}. (19)

It is worth pointing out that when only interior nodes are perturbed, i.e., 1k01\leq k_{0} and knk\leq n, the edges ek0=(vk01,vk0)Sk0e_{k_{0}}=(v_{k_{0}-1},v_{k_{0}})\in S_{k_{0}} and ek+1=(vk,vk+1)Ske_{k+1}=(v_{k},v_{k+1})\in S_{k} are perturbed only once. Thus, the edges incident to the boundaries of T1T_{1}, namely ese_{s} and ete_{t}, are perturbed to e~~s\widetilde{\widetilde{e}}_{s} or e~~t\widetilde{\widetilde{e}}_{t} if and only if the boundary nodes ss and tt were perturbed together with all the nodes in between. In this case the corresponding area components S0S_{0} and SnS_{n} are given by triangles Δs=[s,v~1,s~]\Delta_{s}=[{s},\widetilde{v}_{1},\widetilde{s}] and Δt=[t~,v~n1,t]\Delta_{t}=[\widetilde{t},\widetilde{v}_{n-1},t], respectively.

Second, observe that any point xx within SiS_{i} is in δ+(vi)\mathcal{B}^{\scalebox{0.6}{+}}_{\delta}(v_{i}) as well, which means that det[xvi,e~i]𝖳>0\det\big{[}x-v_{i},\,\widetilde{e}_{i}\big{]}^{\scalebox{0.5}{$\mathsf{T}$}}>0 and det[xvi,ei+1]𝖳>0\det\big{[}x-v_{i},\,e_{i+1}\big{]}^{\scalebox{0.5}{$\mathsf{T}$}}>0. This point also belongs to δ(v~i)={xδ(v~i)det[xv~i,e~~i]𝖳<0 and det[xv~i,e~i+1]𝖳<0}\mathcal{B}^{-}_{\delta}(\widetilde{v}_{i})=\big{\{}x\in\mathcal{B}_{\delta}(\widetilde{v}_{i})\mid\det\big{[}x-\widetilde{v}_{i},\,\widetilde{\widetilde{e}}_{i}\big{]}^{\scalebox{0.5}{$\mathsf{T}$}}<0\text{ and }\det\big{[}x-\widetilde{v}_{i},\,\widetilde{e}_{i+1}\big{]}^{\scalebox{0.5}{$\mathsf{T}$}}<0\big{\}}, since xx is enclosed by Si\partial S_{i}.

Given a sequence T1δ+T~1k0δ+δ+T~1kT_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k_{0}}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k}_{1}, let us assume that v~kSj\widetilde{v}_{k}\in S_{j} for some k0j<kk_{0}\leq j<k, while v~iδ+(vi;T~1i1)\widetilde{v}_{i}\in\mathcal{R}^{\scalebox{0.6}{+}}_{\delta}(v_{i};\widetilde{T}^{i-1}_{1}) for all k0ikk_{0}\leq i\leq k. If kj>1k-j>1 then Sk\partial S_{k} and Sj\partial S_{j} do not share any edges, and hence placing v~k\widetilde{v}_{k} inside SjS_{j} requires an intersection of edges, which contradicts v~kδ+(vk;T~1k1)\widetilde{v}_{k}\in\mathcal{R}^{\scalebox{0.6}{+}}_{\delta}(v_{k};\widetilde{T}^{k-1}_{1}). If j=k1j=k-1 then v~kSkdet[v~kvk,e~k]𝖳>0\widetilde{v}_{k}\in S_{k}\iff\det\big{[}\widetilde{v}_{k}-v_{k},\,\widetilde{e}_{k}\big{]}^{\scalebox{0.5}{$\mathsf{T}$}}>0 and v~kSk1det[v~kv~k1,e~k]𝖳<0\widetilde{v}_{k}\in S_{k-1}\iff\det\big{[}\widetilde{v}_{k}-\widetilde{v}_{k-1},\,\widetilde{e}_{k}\big{]}^{\scalebox{0.5}{$\mathsf{T}$}}<0, where e~k=(v~k1,vk)\widetilde{e}_{k}=(\widetilde{v}_{k-1},v_{k}), which is also a contradiction. Therefore SkS_{k} does not overlap with any of the previously produced regions Sk0,,Sk1S_{k_{0}},\ldots,S_{k-1}. ∎

Corollary 4.6 (Additivity of 𝔽λ\mathbb{F}_{\lambda}).

Let T0n[s,t;2]T_{0}\in\mathcal{L}_{n}[s,t;^{2}] for n2n\geq 2, and T1=T0T_{1}=T_{0} is its copy. Given a small enough scale λ>0\lambda>0 and an appropriate radius of perturbation δ>0\delta>0, consider a sequence of perturbations T1δ+T~1k0δ+δ+T~1kT_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k_{0}}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k}_{1} induced by the positive δ\delta-perturbations of adjacent nodes [vk0,,vk][v_{k_{0}},\ldots,v_{k}] in T1T_{1} for some 0k0<kn0\leq k_{0}<k\leq n. Then the flat norm distance between the adjacent perturbations is additive and sums up to 𝔽λ(T~1kT0)\mathbb{F}_{\lambda}(\widetilde{T}^{k}_{1}-T_{0}):

𝔽λ(T~1kT0)\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}^{k}_{1}-{T}_{0}) =i=k0k𝔽λ(T~1iT~1i1)\displaystyle=\sum_{i=k_{0}}^{k}\mathbb{F}_{\lambda}(\widetilde{T}^{i}_{1}-\widetilde{T}^{i-1}_{1})

where we set T~k01=T1\widetilde{T}^{k_{0}-1}=T_{1}.

Proof.

Let S=Sk0++SkS=S_{k_{0}}+\ldots+S_{k} be the 2-current that combines all the regions produced by T1δ+T~1k0δ+δ+T~1kT_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k_{0}}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k}_{1}. Since SiSj=S_{i}\cap S_{j}=\emptyset for iji\neq j and Si=T~1iT~1i1\partial S_{i}=\widetilde{T}^{i}_{1}-\widetilde{T}^{i-1}_{1}, the boundary of SS is given by

S\displaystyle\partial S =Sk0+Sk0+1++Sk1+Sk\displaystyle=\partial S_{k_{0}}+\partial S_{k_{0}+1}+\ldots+\partial S_{k-1}+\partial S_{k}
=(T~1k0T1)+(T~1k0+1T~1k0)++(T~1k1T~1k2)+(T~1kT~1k1)=T~1kT1.\displaystyle=(\widetilde{T}^{k_{0}}_{1}-{T}_{1})+(\widetilde{T}^{k_{0}+1}_{1}-\widetilde{T}^{k_{0}}_{1})+\ldots+(\widetilde{T}^{k-1}_{1}-\widetilde{T}^{k-2}_{1})+(\widetilde{T}^{k}_{1}-\widetilde{T}^{k-1}_{1})=\widetilde{T}^{k}_{1}-T_{1}.

This means that SS is spanned by T~1kT1=T~1kT0\widetilde{T}^{k}_{1}-T_{1}=\widetilde{T}^{k}_{1}-T_{0}, which under the Assumption 4.3 implies that

𝔽λ(T~1kT0)=λ𝒜(S)=λi=k0k𝒜(Si)\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}^{k}_{1}-T_{0})=\lambda\mathcal{A}(S)=\lambda\sum_{i=k_{0}}^{k}\mathcal{A}(S_{i}) =i=k0k𝔽λ(T~1iT~1i1).\displaystyle=\sum_{i=k_{0}}^{k}\mathbb{F}_{\lambda}(\widetilde{T}^{i}_{1}-\widetilde{T}^{i-1}_{1}).

Refer to caption
Figure 15: The ii-th positive δ\delta-perturbation T~1i1δ+T~1i\widetilde{T}^{i-1}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{i}_{1} induced by viδ+v~iv_{i}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{v}_{i}. The purple edges show history of the previously perturbed edges e~v\widetilde{e}_{v} that have been mapped to e~~v\widetilde{\widetilde{e}}_{v} in T~1i\widetilde{T}^{i}_{1}.

We now present in Theorem 4.7 an upper bound on the flat norm distance between the input current T0T_{0} and its perturbed version resulting from a sequence of perturbations of its internal nodes, i.e., nodes in 𝐍(T0){s,t}\mathbf{N}(T_{0})\setminus\{s,t\}. We subsequently extend the result to include perturbations of all nodes including the boundary nodes in Corollary 4.9.

Theorem 4.7.

Let T0n[s,t;2]T_{0}\in\mathcal{L}_{n}[s,t;^{2}] for n3n\geq 3, and T1=T0T_{1}=T_{0} is its copy. Given a small enough scale λ>0\lambda>0 and an appropriate radius of perturbation δ>0\delta>0, consider a sequence of perturbations T1δ+T~1k0δ+δ+T~1kT_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k_{0}}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k}_{1} induced by the positive δ\delta-perturbations of adjacent non-boundary nodes [vk0,,vk][v_{k_{0}},\ldots,v_{k}] in T1T_{1} for some 1k0<kn11\leq k_{0}<k\leq n-1. Then the flat norm distance between T0T_{0} and T~1k\widetilde{T}^{k}_{1} is upper bounded as follows:

𝔽λ(T~1kT0)\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}^{k}_{1}-{T}_{0}) λδ2(|T0[vk01,vk+1]|+|T0[vk0,vk]|+(kk0)δ).\displaystyle\leq\frac{\lambda\delta}{2}\bigg{(}\big{|}{T_{0}[v_{k_{0}-1},v_{k+1}]}\big{|}+\big{|}{T_{0}[v_{k_{0}},v_{k}]}\big{|}+(k-k_{0})\delta\bigg{)}.
Proof.

As previously was observed in Eq. (12), SiS_{i} can be partitioned by δ~i=(vi,v~i)\widetilde{\delta}_{i}=(v_{i},\widetilde{v}_{i}) into a pair of triangles Δvi+1=[vi,vi+1,v~i]\Delta_{v_{i+1}}=[v_{i},v_{i+1},\widetilde{v}_{i}] and Δ~vi=[v~i,v~i1,vi]\widetilde{\Delta}_{v_{i}}=[\widetilde{v}_{i},\widetilde{v}_{i-1},v_{i}], with the respective boundaries Δvi+1=ei+1e~i+1δ~i\partial\Delta_{v_{i+1}}=e_{i+1}-\widetilde{e}_{i+1}-\widetilde{\delta}_{i} and Δ~vi=e~ie~~i+δ~i\partial\widetilde{\Delta}_{v_{i}}=\widetilde{e}_{i}-\widetilde{\widetilde{e}}_{i}+\widetilde{\delta}_{i}. The area of these triangles is bounded as in Eq. (13), and the upper bound on the length of the perturbed edges e~v\widetilde{e}_{v} is given by corresponding triangle inequalities in Eq. 15. Hence, we get the following upper bound for 𝒜(Si)\mathcal{A}(S_{i}):

𝒜(Si)=𝒜(Δvi+1)+𝒜(Δ~vi)\displaystyle\mathcal{A}(S_{i})=\mathcal{A}(\Delta_{v_{i+1}})+\mathcal{A}(\widetilde{\Delta}_{v_{i}}) δ2(|ei+1|+|e~i|)δ2(|ei+1|+|ei|+δ)\displaystyle\leq\frac{\delta}{2}\big{(}|{e_{i+1}}|+|{\widetilde{e}_{i}}|\big{)}\leq\frac{\delta}{2}\big{(}|{e_{i+1}}|+|{{e}_{i}}|+\delta\big{)} (20)

Note that the upper bound on the flat norm distance for the first perturbation in the sequence T1δ+T~k0T_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{k_{0}} is given by Theorem 4.4 instead of Eq. (20), namely 𝔽λ(T~1k0T1)=λ𝒜(Sk0)λδ2(|ek0|+|ek0+1|)\mathbb{F}_{\lambda}(\widetilde{T}^{k_{0}}_{1}-T_{1})=\lambda\mathcal{A}(S_{k_{0}})\leq\frac{\lambda\delta}{2}\big{(}|{e_{k_{0}}}|+|{e_{k_{0}+1}}|\big{)}. As was shown above in the Corollary 4.6 under the Assumption 4.3, we get additivity of the flat norm distances between consecutive perturbations, and hence the additivity of the upper bounds. Then we get the Theorem’s claim after doing some algebra:

𝔽λ(T~1kT0)\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}^{k}_{1}-T_{0}) =i=k0k𝔽λ(T~1iT~1i1)=λi=k0+1k𝒜(Si)+λ𝒜(Sk0)\displaystyle=\sum_{i=k_{0}}^{k}\mathbb{F}_{\lambda}(\widetilde{T}^{i}_{1}-\widetilde{T}^{i-1}_{1})=\lambda\sum_{i=k_{0}+1}^{k}\mathcal{A}(S_{i})+\lambda\mathcal{A}(S_{k_{0}})
λδ2(i=k0+1k(|e~i|+|ei+1|)+(|ek0|+|ek0+1|))\displaystyle\leq\frac{\lambda\delta}{2}\left(\sum_{i=k_{0}+1}^{k}(|{\widetilde{e}_{i}}|+|{e_{i+1}}|)+(|{e_{k_{0}}}|+|{e_{k_{0}+1}}|)\right)
λδ2(i=k0k+1|ei|+i=k0+1k|ei|+(kk0)δ)\displaystyle\leq\frac{\lambda\delta}{2}\left(\sum_{i=k_{0}}^{k+1}|{e_{i}}|+\sum_{i=k_{0}+1}^{k}|{{e}_{i}}|+(k-k_{0})\delta\right)
=λδ2(|T0[vk01,vk+1]|+|T0[vk0,vk]|+(kk0)δ).\displaystyle=\frac{\lambda\delta}{2}\bigg{(}\big{|}{T_{0}[v_{k_{0}-1},v_{k+1}]}\big{|}+\big{|}{T_{0}[v_{k_{0}},v_{k}]}\big{|}+(k-k_{0})\delta\bigg{)}.

We immediately get the flat norm bounds on the positive δ\delta-perturbations of all interior nodes, as well as for perturbations of all nodes as corollaries.

Corollary 4.8 (Complete perturbation of interior).

Given the setup of Theorem 4.7, consider a sequence of perturbations T1δ+δ+T~1n1T_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n-1}_{1} induced by the positive δ\delta-perturbations of all interior nodes [v1,,vn1][v_{1},\ldots,v_{n-1}]. Then an upper bound on 𝔽λ(T~1n1T0)\mathbb{F}_{\lambda}\big{(}\widetilde{T}^{n-1}_{1}-T_{0}\big{)} is given by

𝔽λ(T~1n1T0)\displaystyle\mathbb{F}_{\lambda}\big{(}\widetilde{T}^{n-1}_{1}-T_{0}\big{)} λδ2(|T0|+|T0[v1,vn1]|+(n2)δ)\displaystyle\leq\frac{\lambda\delta}{2}\bigg{(}\big{|}{T_{0}}\big{|}+\big{|}{T_{0}[v_{1},v_{n-1}]}\big{|}+(n-2)\delta\bigg{)}
=λδ(|T0|(|es|+|et|2)+(n2)δ2).\displaystyle=\lambda\delta\bigg{(}\big{|}{T_{0}}\big{|}-\left(\frac{|{e_{s}}|+|{e_{t}}|}{2}\right)+(n-2)\frac{\delta}{2}\bigg{)}.
Corollary 4.9 (Complete perturbation of T1T_{1}).

Given the setup of Theorem 4.7 with n2n\geq 2, consider a sequence of perturbations T1δ+δ+T~1n+1T_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n+1}_{1} induced by the positive δ\delta-perturbations of each node of T1T_{1}. Then an upper bound on 𝔽λ(T~1n+1T0)\mathbb{F}_{\lambda}\big{(}\widetilde{T}^{n+1}_{1}-T_{0}\big{)} is given by

𝔽λ(T~1n+1T0)\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}^{n+1}_{1}-T_{0}) λδ2(2|T0|+nδ)=λδ(|T0|+nδ2).\displaystyle\leq\frac{\lambda\delta}{2}\big{(}2|{T_{0}}|+n\delta\big{)}=\lambda\delta\left(|{T_{0}}|+\frac{n\delta}{2}\right). (21)
Proof.

Due to the additivity of the flat norm distance between adjacent δ+\delta^{\scalebox{0.4}{+}}-perturbations, we can perturb the interior nodes [v1,,vn1][v_{1},\ldots,v_{n-1}] first, and the boundary nodes after that, since now all edges will be perturbed twice and the order will not affect the upper bound. The perturbation of the interior nodes induces a sequence T1δ+δ+T~1n1T_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n-1}_{1} from Corollary 4.8, while [s,t]δ+[s~,t~][s,t]\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}[\widetilde{s},\widetilde{t}] induces T~1n1δ+T~1nδ+T~1n+1\widetilde{T}^{n-1}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n+1}_{1}.

The area components produced by sδ+s~s\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{s} and tδ+t~t\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{t} are spanned by the differences of corresponding perturbations are given by triangles S0=Δs=[s,v~1,s~]𝒞2[T~1nT~1n1]S_{0}=\Delta_{s}=[{s},\widetilde{v}_{1},\widetilde{s}]\in\mathcal{C}_{2}[\widetilde{T}^{n}_{1}-\widetilde{T}^{n-1}_{1}] and Sn=Δt=[t~,v~n1,t]𝒞2[T~1n+1T~1n]S_{n}=\Delta_{t}=[\widetilde{t},\widetilde{v}_{n-1},t]\in\mathcal{C}_{2}[\widetilde{T}^{n+1}_{1}-\widetilde{T}^{n}_{1}]. Their boundaries are:

S0=Δs=e~se~~sδ~s\displaystyle\partial S_{0}=\partial\Delta_{s}=\widetilde{e}_{s}-\widetilde{\widetilde{e}}_{s}-\widetilde{\delta}_{s} and Sn=Δt=e~te~~t+δ~t\displaystyle\partial S_{n}=\partial\Delta_{t}=\widetilde{e}_{t}-\widetilde{\widetilde{e}}_{t}+\widetilde{\delta}_{t}

where δ~v1(v,v~)\widetilde{\delta}_{v}\in\mathcal{L}_{1}(v,\widetilde{v}) such that |δ~v|δ|{\widetilde{\delta}_{v}}|\leq\delta, see Fig. 16. Then the upper bound on the flat norm distance becomes

𝔽λ(T~1n+1T0)\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}^{n+1}_{1}-T_{0}) =𝔽λ(T~1n+1T~1n)+𝔽λ(T~1nT~1n1)+𝔽λ(T~1n1T0)\displaystyle=\mathbb{F}_{\lambda}(\widetilde{T}^{n+1}_{1}-\widetilde{T}^{n}_{1})+\mathbb{F}_{\lambda}(\widetilde{T}^{n}_{1}-\widetilde{T}^{n-1}_{1})+\mathbb{F}_{\lambda}(\widetilde{T}^{n-1}_{1}-T_{0})
λδ2(i=2n1|e~i|+i=1n|ei|+|es~|+|et~|)\displaystyle\leq\frac{\lambda\delta}{2}\left(\sum_{i=2}^{n-1}|{\widetilde{e}_{i}}|+\sum_{i=1}^{n}|{{e}_{i}}|+|{\widetilde{e_{s}}}|+|{\widetilde{e_{t}}}|\right)
=λδ2(i=in|e~i|+i=1n|ei|)\displaystyle=\frac{\lambda\delta}{2}\left(\sum_{i=i}^{n}|{\widetilde{e}_{i}}|+\sum_{i=1}^{n}|{{e}_{i}}|\right) (22)
λδ2(2|T0|+nδ)=λδ(|T0|+nδ2).\displaystyle\leq\frac{\lambda\delta}{2}\big{(}2|{T_{0}}|+n\delta\big{)}=\lambda\delta\left(|{T_{0}}|+\frac{n\delta}{2}\right).

Refer to caption
Figure 16: The positive δ\delta-perturbations T~1n1δ+T~1nδ+T~1n+1\widetilde{T}^{n-1}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n}_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n+1}_{1} induced by perturbations of the boundary vertices [s,t][s~,t~][s,t]\rightsquigarrow[\widetilde{s},\widetilde{t}]. The area components S0S_{0} and SnS_{n} that were produced in the result are given by triangles instead of quadrilaterals.

4.3.4 Normalized flat norm of δ\delta-perturbations

Recall from Section 2.2 (Eq. (5)) that the normalized flat norm distance is given as

𝔽~λ(T~1T0)\displaystyle\widetilde{\mathbb{F}}_{\lambda}\big{(}\widetilde{T}_{1}-T_{0}\big{)} =𝔽λ(T~1T0)|T0|+|T~1|.\displaystyle=\frac{\mathbb{F}_{\lambda}\big{(}\widetilde{T}_{1}-T_{0}\big{)}}{\big{|}{T_{0}}\big{|}+\big{|}{\widetilde{T}_{1}}\big{|}}.

Note that the normalized flat norm distance has a natural upper bound given by 𝔽λ(T~1n1T0)/|T0|\nicefrac{{\mathbb{F}_{\lambda}\big{(}\widetilde{T}^{n-1}_{1}-T_{0}\big{)}}}{{\big{|}{T_{0}}\big{|}}} that may be of interest when measurements of one of the input geometries are not available or are not reliable. The following corollary of Theorem 4.7 follows from Eq. (21) by dividing both sides by |T0|\left|{T_{0}}\right|.

Corollary 4.10.

Given the setup of Theorem 4.7 for n2n\geq 2, consider a sequence of perturbations T1δ+δ+T~1n+1T_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n+1}_{1} induced by the positive δ\delta-perturbations of each node of T1T_{1}. Then an upper bound on the normalized flat norm distance 𝔽~λ(T~1n+1T0)\widetilde{\mathbb{F}}_{\lambda}\big{(}\widetilde{T}^{n+1}_{1}-T_{0}\big{)} is given by

𝔽~λ(T~1n+1T0)\displaystyle\widetilde{\mathbb{F}}_{\lambda}\big{(}\widetilde{T}^{n+1}_{1}-T_{0}\big{)} λδ(1+n2|T0|δ)=λδ(1+δ2e^)\displaystyle\leq\lambda\delta\left(1+\frac{n}{2\big{|}{T_{0}}\big{|}}\delta\right)=\lambda\delta\left(1+\frac{\delta}{2\hat{e}}\right)

where e^=1n|T0|\hat{e}=\frac{1}{n}|{T_{0}}| is the average edge length of T0T_{0}.

Theorem 4.11.

Let T0n[s,t;2]T_{0}\in\mathcal{L}_{n}[s,t;^{2}] for n2n\geq 2, and T1=T0T_{1}=T_{0} be its copy. Given a small enough scale λ>0\lambda>0 and an appropriate radius of perturbation δ>0\delta>0, consider a sequence of perturbations T1δ+δ+T~1n+1T_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n+1}_{1} induced by the positive δ\delta-perturbations of each node of T1T_{1}. Then an upper bound on the normalized flat norm distance 𝔽λ(T~1n+1T0)\mathbb{F}_{\lambda}\big{(}\widetilde{T}^{n+1}_{1}-T_{0}\big{)} is given by

𝔽~λ(T~1n+1T0)\displaystyle\widetilde{\mathbb{F}}_{\lambda}\big{(}\widetilde{T}^{n+1}_{1}-T_{0}\big{)} λδ2(12+e^+δe^+e^n~)=λδ(14+e^+δ2(e^+e^n~))\displaystyle\leq\frac{\lambda\delta}{2}\left(\frac{1}{2}+\frac{\hat{e}+\delta}{\hat{e}+\hat{e}_{\widetilde{n}}}\right)=\lambda\delta\left(\frac{1}{4}+\frac{\hat{e}+\delta}{2(\hat{e}+\hat{e}_{\widetilde{n}})}\right) (23)

where e^n~=1n|T~1n+1|\hat{e}_{\widetilde{n}}=\frac{1}{n}\big{|}{\widetilde{T}^{n+1}_{1}}\big{|} is the average length of perturbed edges.

Proof.

To derive an upper bound for the normalized flat norm distance observe that by the triangle inequality in Δi\Delta_{i} and Δ~i\widetilde{\Delta}_{i} we get the following bounds:

Δi:eiδe~iei+δΔ~i:e~~iδe~ie~~+δei+e~~i2δe~iei+e~~i2+δ.\displaystyle\begin{array}[]{ccccc}\Delta_{i}:&&\displaystyle e_{i}-\delta\leq&\widetilde{e}_{i}&\leq e_{i}+\delta\\[10.00002pt] \widetilde{\Delta}_{i}:&&\displaystyle\widetilde{\widetilde{e}}_{i}-\delta\leq&\widetilde{e}_{i}&\leq\widetilde{\widetilde{e}}+\delta\\[10.00002pt] \Longrightarrow&&\displaystyle\frac{e_{i}+\widetilde{\widetilde{e}}_{i}}{2}-\delta\leq&\widetilde{e}_{i}&\displaystyle\leq\frac{e_{i}+\widetilde{\widetilde{e}}_{i}}{2}+\delta.\end{array}

Let T¯1=e~1++e~n\overline{T}_{1}=\widetilde{e}_{1}+\ldots+\widetilde{e}_{n}. Note that T0=e1++enT_{0}=e_{1}+\ldots+e_{n} and T~1n+1=e~~1++e~~n\widetilde{T}^{n+1}_{1}=\widetilde{\widetilde{e}}_{1}+\ldots+\widetilde{\widetilde{e}}_{n}. We then continue the derivation from the intermediate result in Eq. (22) obtained during the proof of Corollary 4.9 of Theorem 4.7:

𝔽λ(T~1n+1T0)\displaystyle\mathbb{F}_{\lambda}(\widetilde{T}^{n+1}_{1}-T_{0}) λδ2(i=1n|e~i|+i=1n|ei|)=λδ2(|T¯1|+|T0|)\displaystyle\leq\frac{\lambda\delta}{2}\left(\sum_{i=1}^{n}|{\widetilde{e}_{i}}|+\sum_{i=1}^{n}|{{e}_{i}}|\right)=\frac{\lambda\delta}{2}\bigg{(}\big{|}{\overline{T}_{1}}\big{|}+\big{|}{T_{0}}\big{|}\bigg{)}
λδ2(|T0|+|T~1n+1|2+|T0|+nδ).\displaystyle\leq\frac{\lambda\delta}{2}\left(\frac{\big{|}{T_{0}}\big{|}+\big{|}{\widetilde{T}^{n+1}_{1}}\big{|}}{2}+|{T_{0}}|+n\delta\right).

Recall that e^n~=|T~1n+1|/n\hat{e}_{\widetilde{n}}=\nicefrac{{\big{|}{\widetilde{T}^{n+1}_{1}}\big{|}}}{{n}}. Then,

𝔽λ(T~1n+1T0)|T0|+|T~1n+1|\displaystyle\frac{\mathbb{F}_{\lambda}(\widetilde{T}^{n+1}_{1}-T_{0})}{\big{|}{T_{0}}\big{|}+\big{|}{\widetilde{T}^{n+1}_{1}}\big{|}} λδ2(12+|T0||T0|+|T~1n+1|+n|T0|+|T~1n+1|δ)\displaystyle\leq\frac{\lambda\delta}{2}\left(\frac{1}{2}+\frac{\big{|}{T_{0}}\big{|}}{\big{|}{T_{0}}\big{|}+\big{|}{\widetilde{T}^{n+1}_{1}}\big{|}}+\frac{n}{|{T_{0}}|+|{\widetilde{T}^{n+1}_{1}}|}\delta\right)
=λδ2(12+e^e^+e^n~+δe^+e^n~).\displaystyle=\frac{\lambda\delta}{2}\left(\frac{1}{2}+\frac{\hat{e}}{\hat{e}+\hat{e}_{\widetilde{n}}}+\frac{\delta}{\hat{e}+\hat{e}_{\widetilde{n}}}\right).

Combining the generic upper bound in Eq. (10) that holds for small enough λ>0\lambda>0 and an appropriate δ>0\delta>0, we can rewrite Eq. (23) as

𝔽~λ(T~1n+1T0)\displaystyle\widetilde{\mathbb{F}}_{\lambda}(\widetilde{T}^{n+1}_{1}-T_{0}) min{λδ(14+e^+δ2(e^+e^n~)), 1}.\displaystyle\leq\min\left\{\lambda\delta\left(\frac{1}{4}+\frac{\hat{e}+\delta}{2(\hat{e}+\hat{e}_{\widetilde{n}})}\right),\,1\right\}. (24)
Corollary 4.12.

In the case of a non-shrinking perturbation sequence T1δ+δ+T~1n+1T_{1}\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\ldots\overset{\delta^{\scalebox{0.4}{+}}}{\rightsquigarrow}\widetilde{T}^{n+1}_{1} such that |T~1n+1||T1|=|T0|\big{|}{\widetilde{T}^{n+1}_{1}}\big{|}\geq\big{|}{T_{1}}\big{|}=\big{|}{T_{0}}\big{|}, the upper bound on the normalized flat norm distance is given as

𝔽~λ(T~1n+1T0)\displaystyle\widetilde{\mathbb{F}}_{\lambda}(\widetilde{T}^{n+1}_{1}-T_{0}) λδ(34+δ4e^).\displaystyle\leq\lambda\delta\left(\frac{3}{4}+\frac{\delta}{4\hat{e}}\right).

5 Statistical analysis of normalized flat norm

We use the proposed multiscale flat norm to compare a pair of network geometries from power distribution networks for a region in a county in USA. The two networks considered are the actual power distribution network for the region and the synthetic network generated using the methodology proposed by Meyur et al. [17]. We provide a brief overview of these networks.

Actual network. The actual power distribution network was obtained from the power company serving the location. Due to its proprietary nature, node and edge labels were redacted from the shared data. Further, the networks were shared as a set of handmade drawings, many of which had not been drawn to a well-defined scale. We digitized the drawings by overlaying them on OpenStreetMaps [24] and georeferencing to particular points of interest [8]. Geometries corresponding to the actual network edges are obtained as shape files.

Synthetic network. The synthetic power distribution network is generated using a framework with the underlying assumption that the network follows the road network infrastructure to a significant extent [17]. To this end, the residences are connected to local pole top transformers located along the road network to construct the low voltage (LV) secondary distribution network. The local transformers are then connected to the power substation following the road network leading to the medium voltage (MV) primary distribution network. That is, the primary network edges are chosen from the underlying road infrastructure network such that the structural and power engineering constraints are satisfied.

In this section we study the empirical distribution of the normalized flat norm 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} for different local regions and argue that it indeed captures the similarity between input geometries. We use Algorithm (2) to sample random square shaped regions of size 2ϵ×2ϵ2\epsilon\times 2\epsilon steradians from a given geographic location.

Input: Geometries 1,2\mathscr{E}_{1},\mathscr{E}_{2}, number of regions NN
Parameter: Size of region ϵ\epsilon
1:  Find bounding rectangle for the pair of geometries: bound=𝗋𝖾𝖼𝗍(1,2)\mathscr{E}_{\textrm{bound}}=\mathsf{rect}\left(\mathscr{E}_{1},\mathscr{E}_{2}\right).
2:  Initialize set of regions: {}\mathscr{R}\leftarrow\{\}.
3:  while ||N\left|\mathscr{R}\right|\leq N do
4:     Sample a point (x,y)\left(x,y\right) uniformly from region bounded by bound\mathscr{E}_{\textrm{bound}}.
5:     Define the square region r(x,y)r\left(x,y\right) formed by the corner points {(xϵ,yϵ),(x+ϵ,y+ϵ)}\left\{\left(x-\epsilon,y-\epsilon\right),\left(x+\epsilon,y+\epsilon\right)\right\}.
6:     if r(x,y)12r\left(x,y\right)\cap\mathscr{E}_{1}\cap\mathscr{E}_{2}\neq\emptyset then
7:        Add region r(x,y)r\left(x,y\right) to the set of sampled regions: {r(x,y)}\mathscr{R}\leftarrow\mathscr{R}\cup\left\{r\left(x,y\right)\right\}.
8:     end if
9:  end while
Output: Set of sampled regions: \mathscr{R}.
Algorithm 2 Sample square regions from location

We perform our empirical studies for two urban locations of a county in USA. These locations have been identified as ‘Location A’ and ‘Location B’ for the remainder of this paper. We consider local regions of sizes characterized by ϵ{0.0005,0.001,0.0015,0.002}\epsilon\in\{0.0005,0.001,0.0015,0.002\}. For each location, we randomly sample N=50N=50 local regions for each value of ϵ\epsilon using Algorithm (2) and hence we consider 50×4=20050\times 4=200 regions. For every sampled region, we use Algorithm (1) to compute the multiscale flat norm between the network geometries contained within the region with scale parameter λ{103,25×103,50×103,75×103,105}\lambda\in\{10^{3},25\times 10^{3},50\times 10^{3},75\times 10^{3},10^{5}\}. Thereafter, we normalize the computed flat norm using Eq. (5). Additionally, we compute the global normalized flat norm for the entire location and indicate it by 𝔽~λG\widetilde{\mathbb{F}}_{\lambda}^{G}. The corresponding square box bounding the entire location is characterized by ϵG\epsilon_{G}. We also denote the total length of networks in each location scaled by the size of the location by the ratio |TG|/ϵG|T_{G}|/\epsilon_{G}. The detailed statistical results for the experiments are included in the Appendix.

5.1 Empirical distribution of 𝔽~λ\widetilde{\mathbb{F}}_{\lambda}

First, we show the histogram of normalized flat norms for Location A and Location B with the five different values for the scale parameter λ\lambda, Fig. 17. Each histogram shows the empirical distribution of normalized flat norm values 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} for 200200 uniformly sampled local regions (5050 regions for each ϵ\epsilon). We also record the global normalized flat norm between the network geometries of the location 𝔽~λG\widetilde{\mathbb{F}}_{\lambda}^{G} and denote it by the solid blue line in each histogram. We show the mean normalized flat norm 𝔽^λ\widehat{\mathbb{F}}_{\lambda} using the solid green line, with dashed green lines indicating the standard deviation of the distribution. As we can see from the above histograms, the distribution is skewed toward the right for high values of the scale parameter λ\lambda. This follows from our previous discussion of the dependence of the flat norm on the scale parameter: for a large λ\lambda, the area patches are weighed higher in the objective function of the flat norm LP (Eq. (4)). Therefore, the contribution of lengths of the input currents T1T_{1} and T2T_{2} towards the flat norm distance becomes more dominant at the high values of the scale parameter λ\lambda, so that the flat norm 𝔽λ(T1T2)\mathbb{F}_{\lambda}\left(T_{1}-T_{2}\right) is slowly approaching the total network length |T1|+|T2||T_{1}|+|T_{2}|. Hence, the normalized flat norm is approaching 11. For the remainder of the paper, we will continue our discussion with scale parameter λ=1000\lambda=1000 since the empirical distributions of normalized flat norm corresponding to λ=1000\lambda=1000 indicate almost Gaussian distribution.

Refer to caption
Refer to caption
Figure 17: Distribution of normalized flat norm computed using five different values of λ\lambda for 200200 uniformly sampled local regions in Location A (top) and Location B (bottom). The blue line in each histogram denotes the global normalized flat norm 𝔽~λG\widetilde{\mathbb{F}}_{\lambda}^{G} computed for the location with the corresponding scale λ\lambda. The solid green line denotes the mean normalized flat norm 𝔽^λ\widehat{\mathbb{F}}_{\lambda} for the uniformly sampled local regions computed with scale λ\lambda. The dashed green lines show the spread of the distribution.

Next, we consider the empirical distribution of normalized flat norm computed with scale parameter λ=1000\lambda=1000 for uniformly sampled local regions in Location A and Location B, Fig. 18. We show separate histograms for four different-sized local regions (different values of ϵ\epsilon). Note that for small-sized local regions (low ϵ\epsilon), the distribution is skewed toward the right. This is because when we consider small regions, we often capture very isolated network geometries and the flat norm computation is close to the total network length 𝔽λ(T1T2)|T1|+|T2|\mathbb{F}_{\lambda}\left(T_{1}-T_{2}\right)\rightarrow|T_{1}|+|T_{2}|, which again leads the normalized flat norm to be close to 11. Such occurrences are avoided in larger local regions, and therefore we do not observe skewed distributions.

Refer to caption
Refer to caption
Figure 18: Distribution of normalized flat norm computed using λ=1000\lambda=1000 for 5050 uniformly sampled local regions with four different sizes ϵ\epsilon in Location A (top) and Location B (bottom). The blue line in each histogram denotes the global normalized flat norm 𝔽~λG\widetilde{\mathbb{F}}_{\lambda}^{G}. The solid green line denotes the mean normalized flat norm 𝔽^λ\widehat{\mathbb{F}}_{\lambda} for the uniformly sampled local regions. The dashed green lines show the spread of the distribution.

5.2 Distribution of 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} across local regions

Refer to caption
Figure 19: Plots showing normalized flat norm computed for entire Location A and few local regions within it. The scatter plot (top left plot) shows the empirical distribution of (|T|/ϵ,𝔽~λ)\left(|T|/\epsilon,\widetilde{\mathbb{F}}_{\lambda}\right) values with the global normalized flat norm (|TG|/ϵG,𝔽~λG)\left(|T_{G}|/\epsilon_{G},\widetilde{\mathbb{F}}_{\lambda}^{G}\right) for the region (blue star). Nine local regions (three with small 𝔽~λ\widetilde{\mathbb{F}}_{\lambda}, three with large 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} and three with (|T|/ϵ,𝔽~λ)\left(|T|/\epsilon,\widetilde{\mathbb{F}}_{\lambda}\right) values close to the global value (|TG|/ϵG,𝔽~λG)\left(|T_{G}|/\epsilon_{G},\widetilde{\mathbb{F}}_{\lambda}^{G}\right)) are additionally highlighted. The local regions are highlighted along with the pair of network geometries (top right plot). The normalized flat norm computation (with scale λ=1000\lambda=1000) for the local regions are shown in bottom plots.

The scatter plot in the top left of Fig. 19 shows the empirical distribution of (|T|/ϵ,𝔽~λ)\left(|T|/\epsilon,\widetilde{\mathbb{F}}_{\lambda}\right) values. The scatter plot highlights as a blue star the global value (|TG|/ϵG,𝔽~λG)\left(|T_{G}|/\epsilon_{G},\widetilde{\mathbb{F}}_{\lambda}^{G}\right) of Location A, which indicates the normalized flat norm computed for the entire location. The global normalized flat norm (with a scale parameter λ=1000\lambda=1000) for Location A is 𝔽~λG=0.439\widetilde{\mathbb{F}}_{\lambda}^{G}=0.439 and the ratio |TG|/ϵG=0.528|T_{G}|/\epsilon_{G}=0.528. Further, nine additional points are highlighted in the scatter plot denoting nine local regions within Location A. The solid green line denotes the mean of the normalized flat norm values and the dashed green lines indicate the spread of the values around the mean.

Refer to caption
Figure 20: Plots showing normalized flat norm computed for entire Location B and few local regions within it. The scatter plot (top left plot) shows the empirical distribution of (|T|/ϵ,𝔽~λ)\left(|T|/\epsilon,\widetilde{\mathbb{F}}_{\lambda}\right) values with the global normalized flat norm (|TG|/ϵG,𝔽~λG)\left(|T_{G}|/\epsilon_{G},\widetilde{\mathbb{F}}_{\lambda}^{G}\right) for the region (blue star). Nine local regions (three with small 𝔽~λ\widetilde{\mathbb{F}}_{\lambda}, three with large 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} and three with (|T|/ϵ,𝔽~λ)\left(|T|/\epsilon,\widetilde{\mathbb{F}}_{\lambda}\right) values close to the global value (|TG|/ϵG,𝔽~λG)\left(|T_{G}|/\epsilon_{G},\widetilde{\mathbb{F}}_{\lambda}^{G}\right)) are additionally highlighted. The local regions are highlighted along with the pair of network geometries (top right plot). The normalized flat norm computation (with scale λ=1000\lambda=1000) for the local regions are shown in bottom plots.

The nine local regions are selected such that three of them have the minimum 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} in the location (highlighted by cyan colored diamonds), three of them have the maximum 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} in the location (highlighted by purple triangles), and the remaining three local regions have the (|T|/ϵ,𝔽~λ)\left(|T|/\epsilon,\widetilde{\mathbb{F}}_{\lambda}\right) values close to the global value (|TG|/ϵG,𝔽~λG)\left(|T_{G}|/\epsilon_{G},\widetilde{\mathbb{F}}_{\lambda}^{G}\right) for the location (highlighted by tan plus symbols). The network geometries within each region and the flat norm computation with scale λ=1000\lambda=1000 are shown in the bottom plots. The computed flat norm 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} and ratio |T|/ϵ|T|/\epsilon values are shown above each plot. The local regions are also highlighted (cyan, purple, and tan colored boxes, respectively) in the top right plot where the actual and synthetic network geometries within the entire location are overlaid. Fig. 20 shows similar local regions from Location B.

From a mere visual inspection of both Figs. 19 and 20, we notice that the network geometries in each local region shown in the first row of the bottom plots resemble and almost overlap each other. The computed normalized flat norm 𝔽~λ\widetilde{\mathbb{F}}_{\lambda} values for these local regions agree with this observation. Similarly, the large value of the normalized flat norm justifies the observation that network geometries for local regions depicted in the second row of the bottom plots do not resemble each other. These observations validate our choice of using normalized flat norm as a suitable measure to compare network geometries for local regions.

6 Conclusions

We have proposed a fairly general metric to compare a pair of network geometries embedded on the same plane. Unlike standard approaches that map the geometries to points in a possibly simpler space and then measuring distance between those points [11], or comparing “signatures” for the geometries, our metric works directly in the input space and hence allows us to capture all details in the input. The metric uses the multiscale flat norm from geometric measure theory, and can be used in more general settings as long as we can triangulate the region containing the two geometries. It is impossible to derive standard stability results for this distance measure that imply only small changes in the flat norm metric when the inputs change by a small amount—there is no alternative metric to measure the small change in the input. For instance, our theoretical example (in Fig. 8) shows that the commonly used Hausdorff metric cannot be used for this purpose. Instead, we have derived upper bounds on the flat norm distance between a piecewise linear 1-current and its perturbed version as a function of the radius of perturbation under certain assumptions provided the perturbations are performed carefully (see Section 4.3). On the other hand, we do get natural stability results for our distance following the properties of the flat norm—small changes in the input geometries lead to only small changes in the flat norm distance between them [9, 20].

We use the proposed metric to compare a pair of power distribution networks: (i) actual power distribution networks of two locations in a county of USA obtained from a power company and (ii) synthetically generated digital duplicate of the network created for the same geographic location. The proposed comparison metric is able to perform global as well as local comparison of network geometries for the two locations. We discuss the effect of different parameters used in the metric on the comparison. Further, we validate the suitability of using the flat norm metric for such comparisons using computation as well as theoretical examples.

References

  • [1] Mahmuda Ahmed, Brittany Terese Fasy, Kyle S. Hickmann, and Carola Wenk. A path-based distance for street map comparison. ACM Trans. Spatial Algorithms Syst., 1(1), 2015.
  • [2] Mahmuda Ahmed, Brittany Terese Fasy, and Carola Wenk. Local persistent homology based distance between maps. In Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL ’14, page 43–52, New York, NY, USA, 2014. Association for Computing Machinery.
  • [3] Yunsheng Bai, Hao Ding, Song Bian, Ting Chen, Yizhou Sun, and Wei Wang. SimGNN: A Neural Network Approach to Fast Graph Similarity Computation. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, WSDM ’19, page 384–392, New York, NY, USA, 2019. Association for Computing Machinery.
  • [4] Antoine Bidel, Tom Schelo, and Thomas Hamacher. Synthetic distribution grid generation based on high resolution spatial data. In 2021 IEEE International Conference on Environment and Electrical Engineering and 2021 IEEE Industrial and Commercial Power Systems Europe (EEEIC / ICPS Europe), pages 1–6, Bari, Italy, 2021. IEEE.
  • [5] Maria Antonia Brovelli, Marco Minghini, Monia Molinari, and Peter Mooney. Towards an automated comparison of openstreetmap with authoritative road datasets. Transactions in GIS, 21(2):191–206, 2017.
  • [6] Alessio Cardillo, Salvatore Scellato, Vito Latora, and Sergio Porta. Structural properties of planar graphs of urban street patterns. Phys. Rev. E, 73:066107, 2006.
  • [7] David Eppstein. Subgraph isomorphism in planar graphs and related problems. In Proceedings of the Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’95, page 632–640, USA, 1995. Society for Industrial and Applied Mathematics.
  • [8] ESRI. Georeferencing a raster to a vector, 2022. https://desktop.arcgis.com/en/arcmap/ latest/manage-data/raster-and-images/georeferencing-a-raster-to-a-vector.html; last accessed 26 Sept 2022.
  • [9] Herbert Federer. Geometric Measure Theory. Die Grundlehren der mathematischen Wissenschaften, Band 153. Springer-Verlag, New York, 1969.
  • [10] Sharif Ibrahim, Bala Krishnamoorthy, and Kevin R. Vixie. Simplicial flat norm with scale. Journal of Computational Geometry, 4(1):133–159, 2013. arXiv:1105.5104.
  • [11] David G. Kendall, Dennis M. Barden, T. Keith Carne, and Huiling Le. Shape and Shape Theory. Wiley Series in Probability and Statistics. Wiley, Hoboken, NJ, USA, 2009.
  • [12] Venkat Krishnan, Bruce Bugbee, Tarek Elgindy, Carlos Mateo, Pablo Duenas, Fernando Postigo, Jean-Séebastien Lacroix, Tomás Gómez San Roman, and Bryan Palmintier. Validation of synthetic u.s. electric power distribution system data sets. IEEE Transactions on Smart Grid, 11(5):4477–4489, 2020.
  • [13] Hanyue Li, Jessica L. Wert, Adam Barlow Birchfield, Thomas J. Overbye, Tomas Gomez San Roman, Carlos Mateo Domingo, Fernando Emilio Postigo Marcos, Pablo Duenas Martinez, Tarek Elgindy, and Bryan Palmintier. Building highly detailed synthetic electric grid data sets for combined transmission and distribution systems. IEEE Open Access Journal of Power and Energy, 7:478–488, 2020.
  • [14] Ming Liang, Yao Meng, Jiyu Wang, David L. Lubkeman, and Ning Lu. Feedergan: Synthetic feeder generation via deep graph adversarial nets. IEEE Transactions on Smart Grid, 12(2):1163–1173, 2021.
  • [15] Sushovan Majhi and Carola Wenk. Distance measures for geometric graphs. Computational Geometry, 118:102056, 2024.
  • [16] Carlos Mateo, Fernando Postigo, Fernando de Cuadra, Tomás Gómez San Roman, Tarek Elgindy, Pablo Dueñas, Bri-Mathias Hodge, Venkat Krishnan, and Bryan Palmintier. Building large-scale u.s. synthetic electric distribution system models. IEEE Transactions on Smart Grid, 11(6):5301–5313, 2020.
  • [17] Rounak Meyur, Madhav Marathe, Anil Vullikanti, Henning Mortveit, Samarth Swarup, Virgilio Centeno, and Arun Phadke. Creating realistic power distribution networks using interdependent road infrastructure. In 2020 IEEE International Conference on Big Data (Big Data), pages 1226–1235, Atlanta, GA, USA, 2020. IEEE.
  • [18] Rounak Meyur, Anil Vullikanti, Samarth Swarup, Henning Mortveit, Virgilio Centeno, Arun Phadke, Vince Poor, and Madhav Marathe. Ensembles of realistic power distribution networks. Proceedings of the National Academy of Sciences, 119(26):e2123355119, 2022.
  • [19] Ignacio Morer, Alessio Cardillo, Albert Díaz-Guilera, Luce Prignano, and Sergi Lozano. Comparing spatial networks: A one-size-fits-all efficiency-driven approach. Phys. Rev. E, 101:042301, 2020.
  • [20] Frank Morgan. Geometric Measure Theory: A Beginner’s Guide. Academic Press, Cambridge, MA, USA, fourth edition, 2008.
  • [21] Frank Morgan. Geometric Measure Theory: A Beginner’s Guide. Academic Press, Cambridge, MA, USA, 5th edition, 2016.
  • [22] Simon P. Morgan and Kevin R. Vixie. L1L^{1}TV computes the flat norm for boundaries. Abstract and Applied Analysis, 2007:Article ID 45153,14 pages, 2007. arXiv:0612287.
  • [23] Seongmin Ok. A graph similarity for deep learning. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 1–12, Vancouver, BC, Canada, 2020. Curran Associates, Inc.
  • [24] Open Street Map Foundation. Open Street Maps, 2022. www.openstreetmap.org; Last accessed 20 Aug 2022.
  • [25] Benjamin Paaßen, Claudio Gallicchio, Alessio Micheli, and Barbara Hammer. Tree edit distance learning via adaptive symbol embeddings. In Jennifer Dy and Andreas Krause, editors, Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pages 3976–3985. PMLR, 10–15 Jul 2018.
  • [26] Pau Riba, Andreas Fischer, Josep Lladós, and Alicia Fornés. Learning graph edit distance by graph neural networks. Pattern Recognition, 120:108132, 2021.
  • [27] Kamol Chandra Roy, Samiul Hasan, Aron Culotta, and Naveen Eluru. Predicting traffic demand during hurricane evacuation using real-time data from transportation systems and social media. Transportation Research Part C: Emerging Technologies, 131:103339, 2021.
  • [28] Shammya Shananda Saha, Eran Schweitzer, Anna Scaglione, and Nathan G. Johnson. A framework for generating synthetic distribution feeders using openstreetmap. In 2019 North American Power Symposium (NAPS), pages 1–6, Wichita, KS, USA, 2019. IEEE.
  • [29] Eran Schweitzer, Anna Scaglione, Antonello Monti, and Giuliano Andrea Pagani. Automated generation algorithm for synthetic medium voltage radial distribution systems. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 7(2):271–284, 2017.
  • [30] Hang Si. Constrained Delaunay tetrahedral mesh generation and refinement. Finite Elements in Analysis and Design, 46:33–46, 2010.
  • [31] Mattia Tantardini, Francesca Ieva, Lucia Tajoli, and Carlo Piccardi. Comparing methods for comparing networks. Scientific Reports, 9(1):17557, 2019.
  • [32] Hangjun Xu. An algorithm for comparing similarity between two trees. https://arxiv.org/abs/1508.03381, 2015. Last accessed 26 Sept 2022.