Minimum Cuts in Directed Graphs via $\sqrt{n}$ Max-Flows

Ruoxu Cen
Duke University Jason Li
Carnegie Mellon University Danupon Nanongkai
University of Copenhagen & KTH Debmalya Panigrahi
Duke University Thatchaphol Saranurak
University of Michigan, Ann Arbor

Abstract

We give an algorithm to find a mincut in an $n$ -vertex, $m$ -edge weighted directed graph using $\tilde{O}(\sqrt{n})$ calls to any maxflow subroutine. Using state of the art maxflow algorithms, this yields a directed mincut algorithm that runs in $\tilde{O}(m\sqrt{n}+n^{2})$ time. This improves on the 30 year old bound of $\tilde{O}(mn)$ obtained by Hao and Orlin for this problem.

1 Introduction

The minimum cut (or mincut) problem is one of the most widely studied problems in graph algorithms. In directed graphs (or digraphs), the goal of this problem is to find a minimum weight set of edges whose removal creates multiple strongly connected components in the graph. Equivalently, a minimum cut of a digraph is a bipartition of the vertices into two non-empty sets $(S,V\setminus S)$ such that the weight of edges from $S$ to $V\setminus S$ is minimized. This problem can be solved by using $O(n)$ maxflow calls by finding the minimum among all the $r-v$ mincuts and $v-r$ mincuts for all vertices $v\not=r$ . In a beautiful result, Hao and Orlin [HO94] showed that these maxflow calls can be amortized to match the running time of only polylog( $n$ ) calls to the push-relabel maxflow algorithm [GT88]. This leads to an overall running time of $\tilde{O}(mn)$ on an $m$ -edge, $n$ -vertex graph. Since their work, better maxflow algorithms have been designed (e.g., Goldberg and Rao [GR98]), but the amortization does not work for these algorithms. Using a different technique of duality between rooted mincuts and arborescences, Gabow [Gab95] obtained a running time of $\tilde{O}(m\lambda)$ for this problem, where $\lambda$ is the weight of a mincut (assuming integer weights). This is at least as good as the Hao-Orlin running time for unweighted simple graphs, but can be much worse for weighted graphs. Indeed, the Hao-Orlin bound of $\tilde{O}(mn)$ remains the state of the art for the directed mincut problem on arbitrary weighted graphs.

In this paper, we give an algorithm for the directed mincut problem that has a time complexity of $O(\sqrt{n})$ maxflow calls. Importantly, and unlike the Hao-Orlin algorithm, our algorithm can use any maxflow algorithm; in fact, it treats the maxflow algorithm as a black box. Using state of the art max flow algorithms that run in $\tilde{O}(m+n^{3/2})$ time [vdBLL⁺21], this yields a directed mincut algorithm in $\tilde{O}(m\sqrt{n}+n^{2})$ time, thereby improving on the Hao-Orlin bound. Moreover, if one were to believe the widely held conjecture that maxflow would eventually be solved in $\tilde{O}(m)$ time, then the running time of our algorithm would automatically become $\tilde{O}(m\sqrt{n})$ .

Theorem 1.

There is a randomized Monte Carlo algorithm that finds a minimum cut whp in $\tilde{O}(m\sqrt{n}+n^{2})$ time on an $n$ -vertex, $m$ -edge directed graph.

Our Techniques.

At a high level, our paper is inspired by Karger’s celebrated near-linear time mincut algorithm in undirected graphs [Kar00]. Karger’s algorithm has three main steps: (a) sparsify the graph by random sampling of edges to reduce the mincut value to $O(\log n)$ , (b) use a semi-duality between mincuts and spanning trees to pack $O(\log n)$ edge-disjoint spanning trees in the sparsifier, and (c) find the minimum weight cut among those that have only one or two edges in each such spanning tree using a dynamic program. But, directed graphs are substantially different from undirected graphs. In particular, steps (a) and (c) are not valid in a directed graph. We cannot hope to sparsify a directed graph since many directed graphs do not have sparsifiers even in an existential sense. Moreover, even if a mincut had just a single edge in a spanning tree, Karger’s dynamic program to recover this cut cannot be used in a directed graph.

To overcome these challenges, we adopt several ingredients that we outline below:

•

We consider two possibilities: either the mincut has $\tilde{\Omega}(\sqrt{n})$ vertices on the smaller side or fewer (let us call these balanced and unbalanced cuts respectively). If the mincut is a balanced cut, we use two random samples of $\tilde{O}(\sqrt{n})$ and $\tilde{O}(1)$ vertices each, and find $s-t$ mincuts for all pairs of vertices from the two samples. It is easy to see that whp, the two samples would respectively hit the smaller and larger sides of the mincut, and hence, one of these $s-t$ mincuts will reveal the overall mincut of the graph.
•

The main task, then, is to find the mincut when it is unbalanced. In this case, we use a sequence of steps. The first step is to use cut sparsification of the graph by random sampling of edges. This scales down the size of the mincut, but unlike in an undirected graph, all the cuts of a digraph do not necessarily converge to their expected values in the sample. However, crucially, the mincut can be scaled to $\tilde{O}(\sqrt{n})$ while ensuring that all the unbalanced cuts converge to their expected values.
•

Since only the unbalanced cuts converge to their expected values, it is possible that some balanced cut is the new mincut of the sampled graph, having been scaled down disproportionately by the random sampling. Our next step is to overlay this sampled graph with an expander graph, a technique inspired by recent work of Li [Li21]. Note that an expander has a larger weight for balanced cuts than for unbalanced cuts. We choose the expansion of the graph carefully so that the balanced cuts get sufficiently large weight of edges that they are no longer candidates for the mincut of the sample, while the unbalanced cuts are only distorted by a small multiplicative factor.
•

At this point, we have obtained a graph where the original mincut (which was unbalanced) is a near-mincut of the new graph. Next, we create a (fractional) packing of edge-disjoint arborescences¹¹1An arborescence is a spanning tree in a directed graph where all the edges are directed away from the root. in this graph using a multiplicative weights update procedure (e.g., [You95]). By duality, these arborescenes have the following property: if we sample $O(\log n)$ random arborescences from this packing, then there will be at least one arborescence whp such that the original mincut $1$ -respects the arborescence. (A cut $1$ -respects an arborescence if the latter contains just one edge from the cut.)
•

Thus, our task reduces to the following: given an arborescence, find the minimum weight cut in the original graph among all those that $1$ -respect the arborescence. Our final technical contribution is to give an algorithm that solves this problem using $O(\log n)$ maxflow computations. For this purpose, we use a centroid-based recursive decomposition of the arborescence, where in each step, we use a set of maxflow calls that can be amortized on the original graph. The minimum cut returned by all these maxflow calls is eventually returned as the mincut of the graph.

We note that unlike both the Hao-Orlin algorithm and Gabow’s algorithm that are both deterministic algorithms, our algorithm is randomized (Monte Carlo) and might yield the wrong answer with a small (inverse polynomial) probability. Derandomizing our algorithm, or matching our running time bound using a different deterministic algorithm, remains an interesting open problem.

Previous Work.

The mincut problem has been studied in directed graphs over several decades. For unweighted graphs, Even and Tarjan [ET75] gave an algorithm for this problem that runs in $O(mn\cdot\min(\sqrt{m},n^{2/3}))$ time. This was improved by Schnorr [Sch79] improved by this bound for certain graphs to $O(mn\lambda)$ , where $\lambda$ is the value of the directed mincut. This was further improved by Mansour and Schieber [MS89] to $O(n\cdot\min(m,\lambda^{2}n))$ after almost a decade of Schnorr’s work. Mansour and Schieber’s bound of $O(mn)$ was matched up to logarithmic factors for the more general case of weighted digraphs by Hao and Orlin [HO94]. Finally, Gabow [Gab95] gave an algorithm that runs in $\tilde{O}(m\lambda)$ which further refines this bound for graphs with small $\lambda$ . These remained the fastest directed mincut algorithms for almost 30 years before our work.

Concurrent Work.

Two recent results on algorithms for finding mincuts in directed graphs were obtained concurrently and independently of our work. First, Chekuri and Quanrud [CQ21] showed an exact algorithm with running time $\tilde{O}(n^{2}U)$ if edge weights are integers between $1$ and $U$ .²²2We note that we can use the degree reduction technique in [CQ21] to speed up our algorithm from time $\tilde{O}(m\sqrt{n}+n^{2})$ to $\tilde{O}(n^{2})$ , but we omit details of this improvement in this manuscript to avoid interdependence on concurrent, unpublished research. Second, Quanrud [Qua21] has obtained an $(1+\epsilon)$ -approximate algorithm that runs in $\tilde{O}(n^{2}/\epsilon^{2})$ time.³³3Quanrud can also obtain $o(mn)$ -time algorithms using the currently fastest maxflow algorithm on sparse graphs by Gao, Liu, and Peng [GLP21]. We can also obtain the same time but for exact mincuts. Both papers also extend their ideas to obtain approximation results for other problems as well, such as the vertex mincut problem.

2 The Directed Min-Cut Algorithm

Given a directed graph $G=(V,E)$ with non-negative edge weights $w$ , we consider the problem of finding a (global) minimum directed cut in this graph. For simplicity, we assume that all edge weights $w$ are integers and are polynomially bounded. We denote $\overline{U}=V\setminus U$ . Let $\partial U$ be the set of edges in the cut $(U,\overline{U})$ , and let $\delta(U)$ be the weight of the cut, i.e., $\delta(U)=\sum_{e\in\partial U}w(e)$ . Our goal is to find $\arg\min_{\emptyset\subset U\subset V}\delta(U)$ . Let $MF(m,n)$ denote the time complexity of $s$ - $t$ maximum flow on a digraph with $n$ vertices and $m$ edges. The current record for this bound is $MF(m,n)=\tilde{O}(m+n^{3/2})$ [vdBLL⁺21]. We emphasize that our directed mincut algorithm uses maxflow subroutines in a black box manner and therefore, any maxflow algorithm suffices. Correspondingly, we express our running times in terms of $MF(m,n)$ .

Next we describe the algorithm. Let $S^{*}$ be the source side of a minimum cut. The algorithm considers the following two cases, computes a cut for each case and takes the smaller of the two cuts as its final output.

1.

The first case aims to compute the correct mincut in the event that $\min\{|S^{*}|,|\overline{S^{*}}|\}>\theta\cdot\sqrt{n}/\log n$ . In this case, we randomly sample two vertices $s,t\in V$ , then with reasonable probability, they will lie on opposite sides of the mincut. In that case, we can simply compute the maxflow from $s$ to $t$ . Repeating the sampling $O(\sqrt{n}\log^{2}n)$ times, we obtain the mincut whp. The total running time for this case is $O(MF(m,n)\sqrt{n}\log^{2}n)$ and is formalized in Lemma 2 below:

Lemma 2.

If $\min\{|S^{*}|,|\overline{S^{*}}|\}>r$ , then whp a mincut can be calculated in time $O(MF(m,n)\cdot(n/r)\cdot\log n)$ .

Proof.

Uniformly sample a list of $k=d\cdot(n/r)\cdot\lg n$ vertices $u_{1},\ldots,u_{k}$ , where $d$ is a large constant. Wlog, assume $|S^{*}|\leq|\overline{S^{*}}|$ , and let $\eta=\frac{|S^{*}|}{n}>\frac{r}{n}$ . With probability at least $1-2(1-\eta)^{k}\geq 1-2e^{-k\eta}\geq 1-2n^{-d}$ , the list $u_{1},\ldots,u_{k}$ contains at least one vertex from each of $S^{*}$ and $\overline{S^{*}}$ . Hence, there exists $i$ such that $u_{i}$ and $u_{i+1}$ are on different sides of the $(S^{*},\overline{S^{*}})$ cut. By calculating maxflows for all ( $u_{i}$ , $u_{i+1}$ ) and ( $u_{i+1}$ , $u_{i}$ ) pairs, and reporting the smallest $s-t$ mincut in these calls, we return a global mincut whp. ∎
2.

The second case takes care of the event that $\min\{|S^{*}|,|\overline{S^{*}}|\}\leq\theta\cdot\sqrt{n}/\log n$ . In this case, we select an arbitrary vertex $s$ , and give an algorithm for finding an $s$ -mincut defined as:

Definition 3.

An $s$ -mincut is a minimum weight cut among all those that have $s$ on the source side of the cut, i.e., $\arg\min_{\{s\}\subseteq S\subset V}\delta(S)$ .

Repeating this process with all edge directions reversed, and returning the smaller of the $s$ -mincuts in the original and the reversed graphs, yields the overall mincut.

We now describe the $s$ -mincut algorithm, where we overload notation to denote the value of the $s$ -mincut by $\lambda$ . Here, we first guess $O(\log n)$ potential values of $\tilde{\lambda}$ , which is our estimate of $\lambda$ , as the powers of $2$ , one of which lies in the range $[\lambda,2\lambda]$ , and then for each $\tilde{\lambda}$ , sparsifies the graph using Lemma 7 from Section 3. For each such sparsifier $H$ , the algorithm then applies Lemma 8 from Section 4 to pack $O(\log n)$ $s$ -arborescences in $H$ in $O(m\sqrt{n}\log n)$ time, one of which will $1$ -respect the $s$ -mincut in $G$ (for the correct value of $\tilde{\lambda}$ ):

Definition 4.

An $s$ -arborescence is a directed spanning tree rooted at $s$ such that all edges are directed away from $s$ . A directed $s$ -cut $k$ -respects an $s$ -arborescence if there are at most $k$ cut edges in the arborescence.

Finally, for each of the $O(\log n)$ $s$ -arborescences, the algorithm computes the minimum $s$ -cut that $1$ -respects each arborescence; this algorithm is described in Algorithm 1 and proved in Theorem 12 from Section 5. It runs in $O((MF(m,n)+m)\cdot\log n)$ time for each of the $O(\log n)$ arborescences.

Combining both cases, the total running time becomes $\tilde{O}(m\sqrt{n}+MF(m,n)\sqrt{n})$ , which establishes Theorem 1.

3 Sparsification

This section aims to reduce mincut value to $\tilde{O}(\sqrt{n})$ while keeping $S^{*}$ a $(1+\epsilon)$ -approximate mincut for a constant $\epsilon>0$ that we will fix later. Our algorithm in this stage has two steps. First, we use random sampling to scale down the expected value of all cuts such that the expected value of the mincut $\delta(S^{*})$ becomes $\tilde{O}(\sqrt{n})$ . We also claim that $\partial S^{*}$ remains an approximate mincut among all unbalanced cuts by using standard concentration inequalities. However, since the number of balanced cuts far exceeds that of unbalanced cuts, it might be the case that some balanced cut has now become much smaller in weight than all the unbalanced cuts. This would violate the requirement that $\partial S^{*}$ should be an approximate mincut in this new graph. This is where we need our second step, where we overlay an expander on the sampled graph to raise the values of all balanced cuts above the expected value of $\partial S^{*}$ while only increasing the value of $\partial S^{*}$ by a small factor. This last technique is inspired by recent work of Li [Li21] for a deterministic mincut algorithm in undirected graphs.

We start with the specific expansion properties that we need, and a pointer to an existing construction of such an expander.

Definition 5 (Expander).

A $\psi$ -expander is an undirected graph $G(V,E)$ such that for any $S\subset V$ , $\delta_{G}(S)\geq\psi\cdot\min\{|S|,|\bar{S}|\}$ .

Lemma 6 (Theorem 2.4 of [CGL⁺19]).

Given integer $n$ , we can construct in linear time an $\alpha_{0}$ -expander $X$ for some constant $\alpha_{0}>0$ , such that every vertex in $X$ has degree at most 9.

Now, we prove the main property of this section:

Lemma 7.

Given a digraph $G$ , a parameter $\tilde{\lambda}\in[\lambda,2\lambda]$ , and a constant $\epsilon\in(0,1)$ , we can construct in $O(m\log n)$ time a value $p\in(0,1]$ and a digraph $H$ with $O(m)$ edges such that the following holds whp for the value $p=\min\{\frac{\sqrt{n}}{\lambda},1\}$ .

1.

There is a constant $\theta>0$ (depending on $\epsilon$ ) such that for any set $\emptyset\neq S\subsetneq V$ with $\min\{|S|,|\bar{S}|\}\leq\theta\cdot\sqrt{n}/\log n$ , we have

$(1-\epsilon)\cdot p\cdot\delta_{G}(S)\leq\delta_{H}(S)\leq(1+\epsilon)\cdot p\cdot\delta_{G}(S);$
2.

For any set $\emptyset\neq S\subsetneq V$ , $\delta_{H}(S)\geq(1-\epsilon)p\lambda$ .

Proof.

If $\tilde{\lambda}\leq 2\sqrt{n}$ , then $\lambda\leq\tilde{\lambda}\leq 2\sqrt{n}$ as well, so we set $H$ to be $G$ itself, which satisfies all the properties for $p=1$ . For the rest of the proof, we assume that $\tilde{\lambda}>2\sqrt{n}$ , so that $\lambda\geq\sqrt{n}$ , and we set $p=\frac{\sqrt{n}}{\tilde{\lambda}}\leq 1$ . Throughout the proof, define $\epsilon^{\prime}=\epsilon/2$ , $r=\frac{\epsilon^{\prime 2}}{6}\sqrt{n}/\log n$ , $\alpha=\frac{\sqrt{n}}{\alpha_{0}r}$ , and $\theta=\frac{\epsilon^{\prime 3}\alpha_{0}}{54}$ , where $\alpha_{0}$ is the constant from Lemma 6.

We first construct digraph $\hat{G}$ by reweighting the edges of $G$ as follows. For each edge $e$ in $G$ , assign it a random new weight $w_{\hat{G}}(e)$ chosen according to binomial distribution $B(w(e),p)$ . (If $w_{\hat{G}}(e)=0$ , then remove $e$ from $\hat{G}$ .) For each set $\emptyset\neq S\subsetneq V$ with $\min\{|S|,|\bar{S}|\}\leq r$ , we have $\mathds{E}\delta_{\hat{G}}(S)=p\delta_{G}(S)$ , and by Chernoff bound, the probability that $\delta_{\hat{G}}(S)$ falls outside $[(1-\epsilon^{\prime})p\delta_{G}(S),(1+\epsilon^{\prime})p\delta_{G}(S)]$ is upper-bounded by $2e^{-\lambda\epsilon^{\prime 2}/3}\leq 2n^{-2r}$ . There are $O(n^{r})$ sets $S$ with $\min\{|S|,|\bar{S}|\}\leq r$ , so by a union bound, whp all such sets satisfy $(1-\epsilon^{\prime})p\delta_{G}(S)\leq\delta_{\hat{G}}(S)\leq(1+\epsilon^{\prime})p\delta_{G}(S)$ .

Construct graph $X$ according to Lemma 6 and split each undirected edge into two directed edges. Let $H$ be the “union” of $\hat{G}$ and $\alpha X$ , so that each edge $e$ in $H$ has weight $w_{H}(e)=w_{\hat{G}}(e)+\alpha w_{X}(e)$ , where we say $w(e)=0$ if $e$ does not exist in the corresponding graph.

We now show that $H$ satisfies the two desired properties.

For any set $\emptyset\neq S\subsetneq V$ with $\min\{|S|,|\bar{S}|\}\leq\theta\cdot\sqrt{n}/\log n=\frac{\epsilon^{\prime}\alpha_{0}}{9}r\leq r$ , we have $\delta_{H}(S)\geq\delta_{\hat{G}}(S)\geq(1-\epsilon^{\prime})p\delta_{G}(S)$ from before, so $\delta_{H}(S)\geq(1-\epsilon)p\delta_{G}(S)$ as well. For the upper bound, we have

\delta_{H}(S)=\delta_{\hat{G}}(S)+\alpha\delta_{X}(S)\leq(1+\epsilon^{\prime})p\delta_{G}(S)+9\alpha|S|\leq(1+\epsilon^{\prime})p\delta_{G}(S)+\epsilon^{\prime}\sqrt{n}\leq(1+\epsilon)p\delta_{G}(S).

2.

For any set $\emptyset\neq S\subsetneq V$ with $\min\{|S|,|\bar{S}|\}\leq\theta\cdot\sqrt{n}/\log n=\frac{\epsilon^{\prime}\alpha_{0}}{9}r\leq r$ , we have $\delta_{H}(S)\geq\delta_{\hat{G}}(S)\geq(1-\epsilon)p\delta_{G}(S)\geq(1-\epsilon)p\lambda$ as required by property (2). When $\min\{|S|,|\bar{S}|\}>r$ , we have $\delta_{H}(S)\geq\alpha\delta_{X}(S)\geq\alpha\alpha_{0}r\geq\sqrt{n}$ for all $\emptyset\neq S\subsetneq V$ .

Finally, $H$ has $O(m)$ edges because $E(\hat{G})$ is a subset of $E(G)$ and $E(X)=O(n)$ . ∎

4 Finding a 1-respecting Arborescence

In this section, we assume that there is an unbalanced mincut and show how to obtain an $s$ -arborescence that 1-respects the mincut. More formally, we prove the following:

Lemma 8.

Given weighted digraph $G$ and a fixed vertex $s$ such that $s$ is in the source side of a minimum cut $S^{*}$ and $\min\{|S^{*}|,|\overline{S^{*}}|\}\leq\theta\cdot\sqrt{n}/\log n$ where $\theta$ is defined in Lemma 7, in $O(m\sqrt{n}\log n)$ time we can find $O(\log n)$ $s$ -arborescences, such that whp a minimum cut 1-respects one of them.

The idea of this lemma is as follows. First, we apply Lemma 7 to our graph $G$ and obtain graph $H$ . Whp, a mincut $S^{*}$ in $G$ corresponds to a cut in $H$ of size $(1\pm\epsilon)p\lambda$ and no cut in $H$ has size less than $(1-\epsilon)p\lambda$ . That is, $S^{*}$ is a $(1+O(\epsilon))$ -approximate mincut in $H$ . It remains to find an arborescence in $H$ that 1-respects $S^{*}$ . To do this, we employ a multiplicative weight update (MWU) framework. The algorithm begins by setting all edge weights to be uniform (say, weight $1$ ). Then, we repeat for $O(\sqrt{n}\log(n)/\epsilon^{2})$ rounds. For each round, we find in near-linear time a minimum weight arborescence and multiplicatively increase the weight of every edge in the arborescence.

Using the fact that there is no duality gap between arborescence packing and mincut [Edm73, Gab95], a standard MWU analysis implies that these arborescences that we found, after some scaling, form a $(1+\epsilon)$ -approximately optimal fractional arborescence packing. So our arborescence crosses $S^{*}$ at most $(1+O(\epsilon))<2$ times on average. Thus, if we sample $O(\log n)$ arborescences from our collections, whp, one of them will 1-respect $S^{*}$ .⁴⁴4This should be compared with Karger’s mincut algorithm in the undirected case, where there is a factor $2$ gap, and hence Karger can only guarantee a $2$ -respecting tree in the undirected case. Below, we formalize this high level description.

Definition 9 (Packing problem [You95]).

For convex set $P\subseteq\mathds{R}^{n}$ and nonnegative linear function $f:P\to\mathds{R}^{m}$ , let $\gamma^{*}=\min_{x\in P}\max_{j\in[m]}f_{j}(x)$ be the solution in $P$ that minimizes the maximum value of $f_{j}(x)$ over all $j$ , and define the width of the packing problem as $\omega=\max_{j\in[m],x\in P}f_{j}(x)-\min_{j\in[m],x\in P}f_{j}(x)$ .

The fractional arborescence packing problem conforms to this definition. Enumerate all the $s$ -arborescences as $A_{1},A_{2},\ldots,A_{N}$ . We represent a fractional packing of arborescences as a vector in $\mathds{R}^{N}$ , where coordinate $i$ represents the fractional contribution of $A_{i}$ in the packing. Let $P=\{x\in\mathds{R}^{N}:x^{T}1=1,x\geq 0\}$ be the convex hull of all single arborescences. For each edge $j$ with capacity $w(j)$ , $f_{j}(x)=\sum_{i\in[N]}x_{i}1[j\in T_{i}]/w(j)$ is the relative load of arborescence packing $x$ on edge $j$ . It is easy to see that $\omega\leq 1/w_{\min}$ for tree packing. The objective function is to minimize the maximum load: $\gamma^{*}=\min_{x\in P}\max_{j\in[m]}f_{j}(x)$ .

For any fractional arborescence packing $x\in\mathds{R}^{N}$ with value $x^{T}1=v$ where $f_{j}(x)\leq 1$ for all edges $j$ , we have $\frac{1}{v}x\in P$ . In particular, the maximum arborescence packing, once scaled down by its value, is exactly the vector in $P$ that minimizes the maximum load. Therefore, it suffices to look for the vector $x\in P$ achieving the optimal value $\gamma^{*}$ , and then scale the vector up by $1/\gamma^{*}$ to obtain the maximum arborescence packing.

Next we describe the packing algorithm (Figure 2 of [You95]). Maintain a vector $y\in\mathds{R}^{m}$ , initially set to $y=1$ . In each iteration, find $x=\arg\min_{x\in P}\sum_{j}y_{j}f_{j}(x)$ , and then add $x$ to set $S$ and replace $y$ by the vector $y^{\prime}$ defined by $y^{\prime}_{j}=y_{j}(1+\epsilon f_{j}(x)/\omega)$ . After a number of iterations, return $\bar{x}\in P$ , the average of all the vectors $x$ over the course of the algorithm. The lemma below upper bounds the number of iterations that suffice:

Lemma 10 (Corollary 6.3 of [You95]).

After $\lceil\frac{(1+\epsilon)\omega\ln m}{\gamma^{*}((1+\epsilon)\ln(1+\epsilon)-\epsilon)}\rceil$ iterations of the packing algorithm, $\bar{\gamma}=\max_{j}f_{j}(\bar{x})\leq(1+\epsilon)\gamma^{*}$ .

We will also make use of the (exact) duality between $s$ -arborescence packing and minimum $s$ -cut:

Lemma 11 (Corollary 2.1 of [Gab95]).

The value of maximum $s$ -arborescence packing is equal to the value of minimum $s$ -cut.

Proof of Lemma 8.

First, construct $H$ according to Theorem 7. By the duality above, the minimum $s$ -cut on $H$ has value $\lambda_{H}=\frac{1}{\gamma^{*}}$ . Since $\min\{|S^{*}|,|\overline{S^{*}}|\}\leq\theta\sqrt{n}/\log n$ , we have $\lambda_{H}\leq\delta_{H}(S^{*})\leq(1+\epsilon)p\lambda\leq(1+\epsilon)\sqrt{n}$ .

Run the aforementioned arborescence packing algorithm up to $O(\lambda_{H}\ln m)$ iterations, after which Lemma 10 guarantees that $\bar{\gamma}\leq(1+\epsilon)\gamma^{*}$ . Then $\bar{x}/\bar{\gamma}$ is a vector in $P$ with value $1/\bar{\gamma}\geq\frac{1}{1+\epsilon}\lambda_{H}$ .

Consider sampling a random arborescence $A$ from the distribution specified by $\bar{x}/\bar{\gamma}$ , so we choose arborescence $A_{i}$ with probability $\bar{x}_{i}/\bar{\gamma}$ . Since $\delta_{H}(S^{*})\leq(1+\epsilon)p\lambda\leq(1+\epsilon)^{2}\lambda_{H}$ , the expected number of edges in $A\cap\delta_{H}(S^{*})$ is at most $\frac{(1+\epsilon)^{2}}{1-\epsilon}\leq 1+4\epsilon$ for small enough $\epsilon>0$ . Since we always have $|A\cap\delta_{H}(S^{*})|\geq 1$ , by Markov’s inequality $\Pr[|A\cap\delta_{H}(S^{*})|-1\geq 1]\leq 4\epsilon\leq 1/2$ for small enough $\epsilon$ . Therefore, if we uniformly sampling $O(\log n)$ arborescences from the distribution $\bar{x}/\bar{\gamma}$ , at least one of the arborescences is 1-respecting whp.

It remains to compute $x=\arg\min_{x\in P}\sum_{j}y_{j}f_{j}(x)$ on each iteration. Since $\sum_{j}y_{j}f_{j}(x)$ is linear in $x$ , the minimum must be achieved by a single arborescence. So the task reduces to computing the minimum cost spanning $s$ -arborescence, which can be done in $O(m+n\log n)$ [GGST86]. The total time complexity, over all iterations, becomes $O((m+n\log n)\lambda_{H}\log n)=O((m+n\log n)\sqrt{n}\log n)$ . ∎

5 Mincut Given 1-respecting Arborescence

We propose an algorithm (Algorithm 1) that uses $O(\log n)$ maxflow subroutines to find the minimum $s$ -cut that $1$ -respects a given $s$ -arborescence. The result is formally stated in Theorem 12.

Theorem 12.

Consider a directed graph $G=(V,E,w)$ with $n$ vertices, $m$ edges, and polynomially bounded edge weights $w_{e}>0$ . Fix a global (directed) mincut $S$ of $G$ . Given an arborescence $T$ rooted at $s\in S$ with $|T\cap(S,\overline{S})|=1$ , Algorithm 1 outputs a global minimum cut of $G$ in time $O((MF(m,n)+m)\cdot\log n)$ .

We first give some intuition for Algorithm 1. Because $s\in S$ , if we could find a vertex $t\in\overline{S}$ , then computing the $s$ - $t$ mincut using one maxflow call would yield a global mincut of $G$ . However, we cannot afford to run one maxflow between $s$ and every other vertex in $G$ . Instead, we carefully partition the vertices into $\ell=O(\log n)$ sets $(C_{i})_{i=1}^{\ell}$ . We show that for each $C_{i}$ , we can modify the graph appropriately so that it allows us to (roughly speaking) compute the maximum flow between $s$ and every vertex $c\in C_{i}$ using one maxflow call.

More specifically, Algorithm 1 has two stages. In the first stage, we compute a centroid decomposition of $T$ . Recall that a centroid of $T$ is a vertex whose removal disconnects $T$ into subtrees with at most $n/2$ vertices. This process is done recursively, starting with the root $s$ of $T$ . We let $P_{1}$ denote the subtrees resulting from the removal of $s$ from $T$ . In each subsequent step $i$ , we compute the set $C_{i}$ of the centroids of the subtrees in $P_{i}$ . We then remove the centroids and add the resulting subtrees to $P_{i+1}$ . This process continues until no vertices remain.

In the second stage, for each layer $i$ , we construct a directed graph $G_{i}$ and perform one maxflow computation on $G_{i}$ . The maxflow computation on $G_{i}$ would yield candidate cuts for every vertex in $C_{i}$ , and after computing the appropriate maximum flow across every layer, we output the minimum candidate cut as the minimum cut of $G$ . The details are presented in Algorithm 1.

Input : An arborescence

T

rooted at

s\in S

such that

S

1-respects

T

1 // Stage I: Build centroid decomposition.

2 Let

C_{0}=\{s\}

P_{1}=

the set of subtrees obtained by removing

s

from

T

, and

i=1

3 while $P_{i}\neq\varnothing$ do

4 Initialize

C_{i}

(the centroids of

P_{i}

) and

P_{i+1}

as empty sets.

5 for each subtree $U\in P_{i}$ do

6 Compute the centroid

u

U

and add it to

C_{i}

7 Add all subtrees generated by removing

u

from

U

P_{i+1}

8 Set

\ell=i

and iterate

i=i+1

9// Stage II: Calculate integrated maximum flow for each layer.

10 for $i=1$ to $\ell$ do

11 Construct a digraph

G_{i}=(V\cup\{t_{i}\},E_{1}\cup E_{2}\cup E_{3})

as follows (see Figure 1):

12 1) Add edges

E_{1}=E\cap\cup_{U\in P_{i}}(U\times U)

with capacity equal to their original weight.

13 2) Add edges

E_{2}=\{(s,v):(u,v)\in E\setminus E_{1}\}

with capacity of

(s,v)

equal to the original weight of

(u,v)

14 3) Add edges

E_{3}=\{(u,t_{i}):u\in C_{i}\}

with infinite capacity.

15 Compute the maximum

s

t_{i}

flow

f^{*}_{i}

G_{i}

16 For each component

U\in P_{i}

with centroid

u

, the value of

f^{*}_{i}

on edge

(u,t_{i})

is a candidate cut value, and the nodes in

U

that can reach

u

in the residue graph is a candidate for

\overline{S}

Return the smallest candidate cut as minimum

s

-cut and the corresponding

(S,\overline{S})

Algorithm 1 Finding the global minimum directed cut.

Refer to caption — Figure 1: Construction of auxiliary graph $G_{i}$ in Algorithm 1. Solid lines represent the arborescence $T$ . Dashed lines are other edges in the graph. Rectangles are sets formed by the first level of centroid decomposition. Left: The original graph. Right: The part of $G_{1}$ solving the case that the mincut separates root and the centroid of the middle subtree.

We first state two technical lemmas that we will use to prove Theorem 12.

Lemma 13.

Recall that $P_{i}$ is the set of subtrees in layer $i$ and $C_{i}$ contains the centroid of each subtree in $P_{i}$ . If $C_{j}\subseteq S$ for every $0\leq j<i$ , then $\overline{S}$ is contained in exactly one subtree in $P_{i}$ , and consequently, at most one vertex $u\in C_{i}$ can be in $\overline{S}$ .

Lemma 14.

Let $G_{i}$ be the graph constructed in Step 1 of Algorithm 1. Let $f^{*}_{i}$ be a maximum $s$ - $t_{i}$ flow on $G_{i}$ as in Step 1. For any $U\in P_{i}$ with centroid $u$ , the amount of flow $f^{*}_{i}$ puts on edge $(u,t_{i})$ is equal to the value of the minimum cut between $\overline{U}$ and $u$ .

We defer the proofs of Lemmas 13 and 14, and first use them to prove Theorem 12.

Proof of Theorem 12.

We first prove the correctness of Algorithm 1.

Because $C_{0}=\{s\}$ and $s\in S$ , and the $C_{i}$ ’s form a disjoint partition of $V$ , there must be a layer $i$ such that for the first time, we have a centroid $u\in C_{i}$ that belongs to $\overline{S}$ . By Lemma 13, we know that $\overline{S}$ must be contained in exactly one subtree $U\in P_{i}$ , and hence $u$ must be the centroid of $U$ . In summary, we have $u\in\overline{S}$ and $\overline{S}\subseteq U$ .

Consider the graph $G_{i}$ constructed for layer $i$ . By Lemma 14, based on the flow $f^{*}_{i}$ puts on the edge $(u,t_{i})$ , we can recover the value of the minimum (directed) cut between $\overline{U}$ and $u$ . Because $\overline{S}\subseteq U$ (or equivalently $\overline{U}\subseteq S$ ) and $u\in\overline{S}$ , the cut $(S,\overline{S})$ is one possible cut that separates $\overline{U}$ and $u$ . Therefore, the flow that $f^{*}_{i}$ puts on the edge $(u,t_{i})$ is equal to the global mincut value in $G$ .

In addition, the candidate cut value for any other centroid $u^{\prime}$ of a subtree $U^{\prime}\in P_{i}$ must be at least the mincut value between $s$ and $u^{\prime}$ . This is because the additional restriction that the cut has to separate $\overline{U^{\prime}}$ from $u^{\prime}$ can only make the mincut value larger, and the value of this cut in $G_{i}$ is equal to the value of the same cut in $G$ . Therefore, the minimum candidate cut value in all $\ell$ layers must be equal to the global mincut value of $G$ .

Now we analyze the running time of Algorithm 1. We can find the centroid of an $n$ -node tree in time $O(n)$ (see e.g., [MTZC81]). The total number of layers $\ell=O(\log n)$ because removing the centroids reduces the size of the subtrees by at least a factor of $2$ . Thus, the running time of Stage I of Algorithm 1 is $O(n\log n)$ . In Stage II, we can construct each $G_{i}$ in $O(m)$ time and every $G_{i}$ has $O(m)$ edges. Since there are $O(\log n)$ layers and the maximum flow computations take a total of $O(MF(m,n)\cdot\log n)$ time, the overall runtime is $O(n\log n+(MF(m,n)+m)\log n)=O((MF(m,n)+m)\log n)$ . ∎

Before proving Lemmas 13 and 14 we first prove the following lemma.

Lemma 15.

If $x$ and $y$ are vertices in $\overline{S}$ , then every vertex on the (undirected) path from $x$ to $y$ in the arborescence $T$ also belongs to $\overline{S}$ .

Proof.

Consider the lowest common ancestor $z$ of $x$ and $y$ . Because there is a directed path from $z$ to $x$ and a directed path from $z$ to $y$ , we must have $z\in\overline{S}$ . Otherwise, there are at least two edges in $T$ that go from $S$ to $\overline{S}$ .

Because $s\in S$ and $z\in\overline{S}$ , there is already an edge in $T$ (on the path from $s$ to $z$ ) that goes from $S$ to $\overline{S}$ . Consequently, all other edges in $T$ cannot go from $S$ to $\overline{S}$ , which means the entire path from $z$ to $x$ (and similarly $z$ to $y$ ) must be in $\overline{S}$ . ∎

Recall that Lemma 13 states that if all the centroids in previous layers are in $S$ , then $\overline{S}$ is contained in exactly one subtree $U$ in the current layer $i$ .

Proof of Lemma 13.

For contradiction, suppose that there exist distinct subtrees $U_{1}$ and $U_{2}$ in $P_{i}$ and vertices $x,y\in\overline{S}$ such that $x\in U_{1}$ and $y\in U_{2}$ .

By Lemma 15, any vertex on the (undirected) path from $x$ to $y$ also belongs to $\overline{S}$ . Consider the first time that $x$ and $y$ are separated into different subtrees. This must have happened because some vertex on the path from $x$ to $y$ is removed. However, the set of vertices removed at this point of the algorithm is precisely $\bigcup_{0\leq j<i}C_{j}$ , but our hypothesis assumes that none of them are in $\overline{S}$ . This leads to a contradiction and therefore $\overline{S}$ is contained in exactly one subtree of $P_{i}$ .

It follows immediately that at most one centroid $u\in C_{i}$ can be in $\overline{S}$ . ∎

Next we prove Lemma 14, which states that the maximum flow between $s$ and $t_{i}$ in the modified graph $G_{i}$ allows one to simultaneously compute a candidate mincut value for each vertex $u\in C_{i}$ .

Proof of Lemma 14.

First observe that the maxflow computation from $s$ to $t_{i}$ in $G_{i}$ can be viewed as multiple independent maxflow computations. The reason is that, for any two subtrees $U_{1},U_{2}\in P_{i}$ , there are only edges that go from $s$ into $U_{1}$ and from $U_{1}$ to $t_{i}$ in $G_{i}$ (similarly for $U_{2}$ ), but there are no edges that go between $U_{1}$ and $U_{2}$ .

The above observation allows us to focus on one subtree $U\in P_{i}$ . Consider the procedure that we produce $G_{i}$ from $G$ in Steps 1 to 1 of Algorithm 1. The edges with both ends in $U$ are intact (the edge set $E_{1}$ ). If we contract all vertices out of $U$ into $s$ , then all edges that enter $U$ would start from $s$ , which is precisely the effect of removing cross-subtree edges and adding the edges in $E_{2}$ . One final infinity-capacity edge $(u,t_{i})\in E_{3}$ connects the centroid of $U$ to the super sink $t_{i}$ .

Therefore, the maximum $s$ - $t_{i}$ flow $f^{*}_{i}$ computes the maximum flow between $\overline{U}$ and $u\in U$ simultaneously for all $U\in P_{i}$ , whose value is reflected on the edge $(u,t_{i})$ . It follows from the maxflow mincut theorem that the flow on edge $(u,t_{i})$ is equal to the mincut value between $\overline{U}$ and $u$ in $G$ (i.e., the minimum value $w(A,\overline{A})$ among all $A\subset V$ with $\overline{U}\subseteq A$ and $u\in\overline{A}$ ). ∎

References

[CGL⁺19] Julia Chuzhoy, Yu Gao, Jason Li, Danupon Nanongkai, Richard Peng, and Thatchaphol Saranurak. A deterministic algorithm for balanced cut with applications to dynamic connectivity, flows, and beyond. CoRR, abs/1910.08025, 2019.
[CQ21] Chandra Chekuri and Kent Quanrud. Faster algorithms for rooted connectivity in directed graphs, 2021.
[Edm73] Jack Edmonds. Edge-disjoint branchings. Combinatorial algorithms, 1973.
[ET75] Shimon Even and Robert Endre Tarjan. Network flow and testing graph connectivity. SIAM J. Comput., 4(4):507–518, 1975.
[Gab95] H.N. Gabow. A matroid approach to finding edge connectivity and packing arborescences. Journal of Computer and System Sciences, 50(2):259–273, 1995.
[GGST86] Harold Gabow, Zvi Galil, Thomas Spencer, and Robert Tarjan. Efficient algorithms for finding minimum spanning tree in undirected and directed graphs. Combinatorica, 6:109–122, 06 1986.
[GLP21] Yu Gao, Yang P. Liu, and Richard Peng. Fully dynamic electrical flows: Sparse maxflow faster than goldberg-rao. CoRR, abs/2101.07233, 2021.
[GR98] Andrew V. Goldberg and Satish Rao. Beyond the flow decomposition barrier. J. ACM, 45(5):783–797, 1998.
[GT88] Andrew V. Goldberg and Robert Endre Tarjan. A new approach to the maximum-flow problem. J. ACM, 35(4):921–940, 1988.
[HO94] Jianxiu Hao and James B. Orlin. A faster algorithm for finding the minimum cut in a directed graph. J. Algorithms, 17(3):424–446, 1994.
[Kar00] David R Karger. Minimum cuts in near-linear time. Journal of the ACM (JACM), 47(1):46–76, 2000.
[Li21] Jason Li. Deterministic mincut in almost-linear time. STOC, 2021.
[MS89] Yishay Mansour and Baruch Schieber. Finding the edge connectivity of directed graphs. Journal of Algorithms, 10(1):76–85, 1989.
[MTZC81] N. Megiddo, Arie Tamir, Eitan Zemel, and Ramaswamy Chandrasekaran. An $o(n\log^{2}n)$ algorithm for the k th longest path in a tree with applications to location problems. SIAM Journal on Computing, 10, 05 1981.
[Qua21] Kent Quanrud. Fast approximations for rooted connectivity in weighted directed graphs, 2021.
[Sch79] Claus-Peter Schnorr. Bottlenecks and edge connectivity in unsymmetrical networks. SIAM Journal on Computing, 8(2):265–274, 1979.
[vdBLL⁺21] Jan van den Brand, Yin Tat Lee, Yang P. Liu, Thatchaphol Saranurak, Aaron Sidford, Zhao Song, and Di Wang. Minimum cost flows, mdps, and $\ell_{1}$ -regression in nearly linear time for dense instances. CoRR, abs/2101.05719, 2021.
[You95] Neal E. Young. Randomized rounding without solving the linear program. In Proceedings of the Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’95, page 170–178, USA, 1995.

Minimum Cuts in Directed Graphs via n\sqrt{n} Max-Flows

Abstract

1 Introduction

Theorem 1.

Our Techniques.

Previous Work.

Concurrent Work.

2 The Directed Min-Cut Algorithm

Lemma 2.

Proof.

Definition 3.

Definition 4.

3 Sparsification

Definition 5 (Expander).

Lemma 6 (Theorem 2.4 of [CGL+19]).

Lemma 7.

Proof.

4 Finding a 1-respecting Arborescence

Lemma 8.

Definition 9 (Packing problem [You95]).

Lemma 10 (Corollary 6.3 of [You95]).

Lemma 11 (Corollary 2.1 of [Gab95]).

Proof of Lemma 8.

5 Mincut Given 1-respecting Arborescence

Theorem 12.

Lemma 13.

Lemma 14.

Proof of Theorem 12.

Lemma 15.

Proof.

Proof of Lemma 13.

Proof of Lemma 14.

References

Minimum Cuts in Directed Graphs via $\sqrt{n}$ Max-Flows

Lemma 6 (Theorem 2.4 of [CGL⁺19]).