Blocking Trails for $f$ -factors of Multigraphs

Harold N. Gabow Department of Computer Science, University of Colorado at Boulder, Boulder, Colorado 80309-0430, USA. E-mail: [email protected]

Abstract

Blocking flows were introduced by Dinic [3] to speed up the computation of maximum network flows. They have been used in algorithms for problems such as maximum cardinality matching of bipartite graphs [19, Hopcroft and Karp] and general graphs [24, Micali and Vazirani], maximum weight matching of general graphs [17, Gabow and Tarjan], and many others. The blocking algorithm of [17] for matching is based on depth-first search. We extend the depth-first search approach to find $f$ -factors of general multigraphs. Here $f$ is an arbitrary integral-valued function on vertices, an $f$ -matching is a subgraph where every vertex $x$ has degree $\leq f(x)$ , an $f$ -factor has equality in every degree bound. A set of blocking trails for an $f$ -matching $M$ is a maximal collection ${\cal A}$ of edge-disjoint augmenting trails such that $M\bigoplus_{A\in{{\cal A}}}A$ is a valid $f$ -matching.

Blocking trails are needed in efficient algorithms for maximum cardinality $f$ -matching [20], maximum weight $f$ -factors/matchings by scaling [4, 12], and approximate maximum weight $f$ -factors and $f$ -edge covers [20]. Since these algorithms find many sets of blocking trails, the time to find blocking trails is a dominant factor in the running time. Our blocking trail algorithm runs in linear time $O(m)$ . In independent work and using a different approach, Huang and Pettie [20] present a blocking trails algorithm using time $O(m\alpha(m,n))$ . As examples of the time bounds for the above applications, an approximate maximum weight $f$ -factor is found in time $O(m\,\alpha(m,n))$ using [20], and our algorithm eliminates the factor $\alpha(m,n)$ . Similarly a maximum weight $f$ -factor is found in time $O(\sqrt{\Phi\,{\rm log}\,\Phi}\,m\,\alpha(m,n)\,\,{\rm log}\,(\Phi W))\,$ using [20], ( $\Phi=\sum_{v\in V}f(v)$ , $W$ the maximum edge weight) and our algorithm eliminates the $\alpha(m,n)$ factor, making the time within a factor $\sqrt{\,{\rm log}\,{\Phi}}$ of the bound for bipartite multigraphs.

The technical difficulty for this work stems from the fact that previous algorithms for both matching and $f$ -matching use vertex contractions to form blossoms. The dfs approach necessitates using edge contractions. As an example difficulty, edge contractions require extending the definition of blossom to ”skew blossoms” – configurations that must be reorganized in order to become valid blossoms. (The algorithm of [20] uses vertex contractions.)

1 Introduction

Blocking flows (aka blocking paths) are a fundamental tool for speeding up augmenting-type algorithms. They were introduced by Dinic [3] to obtain the first strongly polynomial algorithm for maximum network flow. Hopcroft and Karp [19] (and independently Karzanov [21]) applied the idea to maximum cardinality bipartite matching. They also proving its applicability to general graph matching, and Micali and Vazirani [24] gave the first blocking paths algorithm for general graph cardinality matching. Gabow and Tarjan applied blocking flows in cost-scaling algorithms for minimum cost network flow (aka degree-constrained subgraphs or $f$ -matchings) [16] and minimum cost general graph matching [17]. Here we have only cited some first applications of the idea, see reference [25, Sec. 9.6,10.8b,16.7a,17.5a,24.4a] for a comprehensive bibliography for the many uses of blocking paths.

We present an algorithm for blocking trails for $f$ -factors of general multigraphs. Here “general” is to be contrasted with “bipartite”, an important but much simpler special case. For the definitions of $f$ -factors, blocking trails, etc., please see the above abstract or the last paragraph of this section.

We turn to discussing recent work that applies blocking trails to multigraph $f$ -factors. Huang and Pettie [20] give algorithms for a host of problems on general multigraphs: They achieve time $O(\sqrt{\Phi}\,m\,\alpha(m,n))$ for maximum cardinality $f$ -matching. Here $\Phi=\sum_{v\in V}f(v)$ , i.e., twice the size of an $f$ -factor. The same bound holds for a minimum cardinality $f$ -edge cover (i.e., $f$ is a lower bound for degrees in the desired subgraph). These algorithms use a scaling algorithm that finds an approximation to the weighted version of the problem. The scaling algorithm uses time $O(m\,\alpha(m,n)\,\epsilon^{-1}\,{\rm log}\,\epsilon^{-1})$ to find a $(1-\epsilon)$ -approximate maximum weight $f$ -matching, or a $(1+\epsilon)$ -approximate minimum weight $f$ -edge cover.

The next applications are scaling algorithms for maximum weight $f$ -factors. To put the results in perspective first recall the classic time bounds for (unweighted) bipartite $f$ -factors [21, 8]:

(1.1)

\begin{cases}O(n^{2/3}\;m)&G\text{ a simple graph}\\ O(\sqrt{\Phi}\;m)&G\text{ a multigraph.}\end{cases}

For maximum weight bipartite $f$ -factors, Gabow and Tarjan [16] presented a scaling algorithm whose time bound is just a scaling factor above (1.1), i.e., time $O(n^{2/3}\;m\;\,{\rm log}\,(nW))$ for simple graphs and $O(\sqrt{\Phi}\;m\;\,{\rm log}\,(\Phi W))$ for multigraphs. Here $W$ is the maximum (integral) edge weight.

Now consider general graphs. Duan, He and Zhang [4] give a scaling algorithm for maximum weight $f$ -factors of simple general graphs. The time is only logarithmic factors above (1.1), i.e., $\widetilde{O}(n^{2/3}\,m\,\,{\rm log}\,W)$ . In fact when all weights are 1 this is the first algorithm to achieve the bound of (1.1), to within logarithmic factors, for unweighted $f$ -factors of simple general graphs. Returning to weighted $f$ -factors, Gabow [12] gives a scaling algorithm that achieves the multigraph bound of (1.1), to within logarithmic factors, specifically $O(\sqrt{\Phi\,{\rm log}\,\Phi}\,m\,\alpha(m,n)\,\,{\rm log}\,(\Phi W))$ .

All of the above algorithms use a subroutine that finds a set of blocking trails. The time bounds cited above assume the blocking set is found by the subroutine presented by Huang and Pettie [20]. It finds a blocking set (more specifically, a set of blocking trails and cycles) in time $O(m\alpha(m,n))$ . Our blocking trail algorithm runs in time $O(m)$ . Using our algorithm the above time bounds all decrease by a factor $\alpha(m,n)$ . (The decrease occurs in [4] but is hidden by the use of $\widetilde{O}$ .)

Our algorithm is based on the depth-first-search approach to blocking paths for matching, introduced in [17]; see also [11]. Dfs is a natural approach for finding blocking paths, since backing up prematurely may necessitate reexploring edges later on. However it introduces complications for managing blossoms – they are usually processed immediately on discovery, but they are necessarily postponed in a dfs regime [17, 11]. Extending the dfs approach to $f$ -factors of general multigraphs introduces new complications, which we now summarize.

Fundamentally, in previous settings blossoms are “shrunk” using vertex contraction. This holds even in previous $f$ -factor algorithms, e.g., see the definition of blossom in [14]. However dfs requires shrinking by edge contraction. A first consequence is new way of discovering blossoms, which we call “skew blossoms”. Skew blossoms have the same properties as ordinary blossoms. However they require a reorganization in order to achieve valid blossom structure (Lemma 2.1).

A second consequence of dfs is incomplete blossoms, i.e., blossoms that do not get completely processed because of the discovery of an augmenting trail. Incomplete blossoms occur in ordinary matching, but they present additional complexity in $f$ -factors. Specifically because of edge contraction, an alternating trail from a later search can reenter an incomplete blossom. (See Sec.2.) The analysis of Sec.5 shows these reentries present no problem.

Finally edge-contracted blossoms are not immediately handled by the standard min-max formula for maximum cardinality $f$ -matching (which we require to prove the blocking property). We show the essential structure of vertex-contracted blossoms is preserved by our algorithm (e.g., Lemma 5.2).

These complications also reveal the high-level difference between our approach and that of Huang and Pettie [20]. Their algorithm uses vertex contraction rather than edge contraction. Its fundamental idea is cycle cancellation, which does not occur in our algorithm.

The paper is organized as follows. Section 2 discusses blossoms, first reviewing the definition of $f$ -factor blossoms, and discussing the new types of blossoms in our algorithm. Appendix A gives the depth-first search algorithm for blocking ordinary matchings [17, 11], for possible help in the reading of Section 2. Section 3 presents the blocking algorithm. Section 4 proves the algorithm constructs a valid search structure, most importantly, the blossoms are valid. Section 5 proves the augmenting trails found by the algorithm form a blocking set. This completes the analysis of the algorithm, which is summarized in Theorem 5.5.

Terminology and conventions

We often omit set braces from singleton sets, denoting $\{v\}$ as $v$ . So $S-v$ denotes $S-\{v\}$ . We abbreviate expressions $\{v\}\cup S$ to $v+S$ . We use a common summing notation: If $f$ is a function on elements and $S$ is a set of elements then $f(S)$ denotes $\sum_{s\in S}f(s)$ .

The trees in this paper are out-trees. Writing $xy$ for an arc of a tree implies the arc joins parent $x$ to child $y$ . We extend parent and child relations to tree arcs, e.g., arc $xy$ is the parent of $yz$ . A node $x$ is an ancestor of a node $y$ allows the possibility $x=y$ , unless $x$ is a proper ancestor of $y$ . Similarly for arcs. A pendant edge has no children.

Graphs in this paper are undirected multigraphs. An edge joining vertices $x$ and $y$ is denoted $\{x,y\}$ . Note this notation ignores distinctions between multiple copies of an edge. Such distinctions are irrelevant to our algorithm. Also note that a loop at $x$ is denoted $\{x,x\}$ . Finally note that we use the usual shorthand notation $xy$ to denote edge $\{x,y\}$ if context makes it clear that $\{x,y\}$ is a graph edge, not a tree arc. (This point is reiterated at the start of Section 3.)

In graph $G=(V,E)$ for $S\subseteq V$ and $M\subseteq E$ , $\delta(S,M)$ ( $\gamma(S,M)$ ) denotes the set of edges of $M$ with exactly one (respectively two) endpoints in $S$ . We omit $M$ (writing $\delta(S)$ or $\gamma(S)$ ) when $M=E$ . A loop at $v\in S$ belongs to $\gamma(S)-\delta(S)$ . For multigraphs $G$ all of these sets are multisets.

Figures in this paper use the following conventions. Matched edges are drawn heavy, unmatched edges light. Free vertices are drawn as rectangles. Trails are indicated by arrowheads on their edges. A figure illustrating the algorithm can be drawn in the given graph $G$ or in an auxiliary graph $\cal T$ (which is essentially the search tree; Fig.5(d.1) and (d.2) shows the two views). The view in $G$ is intuitive and informative but ambiguous: A given vertex $v$ can occur many times in the search, and an edge of the search leading to $v$ can potentially be used at any of these occurrences (again see Fig.5(d)). The same edge drawn in $\cal T$ goes to a new occurrence of $v$ , clearly less informative. Of course there is no such difficulty in ordinary matching.

For an undirected multigraph $G=(V,E)$ with function $f:V\to\mathbb{Z}_{+}$ , an $f$ -factor is a subgraph where each vertex $v\in V$ has degree exactly $f(v)$ . In a partial factor each $v$ has degree $\leq f(v)$ . $v$ is free if strict inequality holds.

We often call the edges of a partial $f$ -factor a matching or $f$ -matching.¹¹1An $f$ -matching is not to be confused with a $b$ -matching [25]. This paper never discusses the latter. So we refer to an ordinary matching as a 1-matching (i.e., $f$ is identically 1 for all vertices). $def(x)$ is the deficiency of vertex $x$ in the current matching $M$ , $def(x)=f(x)-|\delta(x,M)|-2|\gamma(x,M)|$ .

When discussing a matching $M$ , the M-type of an edge $e$ is $M$ or $\overline{M}$ depending on whether $e$ is matched or unmatched, respectively. We usually denote an arbitrary M-type as $\mu$ , and $\mu(e)$ denotes the M-type of edge $e$ .

Consider a graph $G$ with an $f$ -matching $M$ . An augmenting trail is an alternating trail $A$ that begins and ends at a free vertex, such that $M\oplus A$ is a valid matching, i.e., the two ends of the trail still satisfy their degree bound $f$ . (The trail may be closed, i.e., $A$ begins and ends at the same vertex. Alternating means consecutive edges of $A$ have opposite M-types.) An augmenting set is a collection of edge-disjoint augmenting trails ${\cal A}$ such that $M\bigoplus_{A\in{{\cal A}}}A$ is a valid $f$ -matching (i.e., rematching all the trails keeps every free vertex of $M$ within its degree bound $f$ .) A blocking trail set is a maximal augmenting set. It is the analog of a blocking flow.

2 Blossoms in $d$

Blossoms are the main issue, and the main stumbling block, in any algorithm for matching general graphs. So it is appropriate to start with the new features of blossoms in our algorithm. This section starts by reviewing the natural definition of $f$ -factor blossoms presented in [14]. Then it introduces our new blossom variant, skew blossoms, and overviews the new potential difficulty for our algorithm, incomplete blossoms. Finally it presents a blossom substitute structure that allows our algorithm to treat weighted blossoms as ordinary vertices. Appendix A gives the predecessor of our algorithm and may be useful for Fig.2 showing incomplete blossoms.

Definition of Blossom, in $G$

Our algorithm constructs blossoms that satisfy the definition presented in [14]. We call these ordinary blossoms. We briefly restate their definition below. Section 3 gives a similar definition for ordinary blossoms, but as they occur in our auxiliary graph $\cal T$ rather than $G$ .

Let $G$ be a multigraph with an $f$ -matching. A blossom $B$ is a subgraph of $G$ that has a distinguished vertex, called the base vertex $\beta$ , and a distinguished incident edge, called the base edge $\eta$ , whose end in $B$ is $\beta$ . (If $\beta$ is free then $\eta$ is an artificial unmatched edge.) The detailed definition is recursive. We begin with a graph $\overline{G}$ , the original graph $G$ with zero or more recursively defined blossoms contracted. A vertex in $\overline{G}$ is either a contracted blossom or a vertex of $G$ called an atom. The new blossom $B$ is defined as a closed trail $C$ . $C$ begins and ends at a vertex of $\overline{G}$ called the starter. Removing the starter from $C$ gives the blossom trail. The blossom trail must be alternating. Alternating means two consecutive edges meeting at an atom have opposite M-types; there is no restriction on edges meeting at a contracted blossom. Any atom may occur arbitrarily many times in $C$ . However a contracted blossom $A$ occurs at most once (a starter blossom occurs as both ends of $C$ ). Further if $A$ is in the blossom trail its base edge must be one of its two incident $C$ -edges.

In the base case of the recursion the starter is an atom, which is taken to be $\beta$ . The first and last edges of $C$ must have the same $M$ -type, which is called the M-type of $B$ . $\eta$ is an edge incident to $C$ , whose M-type is opposite that of $B$ . (So a blossom whose base vertex is free has M-type $\overline{M}$ .)

In the recursive case the starter is a blossom (that is being enlarged). There is no restriction on the M-type of the starter’s two incident edges. The base vertex of $B$ is the base vertex of the starter. The same holds for the blossom’s M-type and its base edge, which must be incident to the trail $C$ . $\Box$

Our definition of blossoms differs from [14] in an important way. The algorithms of [14] contract the vertex sets of blossoms. This is not compatible with the depth-first search strategy of our algorithm. Instead our definition of blossoms contracts edge sets. (The first example is Fig.5(d).) This means that a vertex of $G$ occuring in a contracted blossom of $C$ may also have atomic occurrences in $C$ . (Fig.7 illustrates the common case of pendant edges, introduced in Lemma 4.4( $i$ ).) In fact a vertex may also occur in many different blossoms of $C$ .

Our edge-contracted blossoms have three properties allowing them to function like vertex-contracted blossoms and make our algorithm correct. First is the most fundamental property of blossoms: An augmenting trail in $\overline{G}$ (graph $G$ with all blossoms contracted) gives an augmenting trail in $G$ , if we add appropriate trails through contracted blossoms. The appropriate trails are called $P_{i}(v,\beta)$ in [14]. Here $v$ is a vertex in a blossom $B$ with base vertex $\beta$ , and the edges of this trail are in the blossom subgraph of $B$ . The trails are constructed in Lemma 4.4 of [14]. This construction works for edge-contracted blossoms if we make one simple change: In every closed trail $C$ of a blossom, replace each occurrence of an atomic vertex $x$ by a new vertex $x_{i}$ . Clearly the edge contractions used in our definition correspond to vertex contractions in the modified graph. So the $P_{i}$ trails exist for edge-contracted blossoms.

The second property is a relaxed version of vertex-contracted blossoms: At any point in our algorithm a given vertex $x$ of $G$ occurs in at most one blossom. This is property $(\dagger)$ , stated after Proposition 4.8. $(\dagger)$ insures a given blossom occurs at most once in $C$ . It is also crucial in establishing the blocking property (starting with the definition of labels in (5.3)).

The third property is that in the search structure, a blossom has one entering edge (its base) and every other edge is leaving. This property fails for edge-contracted blossoms. However it holds when our algorithm halts (Corollary 5.1). This allows us to use the min-max formula for maximum cardinality $f$ -matching in Section 5 to establish the blocking property of our algorithm.

We continue with several more comments on the definition of blossom. First note that the aforementioned trails $P_{i}(v,\beta)$ explain the interpretation of “alternating” in the definition of blossom (the alternating trails through the contracted blossom exist regardless of the edges incident to it in $C$ ). Second, a loop $xx$ qualifies as a closed trail. So a blossom may have base vertex $x$ , closed trail $xx$ , whose first and last edges are identical. Finally, a blossom is called heavy (light) when its M-type is $M$ ( $\overline{M}$ ), respectively.

Skew Blossoms

Skew blossoms extend the algorithmic definition of blossom. They have the same structure as ordinary blossoms after a reorganization. The analysis of skew blossoms is similar to the construction of $P_{i}$ trails in [14]. Skew blossom are illustrated in Fig.1 and defined as follows.

Refer to caption — Figure 1: Example blossoms: (a) A minimal (ordinary) blossom. (b) A skew blossom. Structure of a (heavy) skew blossom: (c) $x\neq\beta(A)$ . (d) $x=\beta(A)$ .

Consider a graph with various blossoms contracted, including a blossom $A$ that contains a vertex $x\in V(G)$ . Let $T$ be an alternating trail, with first vertex $x$ and last vertex (the contracted) blossom $A$ , first edge $xy$ and last edge $\eta(A)$ . $T$ is a skew blossom.

Note the first vertex $x$ is an atom in $G$ and does not belong to $A$ . Also it is possible that ${xy}=\eta(A)$ . We may also have $xy$ a loop at $x$ .

Lemma 2.1

$T$ is a valid blossom, with base vertex $x$ and M-type $\mu({xy})$ .

Proof: We wish to construct a blossom decomposition $B$ for the given skew blossom $S$ . This decomposition will be a sequence of closed trails $C_{i}$ , $i\geq 0$ , satisfying the definition of a valid blossom, and having the same set of edges as $S$ . The initial blossom trail $C_{0}$ will start with the given trail $T$ from the atom $x$ to $\eta(A)$ , and follow a trail in $A$ to an occurrence of $x$ on an edge of M-type $\mu({xy})$ .

To achieve this goal we will construct a sequence of alternating trails $T_{i}$ , $i=1,\ldots,k$ , where each $T_{i}$ is a prefix of $T_{i+1}$ . The last trail $T_{k}$ will be the above initial blossom trail $C_{0}$ . Along the way we also construct the desired sequence of blossom trails $C_{i},i>0$ . The construction maintains the invariant that starting with $C_{0}$ , adding the trails $C_{i}$ in order gives a valid blossom. Furthermore the construction ends with the $C_{i}$ consisting of the same edges as $S$ .

To begin observe that the vertex $x$ occurs as an atom in a unique closed trail $C$ of $A$ . Let $x$ refer to some fixed occurrence of $x$ in $C$ (chosen arbitrarily if there are more than one). Let $\beta$ and $\eta$ be the base vertex and base edge of the blossom corresponding to $C$ .

Suppose $x\neq\beta$ . Let $T_{1}$ be the subtrail of $C\cup\eta$ that starts with the edge of $(\delta(x)\cup\gamma(x))\cap\mu({xy})$ , follows $C$ to $\beta$ and then traverses $\eta$ . $T_{1}$ exists since $C$ alternates at $x$ . Let $C_{1}$ be the trail $C-T_{1}$ . Clearly adding $C_{1}$ to $T_{k}$ gives a valid blossom, which contains all of $C$ .

Now suppose $x=\beta$ . We proceed exactly as before. If the M-type of $C$ is $\mu({xy})$ then $T_{1}$ is the entire trail $C\cup\eta$ and $C_{1}$ is empty. If the M-type of $C$ is $\mu({xy})$ then $T_{1}$ is the single edge $\eta$ and $C_{1}$ is $C$ .

Now inductively assume $T_{i-1}$ ends with the edge $\eta(A_{i-1})$ , where $A_{i-1}$ is a blossom in the closed trail $D$ of $A$ . Proceed similar to the base case: Let $\beta$ and $\eta$ be the base vertex and base edge of $D$ . Let $T_{i}$ be the subtrail of $D\cup\eta$ that starts with edge $\eta(A_{i-1})$ , follows $D$ to $\beta$ and then traverses $\eta$ . Let $C_{i}$ be the trail $D-T_{i}$ . Adding $C_{i}$ to $T_{k}\cup\bigcup_{1}^{i-1}C_{j}$ gives a valid blossom, which contains all of $D$ . As a special case it is possible that $A_{i-1}$ is the starter blossom for $D$ . In that case $T_{i}=T_{i-1}$ and $C_{i}=D$ .

Eventually we have $A_{i-1}=A$ . In that case $i=k$ and $T_{k}$ is as specified above. $\Box$

Incomplete Blossoms

A successful search finds an augmenting trail. This leads to the possibility of incomplete blossoms. A blossom $B$ is incomplete if the blossom step has some $d(u_{i},f_{i})$ invocation leading to a free vertex, so not all of the vertices in $B$ get scanned. Fig.2 gives examples of incomplete blossoms.

In 1-matching incomplete blossoms $I$ present little problem. In detail, the augment step removes all edges in the augmenting path $P$ . In 1-matching this removes all vertices on $P$ . The remaining vertices in $I$ have all been completely scanned (by the dfs order). So $I$ has the same properites as an ordinary blossom. This is not the case for multigraphs, since as illustrated in (a), a vertex $r$ on the augmenting trail remains in the graph with unscanned edges.

Eliminating Weighted Blossoms

Blocking trails are required in two types of applications of our algorithm: algorithms for maximum cardinality $f$ -factors and scaling algorithms for maximum weight $f$ -factors. Cardinality algorithms are handled directly by the algorithm of Section 3. But weighted algorithms require a modification of the graph, which we now describe.

In scaling algorithms, a blocking trail is found and rematched after each dual adjustment. The graph for the blocking trail is formed from the input graph by contracting every weighted blossom (i.e., blossom with positive dual variable $z(B)$ ). A blocking set trail can pass through such a blossom only once. In contrast an ordinary vertex that is not in a weighted blossom can appear in many different blocking set trails, and many different times in each trail. In order to have all vertices alike we replace each weighted blossom by a blossom substitute, illustrated in Fig.3.

In detail consider a contracted blossom $B$ with base vertex $\beta$ , and base edge $\eta={a\beta}$ . ( $\eta$ does not exist if $\beta$ is a free vertex.) Let $Bm$ ( $Bu$ ) denote a typical matched (unmatched) edge incident to $B$ other than $\eta$ . The blossom substitute for $B$ discards the vertices of $B-\beta$ and replaces them by a new vertex denoted $b$ and a new edge ${\beta b}$ . It defines $f(\beta)=f(b)=1$ . If $B$ is a light blossom, ${\beta b}$ is unmatched, each $Bm$ edge is replaced by matched ${bm}$ , and each $Bu$ edge is replaced by unmatched ${\beta u}$ . (Note that aside from $\eta$ , edges incident to $\beta$ in the original graph are treated as $Bm$ or $Bu$ edges in the substitute.) If $B$ is heavy then ${\beta b}$ is matched, each $Bm$ edge is replaced by matched ${\beta m}$ , and each $Bu$ edge is replaced by unmatched ${bu}$ .

It is easy to see that blocking trails in the original graph $G$ correspond to blocking trails in the graph with substitutes, $G^{\prime}$ . In detail consider an alternating trail $T$ in $G$ and a weighted blossom $B$ . $T$ either contains $\beta$ or it does not. If $T$ contains $\beta$ , it contains $\eta$ (if it exists) and exactly one of the $Bm$ , $Bu$ edges. If $T$ does not contain $\beta$ it does not contain $\eta$ or $Bm$ or $Bu$ edge. Let $T^{\prime}$ be an alternating trail in $G^{\prime}$ . We give the details for a light blossom, heavy blossoms are symmetric. If $T^{\prime}$ contains $\beta$ , it contains $\eta$ (if it exists), and it either contains ${\beta b}$ , exactly one ${bm}$ edge and no ${bu}$ edge, or it contains exactly one edge ${\beta u}$ edge and no ${bm}$ edge. If $T^{\prime}$ does not contain $\beta$ it does not contain $\eta$ , ${\beta b}$ or any ${\beta u}$ or ${bm}$ edge.

3 The blocking trail algorithm

This section presents the algorithm. It illustrates the algorithm’s execution and elaborates on details of the algorithm statement.

The overall algorithm is called find_trails. It uses a recursive depth-first search routine called $d$ . We begin by describing the data structures and giving an overview of $d$ . Each vertex $x$ has two lists: $GL(x)$ contains the edges $e$ incident to $x$ that can be used in a grow step from $x$ . $e$ may be matched or unmatched, and may be a loop. A nonloop $e$ starts out in two GL’s, and is removed from both lists when a grow step is executed from the first end. Similarly the second list $BL(x)$ contains the edges incident to $x$ that can trigger a blossom step. When find_blocking_set begins the lists are initialized as

GL(x)=\delta(x)\cup\gamma(x),\ BL(x)=\emptyset.

Each search for an augmenting path begins with the GL’s and BL’s as they were at the end of the previous search. (We shall see that a BL entry is relevant only in the search that created it.)

We manage each $BL(x)$ using a routine pop that removes edges from a list. Specifically pop $(L)$ removes and returns an element $e$ of list $L$ , where $e$ can be chosen arbitrarily with one exception: The first invocation of pop $(L)$ must return the first element that was added to $L$ . Obviously this is a special case of a FIFO queue, but we use pop for clarity and greater generality.

The $d$ routine works in two phases. It starts by using its GL to do every possible grow step. Then it uses the BL to do every possible blossom step. In this second phase $BL(x)$ contains the edges where the dfs retreated from $x$ .

The algorithm constructs a search tree $\cal T$ consisting of the edges added in grow steps. To distinguish $\cal T$ from $G$ we use the terms node and arc for elements of $V({\cal T})$ and $E({\cal T})$ , respectively. We view $\cal T$ as an out-tree, so every arc is directed, from parent to child. Consider an edge of $G$ $e={\{x,y\}}$ . An occurrence of $e$ in $\cal T$ is denoted as $xy$ or $yx$ , where the arc is directed from the first vertex to the second, e.g., $xy$ means $x$ is the parent.

Since a vertex $x\in V(G)$ can occur multiple times in $\cal T$ , we identify nodes of $\cal T$ by an incident $\cal T$ -arc. Specifically let $yx$ be an arc in $\cal T$ . The notation $y\dot{x}$ refers to the node of $\cal T$ at the $x$ end of $yx$ , and $x\dot{y}$ is the node at the $y$ end. So node ${y\dot{x}}$ has $y$ the parent of $x$ or a child of $x$ .

The low-level algorithm represents blossoms using a data structure for set merging. The universe is the set of $\cal T$ -nodes. The sets are the vertex sets of the current blossoms. (So these blossoms are complete or incomplete.) In the pseudocode below, $B_{{x\dot{y}}}$ is maintained as the set of all nodes in the blossom currently containing node ${x\dot{y}}$ . It is a simple matter to transform this to a low-level linear-time set merging algorithm [15] (also see the simplfied incremental-tree set-merging algorithm of [13]).²²2A node ${z\dot{x}}$ not in any blossom has $B_{{x\dot{y}}}=\{{z\dot{x}}\}$ . This condition also holds for a loop $xx$ when edge $\{x,x\}$ is a singleton blossom. This double usage will not cause any confusion. At any point in time $\overline{{\cal T}}$ denotes the graph with the current blossoms (i.e., the $B_{\cdot}$ sets) contracted.

An invocation of $d$ either returns normally or gets terminated, i.e., it stops execution prematurely. The latter occurs when an augmenting trail is discovered. When this occurs every invocation of $d$ in the current call chain is terminated. All those terminated invocations correspond to edges on the augmenting trail. This trail is added to ${\cal A}$ , the set of all augmenting trails that have been discovered.

find_trails does not rematch the trails of ${\cal A}$ , leaving that to the calling routine. For instance in applications of our algorithm for weighted matching, the rematching must undo the blossom substitutes. Rematching is further discussed in detail after Theorem 5.5 of Section 5. The other advantage of postponing the rematching is that it simplifies wording in the analysis.

In contrast, the algorithm maintains the quantities $def(x)$ , $x\in V(G)$ , as the deficiency of vertex $x$ when the trails of ${\cal A}$ have been rematched. This allows proper identification of the free vertices.

procedure find_trails

initialize $\cal T$ to an empty forest and ${\cal A}$ to an empty set

for ( $x\in V(G)$ )

initialize $def(x)$ to the deficiency of $x$ in the current matching

initialize lists $GL(x)$ to $\delta(x)\cup\gamma(x)$ and $BL(x)$ to $\emptyset$

for ( $\alpha\in V(G)$ )

if ( $def(\alpha)>0$ and no invocation $d({z\dot{\alpha}})$ has returned) $d({\varepsilon\dot{\alpha}})$

/* $\varepsilon$ is an artificial vertex, $\epsilon\alpha$ a matched artificial arc */

return ${\cal A}$

procedure d( $z\dot{x}$ )

if ( $def(x)>0$ and $\mu(zx)=\overline{M}$ and $(x\neq\alpha$ or $def(\alpha)\geq 2)$ )

/* augment step */

add the $\overline{{\cal T}}$ -path from $\alpha$ to $x$ to ${\cal A}$

decrement $def(\alpha)$ and $def(x)$

terminate every currently executing invocation of $d$ , including this one

while ( $\exists\text{ edge }xy\in GL(x)\cap\overline{\mu}(zx)$ )

remove $xy$ from $GL(x)$ and $GL(y)$

/* grow step */

add an arc $xy$ , from node ${z\dot{x}}$ to a new node $y$ , to $\cal T$

$B_{{x\dot{y}}}\leftarrow\{{x\dot{y}}\}$

$d({x\dot{y}})$

loop

/* blossom base test */

if (no occurrence of $x$ is in a blossom)

if ( $\exists\text{ edge }xy\in BL(x)\cap\overline{\mu}(zx)$ ) $xy\leftarrow{\rm pop}(BL(x))$ else break

/* blossom enlarge test */

else if ( $z\dot{x}$ or some descendant $w\dot{x}$ is in a blossom)

if ( $\exists\text{ edge }xy\in BL(x)$ ) $xy\leftarrow{\rm pop}(BL(x))$ else break

/* blossom step */

let $P$ be the path in $\overline{{\cal T}}$ from $B_{z\dot{x}}$ to $B_{{y\dot{x}}}$

for (every arc $uv$ of $P$ ) merge $B_{{u\dot{v}}}$ into $B_{{z\dot{x}}}$

for (every arc $uv$ of $P$ , as ordered in $P$ but skipping the first arc)

/* blossom-invocation loop */

$d({v\dot{u}})$

add $zx$ to $BL(x)$

return

Figure 4: Blocking algorithm for

f

-factors.

Fig.4 gives the pseudocode for our algorithm. Fig.5 gives examples of searches of $d$ .

Let us comment on various statements in the algorithm. An invocation $d({\varepsilon\dot{\alpha}})$ made in the main routine is called a search (for an augmenting trail). This invocation is terminated iff an augmenting trail is discovered. So $d({\varepsilon\dot{\alpha}})$ returns if no augmenting trail is discovered. The discussion following Lemma 4.7 will show $\alpha$ remains free for the rest of the algorithm.

We make some conventions to justify the terminology for $\cal T$ . As usual we call $\cal T$ a search tree even though it is a forest (each search starts at a different root). Also we allow $\cal T$ to contain loops $xx$ (added in grow steps). Similarly we allow the path $P$ in a blossom step to contain loops.

We continue discussing the blossom step. Fig.6 is a high level illustration of the structure of the three types of blossoms in $d$ . In the blossom base test, it is unclear why the pop routine returns the correct edge (i.e., it may have the wrong M-type). Lemma 4.1 will show correctness. In the blossom enlarge test note $z\dot{x}$ is in a blossom is easily checked. Also let us informally explain this test (the explanation is proved rigorously below). The possibility that $z\dot{x}$ is in a blossom covers ordinary blossoms (the possibility holds when either $z\dot{x}$ is already the base vertex of a blossom, or $zx$ or its reverse occurs on the blossom path). The possibility that $z\dot{x}$ is not in a blossom but $w\dot{x}$ is holds for skew blossoms. Next note that in the blossom step we identify each arc of $P$ by its nodes in $\cal T$ . The blossom step may have $P$ empty, specifically $B_{{y\dot{x}}}=B_{{z\dot{x}}}$ . We call such a blossom step a noop since nothing changes (there are no $B_{{u\dot{v}}}$ merges or $d({v\dot{u}})$ invocations). An invocation $d({x\dot{y}})$ in a grow step adds $xy$ to $\cal T$ , but $d({v\dot{u}})$ in the blossom step does not cause a similar addition. So the call chain of $d$ can be a proper subset of the current search path. (More precisely, an invocation $d({z\dot{x}})$ in the current call chain has the corresponding arc $xz$ or $zx$ in the current search path. Within blossoms, the call chain may omit search path edges.)

For more motivation let us explain the restriction on $uv$ in the blossom-invocation loop. (The explanation gets rigorously proved in our analysis below.) Let $ab$ be the first arc of $P$ , i.e., $d({a\dot{b}})$ is not called in the blossom-invocation loop. ( $x=a$ in Fig.6(a) and (c).) Let $B$ denote the blossom being formed. Consider two cases.

Suppose the first node of $P$ is not in a blossom. This means $B$ is either the first blossom formed with base edge $zx$ , or $B$ is a skew blossom (Fig.6(a) or (c)). There is no need to invoke $d({a\dot{b}})$ because the edges of $GL(x)$ have all been scanned before the blossom step. (In Fig.6(a) the scanning occurs because of edges $zx$ and $yx$ . In Fig.6(c) the scanning occurs in blossom $A$ or some subblossom.)

Suppose the first end of $P$ is in a blossom $A$ (Fig.6(b)). (We remark that this implies the last end of $P$ is also in $A$ .) There is no need for $d({a\dot{b}})$ since $a\dot{b}$ belongs to $A$ . So just like the skew blossom case, $A$ gives an invocation $d({v\dot{u}})$ with ${v\dot{u}}={a\dot{b}}$ and $\mu(vu)=\mu(ab)$ .

We turn to the issue of blossom completeness. A blossom $B$ becomes complete when $\eta(B)$ returns. (Recall there are two possibilities if $\beta(B)$ is a free vertex, say $b$ . In a search rooted at $b$ , i.e., $b=\alpha$ , $\eta(B)$ is the artificial arc $\varepsilon\alpha$ . Alternatively $b$ may occur in a search where it is not the root. In that case $\eta(B)$ can be a matched $G$ -edge. In both cases $B$ can be completed when $d(\eta(B))$ returns.)

4 Valid search structure

The goal of this section is to show find_trails constructs a valid search structure, i.e., all blossoms are valid and all search paths are alternating [14]. We adopt this as an invariant maintained by the algorithm:

(I1) Every step of the algorithm maintains a valid search structure, i.e., every blossom is valid and every path of the search structure is alternating.

The goal is achieved in Lemma 4.13.

All the proofs and other arguments in this section make the implicit assumption that all previous steps satisfy (I1). We explicitly prove (I1) for every step that modifies the search structure. As part of this analysis we will verify that the algorithm is well-defined. Specifically, in the blossom step it is unclear why the path $P$ exists for arbitrary $yx\in BL(x)$ . The other statements of the algorithm present no problems in terms of their meaning.

We start with a fundamental concept of the algorithm. Consider a vertex $x$ . Let $d({z\dot{x}})$ be the first invocation of $d$ for $x$ that returns. Define $e_{1}(x)$ to be the edge $\{z,x\}$ . This definition requires that $d({z\dot{x}})$ is not terminated. For example an invocation $d({\varepsilon\dot{\alpha}})$ that gives an unsuccessful search makes $e_{1}(\alpha)=\varepsilon\alpha$ . If every search for a vertex $x$ is successful then $e_{1}(x)$ is an edge of $G$ reached in a search rooted at some free vertex $\neq x$ . $e_{1}(x)$ need not exist in this case.

$e_{1}(x)$ may be either arc $zx$ or $xz$ . In fact we will show the former always holds (Lemma 4.3) but this is not required at the moment. We usually abbreviate $e_{1}(x)$ to $e_{1}$ when context establishes the identity of the node $x$ .

We begin the analysis by showing correctness of the pop routine.

Lemma 4.1

When an invocation $d({z\dot{x}})$ satisfies the blossom base test, $\mu(zx)=\overline{\mu}(e_{1}(x))$ .

Remark: By definition $e_{1}(x)$ is the first edge added to $BL(x)$ . So in $d({z\dot{x}})$ pop returns $e_{1}(x)$ , The lemma shows $e_{1}(x)$ has the required M-type $\overline{\mu}(zx)$ . So pop operates correctly in the blossom base test. Correctness of pop is not an issue in the blossom enlarge test since any edge can be removed from $BL(x)$ .

Proof: First observe that no operation pop $(BL(x))$ has been performed previously in the blossom enlarge test, since that would mean a blossom already contains an occurrence of $x$ .

If the blossom base test is satisfied then $BL(x)$ is nonempty so $e_{1}(x)$ exists. Suppose for contradiction that $\mu(zx)=\mu(e_{1})$ . Let $xy$ be the edge that satisfies the blossom test, i.e., $xy\in BL(x)\cap\overline{\mu}$ . So $\mu(xy)=\overline{\mu}(zx)=\overline{\mu}(e_{1})$ . $d({y\dot{x}})$ has returned (since $xy\in BL(x)$ ). $e_{1}$ was added to $BL(x)$ before $d({y\dot{x}})$ returned (by definition of $e_{1}$ . Also $e_{1}\neq yx$ since the edges have opposite M-types.) So $d({y\dot{x}})$ satisfied the blossom test before $d({z\dot{x}})$ . This triggered the operation pop $(BL(x))$ , contradiction. $\Box$

A blossom has a simple structure in $\cal T$ :

Proposition 4.2

The nodes in a blossom $B$ form a subtree of $\cal T$ . $\beta(B)$ is the subtree’s root and $\eta(B)$ is the parent arc of $\beta(B)$ .

Proof: We induct on the size of the blossom. Let $P$ be the path in $\overline{{\cal T}}$ forming blossom $B$ . Before the blossom step merges $B_{\cdot}$ values, the nodes on path $P$ form a subtree of $\overline{{\cal T}}$ . The inductive hypothesis, applied to each blossom on $P$ , implies this is a subtree of $\cal T$ . After the merge this subtree corresponds to the set of nodes of $B$ .

Let $d({z\dot{x}})$ be the invocation that forms blossom $B$ . Suppose the blossom base test is satisfied. $P$ is the path from $z\dot{x}$ to $B_{{y\dot{x}}}$ . So $z\dot{x}$ is $\beta(B)$ and $zx$ is $\eta(B)$ , as claimed in the lemma. Suppose the blossom enlarge test is satisfied. By induction $z\dot{x}$ is $\beta(B_{{z\dot{x}}})$ before the merge of $B_{\cdot}$ values. As in the blossom base test, $z\dot{x}$ remains $\beta(B)$ . $\Box$

Now we continue to investigate $e_{1}(x)$ . In a slight abuse of notation we write $d(e_{1}(x))$ to denote $d({z\dot{x}})$ for the invocation defining $e_{1}(x)$ . We next show this invocation is for a $\cal T$ -arc $zx$ . More generally, every invocation for an occurrence of $x$ in the call chain to $d(e_{1}(x))$ corresponds to a $\cal T$ -arc. Fig.5(f) illlustrates this with arcs $z_{i}x$ , $i=1,\ldots,4$ , $z_{4}x=e_{1}(x)$ , as well as $b^{\prime}z_{4}=e_{1}(z_{4})$ .

Lemma 4.3

Every invocation $d({z\dot{x}})$ made before $d(e_{1}(x))$ returns has $zx$ an arc of $\cal T$ .

Remark: The lemma even includes invocations made in searches before $d(e_{1})$ is invoked.

Proof: Suppose $d({z\dot{x}})$ is executed for an arc $xz$ . We show some invocation $d({w\dot{x}})$ returns before $d({z\dot{x}})$ is invoked. Clearly this implies $d(e_{1}(x))$ returns before $d({z\dot{x}})$ is invoked. The lemma follows.

Let $ax$ be the parent arc of $xz$ . ( $ax$ may be the artificial arc $\varepsilon\alpha$ .) We analyze two cases depending on whether $d({a\dot{x}})$ returns before or after ${a\dot{x}}$ has entered a blossom.

Case ${a\dot{x}}$ is not in a blossom when $d({a\dot{x}})$ returns: Since ${a\dot{x}}={z\dot{x}}$ this case implies $d({z\dot{x}})$ has not been invoked when $d({a\dot{x}})$ returns. So $ax$ is the desired edge $wx$ .

Case ${a\dot{x}}$ is in a blossom when $d({a\dot{x}})$ returns: Let $B$ be the blossom. We claim $ax=\eta(B)$ . To prove the claim suppose the contrary. Proposition 4.2 implies $ax$ is in some blossom path $P$ . The code for a blossom step implies $d({a\dot{x}})$ returns before the blossom is formed. This contradicts the case definition. So the claim holds.

Let $yx\in BL(x)$ be the edge triggering the blossom step that makes $a\dot{x}$ a base. $yx$ was added to $BL(x)$ when $d({y\dot{x}})$ returned. So $yx$ returns before $a\dot{x}$ becomes a base vertex. This is before blossom $B$ is formed. So it is before ${a\dot{x}}={z\dot{x}}$ is in a blossom. Thus it is before $d({z\dot{x}})$ is invoked. So $yx$ is the desired edge $wx$ . $\Box$

Consider the special case of the lemma for occurrences of $x$ in searches before $d(e_{1}(x))$ is invoked. Every such occurrence of $x$ is on a trail of ${\cal A}$ . In proof, suppose $zx$ is the arc leading to the occurrence of $x$ . The corresponding invocation $d({z\dot{x}})$ does not return before $d(e_{1})$ returns (definition of $e_{1}$ ). So it is terminated, i.e., $zx$ is on the augmenting path. Furthermore $zx$ is not in a blossom (that would also imply $d({z\dot{x}})$ has returned). So $zx\in E({{\cal A}})$ as claimed.

The next result establishes the significance of $e_{1}$ . We extend our notational convention to abbreviate $\mu(e_{1}(x))$ to $\mu$ when $x$ is clear. Let ${\cal A}$ be defined when $d(e_{1})$ is invoked, i.e., ${\cal A}$ contains the augmenting trails in the searches before $e_{1}$ is reached.

Lemma 4.4

Fix a vertex $x\in V(G)$ .

( $i$ ) Let $zx$ be an arc in $\cal T$ , $zx\neq e_{1}$ . There are two possibilities for $zx$ :

( $i.a$ ) $zx$ enters $\cal T$ before $d(e_{1})$ is invoked: Either $zx\in E({{\cal A}})$ or $zx$ is an ancestor of $e_{1}$ in $\cal T$ , as well as $\overline{{\cal T}}$ , the contraction of $\cal T$ when $e_{1}$ enters $\cal T$ .

( $i.b$ ) $zx$ enters $\cal T$ after $d(e_{1})$ returns: $zx$ is a pendant edge of $\cal T$ throughout the execution of find_trails. Furthermore $zx$ has M-type $\mu$ .

( $ii$ ) Consider an invocation $d({z\dot{x}})$ where $\mu(zx)=\overline{\mu}$ . If $d({z\dot{x}})$ pops an edge from $BL(x)$ then $GL(x)$ is empty. So no edge of $\delta(x)\cup\gamma(x)$ is added to $\cal T$ after the pop (this includes both the current search and future searches).

Remarks: ( $i.a$ ) applies even if $e_{1}$ does not exist. If $e_{1}$ exists, it satisfies ( $i.a$ ) ( $e_{1}$ is an ancestor of itself). It may also satisfy ( $i.b$ ) ( $e_{1}$ has M-type $\mu$ and it may be pendant).

Lemma 4.11( $iii$ ) extends ( $i.a$ ) to describe the ancestors of $e_{1}$ after $e_{1}$ is popped. In Fig.5(d.2) the leaf occurrence of $v$ illustrates ( $i.b$ ).

We later prove that ( $ii$ ) holds without the requirement on $\mu(zx)$ .

Proof: ( $i$ ) First observe that ( $i$ ) covers all possibilities for $zx$ : In proof the only case not covered is that $d({z\dot{x}})$ is invoked after $d(e_{1})$ is invoked and before $d(e_{1})$ returns. Such an invocation returns before $d(e_{1})$ returns. This contradicts the definition of $e_{1}$ .

( $i.a$ ) Assume $d({z\dot{x}})$ is invoked in the same search as $d(e_{1}(x))$ . (If not $d({z\dot{x}})$ is invoked in a previous search, and as shown above $zx\in E({{\cal A}})$ .) $d({z\dot{x}})$ is invoked before $d(e_{1})$ is invoked (assumption) and does not return before $d(e_{1})$ returns (definition of $e_{1}$ ). So $d(e_{1})$ returns during the execution of $d({z\dot{x}})$ . (Notice this accounts for $d({z\dot{x}})$ being terminated before it returns.) We have shown $e_{1}$ descends from $zx$ in $\cal T$ . $zx$ is not in a contracted blossom $B$ of $\overline{{\cal T}}$ : The contrary makes $zx$ an arc in the blossom path of $B$ . That implies $d({z\dot{x}})$ returns before $B$ is formed, contradiction. ( $z\dot{x}$ may be the base of a contracted blossom.)

( $i.b$ ) $d(e_{1})$ returns with $GL(x)\cap\overline{\mu}$ empty. $zx$ is added to $\cal T$ in a grow step so this implies it has M-type $\mu$ . Furthermore it implies $d({z\dot{x}})$ does not execute any grow steps, i.e., $zx$ is a pendant edge of tree $\cal T$ . Thus ( $i.b$ ) holds.

( $ii$ ) $zx\neq e_{1}$ since the two edges have opposite M-types. So $d({z\dot{x}})$ returns after $d(e_{1})$ . $d(e_{1})$ returns with $GL(x)\cap\overline{\mu}$ empty. $d({z\dot{x}})$ removes every edge of $GL(x)\cap\mu$ before it pops any edge from $BL(x)$ . So $GL(x)$ becomes empty before $d({z\dot{x}})$ pops an edge from $BL(x)$ . $\Box$

As an example suppose $x$ becomes a free vertex in an unsuccessful search. In ( $i.a$ ) observe that there may be a path of multiple occurrences of $x$ . ( $i.b$ ) shows that after the unsuccessful search, every occurrence of $x$ is a leaf. So $x$ will not be on an augmenting trail after the unsuccessful search.

The lemma contains the seeds of the entire algorithm, which we now sketch. (The sketch along with other structural details is proved in Lemmas 4.9–4.12.) Consider a fixed vertex $x$ . Each time an edge of $BL(x)$ is popped it triggers a blossom step. The blossom step may be a noop (i.e., the current blossom containing $x$ does not get enlarged). Ignoring these noops there are three successive “stages” for vertex $x$ : Stage 1 accounts for the time up until $e_{1}(x)$ is popped and its blossom is formed. After that $BL(x)$ -pops trigger blossoms for leaf occurrences of $x$ (corresponds to Lemma 4.4( $i.b$ )). We call this Stage 2. After that $BL(x)$ -pops form skew blossoms (corresponds to Lemma 4.4( $i.a$ )). This is Stage 3. The search may halt, because of an augmenting trail, at any point during this progression (e.g., before $e_{1}$ is defined, before it is popped, etc.). Also at any point in the progression arbitrarily many blossoms may be formed by pops from lists $BL(x^{\prime})$ for various vertices $x^{\prime}\neq x$ , and such blossoms may contain $x$ . For example in Stage 1 blossoms may absorb occurrences of vertex $x$ before $e_{1}$ is popped.

Vertex $x$ can occur in a blossom in only one search for an augmenting trail, the search where $d(e_{1})$ returns (Lemma 4.4( $i.b$ ) and its proof). So the pendant edge and skew blossom stages can only occur in the search where $d(e_{1})$ returns. (Regarding noops, the blossom step for $e_{1}$ may be a noop, e.g., consecutive arcs $ax,xz$ in a blossom path with $ax=e_{1}(x)$ – $d({z\dot{x}})$ pops $ax$ . Pendant edges and skew blossoms do not give noops. Other noops at arbitrary points may occur.)

As previously mentioned we must show the blossom step is well-defined. It is convenient to record that as the following property:

$(*)$ At the moment an invocation $d({z\dot{x}})$ pops an edge $yx$ from $BL(x)$ , ${y\dot{x}}$ descends from ${z\dot{x}}$ in $\overline{{\cal T}}$ .

Notice the contraction $\overline{{\cal T}}$ need not be the same as when $yx$ was added to $BL(x)$ .

It is helpful to give some illustrations for $(*)$ . Fig. 7 shows $(*)$ needn’t hold if we change $\overline{{\cal T}}$ to $\cal T$ .

Next consider Fig.8. It might seem to violate (I1) because $d$ executes as follows: The invocation $d({z\dot{x}})$ executes a blossom step for edge $yx\in BL(x)$ . The blossom step invokes $d({y\dot{s}})$ . It executes a blossom step for edge $rs\in BL(s)$ . But $B_{{r\dot{s}}}$ is not a descendant of $B_{y\dot{s}}=B_{{z\dot{x}}}$ . This purported counterexample is incorrect because the picture in (a) is impossible, the true picture is (b).

In our analysis of a fixed vertex $x$ we assume $(*)$ for all previous blossoms. This implies an enhanced version of the assumption: Suppose we are considering a blossom formed when an invocation $d({z\dot{x}})$ pops an edge $yx$ . Let $d({s\dot{t}})$ be in the call chain from $d({z\dot{x}})$ . $(*)$ implies that when $d({s\dot{t}})$ returns, $\beta(B_{{s\dot{t}}})$ descends from $z\dot{x}$ .

$(*)$ implies any blossom formed after $d({z\dot{x}})$ is invoked but before the pop has its base vertex descending from $z\dot{x}$ .

To set the stage for the ensuing discussion we extend Section 2’s definition of blossom. That definition specifies the properties of a blossom $B$ in the given graph $G$ . We now include the corresponding properties as they occur in the search tree $\cal T$ .

Definition of Blossom, in $G$ and $\cal T$

$\bullet$ Starter: The starter can be a node $x$ of $\cal T$ where $x$ does not occur in the $\cal T$ -subtree of any blossom. $x$ becomes the blossom base $\beta$ . The new blossom $B$ is formed by adding a blossom path $P$ that begins at node $\beta$ and ends at another $\cal T$ -occurrence of $\beta$ . (In the code for $d$ , $P$ ends at $B_{{y\dot{x}}}$ , and ordinary blossoms have $B_{{y\dot{x}}}=\{{y\dot{x}}\}$ .) Blossom $B$ ’s occurrence in $G$ is formed by identifying the two node occurrences of $\beta$ . The first and last edges of $P$ must have the same M-type $\mu$ . The parent arc of $\beta$ is the base edge $\eta$ , which must have opposite M-type $\overline{\mu}$ . (A special case is where the blossom path $P$ is the loop $\beta\beta$ . $B$ ’s occurrence in $G$ is also that loop.)

A blossom $A$ can be enlarged by using $A$ as the starter. Blossom $B$ is formed by adding a blossom path $P$ that begins at some node $a$ of $A$ , and ends at an occurrence of some $G$ -vertex in subtree $A$ . (In the code for $d$ the starter is the blossom $A=B_{{z\dot{x}}}$ . As above $P$ ends at $B_{{y\dot{x}}}$ , and ordinary blossoms have $B_{{y\dot{x}}}=\{{y\dot{x}}\}$ .) Blossom $B$ ’s occurrence in $G$ is formed by identifying ${y\dot{x}}$ with node $z\dot{x}$ in $A$ . (Note from the code of $d$ that $P$ starts with some node $a$ in $A$ , not necessarily ${z\dot{x}}$ .) The M-types of the extreme edges of $P$ are arbitrary. The base vertex and edge of $B$ are inductively defined as those of $A$ .

For skew blossoms the starter is a node not in any blossom subtree, which becomes the base vertex $\beta$ . $P$ ends in a contracted blossom containing an occurrence of $\beta$ . (In the code for $d$ this blossom is $B_{{y\dot{x}}}\neq\{{y\dot{x}}\}$ .) All other requirements for skew blossoms are the same as ordinary blossoms.

$\bullet$ Blossom trail: In the code for a blossom step, $P$ is a path in $\overline{{\cal T}}$ , the contraction of $\cal T$ when blossom $B$ is triggered. Assuming $(*)$ , the $\cal T$ -path $P$ is guaranteed to have all required properties of the blossom trail. Specifically, $P$ is guaranteed to be alternating, since the arcs of $P$ are in $\cal T$ . If $A$ is a contracted blossom on $P$ , $P$ is guaranteed to contain $A$ ’s base edge, which is the parent arc of $A$ . This also guarantees that $A$ occurs only once in the trail $C$ of the blossom. This property extends to a starter blossom $A$ : Its base edge is the parent of the contracted $A$ , so $A$ does not reoccur in the blossom trail. A $G$ -vertex may occur arbitrarily many times in $P$ . $\Box$

We start with some simple properties of blossoms. Consider the moment in time when a blossom $B$ becomes complete, i.e., $d(\eta(B))$ returns. Let $x$ be a vertex that occurs in $B$ .

Proposition 4.5

When $B$ becomes complete, an invocation $d({w\dot{x}})$ has returned for two edges $wx$ of opposite M-type. Moreover $e_{1}(x)$ has been popped.

Proof: Wlog assume $B$ is the first blossom to contain an occurrence of vertex $x$ .

Case $x$ occurs as $\beta(B)$ : Let $yx$ be the first edge of $BL(x)$ to trigger a blossom (not necessarily $B$ ). The invocations $d({y\dot{x}})$ and $d(\eta(B))$ are as desired. Obviously $e_{1}(x)=yx$ was popped.

Case $x$ does not occur as $\beta(B)$ : $x$ is not the first or last vertex of the blossom path $P$ , since $x$ is not the base vertex and $x$ does not occur in a blossom when $B$ is formed. So $P$ contains an arc $ax$ . $ax$ is followed by an arc $xb$ in $P$ , since $a\dot{x}$ is not in a blossom on $P$ (again by the choice of $B$ as first blossom). The invocations $d({a\dot{x}})$ and $d({b\dot{x}})$ are as desired.

Consider the moment when $d({b\dot{x}})$ executes the blossom enlarge test. $GL(x)$ has been emptied. In particular $d(e_{1}(x))$ has been invoked and has returned, adding $e_{1}(x)$ to $BL(x)$ . So $d({b\dot{x}})$ pops $e_{1}(x)$ unless some previous invocation popped it. $\Box$

The next two results give the properties of the two blossom tests.

Proposition 4.6

Suppose $d({z\dot{x}})$ executes the blossom enlarge test.

( $i$ ) Suppose $z\dot{x}$ is not in a blossom. $d({z\dot{x}})$ satisfies the blossom enlarge test $\Longleftrightarrow\$ the pop triggers a skew blossom step.

( $ii$ ) $z\dot{x}$ is in a blossom $\Longleftrightarrow\$ some current blossom $B$ has $zx=\eta(B)$ or arc $xz$ in the blossom path of $B$ .

Proof: ( $i$ ) is clear. For ( $ii$ ) consider two cases:

$zx$ a $\cal T$ -arc: During the execution of $d({z\dot{x}})$ , the current blossom containing $z\dot{x}$ , $B_{{z\dot{x}}}$ , has base vertex $z\dot{x}$ (Proposition 4.2). So the first alternative ( ${z\dot{x}}=\eta(B)$ ) holds.

$xz$ a $\cal T$ -arc: Since the test is executed by the invocation $d({z\dot{x}})$ , $xz$ is in the blossom path of a current blossom. So the second alternative holds. $\Box$

Lemma 4.7

( $i$ ) $d({z\dot{x}})$ with $zx\neq e_{1}(x)$ pendant is a noop, i.e., it returns without executing a grow or blossom step.

( $ii$ ) After a search that invokes $d(e_{1})$ , $x$ only occurs as a leaf of $\cal T$ . Such occurrences do not enter blossoms.

Proof: ( $i$ ) $d({z\dot{x}})$ goes directly to the blossom tests. Clearly $z\dot{x}$ is not in a blossom at this point. So the blossom enlarge test fails. Suppose the blossom base test is satisfied. Then $BL(x)$ contains an edge $yx$ with

(4.1)

\mu(yx)=\overline{\mu}(zx)=\overline{\mu}(e_{1}(x))

(Lemma 4.4( $i.b$ ) gives the second equality). $d({y\dot{x}})$ has returned. $d(e_{1}(x))$ returns before $d({y\dot{x}})$ returns ( $yx\neq e_{1}(x)$ by (4.1)). So $e_{1}(x)$ is in $BL(x)$ during the execution of $d({y\dot{x}})$ . Thus $d({y\dot{x}})$ satisfies the blossom base test (using (4.1)). It performs a blossom step. The occurrence of $y\dot{x}$ in a blossom means $d({z\dot{x}})$ does not execute the blossom base test. Contradiction.

( $ii$ ) Lemma 4.4( $i.b$ ) shows any occurrence of $x$ in a search after the search invoking $d(e_{1})$ is a leaf of $\cal T$ . A pendant edge $yx$ can enter a blossom only as the last edge of the blossom path, i.e., when it is popped from $BL(x)$ . But part ( $i$ ) shows the corresponding execution of $d$ is a noop, i.e., it does not trigger a blossom step. $\Box$

For intuition note this extension of part ( $ii$ ): The occurrences of $x$ in part ( $ii$ ) never enter augmenting trails. In proof no grow step is executed from the $x$ occurrence, by part ( $i$ ) and the nonoccurrence of $x$ in a blossom. So the $\cal T$ -path to $x$ is not extended and does not lead to a free vertex. (This property is not required in the logic of our algorithm analysis.) In particular this extension shows that if a free vertex $\alpha$ has a corresponding invocation $d({z\dot{\alpha}})$ that returns (as in the code for $d$ ) $\alpha$ will never enter an augmenting trail in the rest of the execution of find_trails.

We now begin to track the status of a fixed vertex $x\in V(G)$ during the execution of find_trails. The following result applies to searches at the start of find_trails, specifically, before $e_{1}(x)$ is discovered.

Proposition 4.8

Consider a search where no invocation $d({z\dot{x}})$ returns. Any $\cal T$ -arc $zx$ of the search is on an augmenting trail. Furthermore $x$ does not occur in any blossom of the search.

Remarks: The search may contain arcs $xz$ directed from $x$ .

The proposition may or may not apply to a search after $e_{1}(x)$ is defined.

Proof: Consider a $\cal T$ -arc $zx$ . (Such an arc always exists; for a free vertex $\alpha$ it is the artificial arc $\varepsilon\alpha$ .) The corresponding invocation $d({z\dot{x}})$ was terminated, i.e., $zx$ is on an augmenting trail.

$zx$ is not an arc in a blossom path, again since $d({z\dot{x}})$ did not return. $zx$ is not the base edge of a blossom, since the first blossom containing ${z\dot{x}}$ contains $e_{1}(x)$ . So as claimed, $x$ does not occur in a blossom. $\Box$

The following property of our edge-contracted blossoms is needed for both Sections 4 and 5:

$(\dagger)$ At any moment in find_trails a given vertex $x$ occurs in at most one blossom.

Observe that $x$ can enter a blossom only in the search that invokes $d(e_{1})$ : Proposition 4.8 shows this for searches before the invocation and Lemma 4.7( $ii$ ) shows it for after. To establish $(\dagger)$ we must show it holds throughout the search invoking $d(e_{1})$ . We have broken this search up into three time periods, Stages 1–3 for $x$ . The next several lemmas prove $(\dagger)$ in each of these stages. Specifically Lemma 4.9 proves $(\dagger)$ at any moment in Stage 1. Lemma 4.11( $i$ ) does this for Stage 2, and Lemma 4.12 for Stage 3.

Now assume some invocation $d({z\dot{x}})$ in find_trails returns, i.e., $e_{1}(x)$ exists. We account for the time up to and including the formation of the blossom for the pop of $e_{1}(x)$ . Recall this time period is called Stage 1 for $x$ . It is possible that the pop of $e_{1}(x)$ is a noop, but it is still included Stage 1. Fig. 9(a) gives an example. It is also possible that $e_{1}(x)$ does not get popped. In that case Stage 1 ends after the last blossom containing $x$ is formed. Again this may be a noop.

Lemma 4.9

$x$ occurs in at most one blossom at any moment in Stage 1.

Proof: Consider a point in Stage 1 where there is a unique blossom $B$ containing one or more occurrences of $x$ . Let $C$ be the next blossom formed in Stage 1. If $x$ occurs in $C$ , we will show $\eta(C)=\eta(B)$ . Clearly this implies the lemma holds throughout Stage 1. Note that Stage 1 ends when $C$ is formed by a pop of $e_{1}$ , unless $e_{1}$ is never popped because of an augmenting trail.

Observe that $\eta(C)$ descends from $\eta(B)$ . In proof Proposition 4.5 shows that when $d(\eta(B))$ returns, $e_{1}$ has been popped and so Stage 1 has ended. Thus $C$ is formed before $d(\eta(B))$ returns. This implies $\eta(C)$ descends from $\eta(B)$ .

$C$ is irrelevant unless it contains an occurrence of $x$ . If $C$ does not contain a new occurrence of $x$ then $\eta(C)=\eta(B)$ as desired. Suppose $C$ has a new occurrence of $x$ , say on edge $\{y,x\}$ .

$\{y,x\}$ is not a pendant edge $yx$ . In proof, $\{y,x\}$ is in the blossom path of $C$ . $yx$ pendant must be the last edge of this blossom path. So the blossom is triggered by the pop of $yx$ . $yx$ is popped in an invocation $d({z\dot{x}})$ . This pop was preceded by the pop of $e_{1}(x)$ . But that ended Stage 1.

We conclude $\{y,x\}$ is an ancestor of $e_{1}$ , by Lemma 4.4( $i$ ). Let $P_{1}$ be the $\cal T$ -path from $B$ to $e_{1}$ . We can assume $\{y,x\}$ is the arc $yx$ on $P_{1}$ . $\eta(C)$ is an ancestor of $e_{1}$ (since $C$ contains $x$ , an ancestor of $e_{1}$ ). $\eta(C)$ is not an edge of $P_{1}$ , since every invocation for an arc on $P_{1}$ returns before $B$ is formed. Thus $\eta(C)=\eta(B)$ , as desired. $\Box$

Note that if $e_{1}$ exists but does not get popped, any search after $d(e_{1})$ is invoked can add leaf occurrences of $x$ but no blossom containing $x$ (Lemma 4.7( $ii$ )).

Next we analyze the pop of $e_{1}$ . In particular part ( $ii$ ) of the next lemma shows that the pop of $e_{1}$ satisfies $(*)$ . Part ( $ii$ ) will also be used to show the Stage 2 blossoms satisfy $(*)$ .

Lemma 4.10

Suppose $d({z\dot{x}})$ pops $e_{1}(x)$ .

( $i$ ) $\mu(zx)=\overline{\mu}(e_{1})$ . So $GL(x)=\emptyset$ when $e_{1}$ is popped, and no edge of $\delta(x)\cup\gamma(x)$ will be added to $\cal T$ after that.

( $ii$ ) Let $\eta$ be the base edge of the first blossom containing an occurrence of $x$ . At every moment until $d(\eta)$ returns, every edge of $BL(x)$ descends from $\eta$ . So every edge popped from $BL(x)$ during the execution of $d({z\dot{x}})$ descends from $\eta$ .

Remark: Fig.9(a) gives an example of ( $ii$ ). For instance $BL(x)$ is the list $(e_{1},ex,ax,bx)$ when the first blossom is formed. In general $BL(x)$ may contain arcs directed to or from $x$ , e.g., after $d({d\dot{x}})$ returns $dx$ enters $BL(x)$ .

Proof: ( $i$ ) The second assertion follows immediately from the first using Lemma 4.4( $ii$ ). We turn to the first assertion.

Suppose $e_{1}$ is popped in the blossom base test. (In particular this implies $zx$ is a $\cal T$ -arc.) The blossom step makes $zx$ the base edge of a blossom, $zx=\eta(B)$ . The test implies $\mu(e_{1})=\overline{\mu}(zx)$ , as desired.

Now suppose $zx$ satisfies the blossom enlarge test. (Note that $d({z\dot{x}})$ may be invoked strictly after moment $e_{1}$ enters a blossom. In that case the pop of $e_{1}$ is a noop.)

Claim: $x$ does not occur in a complete blossom.

Proof of Claim: Assume it does occur. Proposition 4.5 implies $d({w\dot{x}})$ has been invoked for edges $wx$ with opposite M-types. Choose an invocation $d({w\dot{x}})$ that has $\mu(wx)=\overline{\mu}(e_{1})$ . $e_{1}$ is popped before $d({w\dot{x}})$ returns, contradicting completeness. $\spadesuit$

$z\dot{x}$ must be in a blossom $B$ . If not, Proposition 4.6( $i$ ) implies $z\dot{x}$ is the base vertex of a skew blossom. But then the Claim gives a contradiction.

If $zx=\eta(B)$ the blossom base test has been executed and as shown above ( $i$ ) holds. So Proposition 4.6( $ii$ ) shows $xz$ is an arc in the blossom path $P$ . The node ${z\dot{x}}$ is not in a blossom on $P$ , again by the Claim. The algorithm statement shows $xz$ is not the first arc of $P$ . So $P$ contains an arc $ax$ preceding $xz$ . $ax$ is not a base edge so $\mu(ax)=\mu(e_{1})$ . The alternation at $x$ implies $\mu(zx)=\overline{\mu}(ax)=\overline{\mu}(e_{1})$ .

( $ii$ ) We first show $e_{1}$ descends from $\eta$ , i.e., $(*)$ holds for the pop of $e_{1}$ . Consider two cases. If $zx=\eta$ then Lemma 4.4( $i$ ) shows $e_{1}(x)$ descends from $zx$ . Suppose $zx\neq\eta$ . So $x$ occurs on the path of a blossom $B$ being formed, say on consecutive arcs $ax,xb$ , and $\eta=\eta(B)$ . (Here we use $(*)$ for $B$ .) The invocation $d(ax)$ occurs before $B$ is formed and pops $e_{1}$ , (So $ax=zx$ .) Lemma 4.4( $i$ ) shows $ax$ is an ancestor of $e_{1}$ , and $\eta$ is an ancestor of $ax$ . Thus $\eta$ is an ancestor of $e_{1}$ and $(*)$ holds.

We continue with ( $ii$ ). Since $e_{1}$ descends from $\eta$ , $BL(x)$ is empty when $d(\eta)$ is invoked. From that moment on until $d(\eta)$ returns, every edge added to $BL(x)$ descends from $\eta$ .

The second assertion of ( $ii$ ) follows siimilarly: Lemma 4.9 shows that at the moment $d({z\dot{x}})$ forms the blossom for $e_{1}$ , $\eta$ is the base of the blossom containing ${z\dot{x}}$ . $\Box$

Note from part ( $i$ ) that if $e_{1}$ is popped, no search after this pop contains an occurrence of $x$ .

Before proceeding it is worthwhile to give an incorrect proof for the first part of ( $ii$ ), i.e., that $(*)$ holds for the pop of $e_{1}$ .

Incorrect Proof: Lemma 4.9 shows that at the moment the blossom for $e_{1}$ is formed, $\eta$ is the base of the unique blossom containing occurrences of $x$ . This makes $\eta$ an ancestor of $e_{1}$ since a blossom is a subtree of $\cal T$ (Proposition 4.2).

The argument fails since the use of Proposition 4.2 assumes the blossom has already been properly formed, i.e., $(*)$ held for the blossom popping $e_{1}$ .

We now analyze the time from when $d({z\dot{x}})$ pops $e_{1}(x)$ until it returns. This is Stage 2.

Lemma 4.11

Suppose $d({z\dot{x}})$ pops $e_{1}(x)$ . In parts ( $i$ ) and ( $ii$ ) $yx$ is an arbitrary edge popped from $BL(x)$ during the execution of $d({z\dot{x}})$ (i.e., from the moment $d({z\dot{x}})$ is invoked and until it returns).

( $i$ ) The pop of $yx$ makes it an edge of $B_{{z\dot{x}}}$ . So $B_{{z\dot{x}}}$ remains the only blossom containing an occurrence of $x$ .

( $ii$ ) The pop of $yx$ satisfies $(*)$ .

( $iii$ ) Suppose $d({z\dot{x}})$ returns. Every pendant edge $yx$ ever created in the execution of find_trails was popped from $BL(x)$ during the execution of $d({z\dot{x}})$ . So at the moment $d({z\dot{x}})$ returns, every occurrence of $x$ in $\cal T$ is either in $B_{{z\dot{x}}}$ or is a proper ancestor of $\beta(B_{{z\dot{x}}})$ not in any blossom.

Remarks: The arc corresponding to $d({z\dot{x}})$ can be $zx$ (blossom base test) or $xz$ (blossom enlarge test).

Stage 2 can have blossom steps for pendant edges as well as noops. For example in Fig.9(a), where we have noted $BL(x)$ as the list $(e_{1},ex,ax,bx)$ , $ex$ and $ax$ trigger noops and $bx$ is pendant. In general noops are for ancestors of $e_{1}(x)$ .

The edge $zx$ need not be in the first blossom containing an occurrence of $x$ . For instance in Fig.9(a) $d({d\dot{x}})$ pops $e_{1}(x)$ but $a\dot{x}$ is first blossom occurrence of $x$ .

In ( $i$ ), Fig.10 shows a leaf $x$ can be popped in an invocation other than $d({z\dot{x}})$ .

Proof: ( $i$ ) Lemma 4.9 shows ( $i$ ) holds for $yx=e_{1}$ , the first pop of $BL(x)$ .

Inductively assume ( $i$ ) has always held up to some point in the execution of $d({z\dot{x}})$ . As previously observed (proof of Lemma 4.7( $ii$ )) a pendant edge $yx$ can enter a blossom only when it is popped from $BL(x)$ . Let $yx$ be the next edge $yx$ to be popped from $BL(x)$ . $yx$ may be popped by an invocation $d({w\dot{x}})$ , $wx\neq zx$ . (The arc for $d({w\dot{x}})$ may be $wx$ or $xw$ , Fig.10(a) shows the latter.) $wx$ is not pendant (Lemma 4.7( $i$ )). So ${w\dot{x}}$ is a $\cal T$ -ancestor of $e_{1}$ (Lemma 4.4( $i$ ); $wx$ may be $e_{1}$ ). $d({w\dot{x}})$ is on the call chain starting from $d({z\dot{x}})$ . Hence $wx$ was in $B_{{z\dot{x}}}$ after the pop of $e_{1}$ (and possibly before that). So the pop of $yx$ preserves ( $i$ ). This completes the induction.

( $ii$ ) Although blossom steps during the execution of $d({z\dot{x}})$ may enlarge $B_{{z\dot{x}}}$ without adding new occurrences of $x$ , $\eta(B_{{z\dot{x}}})$ remains $\eta$ . So Lemma 4.10( $ii$ ) shows every edge popped during the execution of $d({z\dot{x}})$ descends from $\eta$ . This gives ( $ii$ ).

( $iii$ ) Lemma 4.10( $i$ ) shows every pendant edge $yx$ is created before $e_{1}$ is popped. $yx$ is added to $BL(x)$ when $d({y\dot{x}})$ returns. So $yx$ is in $BL(x)$ before $e_{1}$ is popped.

$d({z\dot{x}})$ returns with $BL(x)$ empty. Thus every pendant edge gets popped from $BL(x)$ during the execution of $d({z\dot{x}})$ . With ( $i$ ) this implies $d({z\dot{x}})$ returns with $B_{{z\dot{x}}}$ containing every leaf occurrence of $x$ in $\cal T$ . ( $i$ ) also shows $B_{{z\dot{x}}}$ is the only blossom in which $x$ occurs. $\Box$

Continuing we will track the execution of find_trails for vertex $x$ after $d({z\dot{x}})$ of Lemma 4.11 returns. The return initiates Stage 3. First consider Stage 3 for the example of Fig.9: $d({d\dot{x}})$ (which popped $e_{1}$ ) returns with $BL(x)=(dx)$ . $d({f\dot{x}})$ is on the search path to $d({d\dot{x}})$ , so eventually it pops $dx$ , executes a noop blossom for $dx$ , and returns with $BL(x)=(fx)$ . In Fig.9(b) edge $cw$ triggers the skew blossom for $w$ and $fx$ triggers the skew blossom for $x$ .

For the general case we continue with previous notation: $d({z\dot{x}})$ pops $e_{1}(x)$ . $\eta$ is the base edge of the blossom formed by that pop.

Lemma 4.12

After $d({z\dot{x}})$ returns, the blossom steps triggered by pops of $BL(x)$ are as follows:

(a) Until $d(\eta)$ returns each blossom is a noop.

(b) If $\eta$ is directed to $x$ the blossom is also a noop.

At all times $x$ occurs in a unique blossom.

Remark: A blossom of (c) may be incomplete, as in Fig.5(f) if the parent of $b^{\prime}$ is changed to $z_{3}$ .

Proof: We will prove the following invariant holds after $d({z\dot{x}})$ returns and until the current search ends. Let $B$ be the current blossom $B_{{z\dot{x}}}$ . Recall $B$ is a subtree of $\cal T$ rooted at $\beta(B)$ .

(I2) Any occurrence of $x$ either belongs to $B$ or is a proper $\cal T$ -ancestor of $\beta(B)$ , not in any blossom. $BL(x)$ is a singleton whose edge is in $B\cup\eta(B)$ .

Note that (I2) implies the last assertion of the lemma, property $(\dagger)$ .

The invariant holds at the moment $d({z\dot{x}})$ returns. In proof, Lemma 4.11( $iii$ ) gives the condition on occurrences of $x$ . Also $BL(x)=(zx)$ , since the return adds $zx$ to the previously emptied $BL(x)$ , and $zx\in B\cup\eta(B)$ .

From this moment until $d(\eta)$ returns, noops may change the entry in $BL(x)$ to some other occurrence of $x$ in $B\cup\eta(B)$ . But (I2) is maintained. Now consider the return of $d(\eta)$ . If $\eta$ is an arc directed to $x$ , i.e., $x=\beta(B_{{z\dot{x}}})$ , then the blossom is a noop, $BL(x)$ changes to $(\eta)$ , and (I2) is preserved.

Now assume (I2) holds after $d(\eta)$ has returned, and a blossom step creates the next blossom $C$ . Let $b=\beta(C)$ . Let $a$ be the first proper ancestor of $\beta(B)$ that is an occurrence of $x$ , if such exists.

$BL(x)$ cannot change before control returns to $d(a)$ . Suppose $C$ is formed before that. There are two possibilities. If $b$ is not an ancestor of $\beta(B)$ , then no occurrence of $x$ enters $C$ (we use (I1) here). So (I2) is preserved. The other possibility is that $b$ properly descends from $a$ . $C$ is either disjoint from $B$ or contains $B$ . Again no new occurrence of $x$ enters a blossom and (I2) is preserved.

Now assume control returns to $d(a)$ . The blossom enlarge test is satisfied. The entry $yx$ in $BL(x)$ is an edge in $B\cup\eta(B)$ . So $C$ is a skew blossom with base vertex $a$ . After this blossom step $C$ may get enlarged in $d(a)$ but no occurrence of $x$ is added (again by (I1)). When $d(a)$ returns $\eta(C)$ (the arc directed to $a$ ) is added to $BL(x)$ . So (I2) is preserved.

To show $(*)$ for the skew blossom, observe $a$ is a $\cal T$ -ancestor of ${y\dot{x}}$ . So $a$ is an ancestor of $y\dot{x}$ in the current contraction $\overline{{\cal T}}$ . $\Box$

Lemma 4.12 completes the proof of $(\dagger)$ . As mentioned Lemma 4.10( $i$ ) shows $x$ does not occur in future searches if Stage 3 has been entered.

Note we have also verified that $(*)$ always holds: Lemmas 4.10( $ii$ ), 4.11( $ii$ ), and 4.12(c) establish $(*)$ for Stages 1,2, and 3 respectively.

We can now establish the validity of the search structure $\cal T$ and $\overline{{\cal T}}$ . It simply amounts to the validity of the algorithm’s blossoms.

Lemma 4.13

Any blossom formed in the algorithm is valid, i.e., it satisfies the above Definition of Blossom, in $G$ and $\cal T$ .

Proof: As noted in the Definition of Blossom in $G$ and $\cal T$ , the requisite properties of the blossom trail hold automatically. So we need only verify the properties of the Starter. The requirements are satisfied trivially but for completeness we step through them.

The blossom base test forms ordinary blossoms with singleton starters ${z\dot{x}}=\beta$ . The blossom path correctly begins with $B_{{z\dot{x}}}={z\dot{x}}=\beta$ and ends at $B_{{y\dot{x}}}={y\dot{x}}=\beta$ (recall $yx=e_{1}(x)$ ). $zx$ is the base edge and the blossom test ensures the first and last edges of the blossom trail have the same M-type, specificially $\overline{\mu}(zx)$ .

Now suppose $z\dot{x}$ is in a blossom $B_{{z\dot{x}}}\neq\{{z\dot{x}}\}$ . The blossom step is triggered by the blossom enlarge test. $B_{{z\dot{x}}}$ is the starter blossom $A$ . As specified in the code, $P$ starts at a node in $B_{{z\dot{x}}}=A$ , called vertex $a$ in the Definition. $(*)$ implies the blossom path $P$ descends from $A$ . It ends at $B_{{y\dot{x}}}$ . Consider two possibilities:

$B_{{y\dot{x}}}\neq\{{y\dot{x}}\}$ : This implies $B_{{y\dot{x}}}=A$ , since $A$ is the only blossom containing an occurrence of $x$ . So $P$ is empty and the blossom step is a noop.

$B_{{y\dot{x}}}=\{{y\dot{x}}\}$ : If $yx$ is the loop $xx$ and $xx$ is already a blossom, we again have a noop. Otherwise, identifying $y\dot{x}$ with $z\dot{x}$ gives a valid blossom.

In the blossom enlarge test, a skew blossom corresponds to the alternative that $z\dot{x}$ is a singleton but some descendant is in a blossom. $\Box$

5 Blocking set

This section proves find_trails returns a valid blocking set. This culminates in the main Theorem 5.5, which is followed by a brief discussion of the linear time bound.

5.1 Search structure

We first give several additional properties of the search structure that hold when find_trails halts. They are needed to establish the blocking property.

We start by fleshing out Proposition 4.5. Again consider a moment in time when a blossom $B$ becomes complete. Note that every $x$ occurring in $B$ is in stage 3. Let $\beta$ and $\eta$ be the base vertex and edge of $B$ , respectively. Let $e={\{x,y\}}$ be any edge incident to $B$ in $G$ , say $x\in V(B)\not\ni y$ .

Corollary 5.1

At the moment $B$ becomes complete, either

( $i$ ) $e$ is a proper $\overline{{\cal T}}$ -ancestor of $\eta$ , and $e$ is in the search path $SP$ of $d(\eta)$ , or

( $ii$ ) $e=yx=\eta$ , or

( $iii$ ) $e=xy$ and leaves either $B$ or $SP$ in $\overline{{\cal T}}$ .

Remark: Part ( $iii$ ) is illustrated by arc $xg$ in Fig.9(b).

Proof: $e$ is an arc of $\cal T$ (in one direction or the other) by Lemma 4.10( $i$ ). Lemma 4.11( $iii$ ) shows the $x$ -end of $e$ is either in $B$ or is a proper ancestor of $\beta$ not in any blossom.

Case ${y\dot{x}}$ is a node of $B$ : The nodes of $B$ form a subtree of $\cal T$ (Proposition 4.2). So if $e$ is the arc $yx$ then $e=\eta$ and ( $ii$ ) holds. If $e$ is arc $xy$ then it leaves the subtree of $B$ and ( $iii$ ) holds.

Case ${y\dot{x}}$ is a proper $\cal T$ -ancestor of $\beta$ : If $e$ is arc $yx$ then the invocation $d({y\dot{x}})$ precedes $d(\eta)$ in the call chain. Thus $e$ is in the current search path $SP$ . $e$ is not in a contracted blossom since $B$ is the only blossom containing $x$ . So ( $i$ ) holds.

Suppose $e=xy$ . Since ${y\dot{x}}$ is not in a blossom $xy$ is in $\overline{{\cal T}}$ . It is either in the $\overline{{\cal T}}$ -path to $\eta$ or it is incident to that path. In the first case $e$ is in $SP$ as before. In the second case $e$ is incident to $SP$ , giving ( $iii$ ). $\Box$

Now assume $x$ occurs in blossoms that are both complete and incomplete. Let $B_{0}$ denote the maximal complete blossom containing an occurrence of $x$ . Let the sequence of incomplete blossoms containing $x$ be $B_{1}\subset B_{2}\subset\ldots\subset B_{k}$ , $B_{k}$ , $k>0$ . $\eta(B_{0})$ and $\eta(B_{1})$ may be identical or different. However the base edges $\eta(B_{i})$ , $i\geq 1$ are identical. This follows since when $d(\eta(B_{1}))$ is terminated every invocation of $d$ in the call chain is terminated, i.e., no new blossoms are formed.

The above sequence has some simple special cases, for an arbitrary vertex $x$ : $B_{0}$ may exist with $k=0$ . If $B_{0}$ does not exist define $B_{0}=\{x\}$ . $k$ may or may not be 0.

Let ${\cal C}$ be the set of maximal complete blossoms when find_trails halts.

Lemma 5.2

A $\cal T$ -arc $zx$ not in any trail of ${\cal A}$ or any set $E(B)\cup\eta(B)$ , $B\in{{\cal C}}$ has M-type $\mu(e_{1}(x))$ .

Proof: First observe that $e_{1}(x)$ exists, since invocation $d({z\dot{x}})$ returns (i.e., it is not terminated). Suppose for the sake of contradiction that $zx$ has M-type $\overline{\mu}(e_{1}(x))$ .

Claim 1 $zx$ is not the base edge of any blossom formed in find_trails.

Proof of Claim 1: Suppose on the contrary that $zx$ is the base edge of a blossom $B$ .

Suppose $B$ is incomplete. By definition the search’s augmenting trail includes $\eta(B)$ . But this contradicts the lemma’s hypothesis on ${\cal A}$ .

Suppose $B$ is complete. Let $C$ be the maximal complete blossom containing $B$ . Either $zx=\eta(C)$ or $zx$ is on a blossom path contained in $C$ or a subblossom of $C$ . Both alternatives contradict the lemma’s hypothesis on ${\cal C}$ . $\spadesuit$

Claim 2 $x$ occurs in some blossom when $d({z\dot{x}})$ executes the blossom base test.

Proof of Claim 2: $zx\neq e_{1}(x)$ since the two edges have opposite M-type. Lemma 4.4( $i$ ) shows $zx$ is a proper ancestor of $e_{1}$ . Thus $e_{1}$ enters $BL(x)$ during the execution of $d({z\dot{x}})$ . In fact it enters before $d({z\dot{x}})$ executes the blossom test. (It enters during the execution of a grow step in $d({z\dot{x}})$ .)

Suppose Claim 2 fails. Then $x$ satisfies the first condition of the blossom base test. $e_{1}$ is still in $BL(x)$ . (If not it was popped, placing $e_{1}$ in a blossom. This contradicts the test’s first condition.) The second condition of the base blossom test is satisfied (since $\mu(zx)=\overline{\mu}(e_{1})$ ). So $e_{1}$ is popped and $zx$ becomes the base edge of a new blossom. This contradicts Claim 1. $\spadesuit$

Using Claim 2, let $wx$ be an arc (directed to $x$ ) with $w\dot{x}$ in a blossom when $d({z\dot{x}})$ executes the blossom base test. $wx$ and $zx$ are both ancestors of $e_{1}$ , so one is an ancestor of the other. For the moment assume $zx\neq wx$ . Let $B$ be the first blossom containing vertex $w\dot{x}$ . (It is possible that $B$ is incomplete.) There are four possibilities for $zx$ , but none can hold:

Case $zx$ an ancestor of $\eta(B)$ : The execution of $d({z\dot{x}})$ makes $zx$ the base of a skew blossom, contradicting Claim 1.

Case $zx=\eta(B)$ : Contradicts Claim 1 again.

Case $zx$ is on the blossom path of $B$ : $d({z\dot{x}})$ executes the blossom base test before $B$ is formed. This contradicts the definition of $wx$ . The same contradiction occurs if $zx=wx$ . (Note $wx$ is either $\eta(B)$ or it is on the blossom path of $B$ . The latter must hold if $wx=zx$ .)

Case $zx$ descends from $B$ : This implies $zx$ descends from $wx$ . Again $d({z\dot{x}})$ executes the blossom base test before $B$ is formed, contradiction. $\Box$

Lemma 5.3

A vertex $x$ that is free at end of a search wherein it occurs in a complete blossom $B$ has

(5.1)

\text{$def(x)=1$ and $x=\alpha$}.

Furthermore choosing $B$ to be maximal makes $\eta(B)=\varepsilon\alpha$ .

Remarks: $x$ will be free when find_trails halts.

The hypothesis of maximality is necessary: In Fig.5( $ii$ ) assume $b=\alpha$ and $def(\alpha)=1$ . The blossom of the figure is complete but does not have base vertex $b$ . But the maximal complete blossom, formed as a skew blossom at $\alpha$ , has base vertex $\alpha$ .

Proof: Let $B$ be the first blossom formed with an occurrence of $x$ . $x$ occurs on an edge of the blossom path $P$ .

Case $x$ does not occur as an interior vertex of $P$ : Since $x$ is not in a prior blossom, $x$ is the base vertex of $B$ and the base edge, say arc $zx$ , is unmatched. $d({z\dot{x}})$ starts by executing the test for an augment step. The test fails (since $B$ has not been formed yet). This implies (5.1).

Case $x$ occurs as an interior vertex of $P$ : $P$ alternates at $x$ so $x$ is on an unmatched edge $\{z,x\}$ of $P$ . The invocation $d({z\dot{x}})$ is made before $B$ becomes complete. (The invocation may be for arc $zx$ or $xz$ .) As before it starts by executing the test for an augment, which fails ( $B$ has not been completed). So again (5.1) holds.

If $\eta(B)\neq\varepsilon\alpha$ , eventually control returns to the invocation $d(\varepsilon\alpha)$ . It executes a skew blossom step. This guarantees the second condition of the lemma. $\Box$

5.2 Min-max relation

For disjoint sets of vertices $S,T\subseteq V(G)$ let $E[S,T]$ be the set of all edges with one end in $S$ and the other in $T$ . The maximum size an $f$ -matching (i.e., a subgraph where every vertex $x$ has degree $\leq f(x)$ ) is the minimum value of the expression

(5.2)

f(I)+|\gamma(O)|+\sum_{C}\bigg{\lfloor}\frac{f(C)+|E[C,O]|}{2}\bigg{\rfloor}

where $I$ and $O$ range over all pairs of disjoint vertex sets, and in the summation $C$ ranges over all connected components of $G-I-O$ . This min-max relation is proved in [25, Theorem 32.1] by reduction; [14, Sec. 5.3] gives an algorithmic proof. This section gives another self-contained algorithmic proof, in the process of establishing the blocking property of our algorithm.

We first give some intuition for the relation. The set names are mnemonics for the tight case of the bound: We will show that equality holds for our matching by taking $I$ as the set of inner atoms and $O$ as the set of outer atoms. The connected components $C$ are formed by the remaining vertices, specifically vertices in blossoms and vertices not reached in any search. In the special case of $f\equiv 1$ our min-max relation is Edmonds’ odd set cover formula for a maximum cardinality matching [6, 22, 23]. We view our relation as a “generalized odd set formula”. Just like Edmonds’ formula the nontrivial part of the bound is due to rounding odd sets down by $1/2$ . For 1-matching the $C$ components are the blossoms and one more, the unreached vertices.

Let us show the expression (5.2) always upper bounds the size of an $f$ -matching. Classify the edges of G as type I (one or both ends in $I$ ), $OO$ (both ends in $O$ ), and $\widehat{C}O$ (joins a C vertex to a C or O vertex). Fix an $f$ -matching $M$ . $M$ trivially contains at most $f(I)$ $I$ -edges and $|\gamma(O)|$ OO edges. Consider a connected component $C$ of $G-I-O$ . The number of edges of $M$ in $\widehat{C}O$ is $\frac{1}{2}(\sum_{x\in C\cup O}deg(x,M\cap{\widehat{C}O}))$ . The terms for $x\in O$ trivially sum to at most $|E[C,O]|$ . The terms for $x\in C$ sum to at most $f(C)$ . So $|M\cap{\widehat{C}O}|$ is at most $\frac{1}{2}(f(C)+|E[C,O]|)$ . Since $|M\cap{\widehat{C}O}|$ is integral we can take the floor, thus giving the last term of (5.2).

We will show our algorithm finds a blocking set even if we make augmenting paths through complete blossoms residual. To be precise consider the graph when find_trails halts. Let ${\cal A}$ be the set of all augmenting trails at that point. As before ${\cal C}$ is the set of maximal complete blossoms. Define the residual graph to be $RG=G-{{\cal A}}\cup{{\cal C}}$ . Let $M$ be the matching on $RG$ when find_trails halts. Notice $M$ contains the matched edges of augmenting trails when they occur in blossoms of ${\cal C}$ . For $x\in V(G)$ let $def(x)$ be the deficiency computed by find_trails, i.e., the deficiency of $x$ in the matching on $G$ after augmenting the trails of ${\cal A}$ . The degree constraint function on $RG$ is

f^{\prime}(x)=deg(x,M)+def(x).

Throughout this section we will write $f^{\prime}$ and $def^{\prime}$ as $f$ and $def$ . This causes no problem. For instance the definition of $f^{\prime}$ shows deficiencies are identical in $G$ and $RG$ . (We will use $def$ values when we apply (5.1) of Lemma 5.3.) We will show find_trails finds a maximum cardinality matching of $RG$ .

An edge of $RG$ is a residual edge. Note that a blossom $B\in{{\cal C}}$ need not have a base edge in $RG$ . This occurs when a trail of ${\cal A}$ passes through $B$ (or it simply contains an edge incident of $\delta(\beta(B),\delta(B))$ ).

We will use (5.2) to show $M$ is a maximum cardinality matching of $RG$ . To define sets $I$ and $O$ consider a vertex $x\in V(G)$ where the invocation $d(e_{1}(x))$ returned but $x$ does not occur in any complete blossom.

(5.3)

x\in\begin{cases}I&\mu(x)=\overline{M}\\ O&\mu(x)=M.\end{cases}

As an example a free vertex with deficiency greater than 1 is in $O$ (Lemma 5.3), so its $f$ value is not used in (5.2). We often refer to I and O as vertex labels, and call the vertices of $G-I-O$ unlabelled. Mnemonically note that $x$ is labelled according to arc $e_{1}$ : $x$ is I (O) if $e_{1}$ makes $x$ an inner (outer) vertex. Intuition for the labelling scheme is given in Fig.12.

There are two types of unlabelled vertices: If $e_{1}(x)$ exists then an unlabelled $x$ is in a complete blossom of $G$ . The second possibility for unlabelled $x$ is that $e_{1}(x)$ does not exist, i.e., no invocation $d({z\dot{x}})$ ever returned. Equivalently, every occurrence of $x$ in $\cal T$ was in an augmenting trail $A$ . Call such a vertex $x$ an orphan, i.e., $x$ does not occur in $\cal T$ or $x$ occurs in ${\cal T}\cap RG$ but has no parent, e.g., vertex $a$ in Fig.5(b). (Note that a free vertex such a parent.) $\cal T$ may contain residual arcs directed from $x$ . There may also be edges incident to $x$ that are not in $\cal T$ , i.e., no corresponding grow step was executed.

We proceed to analyze the number of matched residual edges for each of the three types I, OO, $\widehat{C}O$ . An $I$ vertex is not free (a free vertex is O or unlabelled). So exactly $f(I)$ type I edges are matched. Thus $M$ achieves equality in (5.2) for type I edges.

Next consider an OO edge. Say it occurs as the $\cal T$ -arc $zx$ . It satisfies Lemma 5.2. (In particular $zx$ is not the base edge of a ${\cal C}$ -blossom $B$ , since $\beta(B)$ is unlabelled.) Thus $zx$ has M-type $\mu(e_{1}(x))=M$ . So every OO edge is matched and $M$ achieves equality in (5.2) for OO edges.

It remains to consider the $\widehat{C}O$ edges. Let $C$ be a connected component of unlabelled vertices. There are two similar cases, depending on whether or not $C$ contains an orphan. We analyze them in turn.

Case 1. $C$ consists of ${\cal C}$ -blossoms: $C$ is a subtree of $\cal T$ (Proposition 4.2, with augmenting trails included). $C$ consists of one or more ${\cal C}$ -blossoms joined by their base edges. The root of this subtree is the base vertex of the “root” blossom $B_{0}$ . So a blossom $B$ in $C$ has its base edge in $C$ iff $B\neq B_{0}$ .

Next we analyze the edges of $G$ incident to a blossom $B\in{{\cal C}}$ . Consider a residual edge $e={\{x,y\}}$ , $x\in B\notin y$ .

Claim Either $e=\eta(B)$ ( $e$ being arc $yx$ ) or $e$ is arc $xy$ with

(5.4)

y

labelled and

\mu(e)=\mu(e_{1}(y))

y

unlabelled and

y=\beta(A)

for a blossom

A\in{{\cal C}}

Proof of Claim: We will apply Corollary 5.1 to blossom $B$ , adjusting for events that occur after $B$ becomes complete. Consider the corollary’s three alternatives ( $i$ )–( $iii$ ) for $e$ :

( $i$ ) $e$ is one of two oppositely directed arcs, say $ax$ and $xb$ . $B$ is maximal complete so the skew blossom $S$ for $x$ was not completed. $S$ was either never triggered or was triggered and is incomplete. The first case has both arcs on the augmenting trail, so neither arc is residual. In the second case $ax$ is on the augmenting trail. $xb$ is not, since it is not on the search path of any $d$ invocation made in the blossom step. $xb$ is residual, and in particular $b$ is not an orphan. So if $b$ is unlabelled then $xb$ is the base edge of a blossom $A\in{{\cal C}}$ . The Claim holds. If $b$ is labelled then Lemma 5.2 applies (with $zx$ taken as $xb$ ). It shows $\mu(xb)=\mu(e_{1}(b))$ . Again the Claim holds.

( $ii$ ) $e=\eta(B)$ as in the Claim.

( $iii$ ) $e$ is arc $xy$ . The analysis of $xb$ in ( $i$ ) applies, i.e., the same two possibilities of the Claim hold. Note that $e$ may originate from any number of occurrences of $x$ on $SP$ . $\spadesuit$

We now show the component $C$ achieves equality in (5.2). The only fact we will use about edges incident to $C$ is the Claim. We will reuse the Claim when we analyze components $C$ that contain an orphan.

Let $M$ be the set of matched residual edges with one or both ends in $C$ . Define $M_{I}$ to be the set of $M$ -edges of type $I$ , and similarly for $M_{O}$ and $M_{\widehat{C}O}$ . We will show the number of matched edges of type $\widehat{C}O$ for our component $C$ equals its corresponding term in the summation of (5.2). To do this we show the number of ends of matched $\widehat{C}O$ edges is exactly

(5.5)

2|M_{\widehat{C}O}|=f(C)+|E[C,O]|-\epsilon

where $\epsilon\in\{0,1\}$ . (Recall $f$ is the residual degree constraint function.) Since the left-hand side is even, $\epsilon=1$ implies $f(C)+|E[C,O]|$ is odd. So the desired equality always holds.

Let $\beta$ be the base vertex of the root blossom $B_{0}$ , and let $\eta$ be its base edge, also denoted as arc $a\beta$ . The above quantity $\epsilon$ is the sum of three quantities $\epsilon_{i}\in\{0,1\},i=1,2,3$ , defined by

\epsilon_{i}=1{\Longleftrightarrow\ }\eta\text{ is }\begin{cases}\text{the artifical edge $\varepsilon\alpha$}&i=1\\ \text{matched with $a$ labelled I}&i=2\\ \text{unmatched with $a$ labelled O}&i=3.\end{cases}

Clearly at most one of these quantities is 1. Fig. 13 illustrates the possibilities for $\epsilon$ .

The number of ends of matched edges of type $\widehat{C}O$ is exactly

(5.6)

2|M_{\widehat{C}O}|=\sum_{x\in C}deg(x,M)-deg(x,M_{I})+|M_{\widehat{C}O}|.

To analyze this let $e={\{x,y\}}$ be a residual edge incident to $C$ , with $x\in C$ . The Claim and (5.4) show either $e=\eta(B_{0})$ for the root blossom $B_{0}$ or $e$ occurs as arc $xy$ with $y$ labelled and $e$ is either

matched with

y

labelled O or unmatched with

y

labelled I.

This implies

(5.7)

\sum_{x\in C}deg(x,M_{I})=\epsilon_{2}\text{ and }|E[C,O]|=|M_{\widehat{C}O}|+\epsilon_{3}.

Recall (Lemma 5.3) $C$ contains at most one free vertex, which must be $\beta$ , with $\beta=\alpha$ and $f(\beta)=deg(\beta,M)+1$ . Thus

(5.8)

\sum_{x\in C}deg(x,M)=f(C)-\epsilon_{1}.

Combining (5.6) with (5.7) and (5.8) gives (5.5).

Case 2. $C$ contains an orphan: We start by analyzing the adjacencies of an orphan.

Lemma 5.4

Consider a residual edge $e={\{x,y\}}$ incident to an orphan $x$ . Either $y$ is unlabelled and an orphan or $y$ satisfies (5.4).

Proof: Let $e$ denote edge $\{x,y\}$ .

Case $y$ is labelled: We will show $\mu(e)=\mu(e_{1}(y))$ , so (5.4) holds. If edge $e$ occurs as a $\cal T$ -arc it must be arc $xy$ ( $x$ is an orphan). Lemma 5.2 applies (in particular $e$ is not the base edge of a ${\cal C}$ -blossom). It proves $\mu(e)=\mu(e_{1}(y))$ as claimed. Suppose $e$ is not a $\cal T$ -arc. The invocation $d(e_{1}(y))$ exists and returns (because of $y$ ’s label). $e$ was not removed in $d(e_{1}(y))$ ( $x$ is an orphan). So again $\mu(e)=\mu(e_{1}(y))$ .

Case $y$ is unlabelled: If $y$ is an orphan the lemma’s condition holds. The other possibility is that $y$ is in a complete blossom $B^{\prime}$ . We can assume $B^{\prime}\in{{\cal C}}$ . We have $y\in B^{\prime}\not\ni x$ and $\{x,y\}$ is the arc $xy$ ( $x$ an orphan). The Claim above shows $xy=\eta(B^{\prime})$ . So (5.4) holds for $A$ taken as $B^{\prime}$ . $\Box$

Define an orphanage to be a connected component of the graph induced on the residual graph by the set of orphans. (Its edges are the residual edges joining two orphans.) Clearly $C$ contains an orphanage, say $R$ . Lemma 5.4 and (5.4) show $R$ can be joined to other unlabelled vertices only by $\cal T$ -arcs directed from $R$ to the base vertex of a complete blossom. Thus $C$ consists of $R$ and 0 or more subtrees $S$ that consist entirely of complete blossoms and are joined to $R$ by the root blossom’s base edge $\eta$ . We will show $C$ satisfies the tightness relation (5.5) with $\epsilon=0$ . To do this we examine the contribution to the right-hand side made from the orphanage $R$ and from a typical subtree $S$ .

First consider $S$ . Case 1 applies and shows (5.5). Furthermore $\eta$ does not leave the current component $C$ . So each $\epsilon_{i}=0$ and $\epsilon=0$ .

Now consider $R$ . (5.6) holds by definition. Since (5.4) holds for $R$ , (5.7) holds with $\epsilon_{2}=\epsilon_{3}=0$ . $R$ does not contain any free vertex, since a free vertex occurs in an unsuccessful search. So (5.8) holds with $\epsilon_{1}=0$ . As before combining these equations gives (5.5) with $\epsilon=0$ .

Now combining the instances of (5.5) for $R$ and all subtrees $S$ gives (5.5) for $C$ with $\epsilon=0$ . We have now verified the tightness of (5.2) for our labelling and the residual graph matching of find_trails. So our main result follows:

Theorem 5.5

find_trails finds a blocking trail set ${\cal A}$ . In fact find_trails halts with a maximum cardinality matching of the residual graph. $\Box$

Linear-time bound

It is straightforward to see that find_trails operates in linear time $O(m)$ . Here $m$ counts each edge according to its multiplicity. Note the blossom enlarge test is easy to implement using Proposition 4.6( $i$ ), since testing for a skew blossom is trivial.

For an algorithm to use our blocking set the trails of ${\cal A}$ must be rematched. The inclusion of ${\cal C}$ in the residual graph allows any alternating trail through a complete blossom to be rematched. Thus the $P_{i}$ trails of [14] may be used (even for skew blossoms). So postprocessing the augmenting trails uses $O(m)$ time.

A Blocking 1-matchings

Fig.14 is a verbatim statement of the blocking algorithm for 1-matching of [17, 11]. It is the jumping off point for our $f$ -factor algorithm.

procedure find_ap_set initialize $\cal S$ to an empty graph and $\cal P$ to an empty set for (each vertex $v\in V$ ) $b(v)\leftarrow v$ /* $b(v)$ maintains the base vertex of $B_{v}$ */ for (each free vertex $f$ ) if $f\notin V({\cal P})$ then add $f$ to $\cal S$ as the root of a new search tree find_ap $(f)$ return $\cal P$
procedure find_ap( $x$ ) /* $x$ is an outer vertex */ for (each edge $xy\notin M$ ) $/*$ scan $xy$ from $x$ $*/$ if ( $y\notin V({{\cal S}})$ ) if ( $y$ is free) $/*$ $y$ completes an augmenting path $*/$ add $xy$ to $\cal S$ and add path $yP(x)$ to $\cal P$ terminate every currently executing recursive call to find_ap else $/*$ grow step $*/$ add $xy,yy^{\prime}$ to $\cal S$ , where $yy^{\prime}\in M$ find_ap $(y^{\prime})$ else if ( $b(y)$ is an outer, proper descendant of $b(x)$ in $\cal S^{-}$ ) /* blossom step */ /* equivalent test: $b(y)$ became outer strictly after $b(x)$ */ let $u_{i}$ , $i=1,\ldots,k$ be the inner vertices in ${\overline{\cal S}}(B_{y},B_{x})$ , ordered so $u_{i}$ precedes $u_{i-1}$ for ( $i\leftarrow 1\ {\bf to}\ k$ ) for (every vertex $v$ with $b(v)\in\{u_{i},u^{\prime}_{i}\}$ , where $u_{i}u^{\prime}_{i}\in M$ ) $b(v)\leftarrow b(x)$ /* this adds $B_{u_{i}}\cup B_{u^{\prime}_{i}}$ to the new outer blossom */ for ( $i\leftarrow 1\ {\bf to}\ k$ ) find_ap $(u_{i})$ /* process $u_{i}$ in order of increasing depth */ return

Figure 14: Depth-first search blocking algorithm for ordinary matching.

References

[1]
[2] W.J. Cook, W.H. Cunningham, W.R. Pulleyblank, and A. Schrijver, Combinatorial Optimization, Wiley and Sons, NY, 1998.
[3] E.A. Dinic, ”Algorithm for solution of a problem of maximum flow in a network with power estimation”, Soviet Mathematics Doklady, 11, 1970, pp. 1277–1280. (In Russian.)
[4] R. Duan, H. He, and T. Zhang, ”A scaling algorithm for weighted f-factors in general graphs”, Proc. of the 47th International Colloquium on Automata, Languages, and Programming (ICALP 2020), Vol. 168 of LIPIcs, pp. 41:1–41:17, 2020.
[5] R. Duan, S. Pettie, and H-H. Su, ”Scaling algorithms for weighted matching in general graphs”, ACM Trans. Algorithms 14, 1, 2018, Article 8, 35 pages.
[6] J. Edmonds, ”Paths, trees, and flowers”, Canad. J. Math. 17, 1965, pp. 449–467.
[7] J. Edmonds, “Maximum matching and a polyhedron with 0,1-vertices”, J. Res. Nat. Bur. Standards 69B, 1965, pp. 125–130.
[8] S. Even and R.E. Tarjan, “Network flow and testing graph connectivity”, SIAM J. Comput., 4, 1975, pp. 507–518.
[9] H.N. Gabow, ”An efficient implementation of Edmonds’ algorithm for maximum matching on graphs”, J. ACM, 23, 2, 1976, pp. 221–234.
[10] H.N. Gabow, “A scaling algorithm for weighted matching on general graphs,” Proc. 26th Annual Symp. on Found. of Comp. Sci., 1985, pp. 90–100.
[11] H.N. Gabow, ”The weighted matching approach to maximum cardinality matching,” Fundamenta Informaticae 154, 1-4, 2017, pp. 109–130.
[12] H.N. Gabow, ”A weight-scaling algorithm for $f$ -factors of multigraphs”, arXiv:2010.01102, 2020.
[13] H.N. Gabow, ”A data structure for nearest common ancestors with linking”, ACM Trans. on Algorithms, 13, 4, 2017, Article 45, 28 pages.
[14] H.N. Gabow, ”Data structures for weighted matching and extensions to $b$ -matching and $f$ -factors,” ACM Trans. on Algorithms, 14, 3, 2018, Article 39, 80 pages.
[15] H.N. Gabow and R.E. Tarjan, “A linear-time algorithm for a special case of disjoint set union”, J. Comp. and System Sci., 30, 2, 1985, pp. 209–221.
[16] H.N. Gabow and R.E. Tarjan, “Faster scaling algorithms for network problems,” SIAM J. Comput., 18, 5, 1989, pp. 1013–1036.
[17] H.N. Gabow and R.E. Tarjan, “Faster scaling algorithms for general graph matching problems”, J. ACM 38, 4, 1991, pp. 815–853.
[18] Z. Galil, S. Micali and H.N. Gabow, “An $O(EV\,{\rm log}\,V)$ algorithm for finding a maximal weighted matching in general graphs”, SIAM J. Comput., 15, 1, 1986, pp. 120–130.
[19] J. Hopcroft and R. Karp, “An $n^{5/2}$ algorithm for maximum matchings in bipartite graphs”, SIAM J. Comput., 2, 4, 1973, pp. 225–231.
[20] D. Huang and S. Pettie, “Approximate generalized matching: $f$ -matchings and $f$ -edge covers”, arXiv:1706.05761, 2017.
[21] A.V. Karzanov, ”On finding maximum flows in network with special structure and some applications”, Matematicheskie Voprosy Upravleniya Proizvodstvom, Vol 5, 1973, pp. 81–94. (In Russian.)
[22] E.L. Lawler, Combinatorial Optimization: Networks and Matroids, Holt, Rinehart and Winston, New York, 1976.
[23] L. Lovász and M.D. Plummer, Matching Theory, North-Holland Mathematic Studies 121, North-Holland, New York, 1986.
[24] S. Micali and V.V. Vazirani, “An $O(\sqrt{|V|}\cdot|E|)$ algorithm for finding maximum matching in general graphs”, Proc. 21st Annual Symp. on Found. of Comp. Sci., 1980, pp. 17–27.
[25] A. Schrijver, Combinatorial Optimization: Polyhedra and Efficiency, Springer, NY, 2003.

Blocking Trails for ff-factors of Multigraphs