Computing the Union Join and Subset Graph of Acyclic Hypergraphs in Subquadratic Time

Arne Leitert

Abstract

We investigate the two problems of computing the union join graph as well as computing the subset graph for acyclic hypergraphs and their subclasses. In the union join graph $G$ of an acyclic hypergraph $H$ , each vertex of $G$ represents a hyperedge of $H$ and two vertices of $G$ are adjacent if there exits a join tree $T$ for $H$ such that the corresponding hyperedges are adjacent in $T$ . The subset graph of a hypergraph $H$ is a directed graph where each vertex represents a hyperedge of $H$ and there is a directed edge from a vertex $u$ to a vertex $v$ if the hyperedge corresponding to $u$ is a subset of the hyperedge corresponding to $v$ .

For a given hypergraph $H=(V,\mathcal{E})$ , let $n=|V|$ , $m=|\mathcal{E}|$ , and $N=\sum_{E\in\mathcal{E}}|E|$ . We show that, if the Strong Exponential Time Hypothesis is true, both problems cannot be solved in $\mathcal{O}\bigl{(}N^{2-\varepsilon}\bigr{)}$ time for $\alpha$ -acyclic hypergraphs and any constant $\varepsilon>0$ , even if the created graph is sparse. Additionally, we present algorithms that solve both problems in $\mathcal{O}\bigl{(}N^{2}/\log N+|G|\bigr{)}$ time for $\alpha$ -acyclic hypergraphs, in $\mathcal{O}\bigl{(}N\log(n+m)+|G|\bigr{)}$ time for $\beta$ -acyclic hypergaphs, and in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time for $\gamma$ -acyclic hypergraphs as well as for interval hypergraphs, where $|G|$ is the size of the computed graph.

1 Introduction

A hypergraph $H=(V,\mathcal{E})$ is a generalisation of a graph in which each edge $E\in\mathcal{E}$ , called hyperedge, can contain an arbitrary positive number of vertices from $V$ . One may also see a hypergraph $H$ as a family $\mathcal{E}$ of subsets of some set $V$ . Indeed, we say that the family $\mathcal{F}$ of sets forms the hypergraph $H=(V,\mathcal{E})$ if $V=\bigcup_{S\in\mathcal{F}}S$ and $\mathcal{E}=\mathcal{F}$ . We use $n=|V|$ , $m=|\mathcal{E}|$ , and $N=\sum_{E\in\mathcal{E}}|E|$ to respectively denote the cardinality of the vertex set, the cardinality of the hyperedge set, and the total size of all hyperedges of $H$ .

1.1 Acyclic Hypergraphs

A tree $T$ is called a join tree for $H$ if the hyperedges of $H$ are the nodes of $T$ and, for each vertex $v\in V$ , the hyperedges containing $v$ induce a subtree of $T$ . That is, if $v\in E_{i}\cap E_{j}$ , then $v$ is contained in each hyperedge (i. e., node of $T$ ) on the path from $E_{i}$ to $E_{j}$ in $T$ . A hypergraph is acyclic if it admits a join tree. There is a linear-time algorithm which determines if a given hypergraph is acyclic and, in that case, constructs a corresponding join tree for it [27].

Acyclic hypergraphs have various applications. They are, for example, a desired structure when designing relational databases [3]. There is also a close relation between acyclic hypergraphs and chordal as well as dually chordal graphs. Namely, a graph is chordal if and only if its maximal cliques form an acyclic hypergraph [18], and a graph is dually chordal if and only if its closed neighbourhoods form an acyclic hypergraph [8].

Tree-decompositions are another application. The idea is to decompose a graph $G=(V,E)$ into multiple induced subgraphs, usually called bags, where each vertex can be in multiple bags. The set of bags $\mathcal{B}$ forms a tree $T$ in such a way that the following requirements are fulfilled: Each vertex is in at least one bag, each edge is in at least one bag, and $T$ is a join tree for the hypergraph $(V,\mathcal{B})$ . Usually tree-decompositions are considered with additional restrictions. The most known is called tree-width; it limits the maximum cardinality of each bag. For a graph class with bounded tree-width, many NP-complete problems can be solved in polynomial or even linear time. Alternatively, one may limit the distances between vertices inside a bag. Such a tree-decomposition can be used, for example, for constructing tree-spanners [13, 14] and efficient routing schemes [12].

An inclusion-maximal subset of vertices of a graph $G$ is called an atom if it induces a connected subgraph of $G$ without a clique separator. It is known that the atoms of a graph form an acyclic hypergraph [23]. The corresponding join tree is then called atom tree.

The most general acyclic hypergraphs are called $\alpha$ -acyclic (i. e. ., each acyclic hypergraph is $\alpha$ -acyclic). They are closely related to chordal graphs and to dually chordal graphs. Subclasses of $\alpha$ -acyclic hypergraphs are $\beta$ -acyclic hypergraphs which are closely related to strongly chordal graphs and $\gamma$ -acyclic hypergraphs which are closely related to ptolemaic graphs (graphs that are chordal and distance-hereditary). We also consider interval hypergraphs. These are acyclic hypergrpahs for which one of their join trees forms a path. As the name suggests, they are closely related to interval graphs. We give formal definitions and more information about each subclass later in their respective sections.

A class of hypergraphs closely related to acyclic hypergraphs are so-called hypertrees. These hypergraphs are defined in the same way as acyclic hypergraphs, except that the roles of vertices and hyperedges are exchanged. That is, a hypergraph is a hypertee if its vertices admit a tree $T$ such that each hyperedge induces a subtree of $T$ . The hypergraph resulting from exchanging the roles of vertices and hyperedges is called the dual hypergraph. (See Section 2 for a more formal definition.) Subsequently, a hypergraph is a hypertree if and only if it is the dual of an acyclic hypergraph.

Figure 1 shows the hierarchy of acyclic hypergraphs. See Brandstädt and Dragan [7] for a summary of known properties of acyclic hypergraphs as well as their relations to various graph classes.

Figure 1: Hierarchy of acyclic hypergraphs. An edge from class

X

to class

Y

states that

X

is a proper subset of

Y

1.2 Union Join Graph

Note that the join tree of an acyclic hypergraph is not always unique. For example, each tree with $n$ nodes is a valid join tree for the hypergraph formed by $\bigl{\{}\{0,1\},\{0,2\},\ldots,\{0,n\}\bigr{\}}$ . The union join graph $G$ of a given acyclic hypergraph $H$ is the union of all its join trees. That is, each vertex of $G$ represents a hyperedge of $H$ and two vertices of $G$ are adjacent if there exits a join tree $T$ for $H$ such that the corresponding hyperedges are adjacent in $T$ . The union join graph of a hypergraph $H$ may also be called clique graph if $H$ represents the maximal cliques of a chordal graph [17, 20], or atom graph if $H$ represents the atoms of some graph [21]. In [5], Berry and Simonet present algorithms which compute the union join graph of an acyclic hypergraph in $\mathcal{O}(Nm)$ time.

1.3 Subset Graph

The subset graph of a hypergraph $H$ is a directed graph $G$ where each vertex represents a hyperedge of $H$ and there is a directed edge from a vertex $u$ to a vertex $v$ if the hyperedge corresponding to $u$ is a subset of the hyperedge corresponding to $v$ . Pritchard presents an algorithm in [26] that computes the subset graph for a given hypergraph in $\mathcal{O}\bigl{(}N^{2}/\log N\bigr{)}$ time. They also show that any subset graph has at most $\mathcal{O}\bigl{(}N^{2}/\log^{2}N\bigr{)}$ many edges. There are various publications that present algorithms for special cases and different computational models; see for example [15, 25] and the work cited therein.

The Strong Exponential Time Hypothesis, SETH for short, states that there is no algorithm that solves the Boolean satisfiability problem (without limitation on clause size) for some constant $\varepsilon>0$ in $\mathcal{O}\bigl{(}(2-\varepsilon)^{n}\bigr{)}$ time where $n$ is the number of variables in the given instance. A function $f(n)$ is called truly subquadratic if $f(n)\in\mathcal{O}\bigl{(}n^{2-\varepsilon}\bigr{)}$ for some constant $\varepsilon>0$ . Borassi et al. [6] show that, if SETH holds, then there is no algorithm to compute the subset graph of an arbitrary hypergraph in truly subquadratic time, even if the output is sparse. Note that the results in [6] and [26] are not conflicting, since $N^{2-\varepsilon}\in o\bigl{(}N^{2}/\log N\bigr{)}$ .

1.4 Our Contribution

In this paper, we investigate the two problems of computing the union join graph as well as computing the subset graph for acyclic hypergraphs and their subclasses. We show in Section 3 that there is a close relation between both problems by presenting reductions in both directions. It then follows that the result by Borassi et al. still holds when restricted to $\alpha$ -acyclic hypergraphs and also applies to computing a union join graph. We then develop efficient algorithms to solve both problems for acyclic hypergraphs and their subclasses. In particular, we show that, if $|G|$ denotes the size of the computed graph $G$ , then both problems can be solved in $\mathcal{O}\bigl{(}N^{2}/\log N+|G|\bigr{)}$ time for $\alpha$ -acyclic hypergraphs (Section 3), in $\mathcal{O}\bigl{(}N\log(n+m)+|G|\bigr{)}$ time for $\beta$ -acyclic hypergaphs (Section 4), and in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time for $\gamma$ -acyclic hypergraphs (Section 5) as well as for interval hypergraphs (Section 6).

2 Preliminaries

Two graphs $G=(V,E)$ and $G^{\prime}=(V^{\prime},E^{\prime})$ are isomorphic if there is a bijective function $f\colon V\rightarrow V^{\prime}$ such that $uv\in E$ if and only if $f(u)f(v)\in E^{\prime}$ . For simplicity, we write $G=G^{\prime}$ if they are isomorphic.

Let $H=(V,\mathcal{E})$ be a hypergraph. The incidence graph $\mathcal{I}(H)=\bigl{(}U_{V}\cup U_{\mathcal{E}},E_{\mathcal{I}}\bigr{)}$ of $H$ is a bipartite graph were $U_{V}$ represents the vertices of $H$ , $U_{\mathcal{E}}$ represents the hyperedges of $H$ , and there is an edge between two vertices $u_{v}\in U_{V}$ and $u_{E}\in U_{\mathcal{E}}$ if the corresponding vertex $v$ (of $H$ ) is in the corresponding hyperedge $E$ . That is, $U_{V}=\{\,u_{v}\mid v\in V\,\}$ , $U_{\mathcal{E}}=\{\,u_{E}\mid E\in\mathcal{E}\,\}$ , and $E_{\mathcal{I}}=\{\,u_{v}u_{E}\mid v\in E\,\}$ . Note that $\bigl{|}E_{\mathcal{I}}\bigr{|}=N$ . If not stated or constructed otherwise, the incidence graphs of all hypergraphs occurring in this paper are connected, finite, undirected, and without multiple edges. Additionally, whenever a hypergraph is given, it is given as its incidence graph; hence, the input size is in $\Theta(N)$ . We say two hyperedges of $H$ are distinct if they are represented by two different vertices in $\mathcal{I}(H)$ , even if both hyperedges contain the same vertices.

Let $\bigl{(}U_{V}\cup U_{\mathcal{E}},E_{\mathcal{I}}\bigr{)}$ be the incidence graph of some hypergraph $H=(V,\mathcal{E})$ . One can then exchange the roles of $U_{V}$ and $U_{\mathcal{E}}$ to interpret $U_{\mathcal{E}}$ as vertices and $U_{V}$ as hyperedges. We call the resulting hypergraph the dual hypergraph of $H$ and denote it as $H^{*}$ . Observe that, by definition, $(H^{*})^{*}=H$ .

The 2-section graph $\operatorname{2Sec}Sec(H)$ of $H$ is the graph with the vertex set $V$ where two vertices $u$ and $v$ are adjacent if there is a hyperedge $E\in\mathcal{E}$ with $u,v\in E$ . The line graph $L(H)$ of $H$ is the intersection graph of its hyperedges. That is, $L(H)=(\mathcal{E},\mathcal{E}_{L})$ with $\mathcal{E}_{L}=\{\,E_{i}E_{j}\mid E_{i},E_{j}\in\mathcal{E};E_{i}\cap E_{j}\neq\emptyset\,\}$ . It directly follows from these definitions that $\operatorname{2Sec}Sec(H)=L(H^{*})$ .

A sequence $\langle v_{1},v_{2},\ldots,v_{k}\rangle$ of vertices of $H$ forms a path in $H$ if, for each $i$ with $1\leq i<k$ , $H$ contains a hyperedge $E$ with $v_{i},v_{i+1}\in E$ . Let $X$ , $Y$ , and $Z$ be sets of vertices of $H$ . $X$ separates $Y$ form $Z$ if $X\neq\emptyset$ and each sequence of vertices that forms a path from $Y$ to $Z$ in $H$ contains a vertex from $X$ .

Let $T$ be the join tree of some acyclic hypergraph $H$ and let $E_{i}$ and $E_{j}$ be two hyperedges of $H$ which are adjacent in $T$ . We then call the set $S=E_{i}\cap E_{j}$ a separator of $H$ with respect to $T$ . If $T$ is rooted and $E_{i}$ is the parent of $E_{j}$ , we call $S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(E_{j}):=E_{i}\cap E_{j}$ the up-separator of $E_{j}$ . Note that each separator corresponds to an edge of $T$ and vice versa. We call the hypergraph formed by the set of all separators of $H$ the separator hypergraph $\mathcal{S}(H)$ for $H$ with respect to $T$ . It follows from properties ii and iii of Lemma 3 (see Section 3) that $\mathcal{S}(H)$ is always the same for a given $H$ , independent of the used join tree.

3 $\alpha$ -Acyclic Hypergraphs

In this section, we investigate the problems of computing a union join graph and computing a subset graph for the most general case of acyclic hypergraphs. We first show that computing these graphs cannot be done in truly subqadratic time if the SETH is true. For that, we use a problem called Sperner Family problem. It asks whether a family of sets contains two sets $S$ and $S^{\prime}$ such that $S\subseteq S^{\prime}$ . If the SETH is true, then there is no algorithm that solves it truly subquadratic time [6]. Afterwards, we give an algorithm that allows to quickly compute the union join graph if a fast algorithm for the subset graph problem is given. Lastly, we give some additional notes on the Sperner Family problem and its generalisation.

3.1 Hardness Results

Let $\mathcal{F}=\{S_{1},S_{2},\ldots,S_{m}\}$ be a family of sets. We create an acyclic hypergraph $H$ from $\mathcal{F}$ as follows. Create a new vertex $u$ (i. e., $u$ is not contained in any set $S_{i}$ ) and, for each set $S_{i}$ , create a hyperedge $E_{i}=S_{i}\cup\{u\}$ . Additionally, create a hyperedge $\mathcal{S}$ which is the union of all hyperedges $E_{i}$ . Formally, we have that $H=(V,\mathcal{E})$ with $V=\mathcal{S}$ and $\mathcal{E}=\big{\{}\,E_{i}\bigm{|}S_{i}\in\mathcal{F}\,\big{\}}\cup\{\mathcal{S}\}$ . One can create a join tree $T$ for $H$ by starting with $\mathcal{S}$ and then making each hyperedge $E_{i}$ adjacent to it. Thus, $H$ is acyclic. Note that one can create $H$ and $T$ from $\mathcal{F}$ in linear time.

For the remainder of this subsection, assume that we are given a family $\mathcal{F}$ , a hypergraph $H$ , and a corresponding join tree $T$ for $H$ as defined above. Our results in this subsection are based on the following observation.

Lemma 1

$\mathcal{F}$ contains two distinct sets $S_{i}$ and $S_{j}$ with $S_{i}\subseteq S_{j}$ if and only if there is a join tree for $H$ that contains the edge $E_{i}E_{j}$ .

Proof

First, assume that $\mathcal{F}$ contains two distinct sets $S_{i}$ and $S_{j}$ with $S_{i}\subseteq S_{j}$ . In that case, we can create a new join tree $T^{\prime}$ as follows. Remove the edge $E_{i}\mathcal{S}$ from $T$ and make $E_{i}$ adjacent to $E_{j}$ instead. Since $S_{i}\subseteq S_{j}$ , each element $x\in E_{i}\cap\mathcal{S}$ is also contained in $E_{j}$ . Thus, $T^{\prime}$ is a join tree for $H$ and contains the edge $E_{i}E_{j}$ .

Next, assume that there is a join tree $T^{\prime}$ for $H$ with the edge $E_{i}E_{j}$ . Without loss of generality, let $E_{j}$ be closer to $\mathcal{S}$ in $T^{\prime}$ than $E_{i}$ . Recall that $E_{i}\subseteq\mathcal{S}$ . Therefore, by properties of join trees, each vertex in $E_{i}$ is also in $E_{j}$ . It then directly follows from the construction of $H$ that $S_{i}\subseteq S_{j}$ . $\square$

We use the Sperner Family problem to show that there is no truly subquadratic-time algorithm to compute the union join graph of a given acyclic hypergraph. To do so, we first show the following.

Lemma 2

If the SETH is true, then there is no algorithm which decides in $\mathcal{O}\big{(}N^{2-\varepsilon}\big{)}$ time whether or not a given acyclic hypergraph has a unique join tree.

Proof

Recall that we can create a join tree $T$ for $H$ by making each hyperedge $E_{i}$ adjacent to the hyperedge $\mathcal{S}$ . To prove Lemma 2, we show that $\mathcal{F}$ contains two distinct sets $S_{i}$ and $S_{j}$ with $S_{i}\subseteq S_{j}$ if and only if $T$ is not a unique join tree for $H$ .

First, assume that $\mathcal{F}$ contains two such sets $S_{i}$ and $S_{j}$ . In that case, Lemma 1 implies that there is a join tree $T^{\prime}$ for $H$ with the edge $E_{i}E_{j}$ . Since $E_{i}E_{j}$ is not an edge in $T$ , $T$ is not unique. Next, assume that $T$ is not unique. Then, there is a join tree $T^{\prime}$ and a hyperedge $E_{i}$ such that $E_{i}$ is not adjacent to $\mathcal{S}$ in $T^{\prime}$ . Hence, $E_{i}$ is adjacent to some hyperedge $E_{j}$ that is closer to $\mathcal{S}$ in $T^{\prime}$ than $E_{i}$ . Since $E_{i}\subseteq\mathcal{S}$ , properties of join trees imply that $E_{i}\subseteq E_{j}$ . Subsequently, due to Lemma 1, $S_{i}\subseteq S_{j}$ .

It follows that a truly subquadratic-time algorithm which determines if an acyclic hypergraph has a unique join tree would imply an equally fast algorithm to solve the Sperner Family problem for any family of sets. $\square$

Note that, by definition of a union join graph, $H$ has a unique join tree if and only if the union join graph of $H$ is a tree. Therefore, we get the following.

Theorem 3.1

If the SETH is true, then there is no algorithm which constructs the union join graph of a given acyclic hypergraph in $\mathcal{O}\big{(}N^{2-\varepsilon}\big{)}$ time, even if that graph is sparse.

We now show that computing the subset graph of an acyclic hypergraph is as hard as computing the subset graph for a general family of sets.

Theorem 3.2

If the SETH is true, then there is no algorithm which constructs the subset graph of a given acyclic hypergraph in truly subquadratic time.

Proof

Let $G$ be the subset graph for $H$ and $G_{\mathcal{F}}$ be the subset graph for $\mathcal{F}$ . Since, by construction of $H$ , $E_{i}\subseteq E_{j}$ if and only if $S_{i}\subseteq S_{j}$ , $G$ contains the edge $(E_{i},E_{j})$ if and only if $G_{\mathcal{F}}$ contains the edge $(S_{i},S_{j})$ . We can therefore construct $G_{\mathcal{F}}$ from $G$ by simply removing the vertex representing $\mathcal{S}$ from $G$ (and its incident edges).

Recall that we can construct $H$ from $\mathcal{F}$ in linear time. Therefore, a truly subquadratic-time algorithm to construct the subset graph of a given acyclic hypergraph would imply an equally fast algorithm to construct a subset graph of a given family of sets. $\square$

3.1.1 Note on Hypertrees.

Observe that, in the hypergraph $H$ as constructed above, each hyperedge contains the vertex $u$ . We can therefore create a tree $T$ by making each other vertex a leaf adjacent to $u$ . Each hyperedge of $H$ now induces a subtree of $T$ , i. e., $H$ is a hypertree.

It follows that Lemma 2 and Theorem 3.1 still hold if the given hypergraph is both acyclic and a hypertree. Therefore, there is no truly subquadratic-time algorithm which, in general, computes the union join graph of such a hypergraph or determines if has a unique join tree.

3.2 Union Join Graph via Subset Graph

In the previous subsection, we show how to compute the subset graph using the union join graph of an acyclic hypergraph. We now present an algorithm that computes the union join graph of a given acyclic hypergraph with the help of a subset graph. The runtime of our algorithm then depends on the runtime required to compute that subset graph.

For the remainder of this subsection, assume that we are given an acyclic hypergraph $H=(V,\mathcal{E})$ and let $G$ be the union join graph of $H$ (with for us unknown edges). Lemma 3 below gives various characterisations for $G$ .

Lemma 3

For any distinct $E_{i},E_{j}\in\mathcal{E}$ , the following are equivalent.

(i)

$E_{i}E_{j}$ is an edge of $G$ .
(ii)

$H$ has a join tree with the edge $E_{i}E_{j}$ .
(iii)

Each join tree $T$ of $H$ has an edge $E_{i}^{\prime}E_{j}^{\prime}$ on the path from $E_{i}$ to $E_{j}$ in $T$ such that $E_{i}\cap E_{j}=E_{i}^{\prime}\cap E_{j}^{\prime}$ .
(iv)

Each join tree $T$ of $H$ has a separator $S$ on the path $P_{ij}$ from $E_{i}$ to $E_{j}$ in $T$ with $S\subseteq S_{i}$ and $S\subseteq S_{j}$ where $S_{i}$ and $S_{i}$ are the separators in $P_{ij}$ which are respectively closest to $E_{i}$ and $E_{j}$ .
(v)

$E_{i}\cap E_{j}$ separates $E_{i}\setminus E_{j}$ from $E_{j}\setminus E_{i}$ .

Most of the properties in Lemma 3 repeat, generalise, or paraphrase existing results (see [5, 17, 20]). Property iv is, to the best of our knowledge, a new observation. For completeness, however, we prove all of them.

Proof

By definition of $G$ , properties i and ii are equivalent. It follows from properties of join trees that ii implies v.

We next show that v implies iii. Assume that $E_{i}$ and $E_{j}$ are not adjacent in a join tree $T$ . Then there is a path $\langle E_{i}=X_{1},X_{2},\ldots,X_{k}=E_{j}\rangle$ of hyperedges from $E_{i}$ to $E_{j}$ in $T$ . For each $p$ with $1\leq p<k$ , let $S_{p}=X_{p}\cap X_{p+1}$ be the separator corresponding to the edge $X_{p}X_{p+1}$ of $T$ . By properties of join trees, $E_{i}\cap E_{j}\subseteq S_{p}$ for each $S_{p}$ . Now assume that each $S_{p}$ contains a vertex $v_{p}\notin E_{i}\cap E_{j}$ . Then, $\langle v_{1},v_{2},\ldots,v_{k-1}\rangle$ would form a path in $H$ from $v_{1}\in E_{i}\setminus E_{j}$ to $v_{k-1}\in E_{j}\setminus E_{i}$ . That contradicts with property v. Therefore, there is at least one separator $S_{p}$ with $S_{p}\subseteq E_{i}\cap E_{j}$ , i. e., there is an edge $X_{p}X_{p+1}$ in $T$ with $E_{i}\cap E_{j}=X_{p}\cap X_{p+1}$ .

To show that iii implies ii, consider a join tree $T$ where $E_{i}$ and $E_{j}$ are not adjacent. We can create a join tree $T^{\prime}$ by removing the edge $E_{i}^{\prime}E_{j}^{\prime}$ and adding the edge $E_{i}E_{j}$ instead. Since $E_{i}$ and $E_{j}$ are on different sides of $E_{i}^{\prime}E_{j}^{\prime}$ in $T$ , $T^{\prime}$ is also a tree. Additionally, because $E_{i}\cap E_{j}=E_{i}^{\prime}\cap E_{j}^{\prime}$ , $T^{\prime}$ is a valid join tree for $H$ .

It remains to show that iv is equivalent to iii. We first assume property iii. Let $S=E_{i}\cap E_{j}$ be a separator on the path from $E_{i}$ to $E_{j}$ in some join tree $T$ . Since, by properties of join trees, each vertex in $S=E_{i}\cap E_{j}$ is also in $S_{i}$ and $S_{j}$ , it follows that $S\subseteq S_{i}$ and $S\subseteq S_{j}$ . Now assume property iv. Because $S\subseteq S_{i}\subseteq E_{i}$ and $S\subseteq S_{j}\subseteq E_{j}$ , it is also the case that $S\subseteq E_{i}\cap E_{j}$ . Since $S$ is on the path from $E_{i}$ to $E_{j}$ in $T$ , each vertex that is in both $E_{i}$ and $E_{j}$ also has to be in $S$ , i. e., $S\supseteq E_{i}\cap E_{j}$ . Therefore, $S=E_{i}\cap E_{j}$ . $\square$

Based on Lemma 3, we can construct $G$ as follows. Compute a join tree $T$ for $H$ , the separator hypergraph $\mathcal{S}(H)$ (with respect to $T$ ), and its subset graph $G_{\mathcal{S}}$ . Next, use $G_{\mathcal{S}}$ to find all triples $S_{i},S_{j},S$ of separators which satisfy property iv of Lemma 3. Since their corresponding hyperedges are then adjacent in some join tree of $H$ , make the corresponding vertices adjacent in $G$ .

Before analysing our approach further, we address some needed preprocessing. Assume that $H$ contains two hyperedges $E_{i}$ and $E_{j}$ which are not adjacent in $T$ , but are adjacent in some other join tree. There might then be multiple separators $S$ on the path from $E_{i}$ to $E_{j}$ in $T$ which satisfy property iv of Lemma 3. Our algorithm would, therefore, add the edge $E_{i}E_{j}$ to $G$ multiple times, once for each such $S$ . While it is easy to remove redundant edges from $G$ afterwards, we still want to ensure that the time needed to create and remove these edges does not become too much. To achieve that, Algorithm 1 modifies $T$ such that each hyperedge becomes adjacent to its highest possible ancestor in $T$ . As by-product, Algorithm 1 also computes the up-separator of each hyperedge (and, thus, the separator hypergraph $\mathcal{S}(H)$ ).

Input: An acyclic hypergraph

H=(V,\mathcal{E})

and a join tree

T

for

H

Output: A modified join tree

T^{\prime}

for

H

and the separator hypergraph

\mathcal{S}(H)

4Root

T

in an arbitrary hyperedge

R

and then run a pre-order on

T

. Let

\sigma=\langle R=E_{1},E_{2},\ldots,E_{m}\rangle

be the resulting order.

5For each vertex

v

, set

\lambda(v):=\min\{\,i\mid v\in E_{i}\,\}

6for $i:=2$ to $m$ do

7 Set

S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(E_{i}):=\bigl{\{}\,v\in E_{i}\bigm{|}\lambda(v)<i\,\bigr{\}}

8 Let

j=\max\bigl{\{}\,\lambda(v)\bigm{|}v\in S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(E_{i})\,\bigr{\}}

and make

E_{j}

the parent of

E_{i}

Let

\mathcal{S}(H)

be the hypergraph formed by the family

\bigl{\{}\,S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(E_{i})\bigm{|}E_{i}\in\mathcal{E},E_{i}\neq R\,\bigr{\}}

Algorithm 1 Modifies the join tree of a given acyclic hypergraph such that each hyperedge becomes adjacent to its highest possible ancestor.

Lemma 4

Algorithm 1 runs in linear time.

Proof

Line 1 runs in $\mathcal{O}(m)$ time, since the nodes of $T$ are the hyperedges of $H$ . Recall that $H$ is given as an incidence graph $\mathcal{I}(H)$ . Hence, the following are equivalent (with respect to runtime): (i) for each vertex, iterating over all hyperedges containing it; (ii) for each hyperedge, iterating over all vertices it contains; and (iii) iterating over all edges of $\mathcal{I}(H)$ . Therefore, line 1, line 1, and line 1 (and subsequently Algorithm 1) run in $\mathcal{O}(N)$ total time. $\square$

Lemma 5

The tree $T^{\prime}$ created by Algorithm 1 is a valid join tree for $H$ .

Proof

Let $T_{i}$ be the tree after processing $E_{i}$ , i. e., $T=T_{1}$ and $T_{m}=T^{\prime}$ . Thus, $T_{1}$ is a valid join tree for $H$ . Assume, by induction, that $T_{i-1}$ (with $i\geq 2$ ) is a valid join tree for $H$ too. Recall that, by definition of join trees, the set of hyperedges containing a vertex $v$ form a subtree $T_{v}$ of $T$ . The roots of all such $T_{v}$ where $v\in S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(E_{i})$ are ancestors of $E_{i}$ in $T$ and, thus, form a path. By definition of $j$ (line 1), $E_{j}$ is the lowest of such roots in $T$ . It therefore follows that $S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(E_{i})\subseteq E_{j}$ . Subsequently, for each $v\in S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(E_{i})$ , the hyperedges containing $v$ still form a subtree of $T_{i}$ after changing the parent of $E_{i}$ if they did so in $T_{i-1}$ . Note that each subtree $T_{u}$ of a vertex $u\notin S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(E_{i})$ remains unchanged, since it does not contain the edge $E_{i}E_{k}$ . Therefore, for each vertex, the hyperedges containing it form a subtree of $T_{i}$ and, thus, $T_{i}$ is a join tree for $H$ . $\square$

Lemma 6

Let $E_{i}$ and $E_{j}$ be two hyperedges of $H$ , $T^{\prime}$ be the tree computed by Algorithm 1, and $P_{ij}$ be the path from $E_{i}$ to $E_{j}$ in $T^{\prime}$ . Additionally, let $S_{i}$ and $S_{j}$ be the separators on $P_{ij}$ which are closest to $E_{i}$ and $E_{j}$ , respectively. There are at most two separators $S$ on $P_{ij}$ such that $S\subseteq S_{i}$ and $S\subseteq S_{j}$ .

Proof

Let $E_{k}$ be the lowest common ancestor of $E_{i}$ and $E_{j}$ in $T^{\prime}$ . Although $T^{\prime}$ has a potentially different structure than $T$ , it is still the case that the parent of a hyperedge in $T^{\prime}$ was an ancestor of it in $T$ . Thus, $k\leq i,j$ . Note that $P_{ij}$ goes through $E_{k}$ and let $P_{ik}$ and $P_{kj}$ be the respective subpaths of $P_{ij}$ . If $P_{ij}$ contains more than two separators $S$ as defined in Lemma 6, at least two of them are either part of $P_{ik}$ or $P_{kj}$ . Without loss of generality, let them be on $P_{kj}$ and let $S$ be the lowest such separator. Additionally, let $X$ be the hyperedge directly below $S$ , i. e., $S^{\raisebox{0.3014pt}{$\scriptscriptstyle\uparrow$}}(X)=S$ . It follows that $X$ is not adjacent to $E_{k}$ in $T^{\prime}$ .

Since $S\subseteq S_{i}$ , each vertex in $S$ is in all hyperedges on the path from $X$ to $E_{i}$ in $T^{\prime}$ , including $E_{k}$ . Therefore, $S\subseteq E_{k}$ and $\max\bigl{\{}\,\lambda(v)\bigm{|}v\in S\,\bigr{\}}\leq k$ . That is a contradiction, since Algorithm 1 would have made $X$ adjacent to $E_{k}$ or one of its ancestors. $\square$

Algorithm 2 now implements the approach described above. It also uses Algorithm 1 as preprocessing. Therefore, due to Lemma 6, the algorithm adds each edge $E_{i}E_{j}$ at most two times into $G$ .

Input: An acyclic hypergraph

H=(V,\mathcal{E})

and an algorithm

\mathcal{A}

that computes the subset graph for a given family of sets.

Output: The union join graph

G

H

4Find a join tree for

H

(see [27]) and call Algorithm 1. Let

T

be the resulting join tree and

\mathcal{S}

the resulting family of separators (i. e., the hyperedges of

\mathcal{S}(H)

5Use algorithm

\mathcal{A}

to compute the subset graph

G_{\mathcal{S}}

\mathcal{S}

6Create a new graph

G=(\mathcal{E},E_{G})

with

E_{G}=\emptyset

7foreach $S\in\mathcal{S}$ do

8 Use

G_{\mathcal{S}}

to determine all separators

S^{\prime}

with

S\subseteq S^{\prime}

(including

S

itself).

9 For each such

S^{\prime}

, let

EE^{\prime}

be the edge of

T

which

S^{\prime}

represents and let

E

be the hyperedge farther away from

S

T

. Add

E

to a set

\mathbb{E}

of hyperedges. If

S

and

S^{\prime}

represent the same edge of

T

, also add

E^{\prime}

10 Partition

\mathbb{E}

into two sets

\mathbb{E}_{1}

and

\mathbb{E}_{2}

based on which side of

S

they are in

T

11 For each pair

E_{1},E_{2}

with

E_{1}\in\mathbb{E}_{1}

and

E_{2}\in\mathbb{E}_{2}

, add

E_{1}E_{2}

into

E_{G}

Algorithm 2 Computes the union join graph of an acyclic hypergraph.

Theorem 3.3

Algorithm 2 computes the union join graph $G$ of a given acyclic hypergraph $H$ in $\mathcal{O}\bigl{(}T_{\mathcal{A}}(H)+N+|G|\bigr{)}$ time where $T_{\mathcal{A}}(H)$ is the runtime of a given algorithm $\mathcal{A}$ with the separator hypergraph of $H$ as input.

Proof (Correctness)

Let $E_{i}$ and $E_{j}$ be two hyperedges of $H$ . Additionally, let $S_{i}$ and $S_{j}$ be the separators on the path from $E_{i}$ to $E_{j}$ in $T$ (computed in line 2) which are closest to $E_{i}$ and $E_{j}$ , respectively. We show the correctness of Algorithm 2 by showing that $E_{i}E_{j}$ is an edge of $G$ if and only if there is a join tree for $H$ with the edge $E_{i}E_{j}$ .

First, assume that there is a join tree for $H$ with the edge $E_{i}E_{j}$ . Lemma 3 then implies that there is a separator $S\in\mathcal{S}$ such that $S\subseteq S_{i}$ , $S\subseteq S_{j}$ , and $E_{i}$ and $E_{j}$ are on different sides of $S$ in $T$ . Therefore, when processing $S$ , the algorithm finds $S_{i}$ and $S_{j}$ (line 2) and consequently adds $E_{i}$ and $E_{j}$ into $\mathbb{E}$ (line 2). Since both hyperedges are on different sides of $S$ , Algorithm 2 then also adds the edge $E_{i}E_{j}$ to $G$ (line 2).

We now assume that $E_{i}E_{j}$ is an edge of $G$ . Note that Algorithm 2 only adds edges to $G$ in line 2. Thus, there is a separator $S\in\mathcal{S}$ for which the algorithm adds $E_{i}E_{j}$ to $G$ . For that $S$ , one of $E_{i}$ and $E_{j}$ is in $\mathbb{E}_{1}$ and the other is in $\mathbb{E}_{2}$ (line 2) and, hence, $E_{i}$ and $E_{j}$ are on different sides of $S$ in $T$ (line 2). This implies that $S\subseteq S_{i}$ and $S\subseteq S_{j}$ (line 2 and line 2). Therefore, by Lemma 3, there is a join tree for $H$ with the edge $E_{i}E_{j}$ . $\square$

Proof (Complexity)

Creating a join tree for a given acyclic hypergraph $H$ can be implemented in $\mathcal{O}(N)$ time [27]. Modifying that join tree (thereby computing $T$ ) and computing $\mathcal{S}(H)$ using Algorithm 1 can also be done in $\mathcal{O}(N)$ time (Lemma 4). Thus, line 2 runs in total $\mathcal{O}(N)$ time. Computing the subset graph $G_{\mathcal{S}}$ in line 2 requires $\mathcal{O}\bigl{(}T_{\mathcal{A}}(H)\bigr{)}$ time. Since the hyperedges of $H$ form the vertices of $G$ and since $G$ is created without edges, line 2 runs in $\mathcal{O}(m)$ time.

We show next that a single iteration of the loop starting in line 2 runs in $\mathcal{O}\bigl{(}|\mathbb{E}_{1}|\cdot|\mathbb{E}_{2}|\bigr{)}$ time. That is, the runtime for a single iteration is (asymptotically) equivalent to the number of edges of $G$ created. Note that each iteration creates at least one such edge, namely the edge in $T$ that $S$ represents. Additionally, Lemma 3 and Lemma 6 imply that each edge $E_{i}E_{j}$ is added at most twice to $G$ . Therefore, line 2 to line 2 run in $\mathcal{O}\bigl{(}|G|\bigr{)}$ total time.

For a separator $S\in\mathcal{S}$ , let $\mathbb{S}$ denote the set of separators $S^{\prime}$ with $S\subseteq S^{\prime}$ . Since the subset graph $G_{\mathcal{S}}$ is given, one can compute $\mathbb{S}$ (line 2) in $\mathcal{O}\bigl{(}|\mathbb{S}|\bigr{)}$ time by determining all incoming edges of $S$ in $G_{\mathcal{S}}$ . For each $S^{\prime}\in\mathbb{S}$ , the algorithm adds, in line 2, exactly one hyperedge into $\mathbb{E}$ plus one additional hyperedge for $S$ . Thus, $|\mathbb{E}|=|\mathbb{S}|+1$ .

One can determine the hyperedges $E$ and $E^{\prime}$ that form a separator $S^{\prime}$ , which one is farther from $S$ , and on which side of $S$ they are in $T$ as follows. When creating $S^{\prime}$ , add a reference to both hyperedges and include which is the parent and which is the child in $T$ . Now assume that each $S^{\prime}$ is also a node of $T$ adjacent to $E$ and $E^{\prime}$ . Root $T$ in an arbitrary hyperedge, run a pre-order and post-order on $T$ , and let $\mathrm{pre}(x)$ and $\mathrm{post}(x)$ be the indices of a node $x$ in that respective order. For two distinct nodes $x$ and $y$ of $T$ (representing either separators or hyperedges), $x$ is then a descendant of $y$ if and only if $\mathrm{pre}(x)>\mathrm{pre}(y)$ and $\mathrm{post}(x)<\mathrm{post}(y)$ . There are four cases when determining which of $E$ and $E^{\prime}$ to add into $\mathbb{E}$ : if $S$ and $S^{\prime}$ represent the same edge of $T$ , add both hyperedges; if $S^{\prime}$ is a descendant of $S$ , add the child-hyperedge; if $S^{\prime}$ is an ancestor of $S$ , add the parent-hyperedge; and if $S^{\prime}$ is neither an ancestor nor a descendant of $S$ , add the child-hyperedge. Clearly, one side of $S$ contains all its descendants and the other side all remaining hyperedges and separators. That allows us, after a $\mathcal{O}(m)$ -time preprocessing, to determine in constant time on which side of $S$ a give a hyperedge is. Therefore, line 2 and line 2 run in $\mathcal{O}\bigl{(}|\mathbb{E}|\bigr{)}$ time.

Line 2 clearly runs in $\mathcal{O}\bigl{(}|\mathbb{E}_{1}|\cdot|\mathbb{E}_{2}|\bigr{)}$ time. Recall that $|\mathbb{S}|+1=|\mathbb{E}|=|\mathbb{E}_{1}|+|\mathbb{E}_{2}|$ . Therefore, a single iteration of the loop starting in line 2 also runs in $\mathcal{O}\bigl{(}|\mathbb{E}_{1}|\cdot|\mathbb{E}_{2}|\bigr{)}$ time. $\square$

Recall that there is an algorithm which computes the subset graph for any given hypergraph in $\mathcal{O}\bigl{(}N^{2}/\log N\bigr{)}$ time [26]. Thus, we have the following.

Theorem 3.4

There is an algorithm that computes the union join graph $G$ of an acyclic hypergraph in $\mathcal{O}\bigl{(}N^{2}/\log N+|G|\bigr{)}$ time.

The upper bound of at most $\Theta\bigl{(}N^{2}/\log^{2}N\bigr{)}$ many edges for any subset graph [26] does not apply to union join graphs. Consider a hypergraph $H=(V,\mathcal{E})$ with $V=\{u,v_{1},\ldots,v_{n}\}$ and $\mathcal{E}=\{\,E_{i}\mid 1\leq i\leq n\,\}$ where $E_{i}=\{u,v_{i}\}$ . Note that $N=2n$ and that each tree with $\mathcal{E}$ as nodes is a valid join tree for $H$ . Hence, the union join graph of $H$ is a complete graph with $\Theta\bigl{(}N^{2}\bigr{)}$ edges.

3.3 Notes on the Sperner Family Problem and its Generalisation

An interesting question that remains is the complexity of solving the Sperner Family problem for acyclic hypergraphs and hypertrees. We first answer this question for hypertrees.

Theorem 3.5

If the SETH is true, then there is no algorithm which solves the Sperner Family problem for a given hypertree in $\mathcal{O}\big{(}N^{2-\varepsilon}\big{)}$ time.

Proof

We prove the theorem by making a simple linear-time reduction. Consider a family $\mathcal{F}=\{S_{1},S_{2},\ldots,S_{m}\}$ of sets. Create a new vertex $u$ and add it to each set $S_{i}\in\mathcal{F}$ . Let $S^{\prime}_{i}$ be the resulting set. The family $\mathcal{F}^{\prime}=\{S^{\prime}_{1},S^{\prime}_{2},\ldots,S^{\prime}_{m}\}$ then forms a hypertree. Clearly, adding $u$ to each set does not change any subset relations. Therefore, $\mathcal{F}$ contains two distinct sets $S_{i}$ and $S_{j}$ with $S_{i}\subseteq S_{j}$ if and only if $\mathcal{F}^{\prime}$ contains two distingt sets $S^{\prime}_{i}$ and $S^{\prime}_{j}$ with $S^{\prime}_{i}\subseteq S^{\prime}_{j}$ . $\square$

For acyclic hypergraphs we have the following result.

Theorem 3.6

There is a linear-time algorithm to solve the Sperner Family problem for acyclic hypergraphs.

Proof

Let $H$ be an acyclic hypergraph with a join tree $T$ . We first show that the following are equivalent: (i) $H$ has two distinct hyperedges $E_{i}$ and $E_{j}$ with $E_{i}\subseteq E_{j}$ , and (ii) $T$ has an edge $EE^{\prime}$ with $E\subseteq E^{\prime}$ . Clearly, (ii) implies (i). To show that (i) implies (ii), let $E_{i}$ and $E_{j}$ be two distinct hyperedges of $H$ with $E_{i}\subseteq E_{j}$ . It follows from the definition of join trees that $E_{i}\subseteq E_{k}$ for each hyperedge $E_{k}$ on the path from $E_{i}$ to $E_{j}$ in $T$ . Therefore, $T$ contains an edge $E_{i}E_{k}$ with $E_{i}\subseteq E_{k}$ and, thus, (i) is equivalent to (ii).

Let $EE^{\prime}$ be an edge of $T$ with $E\subseteq E^{\prime}$ , and let $S=E\cap E^{\prime}$ be the separator corresponding to that edge. Note that $|S|=|E|$ in such a case. Hence, it follows that (i) is true if and only if $H$ admits a separator $S=E\cap E^{\prime}$ such that $|S|=|E|$ or $|S|=|E^{\prime}|$ .

We can now solve the Sperner Family problem for a given acyclic hypergraph $H$ in linear time as follows. Construct the separator hypergraph for $H$ (see Algorithm 1). For each separator $S=E\cap E^{\prime}$ , determine if $|S|=|E|$ or $|S|=|E^{\prime}|$ . In that case, return True. Otherwise, if no such separator is found, return False. $\square$

Theorem 3.2 and Theorem 3.6 together give an interesting observation: Let $\mathcal{C}$ he a class of hypergraphs. Existence of an algorithm that solves the Sperner Family Problem for $\mathcal{C}$ in truly subquadratic time does not imply that there is such an algorithm to compute a subset graph for hypergraphs in $\mathcal{C}$ , even if the resulting graph is sparse.

One can generalise the Sperner Family Problem as follows: How many pairs of distinct sets $S_{i},S_{j}$ with $S_{i}\subseteq S_{j}$ does a given a family $\mathcal{F}$ contain? Let $p$ be that number. The Sperner Family problem is then equal to the question whether or not $p\geq 1$ . Thus, Theorem 3.6 gives linear-time algorithm to determine if $p\geq 1$ for acyclic hypergraphs. The reduction for Lemma 2, however, implies that there is no truly subquadratic-time algorithm that determines if $p\geq m$ . What remains an open question is the required runtime to determine if $p\geq k$ for any fixed $k$ with $1<k<m$ .

4 $\beta$ -Acyclic Hypergraphs

A hypergraph $H=(V,\mathcal{E})$ is $\beta$ -acyclic if each subset of $\mathcal{E}$ forms an acyclic hypergraph. They are also known as totally balanced hypergraphs [11]. See [16] for more definitions. In this section, we present an algorithm to compute the subset graph $G$ of $\beta$ -acyclic hypergraphs in $\mathcal{O}\bigl{(}N\log(n+m)+|G|\bigr{)}$ time. Afterwards, we show that one can use that algorithm together with Algorithm 2 to compute the union join graph in the same amount of time.

4.1 Constructing the Subset Graph

A matrix is binary if its entries are either $0$ or $1$ . The binary matrix $\bigl{[}\begin{smallmatrix}1&1\\ 1&0\end{smallmatrix}\bigr{]}$ is called $\mathrm{\Gamma}$ . A matrix is $\mathrm{\Gamma}$ -free if it contains no $\mathrm{\Gamma}$ as submatrix. Note that the rows and columns which form a $\mathrm{\Gamma}$ submatrix do not need to be adjacent in the original matrix. One can use a binary $n\times m$ matrix $M$ to represent a given hypergraph $H=(V,\mathcal{E})$ as follows. Let each row $i$ represent a vertex $v_{i}\in V$ and each column $j$ represent a hyperedge $E_{j}\in\mathcal{E}$ . An entry $M_{i,j}$ is then $1$ if and only if $v_{i}\in E_{j}$ . That matrix is called the incidence matrix of $H$ .

A matrix is doubly lexically ordered if rows and columns are permuted in such a way that rows vectors and columns are both in non-decreasing lexicographic order (rows from left to right and columns from top to bottom). Within a row, priorities of entries are decreasing from right to left, and, within a column, priorities of entries are decreasing from bottom to top. One can compute such an ordering in $\mathcal{O}\bigl{(}N\log(n+m)\bigr{)}$ time [24]. Note that the algorithm in [24] does not compute the actual matrix; it only computes the corresponding ordering of vertices and hyperedges, thereby avoiding a quadratic runtime.

Lemma 7

[4, 11] A hypergraph is $\beta$ -acyclic if and only if its doubly lexically ordered incidence matrix is $\mathrm{\Gamma}$ -free.

For the remainder of this subsection, assume that we are given a $\beta$ -acyclic hypergraph $H=(V,\mathcal{E})$ . Let $M$ be a doubly lexically ordered (hence, $\mathrm{\Gamma}$ -free) incidence matrix for $H$ . We assume that we know the ordering of vertices and hyperedges in $M$ , even though we are not given $M$ itself. For two hyperedges $E_{i}$ and $E_{j}$ of $H$ , we say $E_{i}\preceq E_{j}$ if the column of $E_{i}$ is lexicographically smaller than or equal to the column of $E_{j}$ with respect to $M$ . Accordingly, we write $E_{i}\prec E_{j}$ to exclude equality.

Lemma 8

Let $E_{i}$ and $E_{j}$ be two hyperedges of $H$ and let $v$ be the vertex in $E_{i}$ which is earliest in the doubly lexical ordering (i. e., highest in $M$ ). Then, $E_{i}\subseteq E_{j}$ if and only if $E_{i}\preceq E_{j}$ and $v\in E_{j}$ .

Proof

We first show that $E_{i}\preceq E_{j}$ and $v\in E_{j}$ implies $E_{i}\subseteq E_{j}$ . Clearly, $E_{i}\subseteq E_{j}$ if $E_{i}=E_{j}$ . Assume now that $E_{i}\nsubseteq E_{j}$ , i. e., $E_{i}$ contains a vertex $u\notin E_{j}$ . By definition of $v$ , $u$ is lower in $M$ than $v$ . $E_{i}\preceq E_{j}$ and $v\in E_{j}$ then imply that $E_{i}$ , $E_{j}$ , $u$ , and $v$ form a $\mathrm{\Gamma}$ in $M$ . That contradicts with $M$ being $\mathrm{\Gamma}$ -free (see Lemma 7). Therefore, $E_{i}\succ E_{j}$ or $v\notin E_{j}$ .

Clearly, $v\notin E_{j}$ implies $E_{i}\nsubseteq E_{j}$ . Now assume that $E_{i}\succ E_{j}$ . Since $E_{i}\neq E_{j}$ , there is a lowest vertex $u$ in $M$ which is in one of these hyperedges but not in both. Recall that $M$ is ordered lexicographically. Therefore, $E_{i}\succ E_{j}$ implies that $u\in E_{i}$ ( $1$ in $M$ ) and $u\notin E_{j}$ ( $0$ in $M$ ), i. e., $E_{i}\nsubseteq E_{j}$ . $\square$

Lemma 8 allows to compute the subset graph $G$ of a $\beta$ -acyclic hypergraph as follows. First, find doubly lexicographical ordering of vertices and hyperedges. For each hyperedge $E$ , determine all hyperedges $E^{\prime}$ with $E\preceq E^{\prime}$ which contain $v$ as defined in Lemma 8. Then, add the edge $(E,E^{\prime})$ to $G$ for each such pair $E$ and $E^{\prime}$ . Algorithm 3 implements that approach.

Input: A

\beta

-acyclic hypergraph

H=(V,\mathcal{E})

Output: The subset graph

G

H

4Find doubly lexicographical ordering

\sigma

of vertices and hyperedges (see [24]) and order the adjacency list of

\mathcal{I}(H)

according to

\sigma

5Create a new directed graph

G=(\mathcal{E},E_{G})

with

E_{G}=\emptyset

6foreach $E\in\mathcal{E}$ do

7 Let

v

be the vertex in

E

which is first in

\sigma

8 foreach hyperedge $E^{\prime}$ containing $v$ with $E\preceq E^{\prime}$ do (

\preceq

with respect to

\sigma

)

9 Add

(E,E^{\prime})

E_{G}

Algorithm 3 Computes the subset graph of a given

\beta

-acyclic hypergraph.

Theorem 4.1

Algorithm 3 computes the subset graph $G$ of a given $\beta$ -acyclic hypergraph in $\mathcal{O}\bigl{(}N\log(n+m)+|G|\bigr{)}$ time.

Proof (Correctness)

We show the correctness of Algorithm 3 by showing that $G$ contains an edge $(E_{i},E_{j})$ if and only if $E_{i}\subseteq E_{j}$ . First assume that $G$ contains an edge $(E_{i},E_{j})$ . Note that Algorithm 3 only adds $(E_{i},E_{j})$ to $G$ (line 3) if $E_{i}\preceq E_{j}$ and $v\in E_{j}$ (line 3). Therefore, due to Lemma 8, $G$ containing an edge $(E_{i},E_{j})$ implies $E_{i}\subseteq E_{j}$ .

Next, assume that $H$ contains two hyperedges $E_{i}$ and $E_{j}$ with $E_{i}\subseteq E_{j}$ . It then follows from Lemma 8 that $E_{i}\preceq E_{j}$ and $v\in E_{j}$ . Since the algorithm checks all pairs of hyperedges satisfying this condition (line 3 and line 3), it eventually finds $E_{i}$ and $E_{j}$ and adds $(E_{i},E_{j})$ as edge to $G$ (line 3). $\square$

Proof (Complexity)

One can compute a doubly lexicographical ordering $\sigma$ (line 3) in $\mathcal{O}\bigl{(}N\log(n+m)\bigr{)}$ time [24]. Creating the graph $G$ (line 3) can easily be done in $\mathcal{O}(m)$ time.

There are various ways to then order the adjacency list of $\mathcal{I}(H)$ (line 3) in $\mathcal{O}(N)$ time. One option is to reconstruct $\mathcal{I}(H)$ as follows. Iterate over all vertices $v$ of $H$ as ordered in $\sigma$ . For each hyperedge $E$ containing $v$ , add $v$ to the new list of $E$ . Afterwards, the list of vertices in $E$ is ordered with respect to $\sigma$ . The same approach (with hyperedges and vertices swapped) allows to sort, for each vertex $v$ , the list of hyperedges containing it.

We now show that the loop starting in line 3 runs is in $\mathcal{O}\bigl{(}|G|\bigr{)}$ total time. Note that the hyperedges $\mathcal{E}$ form the vertices of $G$ . Hence, there is exactly one iteration of the loop starting in line 3 for each vertex of $G$ . Since the adjacency list of $\mathcal{I}(H)$ is ordered according to $\sigma$ (line 3), we can determine $v$ (line 3) in constant time for each hyperedge $E$ . For the same reason, we can determine the set $\mathbb{E}_{v}=\{\,E^{\prime}\in\mathcal{E}\mid v\in E^{\prime},E\preceq E^{\prime}\,\}$ in $\mathcal{O}\bigl{(}|\mathbb{E}_{v}|\bigr{)}$ time by iterating backwards over the hyperedges containing $v$ . Since we add exactly one edge $(E,E^{\prime})$ to $G$ for each $E^{\prime}$ in such $\mathbb{E}_{v}$ , line 3 and line 3 run in $\mathcal{O}\bigl{(}|G|\bigr{)}$ total time. $\square$

4.2 Constructing the Union Join Graph

We now address how to compute the union join graph for $\beta$ -acyclic hypergraphs. For that, we do not present a new algorithm. Instead, we show that one can use Algorithm 2 together with Algorithm 3. This is possible due to Lemma 9 below.

Lemma 9

If a hypergraph is $\beta$ -acyclic, then its separator hypergraph is $\beta$ -acyclic, too.

Before proving Lemma 9, we need a few auxiliary definitions. Assume we are given a graph $G=(V,E)$ . A clique is a set of vertices of $G$ such that all these vertices are pairwise adjacent. Such a clique $K$ is maximal if no vertex in $G$ is adjacent to all vertices in $K$ . For a cycle $\langle v_{1},v_{2},\ldots,v_{k},v_{1}\rangle$ , a chord is an edge between two non-consecutive vertices. A graph is called chordal if each cycle with four or more vertices has a chord. A hypergraph $H$ is conformal if, for each maximal clique $K$ of $\operatorname{2Sec}Sec(H)$ , $H$ contains a hyperedge $E$ with $K\subseteq E$ .

Proof (Lemma 9)

Let $H=(V,\mathcal{E})$ be a $\beta$ -acyclic hypergraph with a join tree $T$ and let $\mathcal{S}$ be the hyperedges of $\mathcal{S}(H)$ . To prove that $\mathcal{S}(H)$ is $\beta$ -acyclic, we show that each $\mathcal{S}^{\prime}\subseteq\mathcal{S}$ forms an acyclic hypergraph. It is known that a hypergraph $H$ is acyclic if and only if $H$ is conformal and $\operatorname{2Sec}Sec(H)$ is chordal [3]. It is therefore sufficient for us to show that each $\mathcal{S}^{\prime}$ is conformal and its 2-section graph is chordal.

We start by showing that $G_{2}=\operatorname{2Sec}Sec\bigl{(}\mathcal{S}^{\prime}\bigr{)}$ is chordal. Let us assume that $G_{2}$ is not chordal. It then contains a chordless cycle $C=\{v_{1},v_{2},\ldots,v_{k}\}$ with $k\geq 4$ . Since each edge $v_{i}v_{i+1}$ of $G_{2}$ implies that its vertices are in a common hyperedge, there is a sequence of separators $\sigma=\langle S_{1},S_{2},\ldots S_{k}\rangle$ that form $C$ in $G_{2}$ . In particular, we have that $v_{i}\in S_{i}\cap S_{i+1}$ (with index arithmetic modulo $k$ ). Recall that each separator $S_{i}$ corresponds to an edge of $T$ . Let $T_{\sigma}$ be the smallest subtree of $T$ that contains all separators of $\sigma$ . Note that $v_{i}\notin S_{j}$ for all $j\notin\{i,i+1\}$ ; otherwise $C$ would have a chord. Therefore, by properties of join trees, there is no $i$ and $j$ such that $S_{j}$ separates $S_{i}$ and $S_{i+1}$ in $T$ . Hence, each $S_{i}$ in $\sigma$ corresponds to a leaf $E_{i}$ of $T_{\sigma}$ . By properties of join trees, $E_{i}\cap E_{j}=S_{i}\cap S_{j}$ . Hence, $\operatorname{2Sec}Sec\bigl{(}\{E_{1},E_{2},\ldots,E_{k}\}\bigr{)}$ is not chordal implying that $\{E_{1},E_{2},\ldots,E_{k}\}$ does not form an acyclic hypergraph. This contradicts with $H$ being $\beta$ -acyclic. Therefore, $\operatorname{2Sec}Sec\bigl{(}\mathcal{S}^{\prime}\bigr{)}$ is chordal.

We now show that each $\mathcal{S}^{\prime}$ forms a conformal hypergraph. Gilmore’s Theorem [4] states that a hypergraph $H$ is conformal if and only if, for all its hyperedges $E_{1}$ , $E_{2}$ , and $E_{3}$ , $H$ contains a hyperedge $E$ such that $(E_{1}\cap E_{2})\cup(E_{2}\cap E_{3})\cup(E_{1}\cap E_{3})\subseteq E$ . $\mathcal{S}^{\prime}$ is therefore clearly conformal if $|\mathcal{S}^{\prime}|\leq 2$ . Now let $|\mathcal{S}^{\prime}|\geq 3$ and let $S_{1}$ , $S_{2}$ , and $S_{3}$ be three hyperedges in $\mathcal{S}^{\prime}$ . We distinguish between two cases. Case 1: $S_{1}$ , $S_{2}$ , and $S_{3}$ are on a path in $T$ . Without loss of generality, let $S_{2}$ be between $S_{1}$ and $S_{3}$ . Then, by properties of join trees, $S_{1}\cap S_{3}\subseteq S_{2}$ . Case 2: There is a hyperedge $E$ in $H$ such that $S_{1}$ , $S_{2}$ , and $S_{3}$ are in different subtrees when removing $E$ from $T$ . For all $i\in\{1,2,3\}$ , let $S_{i}$ represent the edge $E_{i}E^{\prime}_{i}$ of $T$ and let $E^{\prime}_{i}$ be closer to $E$ in $T$ than $E_{i}$ . Since $H$ is $\beta$ -acyclic, the set $\{E_{1},E_{2},E_{3}\}$ also forms an acyclic and, hence, conformal hypergraph. Without loss of generality, let $E_{3}$ be the hyperedge that satisfies Gilmore’s Theorem for $\{E_{1},E_{2},E_{3}\}$ , i. e., let $E_{1}\cap E_{2}\subseteq E_{3}$ . Note that, by properties of join trees, $E_{i}\cap E_{j}=S_{i}\cap S_{j}$ for all $i,j\in\{1,2,3\}$ , and $v\in S_{3}$ if $v\in(E_{1}\cup E_{2})\cap E_{3}$ . Therefore, $S_{1}\cap S_{2}\subseteq S_{3}$ . Overall, it then follows each $\mathcal{S}^{\prime}$ forms a conformal hypergraph. $\square$

Due to Lemma 9, we can conclude this section as follows.

Theorem 4.2

There is an algorithm that computes the union join graph $G$ of a given $\beta$ -acyclic hypergraph in $\mathcal{O}\bigl{(}N\log(n+m)+|G|\bigr{)}$ time.

Proof

Let $H$ be the given hypergraph. Because the separator hypergraph $\mathcal{S}(H)$ is $\beta$ -acyclic (Lemma 9), we can use Algorithm 3 to compute its subset graph $G^{\prime}$ in $\mathcal{O}\bigl{(}N\log(n+m)+|G^{\prime}|\bigr{)}$ time. Thus, when using $H$ and Algorithm 3 as input, Algorithm 2 computes the union join graph $G$ of $H$ in $\mathcal{O}\bigl{(}N\log(n+m)+|G^{\prime}|+|G|\bigr{)}$ time. Consider again line 2 to line 2 of Algorithm 2. Note that for each edge $(S,S^{\prime})$ of $G^{\prime}$ , there is at least one edge added to $G$ . It follows that $|G^{\prime}|\leq|G|$ and, therefore, one can compute $G$ in $\mathcal{O}\bigl{(}N\log(n+m)+|G|\bigr{)}$ total time. $\square$

5 $\gamma$ -Acyclic Hypergraphs

In [16], Fagin gives various definitions of $\gamma$ -acyclic hypergraphs and presents a polynomial-time recognition algorithm for them. The definition for $\gamma$ -acyclic hypergraphs we give below uses a strong relation between these hypergraphs and distance-hereditary graphs. Before that, we give a few auxiliary definitions and an interesting property of distance-hereditary graphs.

Let $G=(V,E)$ be a connected, undirected, and simple graph without loops or multiple edges. The open and closed neighbourhood of a vertex $v\in V$ are respectively defined as $N(v)=\{\,u\mid uv\in E\,\}$ and $N[v]=N(v)\cup\{v\}$ . A vertex $v$ is pendant if $\bigl{|}N(v)\bigr{|}=1$ . Two vertices $u$ and $v$ are false twins if $N(u)=N(v)$ , and are true twins if $N[u]=N[v]$ . A graph $G$ is distance-hereditary if every induced subgraph is distance preserving, i. e., the distance between two vertices $u$ and $v$ remains the same in every connected induced subgraph of $G$ that contains $u$ and $v$ .

An ordering $\sigma=\langle v_{1},v_{2},\ldots,v_{n}\rangle$ for a graph $G$ is called a pruning sequence if each $v_{i}$ with $i>1$ satisfies one of the following conditions in the subraph of $G$ induced by $\{v_{1},v_{2},\ldots,v_{i}\}$ : (i) $v_{i}$ is pendant, (ii) $v_{i}$ is a true twin of some vertex $v_{j}$ , or (iii) $v_{i}$ is a false twin of some vertex $v_{j}$ . A graph $G$ is distance-hereditary if and only if $G$ admits a pruning sequence [2].

The recognition algorithm in [16] decides whether or not a given hypergraph is $\gamma$ -acyclic by determining if the corresponding incidence graph admits a pruning sequence. Additionally, Ausiello et al. [1] show that the incidence graphs of $\gamma$ -acyclic hypergraphs are so-called (6, 2)-chordal bipartite graph which are known to be equivalent to bipartite distance-hereditary graphs [10]. Therefore, we can define $\gamma$ -acyclic hypergraphs as follows.

Corollary 1

[1, 10] [16] A hypergraph is $\gamma$ -acyclic if and only if its incidence graph is distance-hereditary.

5.1 Constructing the Union Join Graph

Lemma 10

An acyclic hypergraph is $\gamma$ -acyclic if and only if its line graph is isomorphic to its union join graph.

Proof

Let $H$ be an acyclic hypergraph with two distinct hyperedges $E_{i}$ and $E_{j}$ , and let $G$ be the union join graph of $H$ . Consider the following statements: (i) $E_{i}E_{j}$ is an edge of $L(H)$ , (ii) $E_{i}\cap E_{j}\neq\emptyset$ , (iii) $E_{i}\cap E_{j}$ separates $E_{i}\setminus E_{j}$ from $E_{j}\setminus E_{i}$ , and (iv) $E_{i}E_{j}$ is an edge of $G$ . Due to definitions and due to Lemma 3, it follows that (i) is equivalent to (ii), that (iii) implies (ii), and that (iii) is equivalent to (iv).

To prove Lemma 10, we first assume that $H$ is $\gamma$ -acyclic. It is know [16] that a hypergraph is $\gamma$ -acyclic if and only if (ii) implies (iii) for all distinct hyperedges $E_{i}$ and $E_{j}$ . Therefore, if $H$ is $\gamma$ -acyclic, the statements (i), (ii), (iii), and (iv) are equivalent and, subsequently, $L(H)=G$ .

Now assume that $L(H)=G$ , i. e., that (i) and (iv) are equivalent for all distinct hyperedges $E_{i}$ and $E_{j}$ . It then follows that (ii) and (iii) are equivalent and, as a result, that (ii) implies (iii). The same observation from [16] then implies that $H$ is $\gamma$ -acyclic if $L(H)=G$ . $\square$

Theorem 5.1

There is an algorithm that computes the union join graph $G$ of a given $\gamma$ -acyclic hypergraph in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time.

Proof

Due to Lemma 10, we can compute the union join graph $G$ of a $\gamma$ -acyclic hypergraph $H$ by computing its line graph. Note that, by definition, $L(H)=\operatorname{2Sec}Sec(H^{*})$ . It follows from Corollary 1 that the dual hypergraph $H^{*}$ is $\gamma$ -acyclic too. Therefore, we can compute $G=\operatorname{2Sec}Sec(H^{*})$ as follows.

Let $T$ be a join tree of $H^{*}$ rooted in an arbitrary node. Process each hyperedge of $H^{*}$ according to a pre-order on $T$ . When processing a hyperedge $E$ of $H^{*}$ , pick a vertex $v\in E$ that has not been flagged, make $v$ adjacent (in $G$ ) to all flagged vertices in $E$ , and then flag $v$ . Repeat that until all vertices in $E$ are flagged and afterwards continue with the next hyperedge.

By flagging vertices, we ensure that an edge $uv$ is added to $G$ at most once even if both vertices are together in multiple hyperedges of $H^{*}$ . Therefore, since we can construct $T$ in $\mathcal{O}(N)$ time [27], we can construct $G$ from $H$ in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time. $\square$

5.2 Constructing the Subset Graph

5.2.1 Bachman Diagrams.

Consider a hypergraph $H=(V,\mathcal{E})$ , let $\mathcal{E}^{\prime}$ be a subset of $\mathcal{E}$ , and let $\mathfrak{X}$ be the intersection of all hyperedges in $\mathcal{E}^{\prime}$ . We then define $\mathcal{X}$ as the set of all such $\mathfrak{X}$ which are non-empty, i. e.,

\mathcal{X}=\bigcup_{\mathcal{E}^{\prime}\subseteq\mathcal{E}}\Bigl{\{}\,\mathfrak{X}\Bigm{|}{\textstyle\mathfrak{X}=\bigcap_{E\in\mathcal{E}^{\prime}}E},\mathfrak{X}\neq\emptyset\,\Bigr{\}}.

The Bachman diagram $\mathcal{B}(H)$ of $H$ is a directed graph with the node set $\mathcal{X}$ such that there is an edge from $\mathfrak{X}$ to $\mathfrak{Y}$ if $\mathfrak{X}\supset\mathfrak{Y}$ and there is no $\mathfrak{Z}$ with $\mathfrak{X}\supset\mathfrak{Z}\supset\mathfrak{Y}$ . Note that, if $H$ contains two distinct hyperedges $E_{i}$ and $E_{j}$ with the same vertices, they are represented by the same node in $\mathcal{B}(H)$ .

Lemma 11

[16] A hypergraph is $\gamma$ -acyclic if and only if its Bachman diagram forms a tree.

In a Bachman diagram $\mathcal{B}(H)$ as defined above, a vertex $v$ of $H$ is often contained in multiple nodes. A technique from [28] allows us to construct a more compact representation of $\mathcal{B}(H)$ . Let $N(\mathfrak{X})$ be the set of nodes $\mathfrak{Y}$ such that $(\mathfrak{X},\mathfrak{Y})$ is an edge of $\mathcal{B}(H)$ . We then define the label of $\mathfrak{X}$ as $\ell(\mathfrak{X}):=\mathfrak{X}\setminus\bigcup_{\mathfrak{Y}\in N(\mathfrak{X})}\mathfrak{Y}$ . As a result, a vertex $v$ of $H$ is only in the label of the “smallest” node $\mathfrak{Y}$ containing it. Consider now a Bachman diagram $\mathcal{B}(H)$ with the node set $\mathcal{X}$ where we replace each node $\mathfrak{X}\in\mathcal{X}$ with $X=\ell(\mathfrak{X})$ . We call the resulting graph $B$ a simplified Bachman diagram of $H$ . Figure 2 gives an example.

(a)

(b)

Figure 2: The Bachman diagram LABEL:sub@fig:Bachman and its simplified version LABEL:sub@fig:BachmanSimple for a

\gamma

-acyclic hypergraph

H

with the hyperedges

\{a,b,c\}

\{a,d\}

\{b,c\}

, and

\{c,e,f\}

. Nodes which represent hyperedges of

H

are drawn as rectangles; other nodes are drawn as circles.

Let $B$ be a simplified Bachman diagram for a hypergraph $H=(V,\mathcal{E})$ . We use the following functions and notations when working with $B$ and $H$ . The function $\phi$ maps $\mathcal{E}$ onto the nodes of $B$ such that $\phi(E)$ returns the node which represents $E$ . Accordingly, we define $\Phi(X):=\bigl{\{}\,E\in\mathcal{E}\bigm{|}\phi(E)=X\,\}$ for all nodes $X$ of $B$ . Similar to $\phi$ , we define $\psi$ as a function that maps $V$ onto the nodes of $B$ such that $\psi(v)$ returns the node which contains $v$ . For two nodes $X$ and $Y$ , we write $X\leadsto Y$ to state that there is a path form $X$ to $Y$ in $B$ . Note that we assume that $X\leadsto X$ . Lastly, we define $\mathbb{V}(X)=\{\,v\in Y\mid X\leadsto Y\,\}$ . Note that $\mathbb{V}$ is effectively the inverse of the label function $\ell$ we used above.

5.2.2 Subset Graph via Simplified Bachman Diagrams.

We can make the following observation: For two hyperedges $E_{i}$ and $E_{j}$ of $H$ , $E_{i}\subseteq E_{j}$ if and only if $\phi(E_{j})\leadsto\phi(E_{i})$ in the (simplified) Bachman diagram of $H$ . In the remainder of this subsection, we present algorithms which first constructs a simplified Bachman diagram for a given $\gamma$ -acyclic hypergraph $H$ and then uses the previous observation to compute the subset graph $G$ of $H$ in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time.

To the best of our knowledge, there exist only two published algorithms which compute (simplified) Bachman diagrams. Kumar et al. [22] present an $\mathcal{O}\bigl{(}nm^{2}\bigr{)}$ -time algorithm to compute a Bachman diagram for a $\gamma$ -acyclic database schema. Uehara and Uno [28] present a linear-time algorithm that computes a simplified Bachman diagram for the maximal cliques of a ptolemaic graph; these cliques form a $\gamma$ -acyclic hypergraph [11]. Using that algorithm would require us to first compute the 2-section graph of $H$ . That may result in overall quadratic runtime for some hypergraphs. We therefore use neither of these algorithms. Instead, we present a new algorithm which computes a simplified Bachman diagram for a given $\gamma$ -acyclic hypergraph in $\mathcal{O}(N)$ time.

Recall that the incidence graph of a $\gamma$ -acyclic hypergraph $H$ is distance-hereditary. It therefore admits a pruning sequence $\sigma=\langle x_{1},x_{2},\ldots,x_{n+m}\rangle$ . Note that each $x_{i}$ in $\sigma$ can represent either a vertex or a hyperedge of $H$ . The idea for our algorithm is to iterate over $\sigma$ and to step by step construct $B$ . For that, let $\mathcal{I}_{i}$ denote the subgraph of $\mathcal{I}(H)$ induced by $\{x_{1},x_{2},\ldots,x_{i}\}$ .

We start the construction with $x_{1}$ and $x_{2}$ . Note that one of them has to represent a vertex $v$ and the other a hyperedge $E$ of $H$ . Therefore, we initialise $B$ with a single node $X=\{v\}$ and set $\phi(E):=X$ and $\psi(v):=X$ .

Next, we iterate over $\sigma$ , starting with $x_{3}$ . Since incidence graphs are bipartite, it is never the case that $x_{i}$ is the true twin of some $x_{j}$ (with the exception of $i=2$ ). Hence, we have four possible cases for each $x_{i}$ : (i) $x_{i}$ represents a vertex of $H$ and is a false twin in $\mathcal{I}_{i}$ , (ii) $x_{i}$ represents a hyperedge and is a false twin, (iii) $x_{i}$ represents a vertex and is pendant, or (iv) $x_{i}$ represents a hyperedge and is pendant.

If $x_{i}$ is a twin (cases i and ii), the idea is to make the new vertex or hyperedge behave as its twin. For a vertex $v$ , that means to add $v$ into the same node of $B$ . In case of a hyperedge $E$ , it is represented by the same node of $B$ as its twin.

If $x_{i}$ is pendant, adding it may affect the structure of $B$ . For example, let $x_{i}$ represent a vertex $v$ added to a hyperedge $E$ (case iii). If, with respect to $\mathcal{I}_{i-1}$ , $E$ is not subset of another hyperedge (including not being a twin), then we can simply add $v$ into $\phi(E)$ . However, if $E$ is subset of some hyperedge, it is no longer after adding $v$ . We subsequently need to update the structure of $B$ . To do so, we add a new node $Y$ , make it the representative of $E$ , and add an edge from $Y$ to the node of $B$ which previously represented $E$ . We handle case iv in a similar way.

Algorithm 4 implements the approach above and describes in detail how to handle each of the four cases for $x_{i}$ .

Input: A

\gamma

-acyclic hypergraph

H=(V,\mathcal{E})

Output: A simplified Bachman diagram

B

for

H

4Compute a pruning sequence

\sigma=\langle x_{1},x_{2},\ldots,x_{n+m}\rangle

for

\mathcal{I}(H)

(see [9]).

5Create a new empty graph

B

6Let

x_{1}

and

x_{2}

represent the vertex

v

and hyperedge

E

H

. Create a new set

X=\{v\}

, add it as node to

B

, and set

\phi(E):=X

and

\psi(v):=X

7for $i:=3$ to $n+m$ do

8 if $x_{i}$ represents a vertex $v\in V$ and is a false twin in $\mathcal{I}_{i}$ then

9 Let

u

be the vertex represented by a twin of

x_{i}

\mathcal{I}_{i}

and let

X=\psi(u)

10 Add

v

into

X

, i. e., set

\psi(v):=X

and

X:=X\cup\{v\}

13 if $x_{i}$ represents a hyperedge $E\in\mathcal{E}$ and is a false twin in $\mathcal{I}_{i}$ then

14 Let

E^{\prime}

be the hyperedge represented by a twin of

x_{i}

\mathcal{I}_{i}

15 Set

\phi(E):=\phi(E^{\prime})

18 if $x_{i}$ represents a vertex $v\in V$ and is pendant in $\mathcal{I}_{i}$ then

19 Let

E

be the hyperedge represented by the neighbour of

x_{i}

\mathcal{I}_{i}

and let

X=\phi(E)

20 if $\bigl{|}\Phi(X)\bigr{|}=1$ and $X$ has no incoming edges in $B$ then

21 Add

v

into

X

, i. e., set

\psi(v):=X

and

X:=X\cup\{v\}

22 else

23 Create a new set

Y=\{v\}

, add it as node to

B

, set

\psi(v):=Y

and

\phi(E):=Y

, and add the edges

(Y,X)

B

27 if $x_{i}$ represents a hyperedge $E\in\mathcal{E}$ and is pendant in $\mathcal{I}_{i}$ then

28 Let

v

be the vertex represented by the neighbour of

x_{i}

\mathcal{I}_{i}

and let

X=\psi(v)

29 if $|X|=1$ and $X$ has no outgoing edges in $B$ then

30 Set

\phi(E):=X

31 else

32 Create a new set

Y=\{v\}

, add it as node to

B

, set

X:=X\setminus\{v\}

, set

\psi(v):=Y

and

\phi(E):=Y

, and add the edge

(X,Y)

B

Algorithm 4 Computes the Bachman diagram for a given

\gamma

-acyclic hypergraph.

Lemma 12

Algorithm 4 computes a simplified Bachman diagram for a given $\gamma$ -acyclic hypergraph in linear time.

Proof (Correctness)

We start by showing that $B$ forms a tree. Algorithm 4 starts constructing $B$ with a single node (line 4). Whenever the algorithm adds a new node to $B$ (line 4 and line 4), it is incident to exactly one edge. Additionally, no other edge is ever added to or removed from $B$ . Therefore, $B$ forms a tree.

To show that $B$ is a simplified Bachman diagram for $H$ , we show that it satisfies the following two properties:

(1)

For each vertex $v$ of $H$ , $B$ contains exactly one node $X$ with $v\in X$ ; additionally, $\psi(v)=X$ .
(2)

There is a bijection $f$ mapping $\mathcal{X}$ onto the nodes of $B$ such that ((2.)) $f(\mathfrak{X})=X$ if and only if $\mathfrak{X}=\mathbb{V}(X)$ , and ((2.)) $\mathfrak{X}=E$ for some hyperedge $E$ implies $f(\mathfrak{X})=\phi(E)$ .

Property 2 ensures that the nodes of $B$ represent the nodes of a Bachman diagram for $H$ . Property 1 then enforces that the nodes of $B$ are connected properly. Without it, one could satisfy 2 by constructing a graph $B=(\mathcal{X},\emptyset)$ . Additionally, since $B$ forms a tree, it does not contain transitive edges.

Observe that whenever a new vertex $v$ is added (lines 4, 4, 4, and 4), Algorithm 4 adds it into a node $X$ and sets $\psi(v)$ accordingly. In the case that an existing vertex $v$ is added into a new node $Y$ (line 4), the algorithms removes it from its previous node $X$ and updates $\psi(v)$ accordingly. Therefore, the graph $B$ constructed by Algorithm 4 satisfies property 1.

In the remainder of this proof, we show that $B$ satisfies property 2 via an induction over $i$ . For that purpose, let $H_{i}$ denote the hypergraph formed by $\mathcal{I}_{i}$ and let $B_{i}$ be graph constructed after processing $x_{i}$ . We also use subscript $i$ to indicate that we refer to a version of a set, node, hyperedge, or function with respect to $B_{i}$ or $H_{i}$ ; for larger expressions $\varepsilon$ , we may write $[\varepsilon]_{i}$ .

Since $H_{2}$ has only one hyperedge and one vertex, $B_{2}$ (constructed in line 4) is clearly a simplified Bachman diagram for $H_{2}$ and satisfies property 2. In the following, we therefore assume that $i\geq 3$ and that $B_{i-1}$ satisfies property 2. We distinguish between four possible cases for $x_{i}$ .

Case i: $x_{i}$ represents a vertex $v\in V$ and is a false twin in $\mathcal{I}_{i}$ .

Let $u$ be the vertex represented by a twin of $x_{i}$ . Since $u$ and $v$ are twins, it follows that $v\in E_{i}$ if and only if $u\in E_{i-1}$ for each hyperedge $E$ of $H_{i}$ Subsequently, the only change to $\mathcal{X}$ is that $v$ is added to the sets $\mathfrak{X}$ which contain $u$ . That is, for each $\mathfrak{X}\in\mathcal{X}_{i}$ , $\mathfrak{X}_{i}=\mathfrak{X}_{i-1}$ if $u\notin\mathfrak{X}_{i-1}$ and $\mathfrak{X}_{i}=\mathfrak{X}_{i-1}\cup\{u\}$ if $u\in\mathfrak{X}_{i-1}$ . Observe that the algorithm neither adds any nodes nor any edges to the graph. It only adds $v$ into $\psi(u)_{i-1}$ . Hence, for each node $X$ of $B_{i}$ , $\mathbb{V}(X)_{i}=\mathbb{V}(X)_{i-1}$ if $u\notin\mathbb{V}(X)_{i-1}$ and $\mathbb{V}(X)_{i}=\mathbb{V}(X)_{i-1}\cup\{v\}$ if $u\in\mathbb{V}(X)_{i-1}$ . Therefore, $B_{i}$ satisfies property 2.

Case ii: $x_{i}$ represents a hyperedge $E\in\mathcal{E}$ and is a false twin in $\mathcal{I}_{i}$ .

Recall that we defined Bachman diagrams and the family $\mathcal{X}$ in such a way that $\mathcal{X}$ does not contain two equal sets, even if $H$ contains multiple equal hyperedges. Hence, adding a hyperedge $E$ which is equivalent to an existing hyperedge $E^{\prime}$ does neither change $\mathcal{X}$ nor any of the sets contained in it. It follows that setting $\phi(E)_{i}=\phi(E^{\prime})_{i-1}$ is the only change needed for $B_{i}$ to satisfy property 2 (otherwise $B_{i}$ would violate 2(2.)). Algorithm 4 does exactly that in line 4.

Case iii: $x_{i}$ represents a vertex $v\in V$ and is pendant in $\mathcal{I}_{i}$ .

Let $E$ be the hyperedge represented by the neighbour of $x_{i}$ in $\mathcal{I}_{i}$ (i. e., $E_{i}=E_{i-1}$ ), and let $X=\phi_{i-1}(E)$ . Assume that, for each hyperedge $E^{\prime}$ of $H_{i-1}$ which is distinct from $E$ , $E_{i-1}\nsubseteq E^{\prime}_{i-1}$ . In that case, $\bigl{|}\Phi(X)\bigr{|}_{i-1}=1$ and $X$ has incoming edge in $B_{i-1}$ . (As result, Algorithm 4 calls line 4.) Since $v$ is only added into $E$ , $\mathcal{X}_{i}$ is almost identical to $\mathcal{X}_{i-1}$ except that the set $\mathfrak{X}$ which represents $E$ now contains $v$ . Because $X$ has incoming edge in $B_{i-1}$ , adding $v$ into it (line 4) does not affect other nodes. In particular, $\mathbb{V}(Y)_{i}=\mathbb{V}(Y)_{i-1}$ for all nodes $Y$ of $B_{i}$ which are distinct from $X$ , and $\mathbb{V}(X)_{i}=\mathbb{V}(X)_{i-1}\cup\{v\}$ . Therefore, $B_{i}$ satisfies property 2.

Assume now that $H_{i-1}$ contains a hyperedge $E^{\prime}$ distinct from $E$ with $E_{i-1}\subseteq E^{\prime}_{i-1}$ . In that case, $\bigl{[}\phi(E^{\prime})\leadsto X\bigr{]}_{i-1}$ and, thus, $\bigl{|}\Phi(X)\bigr{|}_{i-1}>1$ (if $\phi(E^{\prime})_{i-1}=X$ ) or $X$ has incoming edge in $B_{i-1}$ . (As result, Algorithm 4 calls line 4.) Since $v$ is only added into $E$ but not $E^{\prime}$ , $E_{i}\nsubseteq E^{\prime}_{i}$ . However, for all $\mathcal{E}^{\prime}\subseteq\mathcal{E}_{i}$ with $E\in\mathcal{E}^{\prime}$ and $|\mathcal{E}^{\prime}|>1$ , $\Bigl{[}\bigcap_{E\in\mathcal{E}^{\prime}}E\Bigr{]}_{i}=\Bigl{[}\bigcap_{E\in\mathcal{E}^{\prime}}E\Bigr{]}_{i-1}$ . Therefore, $\mathcal{X}_{i}=\mathcal{X}_{i-1}\cup\{\mathfrak{Y}\}$ with $\mathfrak{Y}_{i}=E_{i}$ . For each $\mathfrak{X}\in\mathcal{X}_{i-1}$ , let $f_{i}(\mathfrak{X})=f_{i-1}(\mathfrak{X})$ . Additionally, let $f_{i}(\mathfrak{Y})=Y$ where $Y=\{v\}$ is the node added to $B$ in line 4. Thus, $f_{i}$ is a bijection mapping $\mathcal{X}_{i}$ onto the nodes of $B_{i}$ . Since the added edge $(Y,X)$ points towards $X$ , $\mathbb{V}(Z)_{i}=\mathbb{V}(Z)_{i-1}$ for all nodes $Z$ of $B_{i-1}$ and $\mathbb{V}(Y)_{i}=\mathfrak{Y}$ . Hence, $B_{i}$ satisfies property 2(2.). Additionally, since the algorithm also sets $\phi(E)_{i}=Y$ , $B_{i}$ also satisfies property 2(2.).

Case iv: $x_{i}$ represents a hyperedge $E\in\mathcal{E}$ and is pendant in $\mathcal{I}_{i}$ .

Let $v$ be the vertex represented by the neighbour of $x_{i}$ in $\mathcal{I}_{i}$ (i. e., $E_{i}=\{v\}$ ), and let $X=\psi(v)_{i-1}$ . Assume that $\mathcal{X}_{i-1}$ contains a set $\mathfrak{X}$ with $\mathfrak{X}_{i-1}=\{v\}$ . In that case, adding $E$ does neither change $\mathcal{X}$ nor any of the sets contained in it. Additionally, $|X_{i}|=1$ and $X$ has no outgoing edges in $B_{i-1}$ . It follows that setting $\phi(E)_{i}=X$ is the only change needed for $B_{i}$ to satisfy property 2 (similar to case ii). Algorithm 4 does exactly that in line 4.

Assume now that, for each set $\mathfrak{X}\in\mathcal{X}_{i-1}$ , $\mathfrak{X}_{i-1}\neq\{v\}$ . In that case, $\mathcal{X}_{i}=\mathcal{X}_{i-1}\cup\{\mathfrak{Y}\}$ with $\mathfrak{Y}_{i}=E_{i}=\{v\}$ . Additionally, $|X_{i}|>1$ or $X$ has an outgoing edge in $B_{i-1}$ . Let $f_{i}(\mathfrak{X})=f_{i-1}(\mathfrak{X})$ for each $\mathfrak{X}\in\mathcal{X}_{i-1}$ , and let $f_{i}(\mathfrak{Y})=Y$ where $Y=\{v\}$ is the node added to $B$ in line 4. Thus, $f_{i}$ is a bijection mapping $\mathcal{X}_{i}$ onto the nodes of $B_{i}$ . Note that Algorithm 4 (in line 4) moves $v$ from node $X$ into the new node $Y$ . However, since the added edge $(X,Y)$ points towards $Y$ , $\mathbb{V}(Z)_{i}=\mathbb{V}(Z)_{i-1}$ for all nodes $Z$ of $B_{i-1}$ and $\mathbb{V}(Y)_{i}=\mathfrak{Y}$ . Therefore, due to the algorithm setting $\phi(E)_{i}=Y$ , $B_{i}$ satisfies property 2. $\square$

Proof (Complexity)

One can compute a pruning sequence for a given distance-hereditary graph in linear time [9] and, thus, for $\mathcal{I}(H)$ (line 4) in $\mathcal{O}(N)$ time. Creating $B$ and adding the first node (lines 4 and 4) can then be done in constant time. For each node $X$ of $B$ , we create two lists. One stores the vertices in $X$ and one the hyperedges in $\Phi(X)$ . For the functions $\phi$ and $\psi$ , we store the node $X$ they map on and a reference to where the hyperedge or vertex is stored in the corresponding list of $X$ . That way, we can perform each of the following operations in constant time: adding new nodes and edges into $B$ (lines 4 and 4), assigning a hyperedge to a node and setting $\phi$ (lines 4, 4, and 4), changing the assignment of a hyperedge to a different node and updating $\phi$ (line 4), adding a vertex into a node and setting $\psi$ (lines 4, 4, and 4), and moving a vertex from one node into another and updating $\psi$ (line 4). Therefore, each iteration of the loop starting in line 4 run in constant time and, subsequently, Algorithm 4 run in overall linear time. $\square$

Lemma 13

Each node $X$ of $B$ with $\Phi(X)=\emptyset$ has an in-degree of at least $2$ .

Proof

We first assume that $X$ has in-degree $0$ . Then, there is no hyperedge $E$ with $\phi(E)\leadsto X$ and, subsequently, no such $E$ with $\mathbb{V}(X)\subseteq E$ . That contradicts with the definition of Bachman diagrams.

Now assume that $X$ has at least one incoming edge $(Y,X)$ . Let $\mathcal{E}_{X}=\bigl{\{}\,E\bigm{|}\phi(E)\leadsto X\,\bigr{\}}$ and $\mathcal{E}_{Y}=\bigl{\{}\,E\bigm{|}\phi(E)\leadsto Y\,\bigr{\}}$ . Since $\bigcap_{E\in\mathcal{E}_{X}}E=\mathbb{V}(X)\subset\mathbb{V}(Y)=\bigcap_{E\in\mathcal{E}_{Y}}$ , there is a hyperedge $E\in\mathcal{E}_{X}\setminus\mathcal{E}_{Y}$ with $\phi(E)\leadsto X$ and $\phi(E)\not\leadsto Y$ . Hence, since $\phi(E)\neq X$ , there is a path from $\phi(E)$ to $X$ in $B$ that does not contain $Y$ and, therefore, $X$ has an in-degree of at least $2$ . $\square$

Input: A

\gamma

-acyclic hypergraph

H=(V,\mathcal{E})

Output: A subset graph

G

for

H

4Compute a simplified Bachman diagram

B

for

H

with the corresponding functions

\phi

and

\Phi

(see Algorithm 4).

5Create a new directed graph

G=(\mathcal{E},\emptyset)

6foreach $E\in\mathcal{E}$ do

7 Let

X=\phi(E)

. Compute

\mathbb{E}_{X}=\bigcup_{Y\leadsto X}\Phi(Y)

8 For each

E^{\prime}\in\mathbb{E}_{X}

distinct from

E

, add the edge

(E,E^{\prime})

G

Algorithm 5 Computes a subset graph for a given

\gamma

-acyclic hypergraph.

Theorem 5.2

Algorithm 5 computes the subset graph $G$ of a given $\gamma$ -acyclic hypergraph in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time.

Proof (Correctness)

Let $E$ and $E^{\prime}$ be two distinct hyperedges of $H$ . By definition of (simplified) Bachman diagrams, $B$ (computed in line 5) contains two nodes $X=\phi(E)$ and $Y=\phi(E^{\prime})$ such that $Y\leadsto X$ if and only if $E\subseteq E^{\prime}$ . Additionally, Algorithm 5 adds the edge $(E,E^{\prime})$ to $G$ (line 5) if and only if $Y\leadsto X$ . Therefore, for any distinct hyperedges $E$ and $E^{\prime}$ of $H$ , $(E,E^{\prime})$ is an edge of $G$ if and only if $E\subseteq E^{\prime}$ . $\square$

Proof (Complexity)

Computing the simplified Bachman diagram $B$ (line 5) can be done in $\mathcal{O}(N)$ time (Lemma 12). Creating the graph $G$ (line 5) can be done in $\mathcal{O}(m)$ time. Additionally, once the sets $\mathbb{E}_{X}$ are known for all $X$ , we can add the edges of $G$ (line 5) in $\mathcal{O}\bigl{(}|G|\bigr{)}$ total time. It remains to show that we can compute the sets $\mathbb{E}_{X}$ in the desired runtime. To do that, we show that we can compute $\mathbb{E}_{X}$ for a given $X$ in $\mathcal{O}\bigl{(}|\mathbb{E}_{X}|\bigr{)}$ time.

Recall that $B$ is a directed graph which forms a tree (Lemma 11). Hence, the the nodes of $B$ from which there is a path to $X$ form a tree $T_{X}$ rooted in $X$ where each edge points from a child to its parent. One can compute $T_{X}$ in $\mathcal{O}\bigl{(}|T_{X}|\bigr{)}$ time by, for example, reversing the edges of $B$ and then performing a BFS or DFS starting at $X$ .

Assume that we partition the nodes of $T_{X}$ into two sets $\mathbb{Y}_{\!X}$ and $\mathbb{Z}_{X}$ where $\mathbb{Y}_{\!X}=\bigl{\{}\,Y\bigm{|}\Phi(Y)\neq\emptyset\,\}$ and $\mathbb{Z}_{X}$ contains all remaining nodes. It follows from Lemma 13 that each node $Y$ of $T_{X}$ with at most one child (including leaves) is in $\mathbb{Y}_{\!X}$ , and each node in $\mathbb{Z}_{X}$ has at least two children. Now assume that we, step by step, remove each node $Y$ from $T_{X}$ which has exactly one child $Y^{\prime}$ , and make $Y^{\prime}$ the child of $Y$ ’s parent. Let $T^{\prime}_{X}$ be the resulting tree. Each node of $T^{\prime}_{X}$ then has at least two children. Thus, at least half of the nodes of $T^{\prime}_{X}$ are leaves. Since each leaf is in $\mathbb{Y}_{\!X}$ and $T^{\prime}_{X}$ contains all nodes in $\mathbb{Z}_{X}$ , it follows that $|\mathbb{Z}_{X}|\leq|\mathbb{Y}_{\!X}|$ and, subsequently, $|T_{X}|\in\Theta\bigl{(}|\mathbb{Y}_{\!X}|\bigr{)}$ .

Recall that $\Phi(Y)\neq\emptyset$ for all $Y\in\mathbb{Y}_{\!X}$ and that each hyperedge of $H$ is associated with at most one such $Y$ . It follows that $|\mathbb{Y}_{\!X}|\leq|\mathbb{E}_{X}|$ . Therefore, we can compute $\mathbb{E}_{X}$ for a given $X$ in $\mathcal{O}\bigl{(}|\mathbb{E}_{X}|\bigr{)}$ time, and line 5 runs in $\mathcal{O}\bigl{(}|G|\bigr{)}$ total time. $\square$

6 Interval Hypergraphs

An acyclic hypergraph $H=(V,\mathcal{E})$ is an interval hypergraph if it admits a join tree that forms a path. That is, there is an order $\sigma=\langle E_{1},E_{2},\ldots,E_{m}\rangle$ for the hyperedges of $H$ such that, for each vertex $v\in V$ , $v\in E_{i}\cap E_{j}$ implies that $v\in E_{k}$ for all $k$ with $i\leq k\leq j$ . Interval hypergraphs are closely related to interval graphs which are a subset of chordal graphs. In particular, a graph is an interval graph if and only if its maximal cliques form an interval hypergraph, and an acyclic hypergraph is an interval hypergraph if and only if its 2-section graph is an interval graph.

Algorithm 9 in [19] allows to recognise interval hypergraphs in linear time. It also produces an order $\sigma$ as defined above. Note that the first step of that algorithm is to compute a clique tree $T$ and a vertex ordering $\phi$ for a given graph. We replace that step by first computing a join tree $T$ of the given hypergraph and then perform Algorithm 10 from [19] to compute $\phi$ .

There are multiple ways to compute the subset graph and union join graph once $\sigma$ is known for a given hypergraph $H$ . One may order the vertices of $H$ based on the right-most hyperedge containing them (with respect to $\sigma$ ). Note that we can compute such an ordering in linear time from $\sigma$ . Both orders together then form a doubly lexically order and allow to construct a $\mathrm{\Gamma}$ -free matrix for $H$ . Note that Algorithm 3 and the algorithm described in Theorem 4.2 only have a logarithmic overhead in runtime because they compute a doubly lexically order. If such an order is given, both algorithm run in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time.

For an alternative approach, we first determine for each vertex $v$ the index of the left-most hyperedge containing it (with respect to $\sigma$ ). Let $\phi(v)$ be that number, i. e., if $E_{i}$ is the left-most hyperedge containing $v$ , then $\phi(v)=i$ . Next, we compute the separators between consecutive hyperedges (see Algorithm 1). Let $S_{i}$ denote the separator between $E_{i-1}$ and $E_{i}$ and let $\phi(S_{i})=\max_{v\in S_{i}}\phi(v)$ . Then, for each $E_{j}$ with $j<i$ , it holds that (i) $E_{j}\supseteq E_{i}$ if and only if $|E_{i}|=|S_{i}|$ and $j\geq\phi(S_{i})$ , and (ii) $E_{i}E_{j}$ is an edge of the union join graph of $H$ if and only if $j\geq\phi(S_{i})$ . Running the same approach again using the reverse of $\sigma$ therefore allows to compute the subset graph and union join graph in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time.

Theorem 6.1

There is an algorithm that computes the union join graph and subset graph of a given interval hypergraph in $\mathcal{O}\bigl{(}N+|G|\bigr{)}$ time, respectively, where $|G|$ is the size of the computed graph.

Acknowledgements

We would like to thank Feodor F. Dragan and Rachel Walker for stimulating discussions.

References

[1] Ausiello, G., D’Atri, A., Moscarini, M.: Chordality properties on graphs and minimal conceptual connections in semantic data models. Journal of Computer and System Sciences 33 (2), 179–202, 1986.
[2] Bandelt, H.-J., Mulder, H.M., Distance-hereditary graphs. Journal of Combinatorial Theory, Series B 41, 182–208, 1986.
[3] Beeri, C., Fagin, R., Maier, D., Yannakakis, M.: On the Desirability of Acyclic Database Schemes. In: Journal of the ACM 30, 479–513, 1983.
[4] Berge, C.: Hypergraphs: Combinatorics of Finite Sets. Elsevier Publishing Co., North-Holland, 1989.
[5] Berry, A., Simonet, G.: Computing the atom graph of a graph and the union join graph of a hypergraph. CoRR abs/1607.02911, 2016.
[6] Borassi, M., Crescenzi, P., Habib, M.: Into the Square: On the Complexity of Some Quadratic-time Solvable Problems. Electronic Notes in Theoretical Computer Science 322, 51–67, 2016.
[7] Brandstädt, A., Dragan, F.F.: Tree-Structured Graphs. In Thulasiraman, K., Arumugam, S., Brandstädt, A., Nishizeki, T. (Eds.): Handbook of Graph Theory, Combinatorial Optimization, and Algorithms, 751–826, CRC Press, 2015.
[8] Brandstädt, A., Dragan, F.F., Chepoi, V., Voloshin, V.I.: Dually Chordal Graphs. SIAM Journal on Discrete Mathematics 11 (3), 437–455, 1998.
[9] Damiand, G., Habib, M., Paul, C., A simple paradigm for graph recognition: application to cographs and distance hereditary graphs. Theoretical Computer Science 263 (1-2), 99–111, 2001.
[10] D’Atri, A., Moscarini, M.: Distance-hereditary graphs, Steiner trees, and connected domination. SIAM Journal of Computing 17 (3), 521–538, 1988.
[11] D’Atri, A., Moscarini, M.: On hypergraph acyclicity and graph chordality. Information Processing Letters 29, 271–274, 1988.
[12] Dourisboure, Y.: Compact Routing Schemes for Generalised Chordal Graphs. Journal of Graph Algorithms and Applications 9 (2), 277–297, 2005.
[13] Dourisboure, Y., Dragan, F.F., Gavoille, C., Yan, C.: Spanners for bounded tree-length graphs. Theoretical Computer Science 383 (1), 34–44, 2007.
[14] Dragan, F.F., Köhler, E.: An Approximation Algorithm for the Tree t-Spanner Problem on Unweighted Graphs via Generalized Chordal Graphs. Algorithmica 69 (4), 884–905, 2014.
[15] Elmasry, A.: Computing the subset partial order for dense families of sets. Information Processing Letters 109, 1082–1086, 2009.
[16] Fagin, R.: Degrees of Acyclicity for hypergraphs and relational database schemes. Journal of the ACM 30, 514–550, 1983.
[17] Galinier, P., Habib, M., Paul, C.: Chordal Graphs and Their Clique Graphs. WG 1995, Lecture Notes in Computer Science 1017, 358–371, 1995.
[18] Gavril, F.: The intersection graphs of subtrees in trees are exactly the chordal graphs. Journal of Combinatorial Theory, Series B 16 (1), 47–56, 1974.
[19] Habib, M., McConnell, R., Paul, C., Viennot, L.: Lex-BFS and partition refinement, with applications to transitive orientation, interval graph recognition and consecutive ones testing. Theoretical Computer Science 234 (1–2), 59–84, 2000.
[20] Habib, M., Stacho, J.: Reduced clique graphs of chordal graphs. European Journal of Combinatorics 33 (5), 712–735, 2012.
[21] Kaba, B., Pinet, N., Lelandais, G., and Berry, B.: Clustering gene expression data using graph separators. In Silico Biology 7 (0031), 2007.
[22] Kumar, T.V.V., Shridhar, A., Ghoshal, A.: Computing full disjunction using COJO. Information Technology and Management 10 (1), 3–20, 2009.
[23] Leimer, H.-G.: Optimal decomposition by clique separators. Discrete Mathematics 113 (1–3), 99–123, 1993.
[24] Paige, R., Tarjan, R.E.: Three Partition Refinement Algorithms. SIAM Journal of Computing 16 (6), 973–989, 1987.
[25] Pritchard, p.: Opportunistic algorithms for eliminating supersets. ActaInformatica 28, 733–754, 1991.
[26] Pritchard, P.: On Computing the Subset Graph of a Collection of Sets. Journal of Algorithms 33 (2), 187–203, 1999.
[27] Tarjan, R.E., Yannakakis, M.: Simple Linear-Time Algorithms to Test Chordality of Graphs, Test Acyclicity of Hypergraphs, and Selectively Reduce Acyclic Hypergraphs. SIAM Journal of Computing 13 (3), 566–579, 1984.
[28] Uehara, R., Uno, Y.: Laminar structure of ptolemaic graphs with applications. Discrete Applied Mathematics 157, 1533–1543, 2009.

Computing the Union Join and Subset Graph of Acyclic Hypergraphs in Subquadratic Time

Abstract

1 Introduction

1.1 Acyclic Hypergraphs

1.2 Union Join Graph

1.3 Subset Graph

1.4 Our Contribution

2 Preliminaries

3 α\alpha-Acyclic Hypergraphs

3.1 Hardness Results

Lemma 1

Proof

Lemma 2

Proof

Theorem 3.1

Theorem 3.2

Proof

3.1.1 Note on Hypertrees.

3.2 Union Join Graph via Subset Graph

Lemma 3

Proof

Lemma 4

Proof

Lemma 5

Proof

Lemma 6

Proof

Theorem 3.3

Proof (Correctness)

Proof (Complexity)

Theorem 3.4

3.3 Notes on the Sperner Family Problem and its Generalisation

Theorem 3.5

Proof

Theorem 3.6

Proof

4 β\beta-Acyclic Hypergraphs

4.1 Constructing the Subset Graph

Lemma 7

Lemma 8

Proof

Theorem 4.1

Proof (Correctness)

Proof (Complexity)

4.2 Constructing the Union Join Graph

Lemma 9

Proof (Lemma 9)

Theorem 4.2

Proof

5 γ\gamma-Acyclic Hypergraphs

Corollary 1

5.1 Constructing the Union Join Graph

Lemma 10

Proof

Theorem 5.1

Proof

5.2 Constructing the Subset Graph

5.2.1 Bachman Diagrams.

Lemma 11

5.2.2 Subset Graph via Simplified Bachman Diagrams.

Lemma 12

Proof (Correctness)

Case i: xix_{i} represents a vertex v∈Vv\in V and is a false twin in ℐi\mathcal{I}_{i}.

Case ii: xix_{i} represents a hyperedge E∈ℰE\in\mathcal{E} and is a false twin in ℐi\mathcal{I}_{i}.

Case iii: xix_{i} represents a vertex v∈Vv\in V and is pendant in ℐi\mathcal{I}_{i}.

Case iv: xix_{i} represents a hyperedge E∈ℰE\in\mathcal{E} and is pendant in ℐi\mathcal{I}_{i}.

Proof (Complexity)

Lemma 13

Proof

Theorem 5.2

Proof (Correctness)

Proof (Complexity)

6 Interval Hypergraphs

Theorem 6.1

Acknowledgements

References

3 $\alpha$ -Acyclic Hypergraphs

4 $\beta$ -Acyclic Hypergraphs

5 $\gamma$ -Acyclic Hypergraphs

Case i: $x_{i}$ represents a vertex $v\in V$ and is a false twin in $\mathcal{I}_{i}$ .

Case ii: $x_{i}$ represents a hyperedge $E\in\mathcal{E}$ and is a false twin in $\mathcal{I}_{i}$ .

Case iii: $x_{i}$ represents a vertex $v\in V$ and is pendant in $\mathcal{I}_{i}$ .

Case iv: $x_{i}$ represents a hyperedge $E\in\mathcal{E}$ and is pendant in $\mathcal{I}_{i}$ .