This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

On the existence of funneled orientations for classes of rooted phylogenetic networks

Janosch Döcker and Simone Linz School of Computer Science, University of Auckland, Auckland, New Zealand [email protected] School of Computer Science, University of Auckland, Auckland, New Zealand [email protected]
Abstract.

Recently, there has been a growing interest in the relationships between unrooted and rooted phylogenetic networks. In this context, a natural question to ask is if an unrooted phylogenetic network 𝒰{\mathcal{U}} can be oriented as a rooted phylogenetic network such that the latter satisfies certain structural properties. In a recent preprint, Bulteau et al. claim that it is computational hard to decide if 𝒰{\mathcal{U}} has a funneled (resp. funneled tree-child) orientation, for when the internal vertices of 𝒰{\mathcal{U}} have degree at most 5. Unfortunately, the proof of their funneled tree-child result appears to be incorrect. In this paper, we present a corrected proof and show that hardness remains for other popular classes of rooted phylogenetic networks such as funneled normal and funneled reticulation-visible. Additionally, our results hold regardless of whether 𝒰{\mathcal{U}} is rooted at an existing vertex or by subdividing an edge with the root.

Key words and phrases:
graph orientation, network classes, phylogenetic network, rooted, unrooted
We thank the New Zealand Marsden Fund for their financial support.

1. Introduction

Phylogenetic networks are commonly used to represent evolutionary relationships between taxa such as species, individuals of a population, or viruses. In general terms, phylogenetic networks are graphs whose vertices represent taxa and edges represent inferred evolutionary relationships. Unrooted phylogenetic networks are undirected graphs that do not contain any explicit information about the direction of evolution such as ancestor-descendant relationships. To include such information, rooted phylogenetic networks are used which are directed acyclic graphs with a single source (the root) and, as in the unrooted case, leaves representing extant taxa, and internal vertices representing hypothetical or extinct taxa. In addition, there also exists a wide range of well-studied classes of phylogenetic networks that are each characterised by certain structural properties such as level-kk and tree-based networks, which have been defined for rooted and unrooted phylogenetic networks, as well as tree-child and reticulation-visible networks, which have been defined for rooted phylogenetic networks only. For a comprehensive overview of phylogenetic network classes, we refer the interested reader to [9]. Network classes that are used throughout this paper are formally defined in the next section.

Initiated by a paper by Huber et al. [6], there has recently been a growing interest in the relationships between unrooted and rooted phylogenetic networks. More specifically, the authors have investigated several questions in the context of orienting an unrooted phylogenetic network 𝒰{\mathcal{U}} as a rooted phylogenetic network. The process of orienting 𝒰{\mathcal{U}} consists of subdividing an edge of 𝒰{\mathcal{U}} with a new vertex ρ\rho and assigning a direction to each edge such that the resulting directed graph is a rooted phylogenetic network with root ρ\rho. Given 𝒰{\mathcal{U}}, results for the following decision problems have been established in [6]:

  1. (i)

    Constrained Orientation: If the desired in-degree for each vertex of 𝒰{\mathcal{U}} is given as well as an edge of 𝒰{\mathcal{U}} to be subdivided with ρ\rho, can 𝒰{\mathcal{U}} be oriented as a rooted phylogenetic network with root ρ\rho that satisfies all in-degree constraints?

  2. (ii)

    𝒞{\mathcal{C}}-Orientation: Can 𝒰{\mathcal{U}} be oriented such that the resulting digraph belongs to a given class 𝒞{\mathcal{C}} of phylogenetic networks?

More specifically, Huber et al. [6] present a polynomial-time algorithm for Constrained Orientation that computes an orientation of 𝒰{\mathcal{U}} satisfying the given in-degree constraints or outputs that there is none. Furthermore, they have shown that, if 𝒰{\mathcal{U}} is binary (i.e., each internal vertex has degree 3), then it is NP-complete to decide if 𝒰{\mathcal{U}} has a tree-based orientation and, lastly, if 𝒰{\mathcal{U}} is binary and 𝒞{\mathcal{C}} satisfies certain properties, then 𝒞{\mathcal{C}}-Orientation is fixed-parameter tractable with respect to the so-called level of 𝒰{\mathcal{U}}. Notably, the latter result includes the popular class of binary tree-child networks. In the same paper, the authors also state two open questions.

  1. Q1

    Given an unrooted binary phylogenetic network 𝒰{\mathcal{U}}, can it be decided in polynomial time if 𝒰{\mathcal{U}} has a tree-child orientation?

  2. Q2

    Given an unrooted phylogenetic network 𝒰{\mathcal{U}}, can it be decided in polynomial time if 𝒰{\mathcal{U}} has a funneled orientation?

The term funneled was coined by Huber et al. [6, p. 26] and refers to the restriction that each reticulation (i.e., a vertex of a rooted phylogenetic network with in-degree at least 2) has out-degree 1. As noted by the authors, it is common in the literature to define reticulations in this way.

Recently Bulteau et al. [1, Cor. 4] have shown that it is NP-complete to decide if 𝒰{\mathcal{U}} has a funneled orientation if each vertex of 𝒰{\mathcal{U}} has degree at most 5 and a vertex of 𝒰{\mathcal{U}} is chosen as root (instead of subdividing an edge of 𝒰{\mathcal{U}} with the root). We remark that Bulteau et al. refer to a funneled orientation as a valid orientation and that their construction to establish NP-completeness allows for vertices of degree 2 in 𝒰{\mathcal{U}} which are explicitly excluded in the definition of such a network in [6] as well as in most other literature on phylogenetic networks because degree-2 vertices do not carry any biological meaning. Relatedly, Garvardt et al. [5] have analysed the parameterised complexity of a variant of the Constraint Orientation problem, where the in-degree of each vertex vv is not fixed but can take on a value of a list that is associated with vv. On the positive side, Bulteau et al. [1, Cor. 3] have shown that Q2 can be answered affirmatively if 𝒰{\mathcal{U}} has degree at most 3. In particular, they have established a linear-time algorithm that computes a funneled orientation of 𝒰{\mathcal{U}} if it exists or, otherwise, that correctly concludes that no such orientation exists. This last result is in line with an earlier result by Janssen et al. [8, Lem. 4.13 and 4.14] who characterised those unrooted binary phylogenetic networks that can be oriented as rooted binary phylogenetic networks. Inspecting the proof of their characterisation [8, Lem. 4.13], their result also yields a polynomial-time algorithm for deciding if an unrooted binary phylogenetic network can be oriented as a rooted binary phylogenetic network.

Turning to Q1, Maeda et al. [11] have recently presented several necessary conditions for when an unrooted binary phylogenetic network has a tree-child orientation. It is worth noting that, in the binary case, any orientation as a rooted phylogenetic network is also funneled. Moreover, given an unrooted phylogenetic network 𝒰{\mathcal{U}} whose non-leaf vertices have degree at least 2 and at most 5 and given a designated root vertex vv of 𝒰{\mathcal{U}}, Bulteau et al. [1, Cor. 5] claim that it is NP-hard to decide if 𝒰{\mathcal{U}} has a funneled tree-child orientation with root vv. Unfortunately, as we will show later, the proof of this corollary is incorrect. In summary, Q1 remains open.

In this paper, we make a step towards answering Q1 by presenting a corrected proof that establishes NP-completeness for deciding if an unrooted phylogenetic network with maximum degree 5 has a funneled tree-child orientation. In comparison to Bulteau et al. [1], our result does not require a designated root to be part of the input and our construction yields unrooted phylogenetic networks without degree-2 vertices. Although our result builds on Bulteau et al., correcting their proof requires novel gadgets and careful reasoning. We also show that our result extends to several other classes of funneled orientations such as funneled tree-sibling and funneled normal networks.

The remainder of the paper is organised as follows. Section 2 contains definitions and formal problem statements. In Section 3, we show that, although the proof by Bulteau et al. [1, Cor. 5] is incorrect, the result that it is NP-complete to decide if an unrooted phylogenetic network 𝒰{\mathcal{U}} with degree at most 5 has a funneled tree-child orientation can be recovered. We then show in Section 4 that hardness remains if each non-leaf vertex of 𝒰{\mathcal{U}} has degree exactly 5. This latter result is subsequently used to establish NP-completeness for deciding if 𝒰{\mathcal{U}} has a funneled tree-sibling, funneled reticulation-visible, or funneled normal orientation. We remark that all results hold under two rooting variants: Variant AA as introduced by Huber et al. [6] subdivides an edge of 𝒰{\mathcal{U}} with the root, and Variant BB chooses a vertex of 𝒰{\mathcal{U}} to be the root. If 𝒰{\mathcal{U}} is binary, one typically chooses a leaf of 𝒰{\mathcal{U}} to be the root since, otherwise, the resulting orientation is not binary. Indeed, all constructions presented in this paper choose a leaf of 𝒰{\mathcal{U}} as the root when establishing hardness under Variant BB.

2. Preliminaries

In this section, we introduce notation and terminology that is used throughout the rest of the paper, and formally state the decision problems whose computational complexity we investigate in subsequent sections.

2.1. Phylogenetic networks.

Let XX be a non-empty finite set. An unrooted phylogenetic network 𝒰{\mathcal{U}} on XX is a simple undirected graph with no degree-2 vertex and with a bijection between the vertices in 𝒰{\mathcal{U}} that have degree 1 and the set XX, that is, the leaves of 𝒰{\mathcal{U}} are bijectively labelled with elements from XX. We will freely refer to leaves using their labels. Two distinct vertices vv and ww of 𝒰{\mathcal{U}} are called neighbours if {v,w}\{v,w\} is an edge in 𝒰{\mathcal{U}}.

A rooted phylogenetic network 𝒩{\mathcal{N}} on XX is a directed acyclic graph with no loop and no parallel arcs that satisfies the following properties:

  1. (i)

    there is a unique vertex ρ\rho, the root, with in-degree 0 and out-degree at least 11,

  2. (ii)

    a vertex of out-degree 0 has in-degree 1 and the set of vertices with out-degree 0 is XX, and

  3. (iii)

    each internal vertex has either in-degree 1 and out-degree at least 2, or in-degree at least 2 and out-degree at least 1.

For technical reasons, we sometimes consider a class of networks that is more general than the class of rooted (resp. unrooted) phylogenetic networks. Specifically, we refer to a network that is a subdivision of a rooted phylogenetic network on XX (resp. unrooted phylogenetic network on XX) as a rooted pseudo network on XX (resp. unrooted pseudo network on XX).

Now, let 𝒩{\mathcal{N}} be a rooted pseudo network on XX. For an arc e=(u,v)e=(u,v) in 𝒩{\mathcal{N}}, we say that uu is a parent of vv and vv is a child of uu. If two distinct vertices vv and ww of 𝒩{\mathcal{N}} have a common parent, we say that vv and ww are siblings. Moreover, a vertex of 𝒩{\mathcal{N}} is called a tree vertex if it has in-degree 11 and out-degree at least 11 and is called a reticulation if it has in-degree at least 22 and out-degree at least 11. Lastly, an arc (u,v)(u,v) is called a shortcut in 𝒩{\mathcal{N}} if there is a directed path from uu to vv in 𝒩{\mathcal{N}} that does not contain (u,v)(u,v).

We next define five classes of rooted pseudo networks. These classes are typically defined in the context of rooted phylogenetic networks. However, their definitions naturally carry over to rooted pseudo networks as follows. Let 𝒩{\mathcal{N}} be a rooted pseudo network. We say that 𝒩{\mathcal{N}} is

  1. (i)

    tree-child if each non-leaf vertex of 𝒩{\mathcal{N}} has a child that is a tree vertex or a leaf,

  2. (ii)

    normal if 𝒩{\mathcal{N}} is tree-child and does not contain a shortcut,

  3. (iii)

    tree-sibling if each reticulation of 𝒩{\mathcal{N}} has a sibling that is a tree vertex or a leaf,

  4. (iv)

    reticulation-visible if, for each reticulation vv of 𝒩{\mathcal{N}}, there is a leaf \ell such that each directed path from the root of 𝒩{\mathcal{N}} to \ell contains vv, and

  5. (v)

    funneled if each reticulation of 𝒩{\mathcal{N}} has out-degree 1.

Since the above five classes are only defined for rooted pseudo networks, we will omit the adjective ‘rooted’ when referring to them. For more details on these network classes, see [2, 3, 7, 9, 12].

2.2. Orientations.

A connector network 𝒢k{\mathcal{G}}_{k} with k0k\geq 0 is a graph that can be obtained from an unrooted pseudo network 𝒰{\mathcal{U}} by deleting the label of kk leaves in 𝒰{\mathcal{U}}. We call a degree-1 vertex of 𝒢k{\mathcal{G}}_{k} a connector leaf or unlabelled leaf if it has no label, and a non-connector leaf or labelled leaf otherwise. Furthermore, we refer to 𝒰{\mathcal{U}} as the partner network of 𝒢k{\mathcal{G}}_{k}. Note that 𝒰{\mathcal{U}} is the unique partner network of 𝒢k{\mathcal{G}}_{k} up to those leaf labels that exist in 𝒰{\mathcal{U}} and not in 𝒢k{\mathcal{G}}_{k}. Lastly, for k=1k=1, let 𝒢1{\mathcal{G}}_{1} be a connector network. We say that an unrooted pseudo network 𝒰{\mathcal{U}} contains 𝒢1{\mathcal{G}}_{1} as a pending subgraph if 𝒰{\mathcal{U}} can be obtained from 𝒢1{\mathcal{G}}_{1} by identifying its connector leaf with a vertex vv of some unrooted pseudo network 𝒰{\mathcal{U}}^{\prime} and deleting the label if vv is a leaf of 𝒰{\mathcal{U}}^{\prime}. For example, the unrooted pseudo network that underlies the network shown in Figure 1(ii) has both 𝒢1{\mathcal{G}}_{1} and 𝒢1{\mathcal{G}}_{1}^{\prime} that are shown in (i) of the same figure as a pending subgraph.

For the purpose of the upcoming definitions, let 𝒢k{\mathcal{G}}_{k} be a connector network with k0k\geq 0. Furthermore, let LL be the set of all labelled leaves of 𝒢k{\mathcal{G}}_{k}, and let UU be the set of all unlabelled leaves of 𝒢k{\mathcal{G}}_{k}. We next define a process that assigns a direction to each edge of 𝒢k{\mathcal{G}}_{k} and, if k1k\leq 1, also introduces a root vertex ρ\rho.

First, for k1k\leq 1, an orientation of 𝒢k{\mathcal{G}}_{k} is obtained by either

Variant AA. subdividing an edge of 𝒢k{\mathcal{G}}_{k} with a new root vertex ρ\rho and then assigning a direction to each edge such that ρ\rho has in-degree 0 and each element in LL has out-degree 0, or

Variant BB. choosing a vertex uu of 𝒢k{\mathcal{G}}_{k} to be the root ρ\rho by setting u=ρu=\rho, deleting the label of uu if uu is a labelled leaf, and then assigning a direction to each edge such that ρ\rho has in-degree 0 and each element in LL if uLu\notin L (resp. L\{u}L\backslash\{u\} if uLu\in L) has out-degree 0.

Note that such an orientation always exists. Now, let 𝒞{\mathcal{C}} be a class of rooted (pseudo) networks and let R{A,B}R\in\{A,B\}. For k=0k=0, we say that an unrooted pseudo network 𝒢0{\mathcal{G}}_{0} has a 𝒞R{\mathcal{C}}_{R}-orientation or, equivalently, that 𝒢0{\mathcal{G}}_{0} is 𝒞R{\mathcal{C}}_{R}-orientable if there exists an orientation 𝒪{\mathcal{O}} of 𝒢0{\mathcal{G}}_{0} such that the following properties are satisfied.

  1. (i)

    𝒪{\mathcal{O}} is obtained from 𝒢0{\mathcal{G}}_{0} by following Variant RR,

  2. (ii)

    𝒪{\mathcal{O}} is a network in 𝒞{\mathcal{C}} with root ρ\rho,

  3. (iii)

    if ρL\rho\notin L, then 𝒪{\mathcal{O}} has |L||L| leaves, and

  4. (iv)

    if ρL\rho\in L, then 𝒪{\mathcal{O}} has |L|1|L|-1 leaves.

Turning to k=1k=1, we say that a connector network 𝒢1{\mathcal{G}}_{1} with a unique connector leaf rr has a 𝒞R{\mathcal{C}}_{R}-orientation or, equivalently, that 𝒢1{\mathcal{G}}_{1} is 𝒞R{\mathcal{C}}_{R}-orientable if its partner network has a 𝒞R{\mathcal{C}}_{R}-orientation. For R=AR=A, 𝒢1{\mathcal{G}}_{1} is called 𝒞A{\mathcal{C}}_{A}-root-forcing if there is no 𝒞A{\mathcal{C}}_{A}-orientation of 𝒢1{\mathcal{G}}_{1} that subdivides the (unique) edge incident with rr with a new root vertex. Similarly, for R=BR=B, 𝒢1{\mathcal{G}}_{1} is called 𝒞B{\mathcal{C}}_{B}-root-forcing if there is no 𝒞B{\mathcal{C}}_{B}-orientation of 𝒢1{\mathcal{G}}_{1} that chooses rr to be the root.

Refer to caption
Figure 1. (i) Two connector networks 𝒢1{\mathcal{G}}_{1} and 𝒢1{\mathcal{G}}^{\prime}_{1} each with a single connector leaf rr. (ii) A rooted pseudo network 𝒩{\mathcal{N}} obtained by identifying rr in 𝒢1{\mathcal{G}}_{1} with rr in 𝒢1{\mathcal{G}}^{\prime}_{1} and orienting the resulting unrooted pseudo network according to Variant AA. Observe that 𝒩{\mathcal{N}} is not tree-child. (iii) An orientation of 𝒢1{\mathcal{G}}_{1} following Variant AA. (iv) An orientation of 𝒢1{\mathcal{G}}_{1} following Variant BB. As we will see in Lemma 3.1, for 𝒞{\mathcal{C}} being the class of tree-child network, 𝒢1{\mathcal{G}}_{1} is 𝒞A{\mathcal{C}}_{A}-root-forcing because every 𝒞A{\mathcal{C}}_{A}-orientation of 𝒢1{\mathcal{G}}_{1} places the root on an edge that is not {r,v}\{r,v\}.

To illustrate, see Figure 1 for an example of a connector network with a single connector leaf that is 𝒞A{\mathcal{C}}_{A}-root-forcing for when 𝒞{\mathcal{C}} is the class of tree-child networks. In Figure 1 as well as in all other figures of this paper, the root is indicated by a small square, connector leaves are indicated by open circles whereas all other vertices are indicated by filled circles. Moreover, we frequently label internal vertices in figures. Their only purpose is to make references. Indeed, they should not be regarded as genuine labels as those used for leaves of pseudo networks and non-connector leaves of connector networks. Furthermore, labels of leaves of pseudo networks and non-connector leaves of connector networks are sometimes omitted from figures to facilitate a clear presentation unless they are of importance (e.g. if they are explicitly mentioned in a proof).

Second, for k2k\geq 2, an orientation 𝒪{\mathcal{O}} of 𝒢k{\mathcal{G}}_{k} is obtained by assigning a direction to each edge in 𝒢k{\mathcal{G}}_{k}. Let 𝒪{\mathcal{O}} be an acyclic orientation of 𝒢k{\mathcal{G}}_{k} such that each vertex in LL has out-degree 0, and each vertex that is not in LUL\cup U has in-degree at least 1 and out-degree at least 1, then 𝒪{\mathcal{O}} is called

  1. (i)

    {\mathcal{F}}-compatible if each vertex of 𝒪{\mathcal{O}} with in-degree at least 2 has out-degree 1,

  2. (ii)

    𝒯𝒞\mathcal{TC}-compatible if each vertex of 𝒪{\mathcal{O}} that is not in LUL\cup U has a child that is either in LUL\cup U or a vertex with in-degree 1 and out-degree at least 1, and

  3. (iii)

    strongly 𝒯𝒞\mathcal{TC}-compatible if each vertex of 𝒪{\mathcal{O}} that is not in LUL\cup U has a child that is either in LL or a vertex with in-degree 1 and out-degree at least 1.

2.3. Problem statements

Let 𝒞{\mathcal{C}} be a class of rooted phylogenetic networks, and let R{A,B}R\in\{A,B\}. In this paper, we analyse the following decision problem.

Funneled 𝒞R{\mathcal{C}}_{R}-Orientation
Instance. An unrooted phylogenetic network 𝒰{\mathcal{U}}.
Question. Does there exist a funneled 𝒞R{\mathcal{C}}_{R}-orientation of 𝒰{\mathcal{U}}?

If R=BR=B, we emphasise that any vertex of 𝒰{\mathcal{U}} can be chosen as root. In particular, no preselected root is part of the input to Funneled 𝒞B{\mathcal{C}}_{B}-Orientation. To obtain NP-hardness results for Funneled 𝒞R{\mathcal{C}}_{R}-Orientation in the next section, we design three gadgets, which are connector networks. Multiple copies of these gadgets are then combined such that the resulting graph is an unrooted pseudo network or an unrooted phylogenetic network 𝒰{\mathcal{U}}. We then orient each gadget separately such that, collectively, these orientations will result in a funneled 𝒞R{\mathcal{C}}_{R}-orientation 𝒩{\mathcal{N}} of 𝒰{\mathcal{U}} if such an orientation exists. To provide some intuition for some of the more technical definitions given in Section 2.2, if a gadget is a connector network 𝒢1{\mathcal{G}}_{1} with exactly one connector leaf, then the upcoming hardness constructions enforce that the root of 𝒩{\mathcal{N}} is contained in the subgraph of 𝒩{\mathcal{N}} that is obtained by orienting 𝒢1{\mathcal{G}}_{1}. On the other hand, if a gadget is a connector network 𝒢k{\mathcal{G}}_{k} with k2k\geq 2, then the hardness constructions enforce that the root of 𝒩{\mathcal{N}} is not contained in the subgraph of 𝒩{\mathcal{N}} that is obtained by orienting 𝒢k{\mathcal{G}}_{k}.

We now turn to the statement of two Boolean satisfiability problems that we later use to establish NP-hardness of Funneled 𝒞R{\mathcal{C}}_{R}-Orientation. Let V={x1,x2,,xn}V=\{x_{1},x_{2},\ldots,x_{n}\} be a set of variables. A literal is a variable xix_{i} or its negation x¯i\bar{x}_{i}, and a clause is a disjunction of a subset of {xi,x¯i:i{1,2,,n}}\{x_{i},\bar{x}_{i}:i\in\{1,2,\ldots,n\}\}. If a clause contains exactly kk distinct literals for k1k\geq 1, then it is called a kk-clause. We say that a clause is positive if it is a subset of VV. A Boolean formula in conjunctive normal form (or short, Boolean formula) is a conjunction of clauses, i.e., an expression of the form φ=j=1mcj\varphi=\bigwedge_{j=1}^{m}c_{j}, where cjc_{j} is a clause for all jj. Now, let φ\varphi be a Boolean formula. We say that φ\varphi is positive if each clause of φ\varphi is positive, i.e., no clause contains an element in {x¯1,x¯2,,x¯n}\{\bar{x}_{1},\bar{x}_{2},\ldots,\bar{x}_{n}\}. A truth assignment for VV is a mapping β:V{T,F}\beta\colon V\rightarrow\{T,F\}, where TT represents the truth value True and FF represents the truth value False. A truth assignment β\beta satisfies φ\varphi if at least one literal of each clause evaluates to TT under β\beta. If β\beta satisfies φ\varphi and has the additional property that at least one literal of each clause evaluates to FF, we say that β\beta nae-satisfies φ\varphi.

Positive Not-All-Equal (2,3)(2,3)-SAT
Instance. A set V={x1,x2,,xn}V=\{x_{1},x_{2},\ldots,x_{n}\} of variables and a collection C={c1,c2,,cm}C=\{c_{1},c_{2},\ldots,c_{m}\} of positive clauses over VV such that each clause consists of either two or three distinct variables.
Question. Is there a truth assignment for VV that nae-satisfies CC?

Positive 1-in-3 SAT
Instance. A set V={x1,x2,,xn}V=\{x_{1},x_{2},\ldots,x_{n}\} of variables and a collection C={c1,c2,,cm}C=\{c_{1},c_{2},\ldots,c_{m}\} of positive clauses over VV such that each clause consists of 3 distinct variables.
Question. Is there a truth assignment for VV that sets exactly one variable in each clause in CC to TT?

3. Funneled Tree-Child Orientations

In this section, we analyse the aforementioned result by Bulteau et al. [1] and establish NP-completeness of Funneled 𝒞R{\mathcal{C}}_{R}-Orientation for when an unrooted phylogenetic network with maximum degree 5 is given and 𝒞{\mathcal{C}} is the class of rooted phylogenetic networks that are tree-child.

3.1. Construction by Bulteau et al.

As mentioned in the introduction, an incorrect proof has recently appeared in the literature [1] that claims NP-completeness for the following decision problem which is slightly different from Funneled 𝒞R\mathcal{C}_{R}-Orientation: Given an unrooted pseudo network 𝒰{\mathcal{U}} on XX with maximum degree 5 and a vertex vv of 𝒰{\mathcal{U}}, does there exist an orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}} such that 𝒪{\mathcal{O}} is a rooted pseudo network on XX (resp. X\{v}X\backslash\{v\} if vXv\in X) with root vv that is funneled and tree-child?

We next describe the high-level idea of the construction that is presented in [1] and then explain why the tree-child orientation result does not hold. Given an instance of Positive Not-All-Equal (2,3)(2,3)-SAT, the authors construct an unrooted pseudo network 𝒰{\mathcal{U}} with leaf set X={1,2,,n}X=\{\ell_{1},\ell_{2},\ldots,\ell_{n}\}. The construction of 𝒰{\mathcal{U}} is based on the bipartite incidence graph which has an internal variable vertex xix_{i} for each variable, an internal clause vertex cjc_{j} for each clause, and an edge connecting xix_{i} with cjc_{j} if and only if xix_{i} appears in cjc_{j}. Additionally, a leaf i\ell_{i} is attached to each xix_{i} and a vertex rr is introduced that is adjacent to each xix_{i}. Lastly, to guarantee that rr has degree at most 5, additional edges and vertices are added. Importantly, the introduction of these additional edges and vertices does not change the edges that connect a variable vertex with a clause vertex. To keep the upcoming discussion as simple as possible, we omit this last step in the construction from the discussion and instead view rr as a high-degree vertex. As an example, consider the following yes-instance of Positive Not-All-Equal (2,3)(2,3)-SAT, which is also detailed in [1].

(1) (x1x2x3)(x2x4)(x1x4)(x1x3)(x2x3x4).(x_{1}\vee x_{2}\vee x_{3})\wedge(x_{2}\vee x_{4})\wedge(x_{1}\vee x_{4})\wedge(x_{1}\vee x_{3})\wedge(x_{2}\vee x_{3}\vee x_{4}).

For this instance, the construction results in the unrooted pseudo network that is shown in Figure 2(a).

Refer to caption
Figure 2. (a) The construction of an unrooted pseudo network 𝒰{\mathcal{U}} for the Positive Not-All-Equal (2,3)(2,3)-SAT instance (1), and (b) an orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}} that is rooted at rr. Observe that 𝒪{\mathcal{O}} is funneled but not tree-child.

Now, let 𝒰{\mathcal{U}} be an unrooted pseudo network obtained from the above construction for some instance {\mathcal{I}} of Positive Not-All-Equal (2,3)(2,3)-SAT, and let 𝒪{\mathcal{O}} be an orientation of 𝒰{\mathcal{U}}. Bulteau at al. essentially thought to have proved that 𝒪{\mathcal{O}} is a rooted pseudo network on XX with root rr that is funneled if and only if 𝒪{\mathcal{O}} is a rooted pseudo network on XX with root rr that is funneled and tree-child. Suppose that 𝒪{\mathcal{O}} is a rooted pseudo network on XX with root rr that is funneled. We next briefly describe some properties of 𝒪{\mathcal{O}}. Clearly, there are arcs (r,xi)(r,x_{i}) and (xi,i)(x_{i},\ell_{i}) for each variable vertex xix_{i} with i{1,2,,n}i\in\{1,2,\ldots,n\}. Since 𝒪{\mathcal{O}} is funneled, the edges that join a variable vertex xix_{i} with a clause vertex cjc_{j} are either all directed away from xix_{i} or all directed into xix_{i}. Moreover, there is no clause vertex cjc_{j} that has all arcs directed into it or all arcs directed away from it in 𝒪{\mathcal{O}}. Intuitively, 𝒪{\mathcal{O}} can be translated into a truth assignment that nae-satisfies CC, where arcs directed into a clause vertex correspond to true variables and arcs directed out of a clause vertex correspond to false variables. Hence, each clause contains at least one true variable and at least one false variable. Returning to the Positive Not-All-Equal (2,3)(2,3)-SAT instance (1) that is described above, Figure 2(b) shows an orientation that is a rooted pseudo network on four leaves and with root rr that is funneled. However, this orientation is not tree-child because, for example, c2c_{2} has no child that is a tree vertex or a leaf. In general, let uu be a child of cjc_{j} in 𝒪{\mathcal{O}}. Note that u=xiu=x_{i} for some variable vertex xix_{i}. Since xix_{i} is also a child of rr, it follows that the in-degree of xix_{i} is at least 2. Hence, xix_{i} is a reticulation and, because the same argument applies to any other child of cjc_{j}, it follows that cjc_{j} has no child that is a tree vertex or a leaf. The erroneous statement in [1, p. 10] is the following:

“Since each clause contains at least one true variable, each clause vertex has at least one child that is a tree node.”

The correct conclusion would be that each clause vertex cjc_{j} has at least one parent that is a tree vertex.

3.2. Funneled tree-child orientations for pseudo networks with maximum degree 5

Let 𝒞{\mathcal{C}} be the class of rooted pseudo networks that are tree-child. In this subsection, we establish hardness for the following decision problem.

Funneled 𝒞A\mathcal{C}_{A}-Orientation (5\leq 5, pseudo)
Instance. An unrooted pseudo network 𝒰{\mathcal{U}} with maximum degree 5.
Question. Does there exist a funneled 𝒞A\mathcal{C}_{A}-orientation of 𝒰{\mathcal{U}}?

We start by introducing three gadgets that we call the root gadget, connection gadget, and caterpillar gadget. These gadgets are shown on the left-hand side of Figure 34, and 5, respectively. They will play a central role in reducing Positive Not-All-Equal (2,3)(2,3)-SAT to Funneled 𝒞A\mathcal{C}_{A}-Orientation (5\leq 5, pseudo), thereby establishing NP-completeness for the latter.

The next three lemmas establish properties of the root, connection, and caterpillar gadget. Throughout the proofs, we use the vertex labels as shown in Figures 34, and 5, when referring to vertices of these gadgets.

Refer to caption
Figure 3. Left: Root gadget with a single connector leaf rr, and two non-connector leaves \ell and \ell^{\prime}. Right: An orientation of the root gadget.
Lemma 3.1.

Let 𝒢{\mathcal{G}} be the root gadget, and let 𝒞{\mathcal{C}} be the class of rooted pseudo networks that are tree-child. Then

  1. (i)

    𝒢{\mathcal{G}} has a 𝒞A\mathcal{C}_{A}-orientation whose root ρ\rho subdivides {w,x}\{w,x\}, and

  2. (ii)

    𝒢{\mathcal{G}} is 𝒞A{\mathcal{C}}_{A}-root-forcing.

Proof.

The orientation that is shown on the right-hand side of Figure 3 is a 𝒞A\mathcal{C}_{A}-orientation of the partner network of 𝒢{\mathcal{G}} which only differs from 𝒢{\mathcal{G}} by viewing rr as a non-connector leaf. Hence, 𝒢{\mathcal{G}} has a 𝒞A\mathcal{C}_{A}-orientation whose root ρ\rho subdivides {w,x}\{w,x\}. This establishes (i). Now, assume towards a contradiction that 𝒢{\mathcal{G}} does not satisfy (ii). Then there exists a 𝒞A{\mathcal{C}}_{A}-orientation 𝒪{\mathcal{O}} of 𝒢{\mathcal{G}} whose root ρ\rho subdivides {r,v}\{r,v\}. Then (ρ,v)(\rho,v) is an arc in 𝒪{\mathcal{O}}. Furthermore, as \ell and \ell^{\prime} are non-connector leaves, (y,)(y,\ell) and (z,)(z,\ell^{\prime}) are arcs in 𝒪{\mathcal{O}}. We next distinguish two cases.

First, suppose that vv has in-degree 1 and out-degree 2 in 𝒪{\mathcal{O}}. Then, ρ\rho is the parent of vv, and ww and xx are the children of vv. Observe that either (w,x)(w,x) or (x,w)(x,w) is an arc in 𝒪{\mathcal{O}}. By symmetry, we may assume that (x,w)(x,w) is an arc in 𝒪{\mathcal{O}}. This implies that ww has in-degree 2 and out-degree 1. Moreover, since 𝒪{\mathcal{O}} is a 𝒞A{\mathcal{C}}_{A}-orientation of 𝒢{\mathcal{G}}, it follows that each of xx and yy has in-degree 1 and out-degree 2. In turn (y,z)(y,z) and (x,z)(x,z) are arcs in 𝒪{\mathcal{O}}. Hence, zz has in-degree 2 and, thus, both children ww and zz of xx have in-degree 2; thereby contradicting that 𝒪{\mathcal{O}} is a 𝒞A{\mathcal{C}}_{A}-orientation of 𝒢{\mathcal{G}}. Second, suppose that vv has in-degree 2 and out-degree 1 in 𝒪{\mathcal{O}}. By symmetry, we may assume that xx is the second parent of vv and ww is the child of vv. Again, as 𝒪{\mathcal{O}} is a 𝒞A{\mathcal{C}}_{A}-orientation of 𝒢{\mathcal{G}}, ww has in-degree 1 and out-degree 2. This implies that (w,x)(w,x) is an arc in 𝒪{\mathcal{O}} and, so, (x,v),(v,w),(x,v),(v,w), and (w,x)(w,x) are the arcs of a directed cycle; thereby contradicting that 𝒪{\mathcal{O}} is acyclic. Combining both cases establishes the lemma. ∎

Refer to caption
Figure 4. Connection gadget (a) with two connector leaves ss and tt, and two non-connector leaves \ell and \ell^{\prime}. Two orientations (b) and (c) of the connection gadget. Both orientations are {\mathcal{F}}-compatible and strongly 𝒯𝒞\mathcal{TC}-compatible.

The next corollary is a generalisation of Lemma 3.1(ii).

Corollary 3.2.

Let 𝒞{\mathcal{C}} be the class of rooted pseudo networks that are tree-child. Let 𝒰{\mathcal{U}} be an unrooted pseudo network that contains a pending subgraph isomorphic to the root gadget (up to the labelling of the two non-connector leaves of that gadget). If 𝒰{\mathcal{U}} has a funneled 𝒞A{\mathcal{C}}_{A}-orientation, then the root subdivides an edge of the root gadget that is not {r,v}\{r,v\}.

Proof.

Observe that the connector leaf rr is either an internal vertex or a leaf of 𝒰{\mathcal{U}}. Assume that there exists a funneled 𝒞A{\mathcal{C}}_{A}-orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}} whose root ρ\rho subdivides {r,v}\{r,v\} or an edge that is not an edge of the root gadget. Since there exists a directed path from ρ\rho to leaf \ell of the root gadget, it follows that (r,v)(r,v) is an arc of 𝒪{\mathcal{O}}. Now the proof can be established as the proof of Lemma 3.1(ii). ∎

Lemma 3.3.

Let 𝒢{\mathcal{G}} be the connection gadget. Then 𝒢{\mathcal{G}} satisfies the following properties:

  1. (i)

    There exists an orientation of 𝒢{\mathcal{G}} that is {\mathcal{F}}-compatible and strongly 𝒯𝒞\mathcal{TC}-compatible in which each of uu and vv has in-degree 1 and out-degree 2, and (s,u)(s,u) and (v,t)(v,t) are arcs.

  2. (ii)

    There exists an orientation of 𝒢{\mathcal{G}} that is {\mathcal{F}}-compatible and strongly 𝒯𝒞\mathcal{TC}-compatible in which each of uu and vv has in-degree 1 and out-degree 2, and (u,s)(u,s) and (t,v)(t,v) are arcs.

  3. (iii)

    Each 𝒯𝒞\mathcal{TC}-compatible orientation of 𝒢{\mathcal{G}} has either arcs (s,u)(s,u) and (v,t)(v,t) or arcs (u,s)(u,s) and (t,v)(t,v).

Proof.

Since 𝒢{\mathcal{G}} has two connector leaves, recall that, by definition, an orientation of 𝒢{\mathcal{G}} is obtained by assigning a direction to each edge of 𝒢{\mathcal{G}}. The orientations of 𝒢{\mathcal{G}} that are shown in Figure 4(b) and (c) establish (i) and (ii), respectively. Next we show that 𝒢{\mathcal{G}} also satisfies (iii). Let 𝒪{\mathcal{O}} be a 𝒯𝒞\mathcal{TC}-compatible orientation of 𝒢{\mathcal{G}}. Due to (i), such an orientation exists. Furthermore, by definition of 𝒯𝒞\mathcal{TC}-compatibility, (w,)(w,\ell) and (w,)(w^{\prime},\ell^{\prime}) are arcs in 𝒪{\mathcal{O}}.

First, suppose that (u,s)(u,s) and (v,t)(v,t) are arcs in 𝒪{\mathcal{O}}. By symmetry, we may assume that (u,v)(u,v) is an arc in 𝒪{\mathcal{O}}. In turn, this implies the arc (w,u)(w,u) exists because, otherwise, uu has in-degree 0. Using the same argument again, (w,w)(w^{\prime},w) and (v,w)(v,w^{\prime}) are also arcs in 𝒪{\mathcal{O}}. Now (u,v),(v,w),(w,w)(u,v),(v,w^{\prime}),(w^{\prime},w) and (w,u)(w,u) are the arcs of a directed cycle in 𝒪{\mathcal{O}}; thereby contradicting that 𝒪{\mathcal{O}} is acyclic. Second, suppose that (s,u)(s,u) and (t,v)(t,v) are arcs in 𝒪{\mathcal{O}}. By symmetry, we may again assume that (u,v)(u,v) is an arc in 𝒪{\mathcal{O}}. Then, (v,w)(v,w^{\prime}) is an arc in 𝒪{\mathcal{O}} because, otherwise vv has out-degree 0. Since 𝒪{\mathcal{O}} is 𝒯𝒞\mathcal{TC}-compatible and vv has in-degree 2 and out-degree 1, ww^{\prime} has in-degree 1 and out-degree 2. Hence (w,w)(w^{\prime},w) is an arc in 𝒪{\mathcal{O}}. Lastly, (w,u)(w,u) is an arc in 𝒪{\mathcal{O}} because, otherwise, uu has no child that is in LUL\cup U or has in-degree 1 and out-degree at least 1. Now (u,v),(v,w),(w,w)(u,v),(v,w^{\prime}),(w^{\prime},w) and (w,u)(w,u) are again the arcs of a directed cycle in 𝒪{\mathcal{O}}; another contradiction. By combining both cases, we deduce that, each 𝒯𝒞\mathcal{TC}-compatible orientation of 𝒢{\mathcal{G}} has either arcs (s,u)(s,u) and (v,t)(v,t), or arcs (u,s)(u,s) and (t,v)(t,v). This establishes (iii) and, therefore, the lemma. ∎

While the root and connection gadgets are new to this paper, the caterpillar gadget is based on what Bulteau et al. [1] refer to as Rule 1. This rule is a graph operation that reduces the degree of an unrooted phylogenetic pseudo network 𝒰{\mathcal{U}} while preserving the existence of certain orientations of 𝒰{\mathcal{U}}. Essentially, Rule 1 is based on the observation that a vertex vv with two adjacent leaves and an arc that is directed into vv forces directions on all other edges incident with vv in every funneled orientation. More precisely, we have the following:

Observation 3.4.

Let 𝒪{\mathcal{O}} be an {\mathcal{F}}-compatible orientation of a connector network 𝒢k{\mathcal{G}}_{k} with k0k\geq 0. Furthermore, let vv be a vertex of 𝒪{\mathcal{O}} that is incident with dd arcs such that one arc is directed into vv and two arcs are directed out of vv. Then vv has in-degree 1 and out-degree d1d-1.

Before establishing properties of the caterpillar gadget in the next lemma, we note that the non-connector leaf \ell of this gadget is not necessary to show that the lemma holds. Its only purpose is to turn p1p_{1} into a degree-5 vertex.

Refer to caption
Figure 5. Left: Caterpillar gadget with n+1n+1 connector leaves, and 1+2n1+2n non-connector leaves. Right: An {\mathcal{F}}-compatible and strongly 𝒯𝒞\mathcal{TC}-compatible orientation of the caterpillar gadget.
Lemma 3.5.

For n1n\geq 1, let 𝒢{\mathcal{G}} be the caterpillar gadget with degree-5 vertices p1,p2,,pnp_{1},p_{2},\ldots,p_{n}. Then 𝒢{\mathcal{G}} has a unique \mathcal{F}-compatible orientation 𝒪\mathcal{O} with arc (r,pn)(r,p_{n}). Furthermore,

  1. (i)

    𝒪\mathcal{O} is strongly 𝒯𝒞\mathcal{TC}-compatible and

  2. (ii)

    for each 1in1\leq i\leq n, (pi,xi)(p_{i},x_{i}) is an arc in 𝒪{\mathcal{O}}.

Proof.

Suppose that (r,pn)(r,p_{n}) is an arc in an \mathcal{F}-compatible orientation 𝒪{\mathcal{O}} of 𝒢{\mathcal{G}}. We first show that 𝒪{\mathcal{O}} is unique. For each i{1,2,,n}i\in\{1,2,\ldots,n\}, let i\ell_{i} and i\ell_{i}^{\prime} be the two non-connector leaves adjacent to pip_{i} (these leaf labels are omitted in Figure 5). Since (r,pn)(r,p_{n}), (pn,n)(p_{n},\ell_{n}), and (pn,n)(p_{n},\ell_{n}^{\prime}) are arcs in 𝒪{\mathcal{O}} and 𝒪{\mathcal{O}} is \mathcal{F}-compatible, it follows from Observation 3.4 that (pn,xn)(p_{n},x_{n}) and (pn,pn1)(p_{n},p_{n-1}) are arcs in 𝒪{\mathcal{O}}. Now, since (pn,pn1)(p_{n},p_{n-1}), (pn1,n1)(p_{n-1},\ell_{n-1}), and (pn1,n1)(p_{n-1},\ell_{n-1}^{\prime}) are arcs in 𝒪{\mathcal{O}}, it again follows by Observation 3.4 that (pn1,xn1)(p_{n-1},x_{n-1}) and (pn1,pn2)(p_{n-1},p_{n-2}) are also arcs in 𝒪{\mathcal{O}}. Continuing in this way and noting that p1p_{1} has, in addition to 1\ell_{1} and 1\ell_{1}^{\prime}, a third non-connector leaf \ell, it is easily checked that there exists exactly one \mathcal{F}-compatible orientation for 𝒢{\mathcal{G}} with arc (r,pn)(r,p_{n}). Moreover, this orientation, which is illustrated on the right-hand side of Figure 5, also satisfies (ii) and, as no vertex in 𝒪{\mathcal{O}} has in-degree at least 2, 𝒪{\mathcal{O}} satisfies (i). ∎

The proof of the following theorem uses a construction that is based on ideas presented in Bulteau et al. [1] and summarised in Section 3.1. However, we use a different root gadget and introduce a connection gadget that enables the tree-child property for orientations corresponding to nae-satisfying truth assignments.

Theorem 3.6.

Let 𝒞{\mathcal{C}} be the class of rooted pseudo networks that are tree-child. Then Funneled 𝒞A\mathcal{C}_{A}-Orientation (5\leq 5, pseudo) is NP-complete.

Proof.

Let 𝒰{\mathcal{U}} be an unrooted pseudo network, and let 𝒪{\mathcal{O}} be an orientation of 𝒰{\mathcal{U}} following Variant AA. Since it can be checked in polynomial time if 𝒪{\mathcal{O}} is a 𝒞A\mathcal{C}_{A}-orientation of 𝒰{\mathcal{U}}, it follows that Funneled 𝒞A\mathcal{C}_{A}-Orientation (5\leq 5, pseudo) is in NP. We next establish NP-hardness using a polynomial-time reduction from the variant of Positive Not-All-Equal (2,3)-SAT, where each variable appears exactly three times (see Dehghan et al. [4] for a proof that the problem remains NP-complete under this restriction).

Let =(V,C)\mathcal{I}=(V,C) be an instance of Positive Not-All-Equal (2,3)-SAT such that each variable appears in exactly three clauses. Let m2m_{2} (resp. m3m_{3}) be the number of clauses in CC that contain exactly two (resp. three) distinct variables. Let 𝒢n+1c{\mathcal{G}}_{n+1}^{c} be the caterpillar gadget with connector leaves r,x1,x2,,xnr,x_{1},x_{2},\ldots,x_{n}, let 𝒢1r{\mathcal{G}}_{1}^{r} be the root gadget with connector leaf rr, and let 𝒢21,𝒢22,,𝒢22m2+3m3{\mathcal{G}}_{2}^{1},{\mathcal{G}}_{2}^{2},\ldots,{\mathcal{G}}_{2}^{2m_{2}+3m_{3}} be copies of the connection gadget each with connector leaves ss and tt. We next construct an unrooted pseudo network in the following way.

Refer to caption
Figure 6. The unrooted pseudo network that is constructed from the Boolean formula (1) of Positive Not-All-Equal (2,3)-SAT. Details are given in the proof of Theorem 3.6. Connection gadgets are shown as grey shaded circles.
  1. (1)

    Identify the connector leaf rr of 𝒢1r{\mathcal{G}}_{1}^{r} with the connector leaf rr of 𝒢n+1c{\mathcal{G}}_{n+1}^{c}. We will continue to refer to the resulting degree-2 vertex as rr.

  2. (2)

    Attach a new leaf i\ell_{i} to each connector leaf xix_{i} with i{1,2,,n}i\in\{1,2,\ldots,n\} via a new edge.

  3. (3)

    Let c1,c2,,cmc_{1},c_{2},\ldots,c_{m} be new vertices. Furthermore, let i{1,2,,n}i\in\{1,2,\ldots,n\}, and let j{1,2,,m}j\in\{1,2,\ldots,m\}. For each xix_{i} and cjc_{j} such that xix_{i} is a variable of cjc_{j}, add one of the 2m2+3m32m_{2}+3m_{3} connection gadgets by identifying the connector leaf ss with xix_{i} and identifying the connector leaf tt with cjc_{j}.

Let 𝒰{\mathcal{U}} be the resulting unrooted pseudo network. We may assume for the remainder of the proof that all leaves of 𝒰{\mathcal{U}} have a distinct leaf label and that the leaf set of 𝒰{\mathcal{U}} is XX. Note that |X|=3+3n+2(2m2+3m3)|X|=3+3n+2(2m_{2}+3m_{3}). By construction and since each variable appears exactly three times in CC, the maximum degree of a vertex in 𝒰{\mathcal{U}} is 5. Figure 6 shows an example of this construction for the Boolean formula given in (1).

3.6.1.

{\mathcal{I}} is a yes-instance if and only if 𝒰{\mathcal{U}} has a funneled 𝒞A\mathcal{C}_{A}-orientation.

Proof.

First, suppose that {\mathcal{I}} is a yes-instance. Let β:V{T,F}\beta\colon V\rightarrow\{T,F\} be a truth assignment that nae-satisfies each clause in CC. We construct a funneled 𝒞A{\mathcal{C}}_{A}-orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}}. Instead of referring to edges of 𝒰{\mathcal{U}}, we refer to edges of the root, connection, and caterpillar gadgets as shown in Figures 35 when assigning directions because each edge in 𝒰{\mathcal{U}}, except for the edges incident with i\ell_{i} with i{1,2,,n}i\in\{1,2,\ldots,n\}, corresponds to a unique edge in one of the gadgets that make up 𝒰{\mathcal{U}}. We start by subdividing the edge {w,x}\{w,x\} of 𝒢1r{\mathcal{G}}_{1}^{r} with a new root vertex ρ\rho and direct the edges of the resulting root gadget as shown on the right-hand side of Figure 3. In particular, we have the arc (v,r)(v,r) in 𝒪{\mathcal{O}}. Turning to the edges of 𝒢n+1c{\mathcal{G}}_{n+1}^{c}, we direct the edges of this gadget as shown on the right-hand side of Figure 5. In particular, we have the arc (r,pn)(r,p_{n}) in 𝒪{\mathcal{O}}. Now, for each i{1,2,,n}i\in\{1,2,\ldots,n\}, let ui,uiu_{i},u_{i}^{\prime}, and ui′′u_{i}^{\prime\prime} be the three neighbours of xix_{i} in 𝒰{\mathcal{U}} that are vertices of three distinct connection gadgets. We direct the edge {xi,i}\{x_{i},\ell_{i}\} into i\ell_{i} and, depending on β(xi)\beta(x_{i}), we direct the three edges {xi,ui}\{x_{i},u_{i}\}, {xi,ui}\{x_{i},u_{i}^{\prime}\}, and {xi,ui′′}\{x_{i},u_{i}^{\prime\prime}\} in one of the following two ways:

  1. (i)

    (xi,ui),(xi,ui),(xi,ui′′)(x_{i},u_{i}),(x_{i},u_{i}^{\prime}),(x_{i},u_{i}^{\prime\prime}) if β(xi)=T\beta(x_{i})=T, and

  2. (ii)

    (ui,xi),(ui,xi),(ui′′,xi)(u_{i},x_{i}),(u_{i}^{\prime},x_{i}),(u_{i}^{\prime\prime},x_{i}) if β(xi)=F\beta(x_{i})=F.

At this point, each connection gadget in 𝒰{\mathcal{U}} has exactly one edge that is already assigned a direction and this edge is e={s,u}e=\{s,u\} (where uu is the vertex as shown in Figure 4). If ee is directed into uu in 𝒪{\mathcal{O}}, then direct the remaining edges of the connection gadget as shown in Figure 4(b) and, otherwise, as shown in Figure 4(c). This completes the assignment of directions to the edges of 𝒰{\mathcal{U}} and, therefore, the construction of 𝒪{\mathcal{O}}. Note that the set of vertices with out-degree 0 is XX and each such vertex has in-degree 1, and the unique vertex with in-degree 0 is ρ\rho. We next show that 𝒪{\mathcal{O}} is indeed a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}. In preparation for the upcoming arguments, let

S={r,x1,x2,,xn,c1,c2,,cm}.S=\{r,x_{1},x_{2},\ldots,x_{n},c_{1},c_{2},\ldots,c_{m}\}.

Intuitively, SS contains each vertex in 𝒪{\mathcal{O}} that is a connector leaf in one of the gadgets that are used in the construction of 𝒰{\mathcal{U}}. Furthermore, observe that each vertex xix_{i} in 𝒪{\mathcal{O}} with i{1,2,,n}i\in\{1,2,\ldots,n\} has either in-degree 1 and out-degree 4, or in-degree 4 and out-degree 1.

We start by establishing that 𝒪{\mathcal{O}} is acyclic. Since the orientations that are shown in Figures 35 are acyclic, it is sufficient to show that no vertex in SS lies on a directed cycle. Let vv be the vertex of 𝒢1r{\mathcal{G}}_{1}^{r} that is a neighbour of rr in 𝒪{\mathcal{O}}. Since there is no directed path from pnp_{n} to vv in 𝒪{\mathcal{O}}, it follows that rr does not lie on a directed cycle. Assume towards a contradiction that xix_{i} lies on a directed cycle for some i{1,2,,n}i\in\{1,2,\ldots,n\}. Then, by the observation at the end of the last paragraph, xix_{i} has in-degree 1 and out-degree 4 and there is a directed path from xix_{i} to pip_{i}. Hence, there is an arc (xi,pi)(x_{i^{\prime}},p_{i^{\prime}}) for some i{1,2,,n}i^{\prime}\in\{1,2,\ldots,n\}; a contradiction. Finally, if cjc_{j} lies on a directed cycle for some j{1,2,,m}j\in\{1,2,\ldots,m\}, then some xix_{i} lies on a directed cycle for some i{1,2,,n}i\in\{1,2,\ldots,n\}; again a contradiction. We conclude that 𝒪{\mathcal{O}} is acyclic and, thus, 𝒪{\mathcal{O}} is a rooted pseudo network on XX with root ρ\rho.

We next argue that 𝒪{\mathcal{O}} is funneled, i.e., each vertex in 𝒪{\mathcal{O}} with in-degree at least 2 has out-degree 1. It follows from the orientations shown in Figures 35 that each vertex of 𝒪{\mathcal{O}} that is not in SS satisfies this property. Turning to the vertices in SS, rr has in-degree 1 and out-degree 1. Moreover, for each i{1,2,,n}i\in\{1,2,\ldots,n\}, xix_{i} has in-degree 1 if β(xi)=T\beta(x_{i})=T and out-degree 1 if β(xi)=F\beta(x_{i})=F. Finally, recall that each clause in CC has either two or three distinct variables. Since β\beta nae-satisfies CC, it follows that cjc_{j} has either in-degree 1 and out-degree is 2, or in-degree 2 and out-degree 1 for each cjCc_{j}\in C. Hence, 𝒪{\mathcal{O}} is funneled.

We complete this part of the proof by showing that 𝒪{\mathcal{O}} is tree-child. Since rr has in-degree 1 and out-degree 1 in 𝒪{\mathcal{O}}, it follows from Lemmas 3.1(i), 3.3(i)–(ii), and 3.5(i) that each vertex of 𝒪{\mathcal{O}} that is not a leaf and not in SS has a child that is a leaf or a tree vertex. Furthermore, rr has a unique child pnp_{n} that is a tree vertex, and each xix_{i} with i{1,2,,n}i\in\{1,2,\ldots,n\} is adjacent to a leaf i\ell_{i}. Now consider a vertex cjc_{j} in 𝒪{\mathcal{O}} with j{1,2,,m}j\in\{1,2,\ldots,m\}. Each child of cjc_{j} is a vertex vv of a connection gadget. The edges of such a connection gadget are directed as shown in Figure 4(c). Importantly, vv is a tree vertex. We conclude that 𝒪{\mathcal{O}} is a pseudo network on XX that is funneled and tree-child.

Second, let 𝒪{\mathcal{O}} be a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}} with root ρ\rho. By Corollary 3.2, ρ\rho subdivides an edge of 𝒢1r{\mathcal{G}}_{1}^{r} that is not {r,v}\{r,v\}. Moreover, since 𝒪{\mathcal{O}} is funneled and there is a directed path from ρ\rho to each leaf in 𝒪{\mathcal{O}}, (v,r)(v,r) and (r,pn)(r,p_{n}) are arcs in 𝒪{\mathcal{O}}. By Lemma 3.5(ii), it now follows that (pi,xi)(p_{i},x_{i}) is an arc in 𝒪{\mathcal{O}} for each i{1,2,,n}i\in\{1,2,\ldots,n\}. Hence, each xix_{i} has a parent pip_{i} and a child i\ell_{i} in 𝒪{\mathcal{O}}. We now define a truth assignment β:V{T,F}\beta\colon V\rightarrow\{T,F\} as follows. For each i{1,2,,n}i\in\{1,2,\ldots,n\}, we set β(xi)=T\beta(x_{i})=T if xix_{i} is a tree vertex and β(xi)=F\beta(x_{i})=F if xix_{i} is a reticulation in 𝒪{\mathcal{O}}.

Towards a contradiction, assume that β\beta does not nae-satisfy each clause in CC. We distinguish two cases. First, suppose that there exists a clause cj=(xxx′′)c_{j}=(x\vee x^{\prime}\vee x^{\prime\prime}) or cj=(xx)c_{j}=(x\vee x^{\prime}) for some j{1,2,,m}j\in\{1,2,\ldots,m\} whose variables are all set to TT under β\beta. We continue by assuming that cj=(xxx′′)c_{j}=(x\vee x^{\prime}\vee x^{\prime\prime}) and note that the same argument applies for when cj=(xx)c_{j}=(x\vee x^{\prime}). As x,xx,x^{\prime} and x′′x^{\prime\prime} are tree vertices in 𝒪{\mathcal{O}}, each arc that joins a vertex in {x,x,x′′}\{x,x^{\prime},x^{\prime\prime}\} with a vertex of a connection gadget is directed towards the latter. Now consider a directed path PP in 𝒪{\mathcal{O}} from a vertex in {x,x,x′′}\{x,x^{\prime},x^{\prime\prime}\} to cjc_{j}, and let ee be the last arc on PP. Since 𝒪{\mathcal{O}} is a 𝒞A{\mathcal{C}}_{A}-orientation, the direction of the arcs restricted to a single connection gadget in 𝒪{\mathcal{O}} is 𝒯𝒞\mathcal{TC}-compatible. Hence, Lemma 3.3(iii) implies that ee is directed into cjc_{j}. It now follows that cjc_{j} has in-degree 3 and out-degree 0; a contradiction. Second, suppose that there exists a clause cjCc_{j^{\prime}}\in C whose variables are all set to FF under β\beta. Then, by an analogous argument, cjc_{j^{\prime}} has in-degree 0 and out-degree-3; also a contradiction. Thus, β\beta nae-satisfies each clause in CC, that is, \mathcal{I} is a yes-instance. ∎

Noting that 𝒰{\mathcal{U}} can be constructed in polynomial time and has a size that is polynomial in nn and mm, we conclude the proof of the theorem. ∎

3.3. Hardness remains for phylogenetic networks

Theorem 3.6 shows that the erroneous result by Bulteau et al. [1, Cor. 5] can be corrected by introducing additional gadgets. However, the construction results in unrooted and rooted pseudo networks. In this short subsection, we outline two modifications to the construction that avoid degree-2 vertices, thereby showing that the following decision problem is NP-complete for when 𝒞{\mathcal{C}} is the class of tree-child networks.

Funneled 𝒞A\mathcal{C}_{A}-Orientation (3,53,5)
Instance. An unrooted phylogenetic network 𝒰{\mathcal{U}} such that each non-leaf vertex has degree 3 or 5.
Question. Does there exist a funneled 𝒞A\mathcal{C}_{A}-orientation of 𝒰{\mathcal{U}}?

Let 𝒰{\mathcal{U}} be an unrooted pseudo network on XX that is constructed from an instance of Positive Not-All-Equal (2,3)-SAT as described in the proof of Theorem 3.6. By construction, each non-leaf vertex of 𝒰{\mathcal{U}} has degree 2, 3, or 5. Moreover, the degree-2 vertices in 𝒰{\mathcal{U}} are rr and cjc_{j} for each j{1,2,,m}j\in\{1,2,\ldots,m\} such that cjc_{j} is a clause with exactly two distinct variables. Consider a 2-clause cj=(xixi)c_{j}=(x_{i}\vee x_{i^{\prime}}) with i,i{1,2,n}i,i^{\prime}\in\{1,2\ldots,n\} and iii\neq i^{\prime}. Observe that cjc_{j} can be viewed as an XOR constraint on xix_{i} and xix_{i^{\prime}}, that is, exactly one of the two variables is set to TT in a truth assignment that nae-satisfies cjc_{j}. In 𝒰{\mathcal{U}}, this is simulated by the clause vertex cjc_{j} being adjacent to two vertices that are part of two distinct connection gadgets, say 𝒢21{\mathcal{G}}_{2}^{1} and 𝒢22{\mathcal{G}}_{2}^{2}. Without loss of generality, we assume that the vertex ss of 𝒢21{\mathcal{G}}_{2}^{1} has been identified with xix_{i} in the construction of 𝒰{\mathcal{U}}. We refer to the operation of deleting all vertices of 𝒢22{\mathcal{G}}_{2}^{2} except for cjc_{j} and xix_{i^{\prime}}, and then identifying cjc_{j} with xix_{i^{\prime}} as the 2-clause suppression of cjc_{j}. See Figure 7 for an example of a 22-clause suppression.

Refer to caption
Figure 7. Left: Partial construction of an unrooted pseudo network 𝒰{\mathcal{U}} from an instance of Positive Not-All-Equal (2,3)-SAT that contains a clause cj=(xixi)c_{j}=(x_{i}\vee x_{i^{\prime}}) as detailed in the proof of Theorem 3.6. Right: Unrooted pseudo network obtained from 𝒰{\mathcal{U}} by the 22-clause suppression of cjc_{j}. Connection gadgets are shown as grey shaded circles.
Lemma 3.7.

Let 𝒞{\mathcal{C}} be the class of rooted pseudo networks that are tree-child. Let 𝒰{\mathcal{U}} be an unrooted pseudo network that is constructed from an instance of Positive Not-All-Equal (2,3)-SAT as described in the proof of Theorem 3.6. Let 𝒰{\mathcal{U}}^{\prime} be the unrooted phylogenetic network obtained from 𝒰{\mathcal{U}} by suppressing rr and applying the 2-clause suppression to each clause vertex cjc_{j} that has degree 2 in 𝒰{\mathcal{U}}. Then 𝒰{\mathcal{U}} has a funneled 𝒞A{\mathcal{C}}_{A}-orientation if and only if 𝒰{\mathcal{U}}^{\prime} has a funneled 𝒞A{\mathcal{C}}_{A}-orientation.

Proof.

We first note that if an unrooted phylogenetic network has a 𝒞A{\mathcal{C}}_{A}-orientation, then this orientation is necessarily a tree-child network (without any vertex of in-degree 1 and out-degree 1). Furthermore, by construction, observe that, except for {v,pn}\{v,p_{n}\}, each edge in 𝒰{\mathcal{U}}^{\prime} corresponds to a unique edge in 𝒰{\mathcal{U}}. In particular, for each 22-clause cj=(xixi)c_{j}=(x_{i}\vee x_{i^{\prime}}), the edge {q,xi}\{q,x_{i^{\prime}}\} in 𝒰{\mathcal{U}}^{\prime}, where qq is a vertex of a connection gadget 𝒢21{\mathcal{G}}_{2}^{1} in 𝒰{\mathcal{U}} corresponds to the edge {q,cj}\{q,c_{j}\}. These two edges are indicated by thick lines in Figure 7.

Now, first, suppose that 𝒪{\mathcal{O}} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}. By Corollary 3.2, the root of 𝒪{\mathcal{O}} subdivides an edge eρe_{\rho} of the root gadget such that eρ{r,v}e_{\rho}\neq\{r,v\}. Let 𝒪{\mathcal{O}}^{\prime} be the orientation obtained from 𝒰{\mathcal{U}}^{\prime} by subdividing eρe_{\rho} in 𝒰{\mathcal{U}}^{\prime} with a root vertex, directing the edge {v,pn}\{v,p_{n}\} into pnp_{n}, and then directing each remaining edge as it is directed in 𝒪{\mathcal{O}}. Let cj=(xixi)c_{j}=(x_{i}\vee x_{i^{\prime}}) be a 22-clause. Then one of xix_{i} and xix_{i^{\prime}}, say xix_{i}, is a tree vertex and xix_{i^{\prime}} is a reticulation in 𝒪{\mathcal{O}} and 𝒪{\mathcal{O}}^{\prime}. It follows that each xix_{i} with i{1,2,,n}i\in\{1,2,\ldots,n\} is a tree vertex (resp. reticulation) in 𝒪{\mathcal{O}} if and only if xix_{i} is a tree vertex (resp. reticulation) in 𝒪{\mathcal{O}}^{\prime}, and the same applies to xix_{i^{\prime}}. Noting that the unique child pnp_{n} of rr in 𝒰{\mathcal{U}} is a tree vertex, it now follows that, as 𝒪{\mathcal{O}} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}, 𝒪{\mathcal{O}}^{\prime} is such an orientation of 𝒰{\mathcal{U}}^{\prime}.

Second, suppose that 𝒪{\mathcal{O}}^{\prime} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}^{\prime}. Let eρe_{\rho} be the edge in 𝒰{\mathcal{U}}^{\prime} that contains the root in 𝒪{\mathcal{O}}^{\prime}. By Corollary 3.2, eρe_{\rho} is an edge of the root gadget such that eρ{v,pn}e_{\rho}\neq\{v,p_{n}\}. Hence, eρe_{\rho} is also an edge of 𝒰{\mathcal{U}}. We now obtain an orientation 𝒪{\mathcal{O}} for 𝒰{\mathcal{U}} by subdividing eρe_{\rho} with ρ\rho, directing the two edges that are incident with ρ\rho away from ρ\rho and directing the remaining edges as follows. For each edge ee such that eeρe\neq e_{\rho} and ee is in 𝒰{\mathcal{U}} and 𝒰{\mathcal{U}}^{\prime}, ee has the same direction in 𝒪{\mathcal{O}} as in 𝒪{\mathcal{O}}^{\prime}. Furthermore, for each leaf \ell that is in 𝒰{\mathcal{U}} but not in 𝒰{\mathcal{U}}^{\prime}, the unique edge incident with \ell is directed towards \ell. Since there exists a directed path from ρ\rho to each leaf in 𝒪{\mathcal{O}}^{\prime}, it follows that 𝒪{\mathcal{O}}^{\prime} contains the arc (v,pn)(v,p_{n}). For the same reason, (v,r)(v,r) and, consequently, (r,pn)(r,p_{n}) are arcs in 𝒪{\mathcal{O}}. Now, the only edges in 𝒰{\mathcal{U}} that have not been assigned a direction yet are edges of the connection gadgets whose non-leaf vertices lie on paths between cjc_{j} and xix_{i^{\prime}} that do not contain xix_{i} for each 2-clause cj=(xixi)c_{j}=(x_{i}\vee x_{i^{\prime}}). By construction, xix_{i} is a tree vertex in 𝒪{\mathcal{O}}^{\prime} if and only if xix_{i^{\prime}} is a reticulation in 𝒪{\mathcal{O}}^{\prime}. Assume that xix_{i} is a tree vertex in 𝒪{\mathcal{O}}^{\prime}. Then (xi,q)(x_{i},q) is an arc in 𝒪{\mathcal{O}}^{\prime} and 𝒪{\mathcal{O}}, where qq is the neighbour of xix_{i} in a connection gadget 𝒢21{\mathcal{G}}_{2}^{1}. Furthermore, by Lemma 3.3(iii), (q,xi)(q^{\prime},x_{i^{\prime}}) is an arc in 𝒪{\mathcal{O}}^{\prime}, where qq^{\prime} is the neighbour of xix_{i^{\prime}} in 𝒢21{\mathcal{G}}_{2}^{1}. In 𝒰{\mathcal{U}}, there exists a connection gadget 𝒢22{\mathcal{G}}_{2}^{2} whose non-leaf vertices lie on paths from cjc_{j} to xix_{i^{\prime}} that do not contain xix_{i}. Since cjc_{j} is a degree-2 vertex in 𝒰{\mathcal{U}} and (q,cj)(q^{\prime},c_{j}) is an arc of 𝒪{\mathcal{O}}^{\prime} (recall that cj=xic_{j}=x_{i^{\prime}} in 𝒰{\mathcal{U}}^{\prime}), we direct the edges of 𝒢22{\mathcal{G}}_{2}^{2} as shown in Figure 4(c) which yields a strongly 𝒯𝒞\mathcal{TC}-compatible orientation for 𝒢22{\mathcal{G}}_{2}^{2}. Then cjc_{j} has in-degree 1 and out-degree 1 in 𝒪{\mathcal{O}}, and cjc_{j} has a child in 𝒢22{\mathcal{G}}_{2}^{2} that is a tree vertex. Replacing Figure 4(c) with Figure 4(b), the case for when xix_{i} is a reticulation is similar and omitted. As 𝒪{\mathcal{O}}^{\prime} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}^{\prime}, it now follows that we obtain a funneled 𝒞A{\mathcal{C}}_{A}-orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}} by repeating the above steps for each 22-clause. ∎

The next theorem is an immediate consequence of Theorem 3.6 and Lemma 3.7.

Theorem 3.8.

Let 𝒞{\mathcal{C}} be the class of tree-child networks. Then Funneled 𝒞A\mathcal{C}_{A}-Orientation (3,53,5) is NP-complete if each non-leaf vertex has degree 3 or 5.

4. Hardness for Other Classes of Funneled Phylogenetic Networks

In this section, we establish NP-completeness for the following decision problem.

Funneled 𝒞R\mathcal{C}_{R}-Orientation (55)
Instance. An unrooted phylogenetic network 𝒰{\mathcal{U}} such that each non-leaf vertex has degree 5.
Question. Does there exist a funneled 𝒞R\mathcal{C}_{R}-orientation of 𝒰{\mathcal{U}}?

In particular, we consider the four classes of tree-child, tree-sibling, normal, and reticulation-visible networks and both rooting variants AA and BB.

Refer to caption
Figure 8. Left: Root gadget with a single connector leaf rr and whose internal vertices all have degree 5. Right: A funneled tree-child orientation for which the root is placed on the edge eρe_{\rho}.
Refer to caption
Figure 9. A funneled tree-child orientation of the degree-5 root gadget that is shown on the left-hand side of Figure 8 for which 5\ell_{5} is chosen as root ρ\rho. Note that the unique reticulation of this orientation is ss.

We start by establishing properties of the degree-55 root gadget that is shown on the left-hand side of Figure 8.

Lemma 4.1.

Let 𝒢{\mathcal{G}} be the degree-5 root gadget. Let 𝒞{\mathcal{C}} be the class of tree-child networks, and let 𝒞{\mathcal{C}}^{\prime} the class of funneled phylogenetic networks. Then

  1. (i)

    𝒢{\mathcal{G}} has a funneled 𝒞A\mathcal{C}_{A}-orientation such that ρ\rho subdivides eρ={u,u}e_{\rho}=\{u,u^{\prime}\},

  2. (ii)

    𝒢{\mathcal{G}} is 𝒞A{\mathcal{C}}^{\prime}_{A}-root-forcing,

  3. (iii)

    𝒢{\mathcal{G}} has a funneled 𝒞B\mathcal{C}_{B}-orientation such that ρ=5\rho=\ell_{5}, and

  4. (iv)

    𝒢{\mathcal{G}} is 𝒞B{\mathcal{C}}^{\prime}_{B}-root-forcing.

Proof.

The orientation that is shown in Figures 8 (right-hand side) and 9 is a funneled 𝒞A\mathcal{C}_{A}-orientation and a funneled 𝒞B\mathcal{C}_{B}-orientation, respectively, of the partner network of 𝒢{\mathcal{G}} which only differs from 𝒢{\mathcal{G}} by viewing rr as a non-connector leaf. This establishes (i) and (iii). Now, assume towards a contradiction that 𝒢{\mathcal{G}} does not satisfy (ii). Then there exists a 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation 𝒪{\mathcal{O}} of 𝒢{\mathcal{G}} such that ρ\rho subdivides {s,r}\{s,r\}. Then (ρ,s)(\rho,s) is an arc in 𝒪{\mathcal{O}}. Furthermore, each vertex in {u,u,v,v}\{u,u^{\prime},v,v^{\prime}\} has in-degree 1 and out-degree 4 since each such vertex has at least two adjacent leaves and 𝒪{\mathcal{O}} is a 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation. We next distinguish two cases.

First, suppose that ss has in-degree 1 and out-degree 4 in 𝒪{\mathcal{O}}. Then (s,u)(s,u) and (s,v)(s,v) are arcs in 𝒪{\mathcal{O}}. Since either (u,v)(u,v) or (v,u)(v,u) is an arc in 𝒪{\mathcal{O}}, uu or vv has in-degree at least 22 and out-degree at least 22; thereby contradicting that 𝒪{\mathcal{O}} is a 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation of 𝒢{\mathcal{G}}. Second, suppose that ss has in-degree 4 and out-degree 1. By symmetry, we may assume that (u,u)(u,u^{\prime}) is an arc in 𝒪{\mathcal{O}}. Since uu^{\prime} and vv^{\prime} have in-degree 1 and out-degree 4, it follows that (u,v)(u^{\prime},v^{\prime}), (u,s)(u^{\prime},s) and (v,s)(v^{\prime},s) are arcs in 𝒪{\mathcal{O}}. Now, as vv has in-degree 1 and out-degree 4, we have that either (s,v)(s,v) and (v,u)(v,u), or (u,v)(u,v) and (v,s)(v,s) are arcs in 𝒪{\mathcal{O}}. In the former case, we get the directed cycle (s,v),(v,u),(u,u),(u,v),(v,s)(s,v),(v,u),(u,u^{\prime}),(u^{\prime},v^{\prime}),(v^{\prime},s). Hence, we may assume that (u,v)(u,v) and (v,s)(v,s) are arcs in 𝒪{\mathcal{O}}. As ss has in-degree 4 and out-degree 1, (s,u)(s,u) is an arc in 𝒪{\mathcal{O}}. This implies that 𝒪{\mathcal{O}} contains the directed cycle (u,v),(v,s),(s,u)(u,v),(v,s),(s,u); thereby again contradicting that 𝒪{\mathcal{O}} is a 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation of 𝒢{\mathcal{G}}. Combining both cases establishes (ii).

Lastly, assume towards a contradiction that 𝒢{\mathcal{G}} does not satisfy (iv). Then there exists a 𝒞B{\mathcal{C}}^{\prime}_{B}-orientation 𝒪{\mathcal{O}} of 𝒢{\mathcal{G}} such that rr is chosen to be the root of 𝒪{\mathcal{O}}. Then (r,s)(r,s) is an arc in 𝒪{\mathcal{O}}. The proof of (iv) can now be established in the same way as the proof of (ii). ∎

To establish the next theorem, the construction from Positive Not-All-Equal (2,3)-SAT as established in the last section cannot be used. Instead, we reduce from Positive 1-in-3 SAT, replace the binary root gadget as shown in Figure 3 with a degree-5 root gadget, omit the connection gadget as shown in Figure 4 altogether, and use clause vertices that each have two adjacent leaves and three adjacent variable vertices. Roughly speaking, the construction results in a phylogenetic network such that any funneled orientation of the constructed unrooted phylogenetic network enforces all clause vertices to be tree vertices by Observation 3.4.

Theorem 4.2.

Let 𝒞{\mathcal{C}} be the class of rooted phylogenetic networks. Then Funneled 𝒞A\mathcal{C}_{A}-Orientation (5)(5) is NP-complete.

Proof.

Let 𝒰{\mathcal{U}} be an unrooted pseudo network, and let 𝒪{\mathcal{O}} be an orientation of 𝒰{\mathcal{U}} following Variant AA. Since it can be checked in polynomial time if 𝒪{\mathcal{O}} is a 𝒞A\mathcal{C}_{A}-orientation of 𝒰{\mathcal{U}}, it follows that Funneled 𝒞A\mathcal{C}_{A}-Orientation(5)(5) is in NP. We next establish NP-hardness using a polynomial-time reduction from the variant of Positive 1-in-3 SAT, where each variable appears exactly three times (see, e.g., Kratochvíl [10, Cor. 1] for a proof that the problem remains NP-complete under this restriction).

Let =(V,C)\mathcal{I}=(V,C) be an instance of Positive 1-in-3 SAT such that each variable appears in exactly three clauses. Let 𝒢n+1c{\mathcal{G}}_{n+1}^{c} be the caterpillar gadget with connector leaves r,x1,x2,,xnr,x_{1},x_{2},\ldots,x_{n}, and let 𝒢1r{\mathcal{G}}_{1}^{r} be the degree-5 root gadget with connector leaf rr. We next construct an unrooted phylogenetic network 𝒰{\mathcal{U}} in the following way.

  1. (1)

    Identify the connector leaf rr of 𝒢1r{\mathcal{G}}_{1}^{r} with the connector leaf rr of 𝒢n+1c{\mathcal{G}}_{n+1}^{c}, and suppress the resulting degree-2 vertex.

  2. (2)

    Attach a new leaf i\ell_{i} to each connector leaf xix_{i} with i{1,2,,n}i\in\{1,2,\ldots,n\} via a new edge.

  3. (3)

    Let c1,c2,,cmc_{1},c_{2},\ldots,c_{m} be new vertices, let i{1,2,,n}i\in\{1,2,\ldots,n\}, and let j{1,2,,m}j\in\{1,2,\ldots,m\}. For each xix_{i} and cjc_{j} such that xix_{i} is a variable of cjc_{j}, add the edge {xi,cj}\{x_{i},c_{j}\}. Furthermore, attach two new leaves to each cjc_{j}.

We may assume for the remainder of the proof that all leaves of 𝒰{\mathcal{U}} have a distinct label and that the leaf set of 𝒰{\mathcal{U}} is XX. By construction, each non-leaf vertex in 𝒰{\mathcal{U}} has degree 5. Figure 10 shows and example of this construction for the following Boolean formula

(x1x2x3)(x2x3x4)(x3x4x5)\displaystyle(x_{1}\vee x_{2}\vee x_{3})\wedge(x_{2}\vee x_{3}\vee x_{4})\wedge(x_{3}\vee x_{4}\vee x_{5})\wedge
(2) (x4x5x6)(x5x6x1)(x6x1x2).\displaystyle(x_{4}\vee x_{5}\vee x_{6})\wedge(x_{5}\vee x_{6}\vee x_{1})\wedge(x_{6}\vee x_{1}\vee x_{2}).
Refer to caption
Figure 10. The unrooted phylogenetic network that is constructed from the Boolean formula (4) of Positive 1-in-3 SAT. Details are given in the proof of Theorem 4.2.

To complete the proof, we follow a similar approach as in the proof of Theorem 3.6.

4.2.1.

{\mathcal{I}} is a yes-instance if and only if 𝒰{\mathcal{U}} has a funneled 𝒞A\mathcal{C}_{A}-orientation.

Proof.

First, let β:V{T,F}\beta\colon V\rightarrow\{T,F\} be a truth assignment that sets exactly one variable of each clause in CC to TT. We next construct a funneled 𝒞A{\mathcal{C}}_{A}-orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}}. As in the proof of Theorem 3.6, instead of referring to edges of 𝒰{\mathcal{U}}, we sometimes refer to edges of 𝒢1r{\mathcal{G}}_{1}^{r} and 𝒢n+1c{\mathcal{G}}_{n+1}^{c} when assigning a direction. We start by subdividing the edge {u,u}\{u,u^{\prime}\} of 𝒢1r{\mathcal{G}}_{1}^{r} with a new root vertex ρ\rho and direct the edges of the resulting root gadget as shown on the right-hand side of Figure 8. Noting that the edge {r,s}\{r,s\} in 𝒢1r{\mathcal{G}}_{1}^{r} corresponds to the edge {pn,s}\{p_{n},s\} in 𝒰{\mathcal{U}}, this assignment of directions to edges implies that the arc (s,pn)(s,p_{n}) is in 𝒪{\mathcal{O}}. We next direct the edges of 𝒢n+1c{\mathcal{G}}_{n+1}^{c} (except for the edge {pn,r}\{p_{n},r\} which also corresponds to the edge {pn,s}\{p_{n},s\} in 𝒰{\mathcal{U}} and has already been directed) as shown on the right-hand side of Figure 5. Now, for each i{1,2,,n}i\in\{1,2,\ldots,n\}, let cjc_{j}, cjc_{j^{\prime}}, and cj′′c_{j^{\prime\prime}} be the clause vertices that are neighbours of xix_{i} in 𝒰{\mathcal{U}}. Depending on β(xi)\beta(x_{i}) we direct the three edges {cj,xi}\{c_{j},x_{i}\}, {cj,xi}\{c_{j^{\prime}},x_{i}\}, and {cj′′,xi}\{c_{j^{\prime\prime}},x_{i}\} as follows.

  1. (i)

    (xi,cj)(x_{i},c_{j}), (xi,cj)(x_{i},c_{j^{\prime}}), and (xi,cj′′)(x_{i},c_{j^{\prime\prime}}) if β(xi)=T\beta(x_{i})=T, and

  2. (ii)

    (cj,xi)(c_{j},x_{i}), (cj,xi)(c_{j^{\prime}},x_{i}), and (cj′′,xi)(c_{j^{\prime\prime}},x_{i}) if β(xi)=F\beta(x_{i})=F.

At this point, the only edges of 𝒰{\mathcal{U}} that have not yet been assigned a direction are incident with a leaf and are directed towards that leaf in 𝒪{\mathcal{O}}. By construction of 𝒪{\mathcal{O}}, the only vertex with in-degree 0 is ρ\rho and the set of vertices with out-degree 0 is XX. Since β\beta sets exactly one variable of each clause to TT, each clause vertex cjc_{j} is a tree vertex in 𝒪{\mathcal{O}} with in-degree 1 and out-degree 4. Furthermore, each variable vertex xix_{i} is either a tree vertex with in-degree 1 and out-degree 4, or a reticulation with in-degree 4 and out-degree 1 in 𝒪{\mathcal{O}}. It now follows from Lemmas 3.5 and 4.1(i) that 𝒪{\mathcal{O}} is funneled. To see that 𝒪{\mathcal{O}} is also acyclic, we can show that no vertex in {x1,x2,,xn,c1,c2,,cm}\{x_{1},x_{2},\ldots,x_{n},c_{1},c_{2},\ldots,c_{m}\} lies on a directed cycle of 𝒪{\mathcal{O}} by using the same ‘acyclicity argument’ as that in the proof of 3.6.1. Hence, 𝒪{\mathcal{O}} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}.

Second, let 𝒪{\mathcal{O}} be a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}. Using the degree-5 root gadget instead of the root gadget that is shown in Figure 3, and Lemma 4.1(ii) instead of Lemma 3.1(ii) in the proof of Corollary 3.2, it is straightforward to show that ρ\rho subdivides an edge of 𝒰{\mathcal{U}} that is an edge of 𝒢1r{\mathcal{G}}_{1}^{r} and not {r,s}\{r,s\}. Hence, (s,pn)(s,p_{n}) is an arc in 𝒪{\mathcal{O}}. It then follows by Lemma 3.5(ii) that, for each i{1,2,,n}i\in\{1,2,\ldots,n\}, (pi,xi)(p_{i},x_{i}) is an arc in 𝒪{\mathcal{O}}. We now define a truth assignment β:V{T,F}\beta\colon V\rightarrow\{T,F\} as follows. For each i{1,2,,n}i\in\{1,2,\ldots,n\}, we set β(xi)=T\beta(x_{i})=T if xix_{i} is a tree vertex and β(xi)=F\beta(x_{i})=F if xix_{i} is a reticulation in 𝒪{\mathcal{O}}. Assume towards a contradiction that there exists a clause cjCc_{j}\in C with cj=(xxx′′)c_{j}=(x\vee x^{\prime}\vee x^{\prime\prime}) and j{1,2,,m}j\in\{1,2,\ldots,m\} such that β\beta does not set exactly one variable of cjc_{j} to TT. We distinguish three cases:

  1. (i)

    If β(x)=β(x)=β(x′′)=T\beta(x)=\beta(x^{\prime})=\beta(x^{\prime\prime})=T, then x,xx,x^{\prime} and x′′x^{\prime\prime} are tree vertices and, thus, (x,cj)(x,c_{j}), (x,cj)(x^{\prime},c_{j}) and (x′′,cj)(x^{\prime\prime},c_{j}) are arcs in 𝒪{\mathcal{O}}. Since the other two vertices adjacent to cjc_{j} are leaves, it follows that cjc_{j} has in-degree 3 and out-degree 2; a contradiction to 𝒪{\mathcal{O}} being a funneled orientation.

  2. (ii)

    If β(x)=β(x)=T\beta(x)=\beta(x^{\prime})=T and β(x′′)=F\beta(x^{\prime\prime})=F, then x,xx,x^{\prime} are tree vertices and x′′x^{\prime\prime} is a reticulation and, thus, (x,cj)(x,c_{j}), (x,cj)(x^{\prime},c_{j}) and (cj,x′′)(c_{j},x^{\prime\prime}) are arcs in 𝒪{\mathcal{O}}. But then, due to the two adjacent leaves of cjc_{j}, it follows that cjc_{j} has in-degree 2 and out-degree 3; again a contradiction.

  3. (iii)

    If β(x)=β(x)=β(x′′)=F\beta(x)=\beta(x^{\prime})=\beta(x^{\prime\prime})=F, then x,xx,x^{\prime} and x′′x^{\prime\prime} are reticulations and, thus, (cj,x)(c_{j},x), (cj,x)(c_{j},x^{\prime}) and (cj,x′′)(c_{j},x^{\prime\prime}) are arcs in 𝒪{\mathcal{O}}. But then, due to the two adjacent leaves of cjc_{j}, it follows that cjc_{j} has in-degree 0 and out-degree 5; a final contradiction.

Hence, β\beta sets exactly one variable of each clause in CC to TT. ∎

We conclude the proof of the theorem by noting that 𝒰{\mathcal{U}} can be constructed in polynomial time and has a size that is polynomial in nn and mm. ∎

We now turn to Funneled 𝒞B\mathcal{C}_{B}-Orientation (5)(5), where 𝒞{\mathcal{C}} is again the class of rooted phylogenetic networks and the root is placed following Variant BB. More precisely, given an unrooted phylogenetic network 𝒰{\mathcal{U}}, this variant chooses a vertex of 𝒰{\mathcal{U}} to be the root and assigns a direction to each edge. Consider the degree-5 root gadget 𝒢{\mathcal{G}} as shown on the left-hand side of Figure 8. Suppose that 𝒰{\mathcal{U}} contains 𝒢{\mathcal{G}} as a pending subgraph. First, by Lemma 4.1(iv), any funneled 𝒞B{\mathcal{C}}_{B}-orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}} chooses a vertex of 𝒢{\mathcal{G}} that is not rr as the root. Second, Lemma 4.1(iii) shows that 𝒢{\mathcal{G}} has a funneled 𝒞B{\mathcal{C}}_{B}-orientation. The next corollary, which is a slight strengthening of a result by Bulteau et al. [1, Cor. 4], shows NP-hardness for graphs with maximum degree 5, now follows from an argument that is analogous to that in the proof of Theorem 4.2.

Corollary 4.3.

Let 𝒞{\mathcal{C}} be the class of rooted phylogenetic networks. Then Funneled 𝒞B\mathcal{C}_{B}-Orientation (5)(5) is NP-complete.

We remark that the root gadget as shown in Figure 3 is not 𝒞R{\mathcal{C}}_{R}-root-forcing for each R{A,B}R\in\{A,B\} and can therefore not be used to establish Theorem 4.2 or Corollary 4.3 for networks whose internal vertices have degree 3 or 5.

The following lemma is the key ingredient in showing that Funneled 𝒞R\mathcal{C}_{R}-Orientation (5)(5) is NP-complete for the class of tree-child networks as well as for any class of rooted phylogenetic networks that contains the class of tree-child networks.

Lemma 4.4.

Let 𝒞{\mathcal{C}} be the class of rooted phylogenetic networks, let 𝒞{\mathcal{C}}^{\prime} be the class of tree-child networks. Furthermore, let 𝒰{\mathcal{U}} be an unrooted phylogenetic network that is constructed from an instance of Positive 1-in-3 SAT as described in the proof of Theorem 4.2. Then, for each R{A,B}R\in\{A,B\}, 𝒰{\mathcal{U}} has a funneled 𝒞R{\mathcal{C}}_{R}-orientation if and only if 𝒰{\mathcal{U}} has a funneled 𝒞R{\mathcal{C}}^{\prime}_{R}-orientation.

Proof.

We establish the proof for R=AR=A. The proof for R=BR=B is similar and omitted. Clearly, any funneled 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation of 𝒰{\mathcal{U}} is also a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}. So suppose that 𝒪{\mathcal{O}} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}. The only vertex of 𝒰{\mathcal{U}} that is not adjacent to a leaf is the vertex ss of the degree-5 root gadget. Since (s,pn)(s,p_{n}) is an arc of 𝒪{\mathcal{O}} as argued in the last paragraph of the proof of 4.2.1, it follows that pnp_{n} is a child of ss. Noting that pnp_{n} has two adjacent leaves, it now follows that pnp_{n} has in-degree 1 and out-degree 4 because 𝒪{\mathcal{O}} is funneled. Thus ss has a child that is a tree vertex and 𝒪{\mathcal{O}} is a funneled 𝒞R{\mathcal{C}}^{\prime}_{R}-orientation of 𝒰{\mathcal{U}}. ∎

Since the last lemma does not only hold for 𝒞{\mathcal{C}}^{\prime} being the class of tree-child networks but also for any class of rooted phylogenetic networks that contains the class of tree child networks, the next corollary is an immediate consequence of Theorem 4.2, Corollary 4.3, and Lemma 4.4.

Corollary 4.5.

Let 𝒞{\mathcal{C}} be the class of tree-child, tree-sibling, or reticulation-visible networks. Then Funneled 𝒞R\mathcal{C}_{R}-Orientation (5)(5) is NP-complete for each R{A,B}R\in\{A,B\}

Lastly, we establish NP-completeness of Funneled 𝒞R\mathcal{C}_{R}-Orientation (5)(5) for the class of normal networks. Let ee be an edge of an unrooted phylogenetic network 𝒰{\mathcal{U}} on XX. We refer to the operation of subdividing ee with a new vertex vv and adding the three edges {v,}\{v,\ell\}, {v,}\{v,\ell^{\prime}\}, and {v,′′}\{v,\ell^{\prime\prime}\} such that \ell, \ell^{\prime}, and ′′\ell^{\prime\prime} are leaves that are not contained in XX as attaching three leaves to ee. Now, let 𝒰{\mathcal{U}} be an unrooted phylogenetic network that is constructed from an instance {\mathcal{I}} of Positive 1-in-3 SAT as described in the proof of Theorem 4.2. Obtain an unrooted phylogenetic network 𝒰{\mathcal{U}}^{\prime} from 𝒰{\mathcal{U}} by attaching three leaves to each edge ee that is incident with a vertex in {x1,x2,,xn}\{x_{1},x_{2},\ldots,x_{n}\} and not incident with a leaf as well as to the edges {s,u}\{s,u\}, {s,u}\{s,u^{\prime}\}, and {s,pn}\{s,p_{n}\}, where ss, uu, and uu^{\prime} are the vertices as shown on the left-hand side of Figure 8. We refer to (𝒰,𝒰)({\mathcal{U}},{\mathcal{U}}^{\prime}) as the network pair associated with {\mathcal{I}}.

Lemma 4.6.

Let 𝒞{\mathcal{C}} be the class of rooted phylogenetic networks, and let 𝒞{\mathcal{C}}^{\prime} be the class of normal networks. Furthermore, let (𝒰,𝒰)({\mathcal{U}},{\mathcal{U}}^{\prime}) be the network pair associated with an instance of Positive 1-in-3 SAT. Then, for each R{A,B}R\in\{A,B\}, 𝒰{\mathcal{U}} has a funneled 𝒞R{\mathcal{C}}_{R}-orientation if and only if 𝒰{\mathcal{U}}^{\prime} has a funneled 𝒞R{\mathcal{C}}^{\prime}_{R}-orientation.

Proof.

Let EE be the subset of edges of 𝒰{\mathcal{U}} that precisely contains each edge that is subdivided in obtaining 𝒰{\mathcal{U}}^{\prime} from 𝒰{\mathcal{U}} by attaching three leaves. Clearly, each edge that is in 𝒰{\mathcal{U}} and not in EE is also an edge of 𝒰{\mathcal{U}}^{\prime}. Moreover, for each edge e={u,w}e=\{u,w\} in EE, there exist edges {u,v},{v,w},{v,},{v,}\{u,v\},\{v,w\},\{v,\ell\},\{v,\ell^{\prime}\}, and {v,′′}\{v,\ell^{\prime\prime}\} in 𝒰{\mathcal{U}}^{\prime}, where \ell \ell^{\prime}, and \ell^{\prime} are leaves. We refer to those five edges as five edges associated with ee in 𝒰{\mathcal{U}}^{\prime}. Let 𝒪{\mathcal{O}} be a 𝒞R{\mathcal{C}}_{R}-orientation of 𝒰{\mathcal{U}}. If R=AR=A, then it follows from the last paragraph of the proof of 4.2.1 that (s,pn)(s,p_{n}) is an arc of 𝒪{\mathcal{O}}. Similarly, if R=BR=B, then follows from the paragraph prior to Corollary 4.3 that (s,pn)(s,p_{n}) is again an arc in 𝒪{\mathcal{O}}. We freely use the existence of the arc (s,pn)(s,p_{n}) throughout the remainder of the proof.

First, suppose that 𝒰{\mathcal{U}} has a funneled 𝒞A{\mathcal{C}}_{A}-orientation 𝒪{\mathcal{O}}. Let eρe_{\rho} be the edge of 𝒰{\mathcal{U}} that is subdivided in obtaining 𝒪{\mathcal{O}} from 𝒰{\mathcal{U}}. Apply the following three steps to obtain an orientation 𝒪{\mathcal{O}}^{\prime} of 𝒰{\mathcal{U}}^{\prime}.

  1. (1)

    If eρEe_{\rho}\notin E, then subdivide eρe_{\rho} of 𝒰{\mathcal{U}}^{\prime} with a vertex ρ\rho and direct the two edges incident with ρ\rho so that ρ\rho has in-degree 0. On the other hand, if eρ={ue,we}e_{\rho}=\{u_{e},w_{e}\} is an edge in EE, recall that {ue,ve},{ve,we}\{u_{e},v_{e}\},\{v_{e},w_{e}\} is a path of length two in 𝒰{\mathcal{U}}^{\prime} such that vev_{e} is adjacent to three leaves e\ell_{e} e\ell^{\prime}_{e}, and e\ell^{\prime}_{e}. Then subdivide {ue,ve}\{u_{e},v_{e}\} with a vertex ρ\rho and let (ρ,ue),(ρ,ve),(ve,we),(ve,e),(ve,e),(\rho,u_{e}),(\rho,v_{e}),(v_{e},w_{e}),(v_{e},\ell_{e}),(v_{e},\ell_{e}^{\prime}), and (ve,e′′)(v_{e},\ell_{e}^{\prime\prime}) be arcs in 𝒪{\mathcal{O}}^{\prime}.

  2. (2)

    Direct each edge of 𝒰{\mathcal{U}}^{\prime} that is also an edge in 𝒰{\mathcal{U}} such that it has the same direction as in 𝒪{\mathcal{O}}.

  3. (3)

    For each edge e={u,w}e=\{u,w\} in E{eρ}E\setminus\{e_{\rho}\}, direct the five edges associated with ee in 𝒰{\mathcal{U}}^{\prime} in the following way. If (u,w)(u,w) is an arc in 𝒪{\mathcal{O}}, then (u,v),(v,w),(v,),(v,)(u,v),(v,w),(v,\ell),(v,\ell^{\prime}), and (v,′′)(v,\ell^{\prime\prime}) are arcs in 𝒪{\mathcal{O}}^{\prime} and, otherwise, (w,v),(v,u),(v,),(v,)(w,v),(v,u),(v,\ell),(v,\ell^{\prime}), and (v,′′)(v,\ell^{\prime\prime}) are arcs in 𝒪{\mathcal{O}}^{\prime}.

Since 𝒪{\mathcal{O}} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}, it follows that 𝒪{\mathcal{O}}^{\prime} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}^{\prime}. Moreover, as (s,pn)(s,p_{n}) is an arc of 𝒪{\mathcal{O}}, there is a directed path from ss to pnp_{n} of length two in 𝒪{\mathcal{O}}^{\prime} whose middle vertex is a tree vertex. Since 𝒪{\mathcal{O}}^{\prime} is funneled and, each vertex except for ss is adjacent to a leaf, it follows that 𝒪{\mathcal{O}}^{\prime} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation for 𝒰{\mathcal{U}}^{\prime} that is also tree-child. To see that 𝒪{\mathcal{O}}^{\prime} is in fact a funneled 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation of 𝒰{\mathcal{U}}^{\prime}, it remains to argue that 𝒪{\mathcal{O}}^{\prime} has no shortcut. Let vv be a reticulation of 𝒪{\mathcal{O}}^{\prime}, and let pp and pp^{\prime} be two distinct parents of vv. Since v{s,x1,x2,,xn}v\in\{s,x_{1},x_{2},\ldots,x_{n}\} it is straightforward to check that pp and pp^{\prime} are two vertices of 𝒪{\mathcal{O}}^{\prime} each with three adjacent leaves. Assume that there is a directed path from pp to pp^{\prime} in 𝒪{\mathcal{O}}^{\prime}. Then there exists an arc (p,u)(p,u) such that uu is not a leaf and uvu\neq v. This implies that pp has in-degree 0 and out-degree 4; a contradiction. It now follows that (p,v)(p,v) is not a shortcut and, by a symmetric argument, (p,v)(p^{\prime},v) is not a shortcut either. Thus, 𝒪{\mathcal{O}}^{\prime} is a funneled 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation of 𝒰{\mathcal{U}}^{\prime}.

Second, suppose that 𝒰{\mathcal{U}} has a funneled 𝒞B{\mathcal{C}}_{B}-orientation 𝒪{\mathcal{O}}. The root ρ\rho of 𝒪{\mathcal{O}} is a vertex of 𝒰{\mathcal{U}} and 𝒰{\mathcal{U}}^{\prime}. Obtain an orientation 𝒪{\mathcal{O}}^{\prime} of 𝒰{\mathcal{U}}^{\prime} by following Steps (2) and (3) so that Step (3) is applied to each edge in EE. Using the same argument as in the previous paragraph, it follows that 𝒪{\mathcal{O}}^{\prime} is a funneled 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation of 𝒰{\mathcal{U}}^{\prime}.

Third, suppose that 𝒰{\mathcal{U}}^{\prime} has a funneled 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation 𝒪{\mathcal{O}}^{\prime}. Let eρe_{\rho} be the edge of 𝒰{\mathcal{U}}^{\prime} that is subdivided in obtaining 𝒪{\mathcal{O}}^{\prime} from 𝒰{\mathcal{U}}^{\prime}. Obtain an orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}} by reversing Steps (1)–(3) from above as follows.

  1. (1’)

    If eρe_{\rho} is an edge of 𝒰{\mathcal{U}}, subdivide eρe_{\rho} in 𝒰{\mathcal{U}} with a vertex ρ\rho and direct the two edges incident with ρ\rho such that ρ\rho has in-degree 0. On the other hand, if eρe_{\rho} subdivides one of the five edges associated with an edge eEe\in E, then subdivide ee in 𝒰{\mathcal{U}} with a vertex ρ\rho and direct the two edges incident with ρ\rho such that ρ\rho has in-degree 0.

  2. (2’)

    Direct each edge of 𝒰{\mathcal{U}} that is also an edge in 𝒰{\mathcal{U}}^{\prime} such that it has the same direction as in 𝒪{\mathcal{O}}^{\prime}.

  3. (3’)

    For each e={u,w}e=\{u,w\} in E{eρ}E\setminus\{e_{\rho}\}, consider the five edges associated with ee in 𝒰{\mathcal{U}}^{\prime}. If (u,v),(v,w),(v,),(v,)(u,v),(v,w),(v,\ell),(v,\ell^{\prime}), and (v,′′)(v,\ell^{\prime\prime}) are arcs in 𝒪{\mathcal{O}}^{\prime}, then (u,w)(u,w) is an arc in 𝒪{\mathcal{O}} and, if (w,v),(v,u),(v,),(v,)(w,v),(v,u),(v,\ell),(v,\ell^{\prime}), and (v,′′)(v,\ell^{\prime\prime}) are arcs in 𝒪{\mathcal{O}}^{\prime}, then (w,u)(w,u) is an arc in 𝒪{\mathcal{O}}.

Since 𝒪{\mathcal{O}}^{\prime} is funneled, one of the two cases described in Step (3’) applies. Furthermore, since 𝒪{\mathcal{O}}^{\prime} is a funneled 𝒞A{\mathcal{C}}^{\prime}_{A}-orientation of 𝒰{\mathcal{U}}^{\prime}, it follows by construction that 𝒪{\mathcal{O}} is a funneled 𝒞A{\mathcal{C}}_{A}-orientation of 𝒰{\mathcal{U}}.

Lastly, suppose that 𝒰{\mathcal{U}}^{\prime} has a funneled 𝒞B{\mathcal{C}}^{\prime}_{B}-orientation 𝒪{\mathcal{O}}^{\prime}. Let ρ\rho be the root of 𝒪{\mathcal{O}}^{\prime}. We first show that ρ\rho is a vertex of 𝒰{\mathcal{U}}. Assume towards a contradiction that ρ\rho is not a vertex of 𝒰{\mathcal{U}}. Let {s,s}\{s,s^{\prime}\} be the unique edge of 𝒰{\mathcal{U}}^{\prime}, where ss is as shown on the left hand-side of Figure 8 and ss^{\prime} is the unique internal vertex introduced by attaching three leaves to {r,s}\{r,s\}. By arguments similar to the ones used to establish Lemma 4.1(ii), it follows that (s,s)(s,s^{\prime}) is an arc in 𝒪{\mathcal{O}}^{\prime}. Hence ρ\rho is a vertex that is introduced in attaching three leaves to {u,s}\{u,s\} or {u,s}\{u^{\prime},s\}. By symmetry, we may assume that ρ\rho is one of the four vertices t,,t,\ell,\ell^{\prime} and ′′\ell^{\prime\prime}, where tt is the unique internal vertex introduced by attaching three leaves to {u,s}\{u,s\}. Regardless of which vertex is chosen as ρ\rho, (t,u)(t,u) and (t,s)(t,s) are arcs in 𝒪{\mathcal{O}}^{\prime} because 𝒪{\mathcal{O}}^{\prime} is funneled. Furthermore, as uu and vv have in-degree 1 and out-degree 4, it follows that (u,v)(u,v) and, in turn, (v,s)(v,s) are arcs in 𝒪{\mathcal{O}}^{\prime}. Thus, (t,s)(t,s) is a shortcut and therefore 𝒪{\mathcal{O}}^{\prime} is not a 𝒞B{\mathcal{C}}^{\prime}_{B}-orientation of 𝒰{\mathcal{U}}^{\prime}; a contradiction. Hence, ρ\rho is a vertex of 𝒰{\mathcal{U}}, and we obtain an orientation 𝒪{\mathcal{O}} of 𝒰{\mathcal{U}} by following Steps (2’) and (3’) such that Step (3’) is applied to each edge in EE. Then it follows again by construction that 𝒪{\mathcal{O}} is a funneled 𝒞B{\mathcal{C}}_{B}-orientation of 𝒰{\mathcal{U}}. ∎

The next corollary follows from Theorem 4.2, Corollary 4.3, and Lemma 4.6.

Corollary 4.7.

Let 𝒞{\mathcal{C}} be the class of normal networks. Then Funneled 𝒞R\mathcal{C}_{R}-Orientation (5)(5) is NP-complete for each R{A,B}R\in\{A,B\}.

References

  • [1] Bulteau, L., Weller, M., Zhang, L. (2023). On turning a graph into a phylogenetic network, hal-04085424.
  • [2] Cardona, G., Rosselló, F., Valiente, G. (2009). Comparison of tree-child phylogenetic networks, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 6, pp. 552–569.
  • [3] Cardona, G., Llabrés, M., Rosselló, F., Valiente, G. (2008). A distance metric for a class of tree-sibling phylogenetic networks, Bioinformatics, 24:1481–1488.
  • [4] Dehghan, A., Sadeghi, M., Ahadi, A. (2015). On the complexity of deciding whether the regular number is at most two. Graphs and Combinatorics, 31:1359–1365.
  • [5] Garvardt, J., Renken, M., Schestag, J., Weller, M. (2023). Finding degree-constrained acyclic orientations. In: 18th International Symposium on Parameterized and Exact Computation, pp. 19:1–19:14
  • [6] Huber, K. T., van Iersel, L., Janssen, R., Jones, M., Moulton, V., Murakami, Y., Semple, C. (2024). Orienting undirected phylogenetic networks. Journal of Computer and System Sciences 140:103480.
  • [7] van Iersel, L., Semple, C, Steel, M. (2010) Locating a tree in a phylogenetic network. Information Processing Letters 110:1037–1043.
  • [8] Janssen, R., Jones, M. and Erdős, P. L., van Iersel, L., Scornavacca, C. (2018). Exploring the tiers of rooted phylogenetic network space using tail moves. Bulletin of Mathematical Biology, 80:2177–2208.
  • [9] Kong, S., Pons, J. C., Kubatko, L., Wicke, K. (2022). Classes of explicit phylogenetic networks and their biological and mathematical significance. Journal of Mathematical Biology, 84:47.
  • [10] Kratochvíl, J. (2003). Complexity of hypergraph coloring and Seidel’s switching. In: WG 2003, Lecture Notes in Computer Science 2880, pp. 297–308.
  • [11] Maeda, S., Kaneko, Y., Muramatsu, H., Murakami, Y., Hayamizu, M. (2023). Orienting undirected phylogenetic networks to tree-child networks, arXiv:2305.10162.
  • [12] Willson, S. J. (2010). Properties of normal phylogenetic networks. Bulletin of Mathematical Biology, 72:340–358.