Self-Assembling DNA Complexes with a Wheel Graph Structure
Abstract
The Watson-Crick complementary properties of DNA make DNA a useful tool for the self-assembly of various target complexes. Concepts from graph theory can be used to model the self-assembling process in which the vertices of the graph represent -armed branched junction molecules, called tiles. We seek to determine the minimum number of tile and cohesive-end types necessary to create the desired self-assembled complex. Although results are known for a few infinite classes of graphs, many classes of graphs remain unsolved. We present results for the wheel graph within the restrictions of three different settings.
keywords:
graph theory , DNA self-assembly , flexible tile model , wheel graph1 Introduction
1.1 Self Assembling DNA Complexes
Advancements in nanotechnology have allowed biologists to construct various complexes from genetic material. This is accomplished by utilizing the complementary properties of DNA molecules so that the molecules bond in a specific way. These complexes, while proven to be incredibly useful, can also be costly to synthesize in a lab. One of our objectives is to make the creation of these structures more efficient.
There are several applications of DNA self-assembly including targeted drug delivery as a treatment for various types of cancer, gene therapy, the formation of fine screen filters, biomolecular computing, and biosensors (see [1, 3, 4, 5, 6]). Our focus has been on complexes formed from flexible -armed branched junction DNA molecules. While some general results are known, there are many open problems in finding the most optimal construction. See [2] for overview of problem and summary of main results. The notion of optimizing the construction of these complexes comes from the fact that creating a complex will require a certain number of bond-edge types, as well as a certain number of tile types. We use concepts from graph theory to model this design process. We then optimize the construction of a target graph by providing an algorithm that will construct the graph with as few tile and bond-edge types as possible under the constraints of one of the three following scenarios:
-
1.
Scenario 1: A pot of tile types may realize other graphs of smaller order than the target graph (that is, a graph on fewer vertices) or nonisomorphic graphs of the same size as the target graph.
-
2.
Scenario 2: A pot of tile types may realize graphs which are of the same order as the target graph, but are nonisomorphic. Graphs of smaller order than the target graph cannot be realized by the pot.
-
3.
Scenario 3: A pot of tile types may not realize graphs of smaller order nor any nonisomorphic graphs of the same order as the target graph.
We introduce the following notation for the minimum number of tile types and bond-edge types needed to construct a complex.
Definition.
Let be a target graph. We denote the minimum number of bond-edge types needed to construct in Scenario by and the minimum number of tile types needed to construct in Scenario by
1.2 Concepts from Graph Theory
A DNA complex can be modeled using a graph, a mathematical object consisting of a nonempty set of nodes, called vertices, and a set of edges which connect pairs of distinct vertices.
Take as example the DNA complex in Figure 1 which takes the shape of a cube. This structure can be modeled by the graph in Figure 2.


The structure of a DNA complex becomes more easily studied as a graph rather than a complex made from genetic material. The -armed DNA molecules that comprise the complex are modeled by objects called tiles, which consist of a single vertex incident to half-edges. Each half-edge represents one arm of the molecule, with some paired sequence of DNA bases at the end. Since the end of each branch is unpaired, then in a lab environment each branch of the molecule seeks its complementary bond type to complete the sequence. We refer to a collection of tiles used to construct a graph as a pot of tiles that realizes the target graph. See an example of a branched molecule along with its corresponding tile in Figure 3.


Half edges are determined by their respective bond-edge types, which we denote with some letter, with hatted and unhatted labels to denote pairs of complementary bond-edge types. For example, a half edge labeled must bond with a half edge labeled . In certain situations, it is useful to think of the half edges of each tile bonding together to form directed edges. For the purpose of this paper, the edge will be directed from the vertex with the unhatted half edge to the vertex with the hatted half edge.
Constructed using tiles rather than a plain graph, the cube complex in Figure 1 could be represented by the graph on the left in Figure 4. As a directed graph, the same complex may be constructed as seen on the right side of Figure 4.


The process of modeling a self-assembling DNA complex with a (directed) graph is at the core of the work presented here. We seek to find a labeling of a target graph with a minimum alphabet of bond-edge types and a minimum set of distinct tiles.
Shifting our focus from complexes to graphs, it is useful to have a few concepts from graph theory in mind. We use the following graph theory concepts throughout this paper.
Definition.
Let be a graph. We denote the number of different vertex degrees in by , the number of distinct even vertex degrees in by , and the number of distinct odd vertex degrees by
Definition.
A cycle in a graph that contains every vertex of is called a Hamilton cycle. Any graph which possesses a Hamilton cycle is called Hamiltonian.
1.3 The Construction Matrix
We are able to use tools from linear algebra to aid us in verifying that a given pot of tiles has the desired properties. Specifically, there is a linear system that governs the ratio of tile and bond-edge types needed when constructing a target graph. This is due to the fact that for every half edge on every tile, there must be a complementary half edge of the same bond-edge type somewhere else in the pot.
The following results are directly from [2] and are written here for later reference:
Definition.
Let be a pot of distinct tiles. We define to be the number of cohesive ends of type on tile , and to be the number of cohesive ends of type on tile . Let be the net number of cohesive ends of type on tile type , i.e. . Define to be the proportion of tile type to be used in the assembly process.
Proposition 1.
Let be a pot. Then the number of hatted cohesive ends of each bond type must equal the number of unhatted cohesive ends of the same type that appear in the construction of . That is, for all , .
The following system of equations captures the requirements outlined in Proposition 1.
The construction matrix of , denoted , is the corresponding augmented matrix:
(1) |
Definition.
Given a pot , we define to be the set of graphs that can be constructed from . The set of graphs of minimum order that may be constructed from is denoted . We write for the order of a smallest graph that may be constructed from .
Proposition 2.
Let Then:
-
1.
If a graph of order may be constructed from , using tiles of type , then is a solution of the construction matrix
-
2.
If is a solution of the construction matrix , and there is a positive integer such that for all , then there is a graph of order that may be constructed using tiles of type .
-
3.
where the minimum is taken over all solutions to such that and is in reduced form for all .
This paper describes results for target complexes that can be modeled with a wheel graph.
1.4 Wheel Graphs
A wheel graph on vertices, denoted , is a graph consisting of a cycle on vertices and an additional vertex (often called the hub) which is adjacent to every vertex on the cycle.

For the purpose of this paper, we refer to the cycle subgraph as the outer cycle. The edges that connect the hub to the vertices on the outer cycle are called spokes.
A property of wheel graphs that we use later on in Section 3.2 is that they are Hamiltonian. One possible Hamilton cycle is shown in Figure 6.

In the case of a wheel graph on 4 vertices, we note that is isomorphic to the complete graph on 4 vertices, All results presented in this paper for the case where are consistent with the results for in [2]. Unless otherwise stated, this paper assumes any wheel graph has order greater than 4; that is, we assume .
2 Methods
Previous authors have provided a number of useful theorems that we use to prove new results on the construction of wheel graphs. The following are results from [2]:
Theorem 1.
Theorem 1 tells us that in Scenario 1, the number of distinct tile types needed to construct a target graph is bounded below by the number of distinct vertex degrees that are present in the graph, and bounded above by the sum of the number of distinct even degrees and twice the number of distinct odd degrees present in the graph. The lower bound holds for and as well.
We also take advantage of the following preposition regarding the hierarchy of tile type and bond-edge type minima between scenarios.
Proposition 3.
and .
The next theorem gives the minimum number of tile and bond-edge types needed to construct the cycle graph on vertices, , in Scenario 3. This result is useful since the cycle graph is a subgraph of the wheel graph.
Proposition 4.
and .
The following result is another lemma that we use in the proof of one of our theorems.
Lemma 1.
In Scenario 3, if is a pot such that , and has no loops, then no tile type may be used for two adjacent vertices in .
3 Results
3.1 Wheel Graphs in Scenario 1 and Scenario 2
We have developed an algorithm to construct wheel graphs in Scenarios 1 and 2 in the most efficient way possible. We use a pot of two tile types and a single bond-edge type to construct a wheel graph of any order.
The pot we propose is given below:
(2) |
where
This leads us to our first theorem.
Theorem 2.
Let be a wheel graph on vertices. Then and .

Proof.
Corollary 1.
Let be a wheel graph on vertices. Then and .
Proof.
By Theorem 1 and Proposition 3, . This lower bound is achieved by the pot proposed above. We will now use the construction matrix to verify that no smaller graphs may be constructed from the same pot. The pot of tiles in (2) gives us the following construction matrix:
which yields the unique solution , thus by Proposition 2 a graph on vertices is the smallest graph realized by this pot. Therefore, and . ∎
It is worth noting that while other graphs on vertices nonisomorphic to the wheel graph are able to be constructed from this same pot of tiles, the conditions of Scenarios 1 and 2 do not prohibit this.

3.2 Wheel Graphs in Scenario 3
Wheel graphs in Scenario 3 require a different construction algorithm compared to the construction in Scenarios 1 and 2 due to the stricter condition that no nonisomorphic graphs of the same order may be realized by a pot of tiles. To begin, we introduce two lemmas that aid us in optimizing the construction of .
Lemma 2.
If is a pot such that , then no bond-edge type used in the construction of may be used both on the outer cycle and a nonincident spoke.
Proof.
We proceed by contradiction. Suppose there exists a pot such that and some bond-edge type, say , appears on an edge on the outer cycle as well as a nonincident spoke edge. See the graphs on the left side of Figure 9.

Without loss of generality, we may assume the edge on the outer cycle is oriented counterclockwise, call this edge . The spoke edge with the same bond-edge type may be oriented either towards the hub or towards the outer cycle, but in either case, it is possible that the half edges on the spoke may reattach with the half edges on the outer cycle. This will produce a multiple edge with the hub and vertex or , as seen on the right side in Figure 9. The resulting graph is nonisomorphic to , thus contradicting the assumption that . ∎
Lemma 3.
If is a pot such that , then a bond-edge type can be used at most twice on the outer cycle in the construction of .
Proof.
By way of contradiction, suppose and suppose in the construction of more than two edges along the outer cycle are labeled with the same bond-edge type. By the Pigeonhole Principle, at least two of these edges must be directed in the same orientation. Without loss of generality, suppose these two edges are oriented clockwise. See Figure 10 for reference.

This arrangement allows for the edges to detach and rejoin as in Figure 11 to form the complex . We will now show that this will necessarily yield a nonisomorphic graph.

Recall, has a Hamilton cycle, as seen in Figure 6. If graph is isomorphic to the original wheel graph, then it must have a Hamilton cycle.
However, refer to Figure 11 to see that the set of vertices excluding the hub, , may be partitioned into two sets so that no two vertices in different sets are adjacent. Suppose without loss of generality that we start a walk at vertex . From there, a walk of distinct vertices will include each vertex, for . The only way to include vertex in a cycle is for the walk to pass through vertex . Thus, no Hamilton cycle exists in the graph . Therefore, the pot realizes a graph nonisomorphic to , contradicting the assumption that . ∎
With these new lemmas, we have developed an algorithm to construct a wheel graph that builds upon the construction of the cycle graph in Scenario 3 (see Proposition 4). In particular, to construct the wheel graph , we begin with the construction of the cycle graph , pictured in Figure 12.

From the optimized construction of the cycle graph, we can append the hub vertex and all necessary edges to the vertices on the outer cycle with one new bond-edge type.

To prove that the construction described above uses the minimum number of tile types and bond-edge types needed to construct the wheel graph on vertices, it suffices to show that the proposed labeling constructs the wheel graph as efficiently as possible, i.e., that it is not possible to construct with fewer bond-edge types or tile types in Scenario 3.
Theorem 3.
.
Proof.
In the wheel graph, , there are edges on the outer cycle. By Lemma 3, no bond-edge type on the outer cycle may be used more than twice. Thus,
If any bond-edge type from the outer cycle is used on more than one spoke edge, then that bond-edge type will appear on both the outer cycle and some nonincident spoke. By Lemma 2, this is not permitted. Since there are spoke edges, then it is necessary to introduce at least one new bond-edge type. Thus,
The following pots of tiles realize using exactly bond-edge types. Note that and are the pots when is even or odd, respectively.
(3) | ||||
(4) | ||||
The pot of tiles where is even produces the following construction matrix:
(5) |
This construction matrix has the unique solution .
The pot of tiles where is even produces the following construction matrix:
(6) |
This construction matrix has the unique solution . By Proposition 2, the smallest graph that can be realized by this pot is on vertices.
We must now show that the proposed pots will not realize graphs nonisomorphic to . In both proposed pots, the solutions from (5) and (6) give the ratio of tile types to be used when constructing a graph of order . In both pots, only one copy of tile is used, and since every other tile has one bond-edge type of type , and is only tile containing a bond-edge of type , then must bond with every other tile from in order to form a complete complex. In both even and odd cases, tile type is used exactly once. The only tile type that is able to bond with the unattached edges of is , so the two copies of must bond with . Once the two copies of bond to and to , then each of these two copies has a free cohesive end that can only be complemented by a cohesive end from , so each copy of must bond with one of the copies of . In general, for tile can only bond with tiles , and . For cases in which is even, must bond with tiles and . For cases in which is odd, must bond with two copies of tile . The resulting complex is isomorphic to .
Thus, . ∎
The corresponding result for minimum number of tile types needed to construct follows from Theorem 3.
Theorem 4.
.
Proof.
Since the only vertex of degree is the hub, this vertex must have its own tile type.
The pots in (3) and (4) suggest that the remaining vertices, all of which are of degree 3, can be labeled with tile types.
If it were possible to construct the outer cycle of using exactly tile types, then every tile on the outer cycle would appear twice if is odd, and if is even, exactly one tile type on the outer cycle would appear once. Suppose for sake of contradiction that tile types may be used on the outer cycle. Label some pair of tiles on the outer cycle , where (where are not necessarily distinct). By Lemma 1, these two copies must have a distance between them of at least two.


By Lemma 2, a single bond type, say , must be used on both spoke edges. By the proof of Lemma 3, each of the remaining two bond-edge types of must be oriented opposingly along the outer cycle (one bond-edge type, say , is colored in blue in Figure 14). Furthermore, since each bond-edge type has appeared twice, then by Lemma 3, the two vertices adjacent to those of via the bond-edge type must be of the same tile type, say . By the nature of the wheel graph, there must exist two paths between the two copies of whose union is exactly the outer cycle.
If at least one such path has odd length, then we see on the right side of Figure 14 that applying Lemmas 2 and 3 to tile type and each subsequent tile type, eventually some tile type will be adjacent to itself, which violates Lemma 1. This contradicts the assumption that the outer cycle of could be constructed using tile types.
If both paths along the outer cycle are of even length, then must be odd. We see on the left side of Figure 14 that repeating the reasoning above, some tile type will be used exactly once, which contradicts the assumption that every tile could be used twice.
Alternatively, if the outer cycle of could be constructed in strictly less than tile types, then at least one tile type would appear three or more times on a vertex on the outer cycle. Suppose without loss of generality that the tile type is (where may not necessarily be distinct bond-edge types). By Lemma 1, tile type may not be adjacent to itself, so we know each instance of must be at least distance 2 from itself. To satisfy Lemma 2, some bond-edge type, say , must be used on all the spoke edges. This leaves three edges on the outer cycle to be labeled with the bond-edge types and , which violates Lemma 3. Thus, we may not use fewer than tile types to construct .
4 Conclusion
In this paper we have presented the minimum number of distinct tile types and bond-edge types needed to construct a self-assembled DNA complex with a wheel graph structure. There are still several other infinite classes of graphs beyond wheel graphs to study, and we anticipate that the results presented here may aid future authors in finding similar results for graphs which contain as a subgraph.
The minimum number of distinct tile and bond edge-types needed to construct the wheel graph in Scenarios 1 and 2 are not only equal, but also constant, i.e., they do not increase as a function of the number of vertices of the target graph. The minima in Scenario 3 do increase as a function of , which is to be expected, given the more stringent conditions that come with Scenario 3.
In Scenario 3, we established two lemmas that aided us in finding lower bounds for the minimum number of tile types and bond-edge types. We believe this method of finding restrictions on the construction of a given complex may be generalized or adapted to find more results for additional classes of graphs.
5 Acknowledgements
This work was supported in part by the CSU San Bernardino Office of Student Research.
References
References
- [1] Leonard M Adleman. Molecular computation of solutions to combinatorial problems. Science, 266(5187):1021–1024, 1994.
- [2] Joanna Ellis-Monaghan, Greta Pangborn, Laura Beaudin, David Miller, Nick Bruno, and Akie Hashimoto. Minimal tile and bond-edge types for self-assembling dna graphs. In Discrete and Topological Models in Molecular Biology, pages 241–270. Springer, 2014.
- [3] Hongzhou Gu, Jie Chao, Shou-Jun Xiao, and Nadrian C Seeman. A proximity-based programmable dna nanoscale assembly line. Nature, 465(7295):202, 2010.
- [4] Thom H LaBean and Hanying Li. Constructing novel materials with dna. Nano Today, 2(2):26–35, 2007.
- [5] Andre V. Pinheiro, Dongran Han, William M. Shih, and Hao Yan. Challenges and opportunities for structural dna nanotechnology. Nature Nanotechnology, 6(12):763–773, 2011.
- [6] Nadrian C Seeman. An overview of structural dna nanotechnology. Molecular biotechnology, 37(3):246, 2007.