This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Virus Assembly Pathways inside a Host Cell

Sanaz Panahandeh [email protected] Department of Physics and Astronomy, University of California, Riverside, CA 92521, USA    Siyu Li Department of Physics and Astronomy, University of California, Riverside, CA 92521, USA    Bogdan Dragnea Department of Chemistry, Indiana University, Bloomington, IN 47405, USA    Roya Zandi Department of Physics and Astronomy, University of California, Riverside, CA 92521, USA
Abstract

Simple RNA viruses self-assemble spontaneously and encapsulate their genome into a shell called the capsid. This process is mainly driven by the attractive electrostatics interaction between the positive charges on capsid proteins and the negative charges on the genome. Despite its importance and many decades of intense research, how the virus selects and packages its native RNA inside the crowded environment of a host cell cytoplasm in the presence of an abundance of non-viral RNA and other anionic polymers, has remained a mystery. In this paper, we perform a series of simulations to monitor the growth of viral shells and find the mechanism by which cargo-coat protein interactions can impact the structure and stability of the viral shells. We show that coat protein subunits can assemble around a globular nucleic acid core by forming non-icosahedral cages, which have been recently observed in assembly experiments involving small pieces of RNA. We find that the resulting cages are strained and can easily be split into fragments along stress lines. This suggests that such metastable non-icosahedral intermediates could be easily re-assembled into the stable native icosahedral shells if the larger wild-type genome becomes available, despite the presence of myriad of non-viral RNAs.

preprint: APS/123-QED

I Introduction

Positive-strand RNA viruses represent the largest genetic class of viruses, including many human, animal, and plant pathogens [1]. Their virions contain mRNA-sense single stranded (ss) RNA which is protected by a protein cage called the capsid [2]. The capsids of more than half of the RNA virus genera have icosahedral symmetry [3, 4, 5, 6]. Since the spontaneous assembly of molecular constituents into a functional virion is one of the key steps in the virus life cycle, its physical and chemical bases have been intensely studied on a number of model systems. A wealth of information has emerged from in vitro assembly studies and detailed models that explain several salient features of the virus self-assembly have been proposed [7, 8, 9, 10].

For all ssRNA icosahedral viruses, genome encapsidation happens at assembly, in cytosolic compartments where the opportunity of encapsidating other, non-cognate cargo also exists [3]. In vitro self-assembly studies suggest that the negatively-charged RNA recruits coat proteins (CPs) through multiple electrostatic interactions with their basic residues [11, 12, 13, 14, 15, 16]. The dominant role of electrostatics is supported by in vitro encapsidation experiments with a variety of polyanionic non-cognate cargo, ranging from other nucleic acids to nanoparticles [17, 18]. Despite a relatively good understanding of this process [19, 12, 7, 20, 21], the origin of selectivity of cognate RNA in vivo encapsidation has remained a mystery because of the presence of other non-genomic polyanionic species. One explanation involves certain packaging sequences which confer a higher coat protein binding affinity to cognate viral RNA [22, 23]. However, it has been pointed out that such packaging sequences might not be the sole source of specificity, with other factors being likely at play [24]. Recently, in discussing experiments that entailed a combination of charge detection mass spectrometry and cryo-electron microscopy, Bond et al. suggested capsid elastic stress at assembly as a pathway selection mechanism. In their experiments, brome mosaic virus (BMV) coat proteins were found to readily form capsids around multiple oligonucleotides. Those capsids were smaller, non-icosahedral, and less stable than the native ones. Based on these findings, Bond et al. speculated that shells do form around small RNAs and other polyanionic cargo but might easily fall apart, with their fragments re-assembling around cognate RNAs when they become available. This hypothesis suggests a narrowing of the possible virus assembly pathways, which would actually take advantage of the presence of non-cognate polyanions [25]. To test theoretically the possibility of this hypothesis, we have modeled capsid growth under the assumption of an elastic lattice formed of subunits having a preferred radius of curvature.

According to the quasi-equivalence principle, the number of proteins forming icosahedral shells is 60T60T where the TT number is a structural index for viral shells and is equal to T=h2+k2+khT=h^{2}+k^{2}+kh with h and k nonnegative integers [26]. Previous in vitro assembly studies of small icosahedral (T=3T=3) viruses found that capsid proteins can assemble in vitro around a variety of anionic cargoes to form isometric shells [27, 24]. In the absence of high-resolution structural or stoichiometric data, it was assumed that even if the size of the shells changes because of the size of their cargoes, the symmetry of the capsids is still the same as that of the native shell, i.e., there are usually 12 pentamers, sitting at the vertices of an icosahedron, in a shell built otherwise from hexamers. The experimental results of Bond et al. noted above were unexpected as they found that nucleic acid oligomers and capsids proteins form structures with D6hD_{6h} and D5hD_{5h} symmetries [25]. Figure 1 shows the cryo-EM images of two different structures obtained in Ref. [25].

In this paper, we study the range of elastic and geometric subunit parameters for which strained, lower-symmetry non-icosahedral structures similar to those reported by Bond et al. form. We calculate the elastic stress distribution across these shells and discover that they should be less stable than the wild type T=3T=3 shells, being strained. We also show that one of the low symmetry structures (Fig. 1C) constitutes the minimum energy structure when smaller size cargoes are encapsulated, while the other one (Fig. 1A) could be obtained only through non-equilibrium simulations [25]. Our theoretical study thus confirms the possibility of a pathway for the packaging process inside cells whereby capsid proteins may start assemble around non-cognate small RNAs in the cell during infection, forming shells that are strained and less stable.

Refer to caption
Figure 1: Cryo‐EM of EE and FF^{*} structures with D6hD_{6h} and D5hD_{5h} symmetries obtained in the experiments of Ref. [25]. Note that in [25] the structures EE and FF^{*} are called H8H8 and H15H15, respectively. A) Isosurface views of the EE structure at different angles. The axis of sixfold symmetry is marked. Scale bar is 10 nm\mathrm{nm}. B) Selected reference‐free 2D class averages indicate different orientations of the particle. C) Isosurface views of FF^{*} structure at different angles. The fivefold symmetry axis is marked. D) Selected reference‐free 2D class averages show different orientations of the particle. Scale bar is 10 nm\mathrm{nm}. Adapted with permission from [25].

Since there is no systematic study about the impact of RNA on the capsid size and morphology as observed in the above experiments, in this paper we also explore under what conditions RNA can change the size and structure of viral shells. We find that the interplay of the mechanical properties of capsid proteins and their interaction with genome is quite intriguing and that RNA can significantly modify the symmetry of viral shells giving rise to the formation of structures with different symmetries that were not found in the previous theoretical studies on empty capsids.

This work lays out the systematic comparison of theory and experiments, which will allow us to gain a better understanding of the role of RNA in the capsid assembly pathway, stability and structure, both in vivo and in vitro. A deeper understanding of the role of genome in virus assembly mechanisms could lead to the design principles for alternative antivirals and facilitate the fabrication and design of precise synthetic nano-structures.

II Results and Discussion

To explore the impact of RNA on the growth pathway, morphology and stability of viral shells, we construct a “structure” phase diagram as a function of the protein spontaneous curvature R0/b0R_{0}/b_{0} and the FvK number γ\gamma, for various genome size and the strength of RNA-protein interactions. We seek to determine how the phase diagram of genome-containing particles differs from the phase diagram of the empty shells obtained in Refs. [28, 29] in the absence of RNA. Figure 2 shows the results of the deterministic simulations as a two dimensional phase diagram as a function of R0/b0R_{0}/b_{0} and γ\gamma in the absence of genome. Each color in the figures corresponds to a specific symmetric structure, see Fig. 3.

Figure 2 clearly indicates that at higher γ\gammas, i.e., for the shells that are easier to bend than to stretch, we observe only a transition from T=1T=1 (structure AA) to T=3T=3 (structure GG) as a function of R0/b0R_{0}/b_{0}. However, at lower γ\gammas, in addition to T=1T=1 and T=3T=3 shells, several structures with other symmetries form as R0/b0R_{0}/b_{0} varies. All structures have 12 pentagons, even the aberrant ones assembled in the white region of the figure. It is notable that some of the symmetric structures (DD, EE, EE^{*} and GG) in Fig. 2 have been observed in other biological systems such as clathrin vesicles [30, 31].

Refer to caption
Figure 2: Phase diagram of the shells assembled for different values of γ\gamma and R0/b0R_{0}/b_{0} without the presence of core using the deterministic model. Each shaded area corresponds to a single type of structure that forms in that region. Shell structures with their corresponding symmetries are shown in Fig. 3. In the hashed regions, in addition to symmetric shells, a number of similarly sized non-symmetric structures form. The white areas correspond to regions in which different types of shells without any specific symmetry are grown. Adapted with permission from [28].
Refer to caption
Figure 3: The structures obtained in the deterministic simulations corresponding to the labeled regions of the phase diagram illustrated in Fig. 2. The shells from left to right and top to bottom have 20, 24, 26, 28, 36, 36, 40, 50, 60, 76, 80 and 80 subunits or 12, 14, 15, 16, 20, 20, 22, 27, 32, 40, 42 and 42 capsomers respectively. Symmetries are icosahedral (T=1T=1), D6D_{6}, D3D_{3}, tetrahedral, D6hD_{6h}, D2D_{2} (tennis ball), D2D_{2}, D3D_{3}, icosahedral (T=3T=3), tetrahedral, icosahedral (T=4T=4) and D5hD_{5h}, respectively. Red and white colors correspond to the location of pentamers and hexamers respectively. Adapted with permission from [28].

As discussed in the method section, to make our calculations more tractable while keeping the important physical features of the system, we model the genome as a spherical core that interacts attractively with the protein subunits through LJ potential. During the simulations, we have considered that the attractive interaction between protein subunits and the genome is uniform and does not depend on the specific sequence of RNA. However, at least for certain viruses, the binding affinity is thought to vary along the genome with segments containing the packaging signals showing higher affinity than the rest [32]. This is important for the initial stages, but it is not clear that that the packaging signals are still playing a role after the initial nucleo-protein complex has formed and the genome has assumed a globular shape. Since the focus of the simulations is on the assembly process after the genome has assumed the globular shape, and not on the kinetics, the simplifying assumption of a spherical core is reasonable as confirmed by the fact that we obtain the same structures as observed in the experiments. When examining the influence of the core on the particle phase diagram, it is particularly interesting to focus on particle sizes that are compatible with T=1T=1 and T=3T=3 structures. Note that T=1T=1 was observed in the in vitro experiments with Bromo Mosaic Virus (BMV), whose native morphology is a T=3T=3 structure [33].

We consider four different core radii Rc/b0=0.95,1.2,1.4,1.5R_{c}/b_{0}=0.95,1.2,1.4,1.5 that commensurate with the size of T=1T=1, “EE and EE^{*}”, “FF and FF^{*}”, and T=3T=3 structures, respectively. Note that EE and EE^{*} have the same size, as it is the case for FF and FF^{*}. We also monitor the assembly of capsids around a core with the radius Rc/b0=1.3R_{c}/b_{0}=1.3 that is not compatible with the size of any of the symmetric shells obtained in Figs. 2 and 3. We emphasize that FF^{*} (see Fig. 4) is exactly the structure that has been discovered in the recent self-assembly experiments of Bond et al. [25] (see Fig. 1C) and is of particular interest to our investigation of the role of genome in the structure of capsids and the virus assembly pathway in the presence of non-native RNAs. One of the interesting results of this paper is that the presence of the core is essential for the formation of FF^{*}, which did not appear in the previous simulations of equilibrium [29] and non-equilibrium [28] structures for empty shells.

Refer to caption
Figure 4: Structure FF^{*} with D5hD_{5h} symmetry has 50 subunits and 27 capsomers. a) Side view of FF^{*}. It has five pairs of pentamers on the side, one single pentamer at the top and another one at the bottom. b) Top view of the FF^{*} structure.

Figure 5 shows the results of the simulations as two dimensional phase diagrams as a function of γ\gamma and R0/b0R_{0}/b_{0}. From the top to the bottom, the rows correspond to the core sizes of Rc/b0=0.95,1.2,1.3,1.4,1.5R_{c}/b_{0}=0.95,1.2,1.3,1.4,1.5, respectively. The figure also illustrates the impact of the strength of genome-protein attractive interaction. The left column in Fig. 5 corresponds to the structures for which the strength of LJ is ϵ~lj=ϵlj/kb=0.005\tilde{\epsilon}_{lj}=\epsilon_{lj}/k_{b}=0.005, and the right column shows the results for stronger LJ interaction, ϵ~lj=0.1\tilde{\epsilon}_{lj}=0.1. Note that we divide ϵlj\epsilon_{lj} by kbk_{b} to make it dimensionless. Similar to Fig. 2, each color corresponds to a specific symmetric structure. In the dotted regions of the figure, the shells assemble without encapsulating the genome because the cost of its packaging is too high.

The phase diagrams on the left hand side of Fig. 5 are very similar to the phase diagram in Fig. 2, revealing that the elastic energy of the growing shell has the dominant contribution to the formation of the symmetric shells in the weak LJ regime (ϵ~=0.005\tilde{\epsilon}=0.005). The simulations are designed such that if the size of the core is smaller than the spontaneous curvature of proteins, the strength of LJ potential is such that the shell will always assemble around it. As the size of the core increases, the subunits whose spontaneous radius of curvature is smaller than the core radius will either assemble into an empty shell based on the phase diagram of Fig. 2 for the relevant γ\gamma and R0/b0R_{0}/b_{0} or into a filled shell but with a different symmetry than the empty one for the same γ\gamma and R0/b0R_{0}/b_{0}. Thus, the main difference between the phase diagrams on the left hand side of Fig. 5 (ϵ~lj=0.005\tilde{\epsilon}_{lj}=0.005) and the ones for the empty shells (Fig. 2) arises from the size of the core. At ϵ~lj=0.005\tilde{\epsilon}_{lj}=0.005, the elastic energy still controls the growth pathway, and as a result the transition lines between different structures move together without influencing the shape varieties. Similar to Fig. 2, we observe two main structures form (T=1T=1 and T=3T=3) at high γ\gamma values (Fig. 5), while different structures form as R0/b0R_{0}/b_{0} changes for lower γ\gammas.

By contrast, if the strength of subunit-genome interaction is strong (ϵ~lj=0.1\tilde{\epsilon}_{lj}=0.1), the presence of RNA can completely change the structure of the capsids as illustrated in the right column of Fig. 5. The tendency of trimers to aggregate on the core surface because of the LJ potential will compete with the bending energy of the shell, resulting in a considerable change in the phase diagram. Most structures in the phase diagram then will have the shapes instructed by the core size. For example, when the core size is Rc/b0=0.95R_{c}/b_{0}=0.95, the optimal size for a T=1T=1 shell, Fig. 5a shows that for the weak LJ potential, T=1T=1 covers a small region in the phase diagram. However, for the strong LJ potential the T=1T=1 structure covers a very wide region of the phase diagram (Fig. 5b).

Further, Fig. 5b reveals that for R0/b0>1.12R_{0}/b_{0}>1.12 at low γ\gammas, only irregular shells without any specific symmetry form, in contrast to Fig. 5a where various symmetric structures form. The irregular shells have larger number of subunits compared to a T=1T=1 structure in Fig. 5b. At low γ\gammas when the subunits are less rigid, they squeeze each other to make room for more subunits because of the strong attractive interaction between the proteins and the core. As we keep increasing γ\gamma at Rc/b0=0.95R_{c}/b_{0}=0.95, the structures DD, CC, BB and T=1T=1 start appearing, respectively. It is important to note that while these shells have different symmetries and number of subunits, they all have more or less the same size due to the strong attraction of the subunits to the core. Figure 5b indicates that the largest symmetric capsid that forms for Rc/b0=0.95R_{c}/b_{0}=0.95 is the structure DD with tetrahedral symmetry and 28 triangular subunits. As γ\gamma increases, the number of subunits decreases and T=1T=1 becomes the dominant structure, regardless of the value of R0/b0R_{0}/b_{0}. Without the core, the T=1T=1 structure would not form if R0/b0>1.28R_{0}/b_{0}>1.28.

Increasing the genome size to Rc/b0=1.2R_{c}/b_{0}=1.2, we observe empty T=1T=1 structures at weak ϵ~lj\tilde{\epsilon}_{lj} and R0/b0<1.2R_{0}/b_{0}<1.2 (Fig. 5c). However, for the strong ϵ~lj\tilde{\epsilon}_{lj} the most dominant symmetric structures are EE and EE^{*} (Fig. 5d) with hexagonal barrel and tennis ball symmetries, respectively. Both structures have 36 protein subunits. Figure 5f shows that while for Rc/b0=1.3R_{c}/b_{0}=1.3 at strong ϵ~lj\tilde{\epsilon}_{lj} the structures JJ, EE, EE^{*}, FF and FF^{*} form, for the weak ϵ~lj\tilde{\epsilon}_{lj} several additional symmetric structures form 5e. We note that the capsids with the number of subunits less than 36 assemble without packaging the genome because the encapsidation of genomes whose size is much larger than the capsid size is energetically forbidden due to the high cost of elastic energies.

Figure 5g shows the phase diagram for the core size Rc/b0=1.4R_{c}/b_{0}=1.4. For this case, all the capsids with less than 40 subunits are empty. In contrast, Fig. 5h shows that at Rc/b0=1.4R_{c}/b_{0}=1.4, if ϵ~lj\tilde{\epsilon}_{lj} is strong, all the capsids encapsidate the genome, and only symmetric structures FF and GG form. As the core size increases to Rc/b0=1.5R_{c}/b_{0}=1.5, Fig. 5j shows that for strong ϵ~lj\tilde{\epsilon}_{lj}, the structure GG with T=3T=3 symmetry covers a large area of the phase diagram. However, at weak ϵ~lj\tilde{\epsilon}_{lj}, structures with different symmetries form.

The results presented in Fig. 5 clearly show that the genome can completely modify the structure of the capsid. In particular, the structure FF^{*} in Fig. 5d that has not been previously observed in the empty shell simulations, is exactly the structure called H15 obtained in the self-assembly studies of BMV shown in Fig. 1C and D [25]. The structure EE^{*} also covers a broad region of the phase diagram (Fig. 5d), while this structure appears in a tiny region of the phase diagram for the empty shells (Fig. 2). We will discuss these structures and their relation with the experiments [25] in the conclusion section.

One of our other intriguing results corresponds to the structure DD, the so called mini-coat that appears in the assembly studies of clathrin shells too [30]. Quite interestingly, even when the size of cargo is optimal for the formation of T=1T=1 icosahedral structure, depending on the spontaneous curvature of proteins and their mechanical properties, the structure DD prevails a large region of phase space (Fig. 5b). This might explain to some extent why minicoats form in the case of clathrin shells, rather than an icosahedral structure.

As noted above, the white regions of the phase diagrams in Figs. 2 and 5 corresponds to the structures with no specific symmetry. In the next section, in which we present the equilibrium structures, we will show that the irregular shells in the white regions do never constitute the minimum energy structures. They basically correspond to the kinetically trapped capsids. In fact, all the small shaded regions in the deterministic phase diagrams surrounded by the white region will expand in the equilibrium phase diagrams (see Fig. 6) and the white regions will completely disappear.

Another interesting point of the phase diagrams is that the genome can move the transition point between two icosahedral structures T=1T=1 and T=3T=3 even when the genome-protein interaction strength is weak (ϵ~lj=0.005\tilde{\epsilon}_{lj}=0.005). For example, at γ=10\gamma=10 when there is no core, the transition point from T=1T=1 to T=3T=3 structure occurs at R0/b0=1.31R_{0}/b_{0}=1.31 (Fig. 2). However, even when the attraction is weak, for Rc/b0=1.5R_{c}/b_{0}=1.5, the empty T=1T=1 forms at very small R0/b0R_{0}/b_{0} without packaging the core since packaging of a big core will cost a lot of elastic energy. Nevertheless, increasing R0/b0R_{0}/b_{0} to 1.24 (Fig. 5i) results into the formation of T=3T=3 with the genome enclosed, see Fig. 10 in Appendix.

Refer to caption
Figure 5: Phase diagrams of the structures obtained in the deterministic simulations in the presence of genome with five various sizes and two different strengths of the protein-genome attractive interactions. From the top to the bottom row, the core sizes are Rc/b0=0.95,1.2,1.3,1.4,1.5R_{c}/b_{0}=0.95,1.2,1.3,1.4,1.5. We note that the core sizes Rc/b0=0.95,1.2,1.4,1.5R_{c}/b_{0}=0.95,1.2,1.4,1.5 are the optimal sizes to grow T=1T=1, EE and EE^{*}, FF and T=3T=3 structures, respectively. The core size Rc/b0=1.3R_{c}/b_{0}=1.3 does not correspond to any of the symmetric structures. The left column corresponds to the weak core-protein interaction ϵ~lj=0.005\tilde{\epsilon}_{lj}=0.005 and the right column to the strong one ϵ~lj=0.1\tilde{\epsilon}_{lj}=0.1 . Each color corresponds to a structure with specific symmetry (see Fig. 3). Dotted regions correspond to the phases in which the capsids form without packaging the core. A careful review of the phase diagrams reveals that the icosahedral structures prevail even if both the spontaneous curvature and core size are optimal for the formation of another type of symmetric structure.

In general, in the presence of weak LJ, the phase boundaries remain more or less vertical showing that the spontaneous curvature of proteins has the key role in defining the structure of capsids. However, the boundary phases become more horizontal as we increase LJ strength, indicating that the core determines the radius of capsid while FvK number modifies the shell symmetry.

II.1 Equilibrium structures

As noted previously, the deterministic simulations involve some irreversible moves, and as such the structures obtained in Fig. 2 and Fig. 5 do not necessarily constitute the free energy minimum structures. To obtain the equilibrium size and symmetry of empty capsids as a function of γ\gamma and R0/b0R_{0}/b_{0}, in a previous work, we carried out Monte Carlo (MC) simulations combined with the bond-flipping method [29]. Unexpectedly, the equilibrium structure phase diagram and the deterministic one presented in Fig. 2 were very similar. It was found that the striking similarities are the result of the strong affinity for the formation of pentamers at specific positions in the shell, which depends on the spontaneous radius R0/b0R_{0}/b_{0} and the mechanical properties (γ\gamma) of the capsid proteins [34, 29].

Since the previous works have already confirmed that similar structures appear in the deterministic and equilibrium phase diagrams albeit small differences in the boundary between different regions, to obtain the minimum energy structures in the presence of genome, we compared the total energy per subunit of all the structures formed in Fig. 5 including the irregular ones for various core sizes as a function of R0/b0R_{0}/b_{0} and γ\gamma.

To construct the equilibrium phase diagrams, we assumed that the genome is located in the center of mass of the capsid and is interacting with the triangular subunits through the LJ potential. The elastic energy of the capsid as before is the sum of the stretching EsE_{s} and bending EbE_{b} energies as presented in Eqs. 1 and 2. Keeping the genome at the center of mass, we let different capsids relax and then we calculate the total energy including the core-capsid, the stretching and bending energies per subunit to find the minimum energy structures. It is worth mentioning that instead of constructing the phase diagrams based on the minimum energy calculations, we could have performed simulations in the grand canonical ensemble framework and monitor the growth of the shells [35]. That approach would be useful if we were interested in the impact of the protein concentration, temperature and the strength of hydrophobic interaction between the proteins too. We note that all these parameters can modify the final assembly product, see Ref. [36]. However, this would have added many more degrees of freedom to the problem and make the interpretation of the results less transparent. To this end, we assumed that the protein concentration and the strength of the hydrophobic interactions are such that only the competition between the elastic energies, the core size and the strength of core-protein interactions define the final structure of capsids. Thus we constructed the equilibrium phase diagrams based on the strength of LJ interactions and elastic energies.

Figure 6 shows the equilibrium phase diagrams for the same core sizes (Rc/b0R_{c}/b_{0}) as the ones used in Fig. 5, from top to bottom Rc/b0=0.95,1.2,1.3,1.4,1.5R_{c}/b_{0}=0.95,1.2,1.3,1.4,1.5. Similar to Fig. 5, the left and right columns correspond to the weak (ϵ~lj=0.005\tilde{\epsilon}_{lj}=0.005) and strong (ϵ~lj=0.1\tilde{\epsilon}_{lj}=0.1) LJ interactions, respectively. In the dotted regions, only empty shells can form. Comparing the equilibrium phase diagrams with the non-equilibrium ones, we find that several structures assembled in the deterministic simulations do not represent the equilibrium structures, such as BB, CC and EE. A quite interesting difference is that the structure FF^{*}, discovered in the experiments of Ref. [25], appears in wide regions of the phase diagrams in Fig. 6. However, the structure EE that was also obtained in the recent experiments, only appears in the deterministic simulations performed under non-equilibrium conditions. Further, all the regions related to irregular shapes in the non-equilibrium phase diagram disappear in the equilibrium ones, most of them are replaced by the structure JJ with a two fold symmetry (see Fig. 11 in the Appendix). The icosahedral structure T=3T=3 occupies a larger area in the equilibrium phase diagrams, in agreement with the observation that many small ssRNA viruses form T=3T=3 structures.

To verify the growth pathway suggested in Ref. [25] and described in the introduction, we closely study the assembly of T=3T=3 and FF^{*} in the presence of different core sizes. Figures 6h and j show that at high γ\gammas, for the core size Rc/b0=1.4R_{c}/b_{0}=1.4, FF^{*} is the equilibrium structure; however, increasing Rc/b0R_{c}/b_{0} to 1.5, T=3T=3 becomes the minimum energy structure in the same region of the phase diagram (for the same γ\gamma and R0/b0R_{0}/b_{0}). Figure 7a shows the growth pathways as the plots of the energy per subunit, for FF^{*} and T=3T=3 structures, as a function of number of subunits at Rc/b0=1.4R_{c}/b_{0}=1.4 and Rc/b0=1.5R_{c}/b_{0}=1.5, respectively. The other parameters in the figure are the spontaneous radius of curvature R0/b0=1.4R_{0}/b_{0}=1.4, γ=5\gamma=5 and ϵ~ij=0.1\tilde{\epsilon}_{ij}=0.1. The figure clearly shows that up to 25 subunits the energies of the two structures are very close to each other. Note that the total number of subunits in FF^{*} structure is 50 subunits, and thus 25 is when the half of the capsid is formed. From this point to the end of the growth though, the energy of FF^{*} increases sharply in comparison to T=3T=3.

We emphasize that as shown in Figs. 6h and j, the optimal spontaneous radius of curvature for the formation of FF^{*} structure is R0/b0=1.4R_{0}/b_{0}=1.4, explaining why if the core size is also Rc/b0=1.4R_{c}/b_{0}=1.4, FF^{*} is the minimum energy structure. However, if we set the radius of the core to Rc/b0=1.5R_{c}/b_{0}=1.5 while keeping the preferred spontaneous curvature of proteins the same as before (R0/b0=1.4R_{0}/b_{0}=1.4), then we obtain T=3T=3 structures even though R0/b0R_{0}/b_{0} is the optimal size for the formation of FF^{*}. The phase diagrams in Figs. 6h and j reveal the regions in which an increase in the core size modifies the optimal structure from FF^{*} to T=3T=3. Quite interestingly, this indicates that if two different genome sizes are in solution, the capsid proteins energetically prefer to assemble around the larger one to form T=3T=3 structure, and this happens, to some extent, regardless of the preferred curvature of proteins.

Figure 7b shows the stress distribution of some incomplete structures corresponding to the growth of FF^{*} and T=3T=3 shells in Fig. 7a. See Appendix for the details of the stress calculation [37]. The bar legend on the right side of Fig. 7b shows the color code corresponding to the different stress level. While the violet color corresponds to the maximum stretching tension, the red shows the maximum compression stress in the bonds. The first incomplete structure on the left of the figure with nt=25n_{t}=25 corresponds to both T=3T=3 and FF^{*}. Up to that point, the growth pathways of both structures overlap as is illustrated in Fig. 7a too. From nt=25n_{t}=25 onward, the stress distribution in the structures packaging Rc/b0=1.4R_{c}/b_{0}=1.4 and Rc/b0=1.5R_{c}/b_{0}=1.5 will become different although their growth pathways are still the same i.e., their incomplete structures are almost identical regardless of the stress distribution. Finally, at nt=30n_{t}=30, the incomplete structures forming around Rc/b0=1.4R_{c}/b_{0}=1.4 and Rc/b0=1.5R_{c}/b_{0}=1.5 start to deviate and follow two different pathways. The upper row shows the pathway corresponding to the formation of the FF^{*} structure with 50 subunits. The figure shows that the highest compression stress distribution is along the pairs of pentamers, where the axis of two fold symmetry are located. However, the bottom row corresponds to T=3T=3 with 60 subunits. The level of stress in the T=3T=3 structure is much lower than the FF^{*} structure, confirming that FF^{*} can easily break into large fragments to provide the intermediate structures for the formation of T=3T=3 with lower stress when cognate larger RNA becomes available.

Refer to caption
Figure 6: Phase diagrams of the equilibrium structures obtained in the presence of genome. The plots have the same color code as in Fig. 5. The dotted colors correspond to the regions in which the empty shells are the equilibrium structures. From the top to the bottom row, the core sizes are Rc/b0=0.95,1.2,1.3,1.4,1.5R_{c}/b_{0}=0.95,1.2,1.3,1.4,1.5. We note that the core sizes Rc/b0=0.95,1.2,1.4,1.5R_{c}/b_{0}=0.95,1.2,1.4,1.5 are the optimal sizes to grow T=1T=1, EE and EE^{*}, FF and FF^{*}, and T=3T=3 structures, respectively. The core size Rc/b0=1.3R_{c}/b_{0}=1.3 does not correspond to any of the symmetric structures. The left column corresponds to the weak core-protein interactions (ϵ~lj=0.005\tilde{\epsilon}_{lj}=0.005) and the right column to the strong one (ϵ~lj=0.1\tilde{\epsilon}_{lj}=0.1). In the hashed regions, two equilibrium structures form. Each color corresponds to a structure with specific symmetry (see Fig. 3). Structure JJ is shown in Fig. 11 in the Appendix. Similar to the phase diagrams presented in Fig. 5, the icosahedral structures prevail even if both the spontaneous curvature and core sizes are optimal for the assembly of other symmetric structures.

Another puzzling question to address corresponds to as why the structure FF^{*} appears only in a small region in the deterministic simulations but covers large regions in the equilibrium phase diagrams. For example, at R0/b0=1.1R_{0}/b_{0}=1.1, γ=3\gamma=3, Rc/b0=1.4R_{c}/b_{0}=1.4 and the strong LJ potential (ϵ~lj=0.1\tilde{\epsilon}_{lj}=0.1), FF^{*} is the minimum energy structure, see Fig. 6h. However, in this region, we obtain the structure FF in our deterministic simulations as shown in Fig. 5h. A careful review of the step by step assembly of both structures (see Fig. 12) shows that both structures FF and FF^{*} are built from 50 subunits, and both follow exactly the same pathway till 25 subunits are assembled (the half of the capsid). After the assembly of the first 25 subunits (Fig. 12a), FF and FF^{*} follow two different pathways. For the structure FF, the five subunits sitting at the edge of the growing shell merge and form a pentamer (Fig. 12b). However, in the case of FF^{*}, one more subunit is added and the capsomer will close as a hexamer (Fig. 12c). Even though a complete FF^{*} structure has a lower energy than the FF one, the assembly of a hexamer is energetically more costly than a pentamer after the half of the capsids is assembled. To this end, there is an energy barrier to the formation of FF^{*} and thus in the deterministic simulations, the shells follow the pathway resulting into the formation of FF structures.

Refer to caption
Figure 7: a) Plots of the energy per subunit vs. number of subunits for the structures FF^{*} (the red dashed line) and T=3T=3 (the black solid line) at γ=5\gamma=5 and the strong LJ potential ϵ~ij=0.1\tilde{\epsilon}_{ij}=0.1. The spontaneous radius of curvature of proteins in both structures is R0/b0=1.4R_{0}/b_{0}=1.4, which is the preferred curvature for the formation of FF^{*}. If the size of the core is also Rc/b0=1.4R_{c}/b_{0}=1.4, FF^{*} constitutes the minimum energy structure, as expected. However, if the size of the core is Rc/b0=1.5R_{c}/b_{0}=1.5, T=3T=3 becomes the equilibrium structure despite the fact that the spontaneous radius of proteins is R0/b0=1.4R_{0}/b_{0}=1.4, which commensurates with an FF^{*} structure. b) Map of the stress distribution in several incomplete shells growing within the pathways shown in a). The top and bottom rows show some intermediate structures of the FF^{*} and T=3T=3 shells, respectively. Up to 30 subunits, the intermediate structures of both FF^{*} and T=3T=3 look exactly the same. However, their energies become different after 25 subunits are assembled. The color bar shows the different levels of stress. Violet color corresponds to the maximum of stretching stress and the red one to the maximum of compression. The FF^{*} structure is under much more stress compared to T=3T=3.
Refer to caption
Figure 8: Both structures FF and FF^{*} follow the same pathway until the half of the capsids (25 subunits) is assembled (a). Then, the two structures follow two different pathways. For the FF structure, the 5 subunits at the edge merge and form a pentamer (b). However, for the FF^{*} structure, one more subunit will be added and the capsomer will close as a hexamer (c). The energetic cost of the formation of a hexamer is higher at this stage, and as such, the capsid will follow the FF structure pathway while the energy of a complete FF^{*} structure is lower. Red and white colors correspond to the location of pentamers and hexamers respectively. See also Fig. 12 in Appendix for the plot of energy per subunit vs. the number of subunits.

III Conclusion

Very little is known as to why inside a cell, capsid proteins predominantly package their cognate RNA. Many factors contribute to this process, but our knowledge in this area is still rudimentary. In this paper, we have investigated the impact of genome on the structure and excess elastic stress of viral shells, which are tightly connected to the capsid stability and function. In particular we have focused on the recent self-assembly experiments of BMV capsid proteins around short oligonucleotides, using a combination of charge detection mass spectrometry and cryo-EM techniques [25]. Quite interestingly, these experiments have revealed that the capsid proteins can assemble to form symmetric structures other than icosahedral ones [25]. Some of the structures formed around the small pieces of oligonucleotides have the D6hD_{6h} hexagonal barrel symmetry (the EE structure in Fig. 1A and Fig. 3) and some others have D5hD_{5h} symmetry (the FF^{*} structure in Fig. 1C and Fig. 4).

To explore under what conditions the FF^{*} and EE structures could form, we have constructed several phase diagrams for various genome sizes as a function of spontaneous curvature and mechanical properties of protein subunits, see Figs. 5 for the results of the deterministic simulations and 6 for the equilibrium structures. We found that the structure FF^{*}, which was called H15H15 in Ref. [25], can only form in the presence of genome.

Our results also show that the size of genome plays an important role for the assembly of FF^{*}, and that the optimal core size for the formation of FF^{*} is smaller than that for T=3T=3. As illustrated in the plots of Fig. 6h, the shell FF^{*} constitutes the global minimum energy structure at the smaller core size (Rc/b0=1.4R_{c}/b_{0}=1.4) for a large region of the phase diagram. Figure 7 reveals that the formation of FF^{*} and T=3T=3 follow almost the same pathway until 30 triangular subunits are assembled. However, their energies and stress distributions start differing after the assembly of the first 25 subunits. Figure 7a shows that after the assembly of the first 30 triangular subunits, while FF^{*} is an equilibrium structure for Rc/b0=1.4R_{c}/b_{0}=1.4, if a larger RNA becomes available, the shell can easily switch the pathway and form a T=3T=3 structure because of the difference between the energy of a T=3T=3 and FF^{*} for larger RNAs.

We also calculated the stress distribution across FF^{*} and T=3T=3 shells as they grow (see Fig. 7b), which confirms our conclusion based on the energy of the systems. As can be seen in the figure, the stress between every two adjacent pentamers in FF^{*} structure is very high despite the fact that it is an equilibrium structure, Fig. 6h. The implication is that FF^{*}s are stabilized by encapsidating short RNAs, but they can become metastable and split into large fragments, which later grow to form T=3T=3 structure with much lower stress distribution if larger cognate RNAs are accessible. Thus, our work suggest an alternative pathway for the assembly of T=3T=3 shells inside the cell in the presence of many smaller non-viral RNAs.

Another interesting point of the in vitro experiments of Bond et al. [25] corresponds to the co-existence of EE and FF^{*} structures. Our work clarifies the conditions under which the two different structures can coexist. Since the capsids assembled in the experiments are built from the same types of proteins, we explored if it is possible to obtain both structures (EE and FF^{*}) for the same γ\gamma and R0/b0R_{0}/b_{0} (related to the mechanical properties of capsid proteins), albeit different core sizes. A careful review of Figs. 5 and 6 shows that we can obtain several structures simultaneously in various cases if different genome lengths are packaged inside the capsids. These results are consistent with the findings of the experiments of Ref. [25], which show that the number of packaged short oligonucleotides are not the same in each structure. Thus if the number of packaged oligomers are different, it is possible to obtain the structures EE and FF^{*} simultaneously in solution.

It is important to note that the EE structure can only be obtained in the deterministic simulations; it never appears in the equilibrium phase diagrams, and as such it constitutes a minimum energy structure neither for the filled nor for the empty capsids. The EEs are probably trapped in a local minimum free energy and are less stable than other structures in the experiments, see also the stress distribution across the EE structure in Fig. 13. These structures might form in the cell but can easily disassemble and reassemble to form a stable icosahedral structure. It is worth mentioning that our simulations were performed under the assumption that the genome assumes an isotropic shape inside the capsid. However, the structure EE is slightly elongated and an anisotropic genome distribution inside the capsid can render EE a minimum energy structure. Thus, more theoretical and simulations are needed to study the stability of EE structures.

In summary, our work shows that the interplay of protein mechanical properties, genome volume, and strength of the genome-capsid protein interaction can significantly modify the symmetry, structure, and stability of the assembly products, and hence kinetic pathways of assembly. Further, our work also supports the presence of an alternative pathway for the formation of T=3T=3 icosahedral structures around their cognate RNAs in the presence of oligonucleotides. More experiments to confirm the existence of this pathway would be required, at different salt concentrations, and genome lengths. Such studies are needed in order to understand ssRNA virus assembly in presence of polyanionic, non-cognate potential encapsulation competitors.

IV Method

To study the formation of capsids around different genome sizes, we use triangular subunits as our building blocks and associate a monomer to each triangle vertex (Fig. 9a). Since capsids are primarily built from pentamers and hexamers, triangular subunits (trimers) have been widely employed to study the assembly of viral shells and other protein nanocages [38, 28, 12, 34]. The elastic energy of a growing capsid, thus becomes, the sum of the stretching EsE_{s} and bending EbE_{b} energies as follows [39], the stretching energy results from the deformation of the triangles from their equilateral shape and sums over all bonds ii. Assuming that each bond is built from a linear spring with the equilibrium length b0b_{0} and the spring constant ksk_{s}, the stretching energy can be written as,

Es\displaystyle E_{s} =\displaystyle= i12ks(bib0)2\displaystyle\sum_{i}\frac{1}{2}k_{s}(b_{i}-b_{0})^{2} (1)

the length of the ith bond is denoted with bib_{i} and can be stretched or compressed as the shell grows.

The bending energy is due to the deviation of local curvature from the preferred one (to be defined as a parameter in this study) and is calculated by summing over all neighboring pairs of subunits with <m,n><m,n> indexing pairs of adjacent subunits and kbk_{b} being the bending stiffness as follows,

Eb\displaystyle E_{b} =\displaystyle= <m,n>kb[1cos(θ<m,n>θ0)].\displaystyle\sum_{<m,n>}k_{b}[1-\cos(\theta_{<m,n>}-\theta_{0})]. (2)

The spontaneous radius of curvature of proteins R0R_{0} is related to the preferred angle between two subunits θ0\theta_{0} through the relation sinθ0/2=(12R02/b023)(1/2)\sin\theta_{0}/2=(12R_{0}^{2}/b_{0}^{2}-3)^{(-1/2)}.

Refer to caption
Figure 9: (a) A protein subunit built from three monomers illustrated as an equilateral triangle in its equilibrium shape. As triangular subunits are added to the growing shells, because of the curvature of the shell, the subunits can be stretched or compressed. (b) The subunits bind together edge to edge, and the growth of the shell proceeds by adding a subunit to the location with the smallest opening angle, α\alpha. As the shell grows, the two unbound edges can either bind together to form a pentamer (c), or a new subunit can be added to form a hexamer (d). The choice between forming a pentamer and a hexamer is based on which leads to a lower energy per subunit in the growing shell. If the energy per subunit becomes lower for the formation of a hexamer (EHE_{H}) compared to that of a pentamer (EPE_{P}), then a hexamer forms; otherwise a pentamer assembles. Adapted with permission from [28].

The total elastic energy of the growing shell then depends on two dimensionless parameters, γ=ksb02/kb\gamma=k_{s}b_{0}^{2}/k_{b} and R0/b0R_{0}/b_{0}. The Foppl von Karman (FvK) number γ\gamma indicates the relative difficulty of deforming a subunit from its equilateral triangular shape vs. the difficulty of bending it away from the preferred radius of curvature. Both γ\gamma and R0/b0R_{0}/b_{0} depend on the intrinsic property of proteins (shape and resistance to deformation), and also on the solution circumstances such as pH and ionic strength, and for closed cages also on differentials of osmolyte concentration[21, 40].

To grow a capsid, we consider that the assembly follows the locally minimum energy path, which, under many circumstances, is reasonably close to the most probable growth pathways and thus can yield a meaningful representation of the final assembly products. During the simulations at each growth step a new subunit is added to the growing edge of the incomplete capsid at a position in which the number of neighbors at the vertices of the newly added subunit will be maximized. Hence, the new subunit will be added to a position with the smallest opening angle α\alpha, see Fig. 9b. A defining step in the assembly process corresponds to the formation of pentamers illustrated in Fig. 9. If there are already 5 triangles attached to a vertex on the rim of an incomplete capsid, then the growth can proceed in two different pathways: (i) a pentamer can be formed by attaching the two neighboring edges, see Fig. 9c, or (ii) a hexamer can be assembled by inserting a new subunit, Fig. 9d. Depending which structure has a lower energy per subunit in the growing shell, a pentamer or a hexamer forms [41].

Note that the algorithm discussed above for the virus growth corresponds to the physical situation in which the line tension of incomplete capsid is high or subunit-subunit interaction is weak enough such that the proteins can easily move around to explore the energy landscape and then attach themselves to a position where their number of neighbors are maximized [42]. Furthermore, we allow the shell to relax between the addition of every subunit assuming that the relaxation time of the elastic stresses in the incomplete capsid is shorter than the time it takes for the shell to grow. This is compatible with the assumption of the high line-tension or weak protein-protein interaction [34, 43]. This approach is also consistent with the results of Ref. [10], in which the self-assembly simulations of T=1T=1 structures at different protein concentrations show that slowing the adsorption of free subunits into the core decreases frustration [10].

We emphasize that our simulations are deterministic and thus irreversible, mimicking the situation in which the strength of protein-protein interaction is such that once a pentamer or hexamer forms, it can no longer dissociate [41, 42]. Quite interestingly, in our simulations, we do not see aberrant particles as observed in the assembly of T=1T=1 structures in Ref. [19]. This might be due to the fact that we follow the most probable pathway [42].

To find the locations of high stress points across the capsid, we calculate the elastic stress distribution in the shell through the trace of the stress tensor as follows

σvα\displaystyle\sigma_{v_{\alpha}} =\displaystyle= vβ(ks2A)r<vα,vβ>(r<vα,vβ>b0)\displaystyle\sum_{v_{\beta}}(\frac{k_{s}}{2A})r_{<v_{\alpha},v_{\beta}>}(r_{<v_{\alpha},v_{\beta}>}-b_{0}) (3)

where the summation is over all vertices sharing a bond with the vertex vαv_{\alpha}. The r<vα,vβ>r_{<v_{\alpha},v_{\beta}>} is the distance between vertices α\alpha and β\beta and A=Z(13)(34)b02A=Z(\frac{1}{3})(\frac{\sqrt{3}}{4})b_{0}^{2} is the average area of each vertex with ZZ the coordination number.

To model the genome, we consider that viral RNA in solution is believed to be folded in such a way that its radius of gyration only changes by a factor of 3 inside the capsid [44]. Moreover, several in vitro virus self-assembly experiments indicate that RNA collapses into its final size right at the beginning of the assembly after a small amount of proteins are adsorbed to it, suggesting that the time scale associated with genome condensation is much faster than the overall time scale for the growth process [45, 46, 47]. Therefore, we assume that capsid growth occurs in presence of a globular RNA modeled by a spherical core, which reduces considerably the many degrees of freedoms involved in the problem. This will make the assembly process more tractable, allowing us to explore how the shell size and structure depends on the genome size and the subunit-genome interaction. Thus, we consider a spherical core interacting through Lennard-Jones (LJ) with the capsid proteins

Elj\displaystyle E_{lj} =\displaystyle= vα4ϵlj[(σr<vα,g>)12(σr<vα,g>)6]\displaystyle\sum_{v_{\alpha}}4\epsilon_{lj}[(\frac{\sigma}{r_{<v_{\alpha},g>}})^{12}-(\frac{\sigma}{r_{<v_{\alpha},g>}})^{6}] (4)

where ϵlj\epsilon_{lj} is the potential strength, σ\sigma is the distance at which the LJ potential between the core and triangle vertex (monomer) is zero and r<vα,g>r_{<v_{\alpha},g>} is the distance between the core center and triangle vertex vαv_{\alpha}.

We start the growth process by adding one subunit to the core and then we monitor the growth of the shell. We assume that the solution condition is always such that the proteins prefer to assemble rather than to stay free in the solution. We consider that the concentration of free proteins in solution is such that the capsid proteins nucleate only in one location along the genome, and the rest of capsid grows around the nucleating site.

In the results and discussion section, we will discuss the conditions under which if the spontaneous radius of proteins is smaller than the size of the core, the shell might not grow around the genome but will form an empty shell.

Acknowledgements.
This work was supported by the National Science Foundation through Grant Numbers DMR-1719550 and 2034794(to R.Z.). B.D. acknowledges support from the Army Research Office, under awards W911NF2010072 and W911NF2010071, and from the National Science Foundation, under award CBET-1803440.

Appendix A

Refer to caption
Figure 10: At γ=10\gamma=10 when there is no core, the transition point from T=1T=1 to T=3T=3 is at R0/b0=1.31R_{0}/b_{0}=1.31. However, in the presence of a core whose size is Rc/b0=1.5R_{c}/b_{0}=1.5 (perfect for a T=3T=3 shell), the boundary shifts towards smaller R0/b0=1.24R_{0}/b_{0}=1.24. When the core-protein attraction is weak (ϵ~lj=0.005\tilde{\epsilon}_{lj}=0.005), T=1T=1 forms for small values of R0/b0<1.24R_{0}/b_{0}<1.24 without packaging the core as it will cost a lot of elastic energy. However, increasing R0/b0R_{0}/b_{0} to 1.24, T=3T=3 structures form with the genome inside.
Refer to caption
Figure 11: Structure JJ with 44 subunits and D2D_{2} syemmetry. Reproduced from Ref.[29] with permission from the Royal Society of Chemistry.
Refer to caption
Figure 12: Plots of the energy per subunit for the structures FF (the black solid line) obtained from the deterministic simulations and FF^{*} (the red dashed line) as a function of the number of subunits at R0/b0=1.1R_{0}/b_{0}=1.1, γ=3\gamma=3, Rc/b0=1.4R_{c}/b_{0}=1.4 and the strong LJ potential ϵ~ij=0.1\tilde{\epsilon}_{ij}=0.1. Both structures FF and FF^{*} follow the same pathway until one half of the capsids (25 subunits) are assembled. Then, the two structures follow two different pathways. For the FF structure, the 5 subunits at the edge merge and form a pentamer. However, for the FF^{*} structure, one more subunit will be added and the capsomer will close as a hexamer with higher energy (see Figure 8 in the paper). The energetic cost of the formation of a hexamer is higher at this stage, and as such, the capsid follows the black line pathway and forms the FF structure while the energy of complete FF^{*} structure is lower.
Refer to caption
Figure 13: Map of the stress distribution of the structures EE and T=3T=3 (a) Shell with EE symmetry forms in the deterministic simulations for Rc/b0=1.2R_{c}/b_{0}=1.2, ϵ~lj=0.1\tilde{\epsilon}_{lj}=0.1, R0/b0=1.04R_{0}/b_{0}=1.04 and γ=3\gamma=3. The highest positive stress is along the hexamer belt in the middle of the shell. (b) When we increase the size of the core to Rc/b0=1.5R_{c}/b_{0}=1.5 keeping all the other parameters as in part (a), the structure T=3T=3 forms which has much lower stress compared to the structure EE.

A.1 stress

In this section, we provide the details of the stress calculation used in Fig. 7 in the paper and Fig. 12 in Appendix. Employing the virial formula, we can calculate the stress component for vertex vjv_{j} as follows,

σvαij=12Avβ(xvβixvαi)f<vα,vβ>j,\sigma^{ij}_{v_{\alpha}}=\frac{1}{2A}\sum_{v_{\beta}}(x^{i}_{v_{\beta}}-x^{i}_{v_{\alpha}})f^{j}_{<v_{\alpha},v_{\beta}>}, (5)

where A=Z1334b02A=Z\frac{1}{3}\frac{\sqrt{3}}{4}b_{0}^{2} is the area of vertex vαv_{\alpha} with ZZ the coordination number and xvαix^{i}_{v_{\alpha}}(xvβix^{i}_{v_{\beta}}) is the ithi^{th} component of the position of vertex vαv_{\alpha}(vβv_{\beta}). The summation is over the nearest neighbor vertices. The quantity f<vα,vβ>jf^{j}_{<v_{\alpha},v_{\beta}>} is the jthj^{th} component of the force on vertex vαv_{\alpha} in the <vα,vβ><v_{\alpha},v_{\beta}> bond and is equal to

f<vα,vβ>j=ks(r<vα,vβ>b0)xvβjxvαjr<vα,vβ>,f^{j}_{<v_{\alpha},v_{\beta}>}=k_{s}(r_{<v_{\alpha},v_{\beta}>}-b_{0})\frac{x^{j}_{v_{\beta}}-x^{j}_{v_{\alpha}}}{r_{<v_{\alpha},v_{\beta}>}}, (6)

where ksk_{s} denotes the stretching modulus, r<vα,vβ>r_{<v_{\alpha},v_{\beta}>} is the distance between vαv_{\alpha} and vβv_{\beta}, and b0b_{0} is the equilibrium length of the bond. Combining Eqs. 5 and 6, we obtain

σvαij=ks2Avβ(r<vα,vβ>b0)(xvβixvαi)(xvβjxvαj)r<vα,vβ>.\sigma^{ij}_{v_{\alpha}}=\frac{k_{s}}{2A}\sum_{v_{\beta}}(r_{<v_{\alpha},v_{\beta}>}-b_{0})\frac{(x^{i}_{v_{\beta}}-x^{i}_{v_{\alpha}})(x^{j}_{v_{\beta}}-x^{j}_{v_{\alpha}})}{r_{<v_{\alpha},v_{\beta}>}}. (7)

The trace of the stress tensor thus becomes,

σvαkk=ks2Avβr<vα,vβ>(r<vα,vβ>b0).\sigma^{kk}_{v_{\alpha}}=\frac{k_{s}}{2A}\sum_{v_{\beta}}r_{<v_{\alpha},v_{\beta}>}(r_{<v_{\alpha},v_{\beta}>}-b_{0}). (8)

References

  • King et al. [2011] A. M. King, E. Lefkowitz, M. J. Adams, and E. B. Carstens, Virus Taxonomy: Ninth Report of the International Committee on Taxonomy of Viruses, Vol. 9 (Elsevier, San Diego, 2011).
  • Flint et al. [2000] S. Flint, L. Enquist, R. Krug, V. Racaniello, and A. Skalka, Principles of Virology: Molecular Biology, Pathogenesis, and Control (ASM Press: Washington D.C., 2000) pp. 66–83.
  • Rao [2006] A. L. N. Rao, Genome Packaging by Spherical Plant RNA Viruses, Annu. Rev. Phytopathol. 44, 61 (2006).
  • Weiss [2005] P. Weiss, Armor-plated puzzle: Deciphering the code of viral geometry, Sci. News 168, 152 (2005).
  • Johnson and Speir [1997] J. E. Johnson and J. A. Speir, Quasi-equivalent viruses: A paradigm for protein assemblies, J. Mol. Biol. 269, 665 (1997).
  • Twarock and Luque [2019] R. Twarock and A. Luque, Structural puzzles in virology solved with an overarching icosahedral design principle, Nat. Commun. 10, 1 (2019).
  • Zandi et al. [2020] R. Zandi, B. Dragnea, A. Travesset, and R. Podgornik, On Virus Growth and Form, Phys. Rep. 847, 1 (2020).
  • Hagan and Grason [2021] M. F. Hagan and G. M. Grason, Equilibrium Mechanisms of Self-Limiting Assembly, Rev. Mod. Phys. 93, 025008 (2021)2007.01927 .
  • Chen et al. [2007] T. Chen, Z. Zhang, and S. C. Glotzer, A precise packing sequence for self-assembled convex structures, PNAS 104, 717 (2007).
  • Elrad and Hagan [2008] O. M. Elrad and M. F. Hagan, Mechanisms of size control and polymorphism in viral capsid assembly, Nano lett. 8, 3850 (2008).
  • Hu et al. [2008] Y. Hu, R. Zandi, A. Anavitarte, C. M. Knobler, and W. M. Gelbart, Packaging of a Polymer by a Viral Capsid: The Interplay between Polymer Length and Capsid Size, Biophys. J. 94, 1428 (2008).
  • Hagan [2014] M. F. Hagan, Modeling Viral Capsid Assembly, Adv. Chem. Phys. 155, 1 (2014).
  • Dong et al. [2020] Y. Dong, S. Li, and R. Zandi, Effect of the charge distribution of virus coat proteins on the length of packaged rnas, Phys. Rev. E 102, 062423 (2020).
  • Li et al. [2017] S. Li, G. Erdemci-Tandogan, P. van der Schoot, and R. Zandi, The effect of rna stiffness on the self-assembly of virus particles, J. Condens. Matter Phys. 30, 044002 (2017).
  • Erdemci-Tandogan et al. [2016] G. Erdemci-Tandogan, J. Wagner, P. Van Der Schoot, R. Podgornik, and R. Zandi, Effects of rna branching on the electrostatic stabilization of viruses, Phys. Rev. E 94, 022408 (2016).
  • Zhang and Linse [2014] R. Zhang and P. Linse, Topological effects on capsomer-polyion co-assembly, J. Chem. Phys. 140, 244903 (2014).
  • Sun et al. [2007] J. Sun, C. DuFort, M.-C. Daniel, A. Murali, C. Chen, K. Gopinath, B. Stein, M. De, V. M. Rotello, A. Holzenburg, C. C. Kao, and B. Dragnea, Core-controlled polymorphism in virus-like particles, PNAS 104, 1354 (2007).
  • Garmann et al. [2014] R. F. Garmann, M. Comas-Garcia, A. Gopal, C. M. Knobler, and W. M. Gelbart, The assembly pathway of an icosahedral single-stranded rna virus depends on the strength of inter-subunit attractions, J. Mol. Biol. 426, 1050 (2014).
  • Elrad and Hagan [2010] O. M. Elrad and M. F. Hagan, Encapsulation of a polymer by an icosahedral virus, Phys Biol 7, 045003 (2010).
  • Bruinsma et al. [2016] R. F. Bruinsma, M. Comas-Garcia, R. F. Garmann, and A. Y. Grosberg, Equilibrium self-assembly of small rna viruses, Phys. Rev. E 93, 032405 (2016).
  • Fejer et al. [2010] S. N. Fejer, D. Chakrabarti, and D. J. Wales, Emergent complexity from simple anisotropic building blocks: Shells, tubes, and spirals, ACS nano 4, 219 (2010).
  • Stockley et al. [2013a] P. G. Stockley, R. Twarock, S. E. Bakker, A. M. Barker, A. Borodavka, E. Dykeman, R. J. Ford, A. R. Pearson, S. E. Phillips, N. A. Ranson, and R. Tuma, Packaging signals in single-stranded rna viruses: Nature’s alternative to a purely electrostatic assembly mechanism, J. Biol. Phys. 39, 277 (2013a).
  • Twarock and Stockley [2019] R. Twarock and P. G. Stockley, Rna-mediated virus assembly: Mechanisms and consequences for viral evolution and therapy, Annu. Rev. Biophys 48, 495 (2019).
  • Comas-Garcia [2019] M. Comas-Garcia, Packaging of genomic rna in positive-sense single-stranded rna viruses: A complex story, Viruses 11, 253 (2019).
  • Bond et al. [2020] K. Bond, I. B. Tsvetkova, J. C.-Y. Wang, M. F. Jarrold, and B. Dragnea, Virus assembly pathways: Straying away but not too far, Small 16, 2004475 (2020).
  • Caspar and Klug [1962] D. L. Caspar and A. Klug, Physical principles in the construction of regular viruses, Cold Spring Harbor Symp. Quant. Biol. 27, 1 (1962).
  • Garmann et al. [2015] R. F. Garmann, M. Comas-Garcia, C. M. Knobler, and W. M. Gelbart, Physical principles in the self-assembly of a simple spherical virus, Acc. Chem. Res. 49, 48 (2015).
  • Wagner and Zandi [2015] J. Wagner and R. Zandi, The robust assembly of small symmetric nanoshells, Biophys. J. 109, 956 (2015).
  • Panahandeh et al. [2018] S. Panahandeh, S. Li, and R. Zandi, The equilibrium structure of self-assembled protein nano-cages, Nanoscale 10, 22802 (2018).
  • Fotin et al. [2004] A. Fotin, Y. Cheng, P. Sliz, N. Grigorieff, S. C. Harrison, T. Kirchhausen, and T. Walz, Molecular model for a complete clathrin lattice from electron cryomicroscopy, Nature 432, 573 (2004).
  • Cheng et al. [2007] Y. Cheng, W. Boll, T. Kirchhausen, S. C. Harrison, and T. Walz, Cryo-electron tomography of clathrin-coated vesicles: Structural implications for coat assembly, J. Mol. Biol. 365, 892 (2007).
  • Twarock et al. [2018] R. Twarock, R. J. Bingham, E. C. Dykeman, and P. G. Stockley, A modelling paradigm for rna virus assembly, Curr. Opin. Virol. 31, 74 (2018).
  • Lucas et al. [2001] R. W. Lucas, Y. G. Kuznetsov, S. B. Larson, and A. McPherson, Crystallization of Brome Mosaic Virus and T = 1 Brome Mosaic Virus Particles Following a Structural Transition., Virology 286, 290 (2001).
  • Li et al. [2018] S. Li, P. Roy, A. Travesset, and R. Zandi, Why large icosahedral viruses need scaffolding proteins, PNAS 115, 10971 (2018).
  • Panahandeh et al. [2020] S. Panahandeh, S. Li, L. Marichal, R. Leite Rubim, G. Tresset, and R. Zandi, How a virus circumvents energy barriers to form symmetric shells, ACS nano 14, 3170 (2020).
  • Paquay et al. [2016] S. Paquay, H. Kusumaatmaja, D. J. Wales, R. Zandi, and P. van der Schoot, Energetically favoured defects in dense packings of particles on spherical surfaces, Soft Matter 12, 5708 (2016).
  • Li et al. [2019] S. Li, R. Zandi, A. Travesset, and G. M. Grason, Ground states of crystalline caps: Generalized jellium on curved space, Phys. Rev. Lett. 123, 145501 (2019).
  • Vernizzi et al. [2011] G. Vernizzi, R. Sknepnek, and M. O. De La Cruz, Platonic and archimedean geometries in multicomponent elastic membranes, PNAS 108, 4292 (2011).
  • Hicks and Henley [2006] S. D. Hicks and C. Henley, Irreversible growth model for virus capsid assembly, Phys. Rev. E 74, 031912 (2006).
  • Zeng et al. [2021] C. Zeng, L. Scott, A. Malyutin, R. Zandi, P. Van der Schoot, and B. Dragnea, Virus mechanics under molecular crowding, J. Phys. Chem 125, 1790 (2021).
  • Ning et al. [2016] J. Ning, G. Erdemci-Tandogan, E. L. Yufenyuy, J. Wagner, B. A. Himes, G. Zhao, C. Aiken, R. Zandi, and P. Zhang, In Vitro protease cleavage and computer simulations reveal the hiv-1 capsid maturation pathway, Nat. Commun. 7, 13689 (2016).
  • Levandovsky and Zandi [2009] A. Levandovsky and R. Zandi, Nonequilibirum assembly, retroviruses, and conical structures, Phys. Rev. Lett 102, 198102 (2009).
  • Zandi et al. [2006] R. Zandi, P. van der Schoot, D. Reguera, W. Kegel, and H. Reiss, Classical nucleation theory of virus capsids, Biophys. J. 90, 1939 (2006).
  • Yoffe et al. [2011] A. M. Yoffe, P. Prinsen, W. M. Gelbart, and A. Ben-Shaul, The ends of a large rna molecule are necessarily close, Nucleic Acids Res. 39, 292 (2011).
  • Borodavka et al. [2012] A. Borodavka, R. Tuma, and P. G. Stockley, Evidence that viral rnas have evolved for efficient, two-stage packaging, PNAS 109, 15769 (2012).
  • Chevreuil et al. [2018] M. Chevreuil, D. Law-Hine, J. Chen, S. Bressanelli, S. Combet, D. Constantin, J. Degrouard, J. Möller, M. Zeghal, and G. Tresset, Nonequilibrium self-assembly dynamics of icosahedral viral capsids packaging genome or polyelectrolyte, Nat. Commun. 9, 3071 (2018).
  • Daniel et al. [2010] M. C. Daniel, I. B. Tsvetkova, Z. T. Quinkert, A. Murali, M. De, V. M. Rotello, C. C. Kao, and B. Dragnea, Role of Surface Charge Density in Nanoparticle-Templated Assembly of Bromovirus Protein Cages, ACS Nano 4, 3853 (2010).
  • Stockley et al. [2013b] P. G. Stockley, R. Twarock, S. E. Bakker, A. M. Barker, A. Borodavka, E. Dykeman, R. J. Ford, A. R. Pearson, S. E. V. Phillips, N. A. Ranson, and R. Tuma, Packaging Signals in Single-Stranded RNA Viruses: Nature’s Alternative to a Purely Electrostatic Assembly Mechanism, J. Biol. Phys. 39, 277 (2013b).