This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The Effect of a Knot on the Thermal Stability of Protein MJ0366 : Insights from Molecular Dynamics and Monte Carlo Simulations

A. M. Begun Pacific Quantum Center, Far Eastern Federal University, 690922, Vladivostok, Russia Nordita, Stockholm University, Stockholm, Sweden    A. A. Korneev Pacific Quantum Center, Far Eastern Federal University, 690922, Vladivostok, Russia    Anastasiia Zorina Pacific Quantum Center, Far Eastern Federal University, 690922, Vladivostok, Russia
Abstract

Protein MJ0366 is a hypothetical protein from Methanocaldococcus jannaschii that has a rare and complex knot in its structure. The knot is a right-handed trefoil knot that involves about half of the protein’s residues. In this article, we investigate the thermal stability of protein MJ0366 using numerical simulations based on molecular dynamics and Monte Carlo methods. We compare the results with those of a similar unknotted protein and analyze the effects of the knot on the folding and unfolding processes. We show that the knot in protein MJ0366 increases its thermal stability by creating a topological barrier that prevents the protein from unfolding at high temperatures. We also discuss the possible biological implications of the knot for the function and evolution of protein MJ0366 .

I Introduction

About 1 % of protein molecule structures presented in the PDB bank have a knotted structure Jamroz et al. (2014); Dabrowski-Tumanski et al. (2018a). Their backbone can be tied into different types of the simplest knots (see examples in Fig. 1) according to the list of tabulating prime knots Hoste et al. (1998). In a protein, these knots are interpreted as topological barriers that dramatically affect the protein folding processes. In particular, the presence of such a topological barrier affects the temperature stability of the molecule Sulkowska (2020); Hsu (2023). Moreover, knotted methyltransferases exhibit a trefoil configuration even in the denatured state Mallam et al. (2010); Wang et al. (2013), which also indicates a non-trivial role for the knot in protein molecules.

The hypothetical protein MJ0366 from Methanocaldococcus jannaschii (PDB ID - 2EFV) has a knot structure of a left-handed trefoil and is the shortest knot protein available in the PDB bank Thiruselvam et al. (2017). Due to its small size (82 amino acids) and single-domain structure, it is the subject of many theoretical Noel et al. (2010); a Beccara et al. (2013); Noel et al. (2013); Chwastyk and Cieplak (2015); Dabrowski-Tumanski et al. (2015); Soler et al. (2016); Niewieczerzal and Sulkowska (2017); Paissoni et al. (2021); Especial and Faísca (2021); Wang and Peng (2022) and experimental studies Wang et al. (2015); Ramírez et al. (2017); Rivera et al. (2020, 2023). One of the first experiments was the study of the folding dynamics and kinetic pathways of MJ0366 under chemical denaturation Wang et al. (2015). Then single molecule force spectroscopy experiment was showed that Immunoglobulin Binding Protein affects on the folding of MJ0366 Ramírez et al. (2017). Using optical tweezers, it was shown that the mechanical folding process exhibits a two-state mechanism Rivera et al. (2020). Moreover, the results of experiments on the study of the temperature stability of this protein were recently presented, where a non-trivial temperature dependence was observed Rivera et al. (2023).

At the same time, the conducted theoretical studies lead to different results. In this connection, the question arises about the correct description of folding/unfolding of this MJ0366 protein.

One of the first paper on in silico study of MJ0366 used all-atom structure-based model Noel et al. (2010). Knot formation was shown to occur in three stages (with an intermediate slipknot conformation) begins with the CC-end of the chain being threaded through the twisted loop. Later, a full-atom model , but with realistic interatomic forces based on DPR approach, showed qualitatively the same results a Beccara et al. (2013). It was demonstrated that open region of that twisted loop created by native β\beta-sheet. In Noel et al. (2013) analogous spontaneous knotting has been demonstrated using unbiased all-atom explicit-solvent simulations. Subsequently, in a coarse-grained model, it was shown that such a single-loop formation is not dominant for MJ0366, and a more frequent number of folding trajectories fall on the two-loop mechanism Chwastyk and Cieplak (2015). Simulations in different approaches were also carried out on the influence of chaperones on knotting processes Soler et al. (2016); Niewieczerzal and Sulkowska (2017).

Determination of the minimum number of amino acid contacts required to form a knot Dabrowski-Tumanski et al. (2015) and a method for identifying key residues in a knot Wang and Peng (2022) were given using the MJ0366 as an example. Folding stability of this protein has also been theoretically studied by by mutating individual residues Paissoni et al. (2021). The set of heterogeneous native residues required for efficient knotting was also investigated Especial and Faísca (2021).

In order to study the dynamic properties of MJ0366, we use the Landau-Ginzburg-Wilson approach, which is based on the principles of universality and gauge symmetry Chernodub et al. (2010). This approach has already been used to study biophysical properties of proteins. A phase diagram of myoglobin was constructed Begun et al. (2019). Investigation of the folding of a protein that has a slipknot type of a knot in its native state Begun et al. (2021). The temperature stability of α\alpha-synuclein, which is one of the components of Lewy bodies and a pathogenic criterion for various neuro diseases, in particular, Parkinson’s disease, was also assessed Korneev et al. (2022). In this paper we study the folding pathways of MJ0366 by using the Landau-Ginzburg-Wilson approach. We obtain tertiary structure of the protein in terms of kinks Chernodub et al. (2010) and analyze how the knot affects on the temperature stability of the protein MJ0366.

Refer to caption
Figure 1: Possible knot types. Panel a) stands for trefoil (313_{1}) knot, panel b) is figure eight (414_{1}) knot, panel c) three-twist (525_{2}) knot, d) Stevedor knot (61)6_{1})

II Materials and methods

II.1 The model

We use the approach of topological solitons developed by Antti J. Niemi Niemi (2014). It is based on representing the protein as a partially linear chain consisting of CαC_{\alpha} atoms and links that connect them (backbone). The positions of all CαC_{\alpha} atoms in the protein determine the positions of all other atoms due to rigidity of the peptide group.

This curve is considered in terms of discrete Frenet coordinates. In each residue, the Frenet frame (Fig. 2) consisting of three vectors (t,b,n)(\vec{t},\vec{b},\vec{n}) is built:

{dgroup}
ti=ri+1ri|ri+1ri|,\vec{t}_{i}=\frac{\vec{r}_{i+1}-\vec{r}_{i}}{|\vec{r}_{i+1}-\vec{r}_{i}|}, (1)
bi=[ti1,ti]|[ti1,ti]|,\vec{b}_{i}=\frac{[\vec{t}_{i-1},\vec{t}_{i}]}{|[\vec{t}_{i-1},\vec{t}_{i}]|}, (2)
ni=[bi,ti],\vec{n}_{i}=[\vec{b}_{i},\vec{t}_{i}], (3)

where ri\vec{r}_{i} are coordinate vectors of CαC_{\alpha} atoms.

The angles of curvature κi\kappa_{i} and torsion τi\tau_{i} (Fig. 2) act as new coordinates. They can be calculated as follows: {dgroup}[noalign]

κi=arccos(ti+1,ti),\kappa_{i}=\arccos(\vec{t}_{i+1},\vec{t}_{i}), (4)
τi=sign([bi1,bi],ti)arccos(bi+1,bi).\tau_{i}=\mathrm{sign}([\vec{b}_{i-1},\vec{b}_{i}],\vec{t}_{i})\arccos(\vec{b}_{i+1},\vec{b}_{i}). (5)
Refer to caption
Figure 2: Backbone and Frenet frame (t,b,n)(\vec{t},\vec{b},\vec{n})

The free energy function is based on the Abelian Higgs model.

F=i=1N1(κi+1κi)2+i=1N(λ(κi2m2)2+dκi2τi2bκi2τiaτi+cτi2)+i>jNV(|rirj|),F=\sum_{i=1}^{N-1}\left(\kappa_{i+1}-\kappa_{i}\right)^{2}+\sum_{i=1}^{N}\left(\lambda(\kappa_{i}^{2}-m^{2})^{2}+d\kappa_{i}^{2}\tau_{i}^{2}\\ -b\kappa_{i}^{2}\tau_{i}-a\tau_{i}+c\tau_{i}^{2}\right)+\sum_{i>j}^{N}V(|\vec{r}_{i}-\vec{r}_{j}|), (6)

where the terms with coefficients aa and bb stand for conservation of charges in discrete non-linear Schrodinger equation hierarchy called helicity and momentum, this terms provide parity breaking and make protein right-handed chiral. Terms with coeficients cc and dd together constitutes the Kirchhoff energy of an elastic rod. The last term is hard-core Pauli repulsion to avoid self-crossings.

This energy function possesses solutions in the form of topological solitons, and the protein model is built as a composition of solitons (multisoliton). Each soliton describes a region of the protein consisting of parts of regular structures (α\alpha-helices and β\beta-strands) and an intermediate structure between them (loop).

The sets of coefficients (λ,m,a,b,c,d)k(\lambda,m,a,b,c,d)_{k} are individual and uniform for each soliton. Based on these coefficients, the native state can be calculated as a configuration satisfying to the minimum of (6):

{dgroup}
Fκi=2(2κiκi1κi+1)+4κi(κi2m2)+2dκiτi22bκiτi=0,\frac{\partial F}{\partial\kappa_{i}}=2(2\kappa_{i}-\kappa_{i-1}-\kappa_{i+1})+4\kappa_{i}(\kappa_{i}^{2}-m^{2})+2d\kappa_{i}\tau_{i}^{2}{-2b\kappa_{i}\tau_{i}=0,} (7)
Fτ=2dκi2τibκi2a+2cτi=0.\frac{\partial F}{\partial\tau}={2d\kappa_{i}^{2}\tau_{i}-b\kappa_{i}^{2}-a+2c\tau_{i}=0.} (8)

Taking into account that solitons usually aggregate several amino acid residues, the number of parameters needed to describe the protein structure is often less than the amount of amino acids that make up the protein.

As a starting point for modeling the protein structure, we use a structure taken from RCSB Berman (2000). Using a simulated annealing algorithm Pincus (1970), the soliton parameters are chosen such that the configuration corresponding to the minimum free energy corresponds to the initial structure taken from the RCSB data base with a minimum RMSD value.

II.2 Unfolding-folding simulation

The obtained structure can be used for simulation of the unfolding-folding process using Monte Carlo methods. To do this we use Glauber algorithm Glauber (1963); Bortz et al. (1975). In comparison to Metropolis-Hastings algorithm it allows us to choose perturbing site randomly regardless if it was chosen before.

At each Monte Carlo step nn we independently change either the torsion angle or the curvature angle at a randomly chosen site ii:

{dgroup}
τiτi+rστ,\tau_{i}\rightarrow\tau_{i}+r\sigma_{\tau}, (9)
κiκi+rσκ,\kappa_{i}\rightarrow\kappa_{i}+r\sigma_{\kappa}, (10)

where rr is a normally distributed around r=0r=0 random number with dispersion Δr=1\Delta r=1 and στ=σκ=0.01\sigma_{\tau}=\sigma_{\kappa}=0.01 is proportionality factor. And then compute the corresponding change ΔF(κi,τi)\Delta F(\kappa_{i},\tau_{i}) in the free energy. The obtained configuration is being accepted with probability

P=11+eΔF/TG,P=\frac{1}{1+e^{\Delta F/T_{G}}}, (11)

where TGT_{G} is the Glauber temperature factor that allows to simulate temperature changing. It is important to note that it is Monte Carlo temperature and it can not be concided with the physical temperature measured in Kelvin directly. The relation can be found by the methods of renormalization Krokhotin et al. (2013); Peng et al. (2016).

It has been demonstrated Martinelli and Olivieri (1994a, b) that Monte Carlo simulation using equation (11) converges exponentially to the Gibbsian distribution. Therefore, this approach should offer a statistically significant representation of protein folding dynamics near equilibrium during adiabatic temperature changing.

The simulation process consists of three stages: heating (unfolding), thermalization, cooling (folding). During heating and cooling temperature is increasing (decreasing) by exponential law with growth factor defined by required amount of steps and temperature range:

TG=TGb,T_{G}^{\prime}=T_{G}b, (12)
b=lnTmaxTminN1,b=ln\frac{T_{max}}{T_{min}}N^{-1}, (13)

where NN is number of steps required for each stage. This choice allows to guarantee adiabatic nature of the process.

II.3 Knot detection

Knots as topological objects are defined only on closed loops while proteins mostly are not closed. To avoid this different techniques can be applied.

To detect whether protein is knotted or not we used Knot_pull—python package Jarmolinska et al. (2019) by Aleksandra Jarmolińska. Knot_pull refines and streamlines the given structure with topological awareness, ensuring that the inherent chain does not intersect with itself (Fig. 3). Smoothing continues until no further modifications are possible (or the current arrangement has been previously examined), any unnecessary beads are eliminated, retaining only the essential topological points. Based on this structure, the corresponding Alexander-Briggs Alexander and Briggs (1926) notation is calculated.

Refer to caption
Figure 3: Sequential states during the Knot_pull calculations of 2EFV knot type.

This approach is more suitable than others based on the calculation of polynomial invariants for the determination of knot type Lua (2012); Tubiana et al. (2018); Dabrowski-Tumanski et al. (2018b, 2020). To detect knot type these approaches close the loop randomly on infinite radius sphere (Fig. 4), such closure repeats many times and gives only the probability of observing given knot type.

Refer to caption
Figure 4: Closure the protein structure on infinite sphere. By red color additional bounds are marked.

II.4 Studied protein

The protein MJ0366 was selected for investigation due to its status as the smallest known knotted protein. Its relatively small size and knotted topology make it a particularly interesting subject for studying the structural and functional implications of protein knots, providing a simpler model to understand the biological significance and stability of these complex protein structures.

The MJ0366 protein is a hypothetical protein from the archaeon Methanocaldococcus jannaschii Thiruselvam et al. (2017), which is a hyperthermophilic, methane-producing microorganism found in extreme environments such as hydrothermal vents.

M. jannaschii is a member of the Archaea domain, a group of prokaryotes that often thrive in extreme environments. It grows optimally at temperatures above 85\celsius85\celsius and can reduce carbon dioxide to methane using hydrogen, a process known as methanogenesis. This species is used as a model organism to study extremophilic life and early evolutionary processes due to its ancient evolutionary lineage and unique biochemical pathways.

The KnotProt 2.0 package Dabrowski-Tumanski et al. (2018b) detects the knot of type 313_{1} whose core consists of 71 residues, from 6 to 77 amino acids (Figure 6). The knot loop in MJ0366 consists mostly of helices, that probably makes it more tough and thus more stable.

Although the exact biological function of MJ0366 remains hypothetical, knotted proteins, in general, are known for their mechanical and thermal stability. Some knotted proteins have been implicated in various biological processes, including those related to diseases like Parkinson’s disease.

The study of MJ0366 sheds light on the structural complexity of knotted proteins and their potential stability in extreme conditions.

For calculations model 2EFV from PDB database was used.

III Results

III.1 Soliton model of MJ0366

Knotted protein 2EFV consists of 82 amino acids which form 2 β\beta-strands and 5 α\alpha-helices. It means that there are 8 loops and the structure must be describe by at least 8 individual solitons. But β\beta-strands because of their complicated geometry require additional solitons in bending positions. Finally, we construct the native structure of the protein using 12 solitons. It allows us to reach resolution of 0.52 Å RMSD that is less than reported resolution 1.90 Å of experimental data.

The results of constructing are summarized in Figure 5. In panel a) overlay of 3d models experimental (green) and soliton (purple) structures is represented. Panel b) shows how the angles of curvature and torsion behave in experimental data and in the soliton model.

Refer to caption Refer to caption
Figure 5: Overlaying of experimental model taken from PDB (green) and soliton model (purple). RMSD of them is 0.52 Å. Right pannel represents comparison of curvature (a) and torsion (b) of soliton model and PDB data.
Refer to caption Refer to caption
Figure 6: 3d model and knottines Dabrowski-Tumanski et al. (2018b) of 2EFV. Knot tails are denoted with gray color and not core with blue color.

III.2 Unfolding-folding simulation results

We performed simulations of the unfolding-folding process and inspected 3d models of the protein.

We found that 2EFV does not fold back to its native state once knot was untied after slow enough heating to the temperature of self-avoiding random walk (i.e. denaturation). It becomes clear from the graphs of radius of gyration RGR_{G} (Figure 7). In the panel c) enlarged area around TG=1012T_{G}=10^{-12} preciding to unfolding process is shown. The radius of gyration undergoes a shrinkage that is connected to unknotting of the molecule Lundgren and Niemi (2009). Meanwhile in panel d) which corresponds to the same area, but during the cooling process, shrinkage is totally absent.

Refer to caption
Figure 7: Radius of gyration in heat-cool simulation.

We studied the temperature range in which protein can be folded back to the native state. To do that, we heated the protein to the required temperature (NstepsN_{steps}), did a short thermalization (Nsteps/10N_{steps}/10) and then cooled it back to TG=1017T_{G}=10^{-17}. We present probability distributions of RMSD after heating and cooling to a certain temperature, the results are represented in Figure 8.

Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 8: Probability distribution of RMSD after Nsteps=1N_{steps}=1 million steps cooling process from certain temperature TGT_{G} to temperature of native state TG0=1017T_{G_{0}}=10^{-17}.

We checked all obtained configurations by Knot_pull and found that all and only configurations which belong to the first peak must be considered as knotted ones. Other peaks are products of misfolding to local energy minimums.

Refer to caption
Figure 9: Percent of unknotted configurations

The series of simulation was provided. We studied the dependences of the percantage of knotted configurations after heating and cooling from the maximum temperature in the heating-cooling process and the amount of Monte Carlo steps in the process. As follows from trajectories analysis the unknotted configuration can not be tied back, and after the cooling if the configuration is still knotted, it means that it was not untied. So, we can interpret this graph as percent of untied configurations.

The results obtained were fitted by the sigmoid function

f(TG)=a(1+eb(logTGlogTcrit)).f(T_{G})=\frac{a}{(1+e^{-b(\log T_{G}-\log T_{\mathrm{crit}})})}. (14)

The parameter TcritT_{\mathrm{crit}} in (14) can be interpreted as as a critical temperature for particular heating velocity at which protein denatures irreversibly. We plot extracted from fit values of critical temperature and fit it with linear regression. The critical temperature at infinitely slow heating turns out to be

log10Tcrit=11.18±0.05.\log_{10}T_{\mathrm{crit}}=-11.18\pm 0.05.
Refer to caption
Figure 10: Critical temperature dependence of heating velocity.

Thus, we see that 2EFV is relatively thermally stable, especially at high heating velocities. We propose that the high thermal stability of 2EFV is connected to the knot in its structure. To prove that we start simulations not from native state but from unknotted deeply cooled configuration. The unknotted configurations we chose from a huge variety of configurations cooled after unknotting. We made several calculations starting from different configurations closest to the native state having RMSD in a range from 3 to 5 Å  and radius of gyration from 12.20 to 12.30 Å.

Refer to caption Refer to caption
Figure 11: Radius of gyration in heat-cool simulation

It is clear from the graphs that the unknotted configurations starts unfolding earlier for two orders of magnitude of Glauber temperature than the native knotted configuration. Also, it does not shrink, and we can conclude that shrinking before unfolding is connected with the unknotting of the structure.

IV Conclusions

In conclusion, our study demonstrates that topology plays a crucial role in the dynamics of proteins, particularly in their thermal stability and folding behavior. Through the investigation of the protein MJ0366, which contains a trefoil knot in its structure, we have elucidated several key aspects of how this topological feature influences protein behavior.

Our findings indicate that the knot within the MJ0366 protein contributes significantly to its thermal stability. This enhanced stability is intricately linked to the speed of heating, suggesting that the rate of thermal application is a critical factor in maintaining the structural integrity of knotted proteins. During the unfolding process, we observed that the protein undergoes a noticeable shrinking, a phenomenon that correlates with the unknotting of the protein structure. This connection between shrinking and unknotting highlights the mechanical and structural impact of the knot on the protein’s response to thermal stress.

Furthermore, our research has shown that once the MJ0366 protein is unknotted, it cannot refold back into its original knotted configuration. This irreversibility underscores the unique and critical role of the knot in maintaining the protein’s functional state and stability.

The implications of our study extend beyond the specific case of the MJ0366 protein. Understanding the topology of knots and other forms of self-entanglement in proteins is essential for comprehending their stability, folding mechanisms, and functional properties. As such, future research into the topological characteristics of proteins promises to unveil new insights into protein dynamics, with potential applications in the design of more stable and functional proteins for various biotechnological and medical purposes.

In summary, our work emphasizes the importance of considering topological features in protein studies and opens new avenues for exploring how these features can be harnessed to control and enhance protein stability and functionality.

V Acknowlegemnts

The authors are grateful to Antti Niemi and Alexander Molochkov for useful discussions. A.B. is supported by the Carl Trygger Foundation Grant No. CTS 18:276. Nordita is supported in part by Nordforsk. AK and AZ are supported by the state assignment of the Ministry of Science and Higher Education of Russia (Project No. FZNS-2024-0002).

References