Learning Markovian Dynamics with Spectral Maps

Jakub Rydzewski [email protected] Tuğçe Gökdemir Institute of Physics, Faculty of Physics, Astronomy and Informatics, Nicolaus Copernicus University, Grudziadzka 5, 87-100 Toruń, Poland

Abstract

The long-time behavior of many complex molecular systems can often be described by Markovian dynamics in a slow subspace spanned by a few reaction coordinates referred to as collective variables (CVs). However, determining CVs poses a fundamental challenge in chemical physics. Depending on intuition or trial and error to construct CVs can lead to non-Markovian dynamics with long memory effects, hindering analysis. To address this problem, we continue to develop a recently introduced deep-learning technique called spectral map [Rydzewski, J. Phys. Chem. Lett. 2023, 14, 22, 5216–5220]. Spectral map learns slow CVs by maximizing a spectral gap of a Markov transition matrix describing anisotropic diffusion. Here, to represent heterogeneous and multiscale free-energy landscapes with spectral map, we implement an adaptive algorithm to estimate transition probabilities. Through a Markov state model analysis, we validate that spectral map learns slow CVs related to the dominant relaxation timescales and discerns between long-lived metastable states.

I Introduction

Understanding the long timescale dynamics of complex molecular systems constitutes a fundamental problem in chemical physics [1]. This dynamics is often governed by rare transitions between multiple long-lived metastable states that involve overcoming barriers much higher than the thermal energy ( $\gg k_{\mathrm{B}}T$ ) [2, 3]. Although having many spatial and temporal scales, systems exhibiting metastable characteristics frequently display slow relaxation dynamics that can be captured by a few reaction coordinates, also called collective variables (CVs), to demarcate between physical states [4, 5]. The long timescale dynamics is often considered effectively Markovian and modeled as diffusion in a free-energy landscape spanned by CVs [6, 7, 8]. Under this view, the dynamics mainly depends on slowly varying variables, while fast degrees of freedom are treated as uncoupled thermal noise, leading to adiabatic timescale separation [9].

Relying merely on physical intuition or trial and error to identify slow CVs can be unsystematic and result in highly non-Markovian dynamics with long memory effects, which can hinder our understanding of the underlying physical process [10, 11]. The problem of timescale separation has been addressed in the Mori–Zwanzig projection operator formalism by constructing a generalized Langevin equation that describes the dynamics in a slow subspace. However, as noted by Zwanzig, “no a priori choice is specifically indicated by theory” [12] to select this slow subspace. Recently, there has been considerable interest in constructing reduced representations of complex systems using data-driven techniques [13, 14, 15, 16]. Many such techniques are developed to learn CVs by separating slow and fast kinetics, which is particularly useful for metastable systems with multiple temporal scales [17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34].

In this Communication, we further improve spectral map [34], an unsupervised statistical learning technique inspired by diffusion map [35, 36, 37] and parametric dimensionality reduction [38, 39, 40, 41, 42]. Spectral map constructs slow CVs to describe Markovian dynamics by maximizing the spectral gap between slow and fast eigenvalues of a Markov transition matrix, increasing timescale separation, and preventing memory effects. We improve the learning process of spectral map by implementing an adaptive algorithm to estimate transition probabilities, enabling the representation of multiscale and heterogeneous free-energy landscapes with long-lived metastable states according to their physical characteristics. Finally, through a standard Markov state model analysis, we validate that CVs constructed by spectral map encode dominant slow relaxation timescales.

II Collective Variables

Consider a high-dimensional system described by $n$ configuration variables $\mathbf{x}=(x_{1},\dots,x_{n})$ whose dynamics at temperature $T$ and a potential energy function $U(\mathbf{x})$ is sampled from an unknown equilibrium distribution. If we represent the system by its microscopic coordinates, the dynamics follows a canonical equilibrium distribution given by the Boltzmann density $p(\mathbf{x})=\operatorname{e}^{-\beta U(\mathbf{x})}/Z$ , where $\beta=1/(k_{\mathrm{B}}T)$ is the inverse temperature, $k_{\mathrm{B}}$ is the Boltzmann constant and $Z=\int\operatorname{d\mathbf{x}}\operatorname{e}^{-\beta U(\mathbf{x})}$ is the partition function of the system.

We simplify the high-dimensional configuration space by mapping it into a reduced space $\mathbf{z}=\left(z_{1},\dots,z_{d}\right)$ given by a set of $d$ functions of the configuration variables, commonly referred to as CVs, where $d\ll n$ . We encapsulate these functions in a target mapping [41, 42, 16]:

\mathbf{z}=\xi_{w}(\mathbf{x})\equiv\big{\{}\xi_{k}(\mathbf{x},w)\big{\}}_{k=1}^{d},

(1)

where $w$ are learnable parameters ensuring that the target mapping describes slow dynamics and minimizes memory effects. By sampling the system in the CV space, its dynamics follows a marginal equilibrium density $p(\mathbf{z})\propto\operatorname{e}^{-\beta F(\mathbf{z})}$ , where $F(\mathbf{z})$ is a free-energy landscape:

F(\mathbf{z})=-\frac{1}{\beta}\log\int\operatorname{d\mathbf{x}}\,\delta\left(\mathbf{z}-\xi_{w}(\mathbf{x})\right)\operatorname{e}^{-\beta U(\mathbf{x})}

(2)

defined modulo an irrelevant constant.

Refer to caption — Figure 1: Training the target mapping $\xi_{w}(\mathbf{x})=\mathbf{z}$ for the three-state Müller–Brown potential from Ref. 43. (a) Schematic outline of a neural network used to model the target mapping. The reduction from high-dimensional variables $\mathbf{x}$ is performed to obtain low-dimensional slow CVs $\mathbf{z}$ by maximizing the spectral gap $\sigma=\lambda_{k-1}-\lambda_{k}$ , where $k$ is the number of the metastable states in the CV space, and $\{\lambda_{k}\}$ are eigenvalues from the spectral decomposition of the Markov transition matrix $M$ (Eq. 6). (b) Free-energy landscape of a particle moving in the two-dimensional space $\mathbf{x}=(x_{1},x_{2})$ with barriers between the states of around 20 $k_{\mathrm{B}}T$ and a kinetic bottleneck between states B and C. (c-f) Examples of training the target mappings with different network architectures, e.g., $\mathbf{x}^{[a,\dots,b]}\mapsto z$ denotes mapping the $\mathbf{x}$ variable through a network consisting of $[a,\dots,b]$ layers to the $z$ CV. Spectral gap values $\sigma$ are given in the top-left corners. The following variables are used as CVs: (c) the $x_{1}$ variable, (d) the $x_{2}$ variable, (e) a linear combination of $x_{1}$ and $x_{2}$ , (f) and a nonlinear combination of $x_{1}$ and $x_{2}$ . For additional details and results, see the Supplementary Material (Sec. S3).

III Spectral Map

To estimate the separation between effective timescales characteristic of the system, we model its reduced dynamics as a Markov chain using kernel functions. To measure the similarity between CV samples $\mathbf{z}_{k}$ and $\mathbf{z}_{l}$ , we introduce a Gaussian kernel for the reduced representation:

G(\mathbf{z}_{k},\mathbf{z}_{l})=\exp\left(-\frac{1}{\varepsilon}{\lVert\mathbf{z}_{k}-\mathbf{z}_{l}\rVert^{2}}\right)

(3)

where $\lVert\mathbf{z}_{k}-\mathbf{z}_{l}\rVert$ denotes pairwise Euclidean distances between every pair of CV samples $k,l=1,\dots,N$ and $N$ is the number of samples. The Gaussian kernel exhibits a notion of locality by defining a neighborhood around each sample of radius $\varepsilon$ [17].

In our previous proof-of-concept work that proposed spectral map [34], $\varepsilon$ was kept constant throughout the learning process. However, metastable states often have multimodal characteristics, resulting in spatially heterogeneous free-energy landscapes. Therefore, to improve the ability of spectral map to adjust to metastable states, we estimate the sample-dependent scale factors by adaptively balancing local and global scales as:

\varepsilon_{kl}(r)=\lVert\mathbf{z}_{k}-\eta_{r}(\mathbf{z}_{k})\rVert\cdot\lVert\mathbf{z}_{l}-\eta_{r}(\mathbf{z}_{l})\rVert,

(4)

where each term defines a ball centered at $\mathbf{z}$ of radius $\eta_{r}(\mathbf{z})>0$ . For convenience, we define the radius by the fraction of the neighborhood size $r\in[0,1]$ , allowing us to decide which scale is more relevant. Specifically, the Gaussian kernel describes a local neighborhood around each sample for values $r$ close to 0 (i.e., the nearest neighbors), which correspond to deep and narrow states. For values of $r$ around 1 (the farthest neighbors), it considers more global information, corresponding to shallow and wide states. Intermediate values of $r$ maintain a balance between spatial scales. For additional details about estimating $\varepsilon_{kl}$ , see the Supplementary Material (Sec. S2).

As the marginal equilibrium density in the CV space is often far from uniform for dynamical systems with complex free-energy landscapes, we need a density-preserving kernel for data sampled from any underlying probability distribution. For this, we employ an anisotropic diffusion kernel as introduced for diffusion maps [35]:

K(\mathbf{z}_{k},\mathbf{z}_{l})=\frac{G(\mathbf{z}_{k},\mathbf{z}_{l})}{\sqrt{\varrho(\mathbf{z}_{k})\varrho(\mathbf{z}_{l})}},

(5)

where $\varrho(\mathbf{z}_{k})=\sum_{l}G(\mathbf{z}_{k},\mathbf{z}_{l})$ is a kernel density estimate. Next, we build a Markov transition matrix by row-normalizing $K$ :

m_{kl}\sim M(\mathbf{z}_{k},\mathbf{z}_{l})=\frac{K(\mathbf{z}_{k},\mathbf{z}_{l})}{\sum_{n}K(\mathbf{z}_{k},\mathbf{z}_{n})}

(6)

which models a discrete Markov chain in the CV space $m_{kl}=\mathrm{Pr}\{\mathbf{z}_{\tau+1}=\mathbf{z}_{l}\;|\;\mathbf{z}_{\tau}=\mathbf{z}_{k}\}$ expressing a probability of transition between CV samples $\mathbf{z}_{k}$ and $\mathbf{z}_{l}$ in an auxiliary (non-physical) time $\tau$ . The Markov chain approximates the long-time asymptotics of the system by describing the dynamics by the Fokker–Planck anisotropic diffusion [35].

Finally, to estimate the dominant timescales encoded in the system, we perform a spectral decomposition of the Markov transition matrix:

M\Psi_{k}=\lambda_{k}\Psi_{k},

(7)

where $\Psi_{k}$ and $\lambda_{k}$ are the $k$ -th right eigenfunctions and eigenvalues of $M$ , respectively. The real-valued eigenvalues of $M$ are (sorted in non-ascending order):

\lambda_{0}=1>\lambda_{1}\dots\geq\lambda_{N},

(8)

where the eigenvalue $\lambda_{0}$ corresponds to the equilibrium distribution of the Markov chain (Eq. 6) given by the eigenfunction $\Psi_{0}$ . The dominant eigenvalues related to the slowest relaxation timescales in the system can be found by associating each eigenvalue with an effective timescale [45]:

t_{k}=-\frac{1}{\log\lambda_{k}}.

(9)

The largest gap between neighboring eigenvalues is called the spectral gap and determines the degree of the timescale separation between the slow and fast processes:

\sigma=\lambda_{k-1}-\lambda_{k},

(10)

where $k>0$ indicates the number of metastable states in the CV space [46].

The theory of spectral characterization of metastable states (see works by Gaveau and Schulman [47, 46] and references therein) explains that the spectral gap and degree of degeneracy in eigenvalue spectrum are related to the timescale separation and Markovian dynamics. If the eigenvalue of the Markov transition matrix is nearly degenerate $k+1$ times, it indicates that the equilibrium distribution breaks into $k$ metastable states with infrequent transitions between them. The converse is also true: if the equilibrium density breaks into metastable states separated by a free-energy barrier much larger than the thermal energy $k_{\mathrm{B}}T$ , there is eigenvalue degeneracy.

To achieve the reduced dynamics that is effectively Markovian, it is crucial to have a spectral gap between neighboring eigenvalues, along with the near degeneracy of the dominant eigenvalue. This condition is essential for the maximal spectral gap at $k$ to lead to the separation into $k$ metastable states, which can help reduce memory effects in the dynamics. Therefore, we consider the spectral gap as a scoring function in spectral map.

IV Applications

IV.1 Müller–Brown Potential

As a simple demonstration, we apply spectral map to a dataset consisting of trajectories of a particle moving in a modified three-state Müller–Brown potential [Fig. 1(b)]. Following Ref. 43, we modify the standard Müller–Brown potential by adding a kinetic bottleneck between states B and C that is mainly responsible for the slowest timescale in the dynamics (see Sec. S3 in the Supplementary Material). The input variables for the target mappings are the $\mathbf{x}=(x_{1},x_{2})$ coordinates of the particle. We use the Adam optimizer with a learning rate of $10^{-3}$ . The dataset consisting of 80 trajectories initiated at random from the three metastable states is divided into batches of size 2000. The training for $k=3$ is carried through 30 epochs. The trajectories are generated by a Langevin integrator as implemented in PLUMED [48, 49], using a temperature of 1, a friction coefficient of 10, and a time step of 0.005. We set the fraction of neighborhood size used to estimate sample-dependent scale factors to $r=0.5$ (Eq. 4).

In Fig. 1(c), we can see that if the $x_{1}$ variable is used as a CV, the spectral gap is just around $\sigma=0.48$ . Treating the $x_{2}$ as a CV results in an improved spectral gap of 0.62, as $x_{2}$ is slower than $x_{1}$ and better separates the three states [Fig. 1(d)]. Note that the CVs shown in Fig. 1(c-d) are identity mappings and cannot be maximized. They are used as a baseline to calculate the reference values of the spectral gap. Using a linear combination of $x_{1}$ and $x_{2}$ , we can slightly increase the separation between the states [Fig. 1(e)]. By adding a hidden layer with the ELU activation function to the target mapping [Fig. 1(f)], we obtain a separation with the main states A and B closer to each other. State C is pushed farther away as the particle reaching this state must overcome the kinetic bottleneck. This clearly shows that the nonlinear CV captures this slowest transition in the dynamics. As the transitions between states A and B are faster, these states are relatively closer to each other in the reduced representation. For additional details and results, see the Supplementary Material (Sec. S3).

IV.2 Chignolin

As the next application of our method, we consider the folding process of a ten-residue protein chignolin (CLN025) in solvent. A 100- $\mu$ s unbiased molecular dynamics simulation of CLN025 at its melting temperature of 340 K is obtained from Ref. 44. The folding (also termed “zipping”) of CLN025 is of interest due to its $\beta$ -hairpin structure and can serve as a building block to understand more complex processes.

As a high-dimensional representation, we use pairwise Euclidean distances between C $\alpha$ atoms, amounting to $n=45$ configuration variables. The dataset consists of 10,000 samples extracted from the simulation every 10 ns and randomly split into the training and validation sets of sizes 6000 and 4000, respectively. The training of the target mapping is carried out using $k=2$ for 100 epochs with data batches of 2000 samples. Once the target mapping is trained, we evaluate all samples from the trajectory (every 200 ps) to construct a free-energy landscape. To model the target mapping, we use a neural network of size [45, 200, 100, 50, 2], with each hidden layer employing the ELU activation function. We use the Adam optimizer with a learning rate of $10^{-3}$ . We set the fraction of neighborhood size used to estimate sample-dependent scale factors to $r=0.65$ by exploring which value of $r$ corresponds to the largest spectral gap (see Sec. S3 in the Supplementary Material for more details).

We present our results in Fig. 2. We can see that the free-energy landscape spanned by the slow CVs identified by spectral map depicts the folded and unfolded metastable states of CLN025 and rare transitions between these states by separating them with a barrier of around 20 kJ/mol. An interesting feature of these CVs is that the shape of the metastable states is preserved by the adaptive algorithm for computing sample-dependent scale factors. Namely, the folded state is well-defined, which is shown by its small size and the deepest free-energy minimum. In contrast, as the unfolded conformations are more loosely constrained, they are in a wider and shallower free-energy basin. Remarkably, the shape and depths of the metastable states and the height of the free-energy barrier are almost indistinguishable from results presented in Ref. 44.

In comparison to CVs calculated for CLN025 in our previous proof-of-concept work that introduced spectral map, where a constant scale factor is used to construct the Markov transition matrix (see Ref. 34), it is especially evident that the mapping of the unfolded state improved. Here, it resembles more accurately the physical characteristics of an ensemble comprising many degenerate configurations with a limited number of native contacts [50]. Therefore, we can see that estimating a neighborhood adaptively, instead of providing a constant value, enables a physically meaningful reconstruction of the free-energy landscape.

Furthermore, we use the spectral gap and sum of eigenvalues to evaluate the quality of two-dimensional projections of the CLN025 dataset computed using time-lagged independent component analysis [22] for several values of lag times. Comparing these results to the scores calculated from the CV space constructed with spectral map indicates that the CVs found with spectral map are characterized by both larger values of the spectral gap and the sum of eigenvalues. For more details, we refer to Sec. S3 in the Supplementary Material.

To validate that the learned CVs correspond to the slowest relaxation process in the dynamics of CLN025 and do not mix the folded and unfolded states, we perform a standard Markov state model analysis [51]. The quality of a Markov state model and the estimation of relaxation times can depend heavily on the reaction coordinate describing the studied physical process [52]. For example, using non-Markovian CVs that do not distinguish between long-lived states to build a Markov state model may lead to an accumulation of errors in the next steps of modeling and, ultimately, in estimated kinetics. However, using a reaction coordinate that accurately represents slow kinetics can significantly improve the accuracy of a Markov state model. Therefore, to validate that the CVs provided by spectral map accurately describe the slow modes of CLN025, we need to obtain relaxation timescale estimates that match reference values [44].

Table 1: Folding (F) and unfolding (U) times of CLN025. Mean times and standard deviations are estimated by constructing a Markov state model from the slow CVs learned by spectral map. Reference values are taken from Ref. 44.

Process	Mean [ $\mu$ s]	St. Dev. [ $\mu$ s]	Ref. [ $\mu$ s] [44]
Folding	0.58	0.01	0.61
Unfolding	2.08	0.02	2.24

Using the standard pipeline for constructing a Markov state model (see Sec. S4 in the Supplementary Material for details), we find that there is a single dominant relaxation process encoded in the CV learned by spectral map that corresponds to the reversible folding of CLN025. Other processes occur on a timescale below a lag time $\tau$ of 100 ns. We observe a plateau in an implied timescale convergence analysis for $>100$ ns, showing that the model is Markovian. Our analysis shows the model is not sensitive to changes in its parameters, as indicated by the Chapman–Kolmogorov test (performed up to 1 $\mu$ s). Having verified that the resulting Markov state model is very accurate, we estimate folding and unfolding times. The means and standard deviations of the folding and unfolding of CLN025 are shown in Tab. 1. Our estimates agree remarkably well compared to times reported by Lindorf-Larssen et al., who used a native contact-based definition of the folded and unfolded states [44]. Overall, our results indicate that spectral map uncovers slow CVs correctly by ensuring that the selected number of long-lived metastable states and rare transitions between them are represented in the reduced representation.

Overall, we have shown that the learned slow CVs separate the folded and unfolded states of CLN025 according to the slowest relaxation timescale, with faster processes being negligible. Furthermore, we have validated that the learned CVs are correct by performing a standard Markov state model analysis. The slow CVs, learned by spectral map without any data preprocessing and landmark subsampling [16], have allowed us to build a high-quality Markov state model and, thus, to estimate the relaxation timescales of CLN025 accurately. Moreover, supplementing the Markov state model with the slow reaction coordinates has simplified parameter search in the model, as shown by little dependence on selected parameters (see Sec. S4 in the Supplementary Material). This result is especially promising as selecting parameters in Markov state models is known to be difficult in some cases [53, 54, 55, 56].

V Conclusions

Many techniques for learning slow CVs rely partly on identifying lag times. In contrast, spectral map learns CVs describing the slowest modes and dominant relaxation times without requiring a specific lag time. This is possible as spectral map estimates transition probabilities between samples, unlike other techniques that rely on counting configurations. Selecting an incorrect lag time can lead to non-Markovian dynamics and result in non-negligible memory effects, which can mix slow and fast timescales, causing significant errors in estimated kinetics [57, 58]. However, it is important to note that spectral map exchanges temporal parameters (lag time) for spatial parameters (neighborhood size). Estimating scale factors is also known to affect the reduced representation [59, 60]. Although methods for identifying slow CVs using a lag time [22, 26, 61] are well established and commonly used, spectral map is a recently developed technique. Therefore, it is yet to be determined if estimating scale factors is more practical than selecting lag times.

Spectral map employs the anisotropic diffusion kernel to estimate transition probabilities and maximize the timescale separation. This kernel has been first introduced for diffusion map [17], and therefore, it is important to compare these two techniques. In diffusion maps, the reduced space is approximated using eigenfunctions of a Fokker–Planck operator (diffusion coordinates) constructed from samples in the configuration space. Because of that, the validity of the eigenfunctions cannot be self-consistently verified, and their quality cannot be improved. Additionally, from a technical perspective, eigendecompositions of large transition matrices can be computationally expensive. In contrast to diffusion map, spectral map performs the spectral decomposition of a Markov transition matrix in the reduced space, which is perhaps the main difference between the two techniques. Spectral map does not rely on eigenfunctions as approximations of the reduced representation and only requires computing eigenvalues. These eigenvalues are then used to maximize the spectral gap to iteratively improve CVs corresponding to slow kinetics through the learning process. Initially, the reduced dynamics can have non-Markovian characteristics. As the spectral gap is maximized, memory effects are reduced. Moreover, the eigendecompositions are computed from data batches, which reduces computational costs.

In this Communication, we have further improved spectral map, an unsupervised machine-learning technique for learning slow CVs describing Markovian dynamics of complex metastable systems. By implementing an adaptive algorithm for estimating transition probabilities, we have added to spectral map the ability to represent heterogeneous and multiscale free-energy landscapes. We have verified with a standard Markov state model analysis that spectral map can be successfully used to construct CVs corresponding to the slowest modes. Our technique is applicable to a variety of physical processes with multiple temporal scales. Although spectral map is still in its preliminary version, it has shown potential in computing slow reaction coordinates and deserves further development.

Supplementary Material

Computational details and results, including information about simulation datasets, learning, and Markov state models are available in the Supplementary Material.

Acknowledgments

J. R. acknowledges funding from the Polish Science Foundation (START), the Ministry of Science and Higher Education in Poland, and the Japan Society for the Promotion of Science (JSPS). J. R. and T. G. are supported by the National Science Center in Poland (Sonata 2021/43/D/ST4/00920, “Statistical Learning of Slow Collective Variables from Atomistic Simulations”). D. E. Shaw Research is acknowledged for providing the CLN025 dataset.

Author Declarations

Conflict of Interest

The authors have no conflicts to disclose.

Author Contributions

Jakub Rydzewski: Conceptualization (leading); Investigation (equal); Methodology (leading); Software (leading); Supervision (leading); Visualization (leading); Writing – original draft (leading); Writing – review & editing (equal). Tuğçe Gökdemir: Conceptualization (supporting); Investigation (equal); Methodology (supporting); Software (supporting); Supervision (supporting); Visualization (supporting); Writing – original draft (supporting); Writing – review & editing (equal).

Data Availability

Data and a reference implementation of spectral map are available on PLUMED-NEST [49] (www.plumed-nest.org), the public repository of the PLUMED consortium, as plumID:24.005.

References

Frenkel and Smit [2023] D. Frenkel and B. Smit, Understanding Molecular Simulation: From Algorithms to Applications, 3rd ed. (Academic Press, 2023).
Valsson, Tiwary, and Parrinello [2016] O. Valsson, P. Tiwary, and M. Parrinello, “Enhancing Important Fluctuations: Rare Events and Metadynamics from a Conceptual Viewpoint,” Annu. Rev. Phys. Chem. 67, 159–184 (2016).
Yang et al. [2019] Y. I. Yang, Q. Shao, J. Zhang, L. Yang, and Y. Q. Gao, “Enhanced Sampling in Molecular Dynamics,” J. Chem. Phys. 151, 070902 (2019).
Rohrdanz, Zheng, and Clementi [2013] M. A. Rohrdanz, W. Zheng, and C. Clementi, “Discovering Mountain Passes via Torchlight: Methods for the Definition of Reaction Coordinates and Pathways in Complex Macromolecular Reactions,” Annu. Rev. Phys. Chem. 64, 295–316 (2013).
Noé and Clementi [2017] F. Noé and C. Clementi, “Collective Variables for the Study of Long-Time Kinetics from Molecular Trajectories: Theory and Methods,” Curr. Opin. Struct. Biol. 43, 141–147 (2017).
Berezhkovskii and Szabo [2005] A. Berezhkovskii and A. Szabo, “One-Dimensional Reaction Coordinates for Diffusive Activated Rate Processes in Many Dimensions,” J. Chem. Phys. 122, 014503 (2005).
Berezhkovskii and Szabo [2011] A. Berezhkovskii and A. Szabo, “Time Scale Separation Leads to Position-Dependent Diffusion along a Slow Coordinate,” J. Chem. Phys. 135, 074108 (2011).
Maragliano et al. [2006] L. Maragliano, A. Fischer, E. Vanden-Eijnden, and G. Ciccotti, “String Method in Collective Variables: Minimum Free Energy Paths and Isocommittor Surfaces,” J. Chem. Phys. 125, 024106 (2006).
Zwanzig [2001] R. Zwanzig, Nonequilibrium Statistical Mechanics, 1st ed. (Oxford University Press, 2001).
Lange and Grubmüller [2006] O. F. Lange and H. Grubmüller, “Collective Langevin Dynamics of Conformational Motions in Proteins,” J. Chem. Phys. 124, 214903 (2006).
Micheletti, Bussi, and Laio [2008] C. Micheletti, G. Bussi, and A. Laio, “Optimal Langevin Modeling of Out-of-Equilibrium Molecular Dynamics Simulations,” J. Chem. Phys. 129, 074105 (2008).
Zwanzig [1961] R. Zwanzig, “Memory Effects in Irreversible Thermodynamics,” Phys. Rev. 124, 983 (1961).
Ceriotti [2019] M. Ceriotti, “Unsupervised Machine Learning in Atomistic Simulations, between Predictions and Understanding,” J. Chem. Phys. 150, 150901 (2019).
Wang, Ribeiro, and Tiwary [2020] Y. Wang, J. M. L. Ribeiro, and P. Tiwary, “Machine Learning Approaches for Analyzing and Enhancing Molecular Dynamics Simulations,” Curr. Opin. Struct. Biol. 61, 139–145 (2020).
Chen and Chipot [2023] H. Chen and C. Chipot, “Chasing Collective Variables using Temporal Data-Driven Strategies,” QRB Discovery 4, e2 (2023).
Rydzewski, Chen, and Valsson [2023] J. Rydzewski, M. Chen, and O. Valsson, “Manifold Learning in Atomistic Simulations: A Conceptual Review,” Mach. Learn.: Sci. Technol. 4, 031001 (2023).
Coifman et al. [2005] R. R. Coifman, S. Lafon, A. B. Lee, M. Maggioni, B. Nadler, F. Warner, and S. W. Zucker, “Geometric Diffusions as a Tool for Harmonic Analysis and Structure Definition of Data: Diffusion Maps,” Proc. Natl. Acad. Sci. U.S.A. 102, 7426–7431 (2005).
Singer et al. [2009] A. Singer, R. Erban, I. G. Kevrekidis, and R. R. Coifman, “Detecting Intrinsic Slow Variables in Stochastic Dynamical Systems by Anisotropic Diffusion Maps,” Proc. Natl. Acad. Sci. U.S.A. 106, 16090–16095 (2009).
Naritomi and Fuchigami [2011] Y. Naritomi and S. Fuchigami, “Slow Dynamics in Protein Fluctuations Revealed by Time-Structure based Independent Component Analysis: the Case of Domain Motions,” J. Chem. Phys. 134, 065101 (2011).
Ferguson et al. [2011] A. L. Ferguson, A. Z. Panagiotopoulos, P. G. Debenedetti, and I. G. Kevrekidis, “Integrating Diffusion Maps with Umbrella Sampling: Application to Alanine Dipeptide,” J. Chem. Phys. 134, 04B606 (2011).
Rohrdanz et al. [2011] M. A. Rohrdanz, W. Zheng, M. Maggioni, and C. Clementi, “Determination of Reaction Coordinates via Locally Scaled Diffusion Map,” J. Chem. Phys. 134, 03B624 (2011).
Pérez-Hernández et al. [2013] G. Pérez-Hernández, F. Paul, T. Giorgino, G. De Fabritiis, and F. Noé, “Identification of Slow Molecular Order Parameters for Markov Model Construction,” J. Chem. Phys. 139, 015102 (2013).
Tiwary and Berne [2016] P. Tiwary and B. J. Berne, “Spectral Gap Optimization of Order Parameters for Sampling Complex Molecular Systems,” Proc. Natl. Acad. Sci. U.S.A. 113, 2839 (2016).
Wu et al. [2017] H. Wu, F. Nüske, F. Paul, S. Klus, P. Koltai, and F. Noé, “Variational Koopman Models: Slow Collective Variables and Molecular Kinetics from Short Off-Equilibrium Simulations,” J. Chem. Phys. 146, 154104 (2017).
Chiavazzo et al. [2017] E. Chiavazzo, R. Covino, R. R. Coifman, C. W. Gear, A. S. Georgiou, G. Hummer, and I. G. Kevrekidis, “Intrinsic Map Dynamics Exploration for Uncharted Effective Free-Energy Landscapes,” Proc. Natl Acad. Sci. U.S.A. 114, E5494–E5503 (2017).
Wehmeyer and Noé [2018] C. Wehmeyer and F. Noé, “Time-Lagged Autoencoders: Deep Learning of Slow Collective Variables for Molecular Kinetics,” J. Chem. Phys. 148, 241703 (2018).
Thiede et al. [2019] E. H. Thiede, D. Giannakis, A. R. Dinner, and J. Weare, “Galerkin Approximation of Dynamical Quantities using Trajectory Data,” J. Chem. Phys. 150, 244111 (2019).
Chen, Sidky, and Ferguson [2019] W. Chen, H. Sidky, and A. Ferguson, “Nonlinear Discovery of Slow Molecular Modes using State-Free Reversible VAMPnets,” J. Chem. Phys. 150, 214114 (2019).
Bonati, Piccini, and Parrinello [2021] L. Bonati, G. Piccini, and M. Parrinello, “Deep Learning the Slow Modes for Rare Events Sampling,” Proc. Natl. Acad. Sci. U.S.A. 118, e2113533118 (2021).
Morishita [2021] T. Morishita, “Time-Dependent Principal Component Analysis: A Unified Approach to High-Dimensional Data Reduction using Adiabatic Dynamics,” J. Chem. Phys. 155, 134114 (2021).
Evans, Cameron, and Tiwary [2022] L. Evans, M. K. Cameron, and P. Tiwary, “Computing Committors via Mahalanobis Diffusion Maps with Enhanced Sampling Data,” J. Chem. Phys. 157, 214107 (2022).
Chen, Roux, and Chipot [2023] H. Chen, B. Roux, and C. Chipot, “Discovering Reaction Pathways, Slow Variables, and Committor Probabilities with Machine Learning,” J. Chem. Theory Comput. 19, 4414–4426 (2023).
Rydzewski [2023a] J. Rydzewski, “Selecting High-Dimensional Representations of Physical Systems by Reweighted Diffusion Maps,” J. Phys. Chem. Lett. 14, 2778–2783 (2023a).
Rydzewski [2023b] J. Rydzewski, “Spectral Map: Embedding Slow Kinetics in Collective Variables,” J. Phys. Chem. Lett. 14, 5216–5220 (2023b).
Nadler et al. [2006] B. Nadler, S. Lafon, R. R. Coifman, and I. G. Kevrekidis, “Diffusion Maps, Spectral Clustering and Reaction Coordinates of Dynamical Systems,” Appl. Comput. Harmon. Anal. 21, 113–127 (2006).
Coifman et al. [2008] R. R. Coifman, I. G. Kevrekidis, S. Lafon, M. Maggioni, and B. Nadler, “Diffusion Maps, Reduction Coordinates, and Low Dimensional Representation of Stochastic Systems,” Multiscale Model. Simul. 7, 842–864 (2008).
Singer and Coifman [2008] A. Singer and R. R. Coifman, “Non-Linear Independent Component Analysis with Diffusion Maps,” Appl. Comput. Harmon. Anal. 25, 226–239 (2008).
Hinton and Salakhutdinow [2006] G. E. Hinton and R. R. Salakhutdinow, “Reducing the Dimensionality of Data with Neural Networks,” Science 313, 504–507 (2006).
van der Maaten [2009] L. van der Maaten, “Learning a Parametric Embedding by Preserving Local Structure,” J. Mach. Learn. Res. 5, 384–391 (2009).
Zhang and Chen [2018] J. Zhang and M. Chen, “Unfolding Hidden Barriers by Active Enhanced Sampling,” Phys. Rev. Lett. 121, 010601 (2018).
Rydzewski and Valsson [2021] J. Rydzewski and O. Valsson, “Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling,” J. Phys. Chem. A 125, 6286–6302 (2021).
Rydzewski et al. [2022] J. Rydzewski, M. Chen, T. K. Ghosh, and O. Valsson, “Reweighted Manifold Learning of Collective Variables from Enhanced Sampling Simulations,” J. Chem. Theory Comput. 18, 7179–7192 (2022).
Bonati et al. [2023] L. Bonati, E. Trizio, A. Rizzi, and M. Parrinello, “A Unified Framework for Machine Learning Collective Variables for Enhanced Sampling Simulations: mlcolvar,” J. Chem. Phys. 159, 014801 (2023).
Lindorff-Larsen et al. [2011] K. Lindorff-Larsen, S. Piana, R. O. Dror, and D. E. Shaw, “How Fast-Folding Proteins Fold,” Science 334, 517–520 (2011).
Bovier et al. [2002] A. Bovier, M. Eckhoff, V. Gayrard, and M. Klein, “Metastability and Low Lying Spectra in Reversible Markov Chains,” Commun. Math. Phys. 228, 219–255 (2002).
Gaveau and Schulman [1998] B. Gaveau and L. S. Schulman, “Theory of Nonequilibrium First-Order Phase Transitions for Stochastic Dynamics,” J. Math. Phys. 39, 1517–1533 (1998).
Gaveau and Schulman [1996] B. Gaveau and L. S. Schulman, “Master Equation based Formulation of Nonequilibrium Statistical Mechanics,” J. Math. Phys. 37, 3897–3932 (1996).
Tribello et al. [2014] G. A. Tribello, M. Bonomi, D. Branduardi, C. Camilloni, and G. Bussi, “plumed 2: New Feathers for an Old Bird,” Comp. Phys. Commun. 185, 604–613 (2014).
plumed Consortium, [2019] plumed Consortium,, “Promoting Transparency and Reproducibility in Enhanced Molecular Simulations,” Nat. Methods 16, 670–673 (2019).
Best, Hummer, and Eaton [2013] R. B. Best, G. Hummer, and W. A. Eaton, “Native Contacts Determine Protein Folding Mechanisms in Atomistic Simulations,” Proc. Natl. Acad. Sci. U.S.A. 110, 17874–17879 (2013).
Prinz et al. [2011] J.-H. Prinz, H. Wu, M. Sarich, B. Keller, M. Senne, M. Held, J. D. Chodera, C. Schütte, and F. Noé, “Markov Models of Molecular Kinetics: Generation and Validation,” J. Chem. Phys. 134, 174105 (2011).
McKiernan, Husic, and Pande [2017] K. A. McKiernan, B. E. Husic, and V. S. Pande, “Modeling the Mechanism of CLN025 $\beta$ -Hairpin Formation,” J. Chem. Phys. 147, 104107 (2017).
Nedialkova et al. [2014] L. V. Nedialkova, M. A. Amat, I. G. Kevrekidis, and G. Hummer, “Diffusion Maps, Clustering and Fuzzy Markov Modeling in Peptide Folding Transitions,” J. Chem. Phys. 141, 114102 (2014).
McGibbon, Husic, and Pande [2017] R. T. McGibbon, B. E. Husic, and V. S. Pande, “Identification of Simple Reaction Coordinates from Complex Dynamics,” J. Chem. Phys. 146, 044109 (2017).
Husic and Pande [2017] B. E. Husic and V. S. Pande, “Note: MSM Lag Time Cannot be Used for Variational Model Selection,” J. Chem. Phys. 147, 176101 (2017).
Suárez et al. [2021] E. Suárez, R. P. Wiewiora, C. Wehmeyer, F. Noé, J. D. Chodera, and D. M. Zuckerman, “What Markov State Models Can and Cannot Do: Correlation Versus Path-Based Observables in Protein-Folding Models,” J. Chem. Theory Comput. 17, 3119–3133 (2021).
Schultze and Grubmüller [2021] S. Schultze and H. Grubmüller, “Time-Lagged Independent Component Analysis of Random Walks and Protein Dynamics,” J. Chem. Theory Comput. 17, 5766–5776 (2021).
Kozlowski and Grubmüller [2023] N. Kozlowski and H. Grubmüller, “Uncertainties in Markov State Models of Small Proteins,” J. Chem. Theory Comput. 19, 5516–5524 (2023).
Berry, Giannakis, and Harlim [2015] T. Berry, D. Giannakis, and J. Harlim, “Nonparametric Forecasting of Low-Dimensional Dynamical Systems,” Phys. Rev. E 91, 032915 (2015).
Berry and Harlim [2016] T. Berry and J. Harlim, “Variable Bandwidth Diffusion Kernels,” Appl. Comput. Harmon. Anal. 40, 68–96 (2016).
Mardt et al. [2018] A. Mardt, L. Pasquali, H. Wu, and F. Noé, “VAMPnets for Deep Learning of Molecular Kinetics,” Nat. Commun. 9, 5 (2018).