This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Energy Decomposition along Reaction Coordinate and Energy Weighted Reactive Flux: I. Theory

Wenjin Li
Institute for Advanced Study, Shenzhen University, Shenzhen, China
[email protected]
Abstract

A theoretical framework is proposed for an energy decomposition scheme along the reaction coordinate, in which the ensemble average of the potential energy weighted with reactive flux intensity is decomposed into energy components at the per-coordinate level. The decomposed energy quantity is demonstrated to be closely related to the free energy along the reaction coordinate and its connection to the emergent potential energy is provided. We also explore the property of the reactive flux weighted by the potential energy in the subspace of collective variables. A free energy analogue is then proposed in the subspace of collective variables and the directional derivative of this free energy analogue in the direction of the reactive flux is shown to be useful in the identification of the reaction coordinate and the minimum free energy path.

I. INTRODUCTION

Complex biomolecular systems, such as protein folding, large conformational changes, and protein-ligand interactions, have been intensively investigated by molecular dynamics (MD) simulations in the past several decades 1, 2. In spite of the great success in unveiling the detailed mechanisms of biomolecular systems, biomolecular simulations face mainly the challenges in several aspects such as the force field, the sampling problem, and the extraction of insightful information from high-dimensional trajectories. In the endeavour of overcoming the latter two challenges, a complex system from a physical viewpoint is usually reduced to a prototypical system consisting of two metastable states that are separated by a high energy barrier 3. In a straight-forward simulation, the system will spend most of the time in the metastable states and the transitions from one metastable state to the other are rarely sampled. To tackle the sampling issue, one common solution among many others resorts to a large ensemble of short trajectories obtained via various path sampling strategies 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15. These strategies usually possess the following properties: (1) Avoid redundant sampling of metastable basins and enhance the sampling of transition or barrier regions; (2) Easy to be parallelized and thus best utilize supercomputers; (3) The sampled trajectories are natural and form a non-equilibrium ensemble (no bias potential is introduced but the weight of a trajectory is different to the one in an equilibrium ensemble). Examples of such an ensemble of trajectories are swarms of trajectories 4, 16 and transition path ensemble (TPE) 11, 12, 13, 14, 15.

Since a non-equilibrium ensemble of short trajectories is generally more affordable for biomolecular systems than a long equilibrium ensemble, it is of great interest to extract from it useful kinetic and dynamic information, which includes for example (1) the metastable states and the matrix of transition probabilities among these metastable states; (2) the reaction coordinate and the free energy along it; (3) the contributions of different parts of the biomolecule (a group of atoms or residues) to the rare transition. Here, we will pay a special attention to the latter two cases. For the identification of reaction coordinate, the committor function of a configuration, that is the probability of the trajectories initiated from the configuration will reach the product state first before visiting the starting state, is generally viewed as the ‘ideal’ reaction coordinate 17. For a diffusive process with quadratic energy barrier, the gradient of the committor was demonstrated to be parallel to the eigenvector of the only negative eigenvalue of the matrix \vectorsymHD\vectorsym{HD} 18, 19, where \vectorsymH\vectorsym{H} is the Hessian matrix of the potential energy and \vectorsymD\vectorsym{D} is the diffusion tensor. Using the committor as the reference, several approaches such as genetic neural network method 20 and likelihood maximization method 21, 22, 23 have been proposed to select the reaction coordinate out of a pool of candidate coordinates. However, the evaluation of the committor is computationally expensive, although an efficient method to estimate the committor via a fitting procedure is available 24, approaches were then developed to extract the reaction coordinate out of a non-equilibrium ensemble of trajectories without extra simulations 25, 26, 27. Li and co-workers suggested that the emergent potential energy along a coordinate is useful in appraising the relevance of the coordinate to the reaction coordinate 25. Later on, the equipartition terms in the TPE along a coordinate were maximized to obtain a one-dimensional representation of the reaction coordinate 26. Recently, a density-weighted average of the flux along a coordinate is shown to be a promising quantity to select coordinates most relevant to the reaction coordinate 27, and the theoretical framework for this quantity and its connection to the transition path theory is established 28.

Existing methods that estimate the contributions from different parts of the system in the transition from one metastable state to the other largely rely on free energy decomposition. Typical examples are free energy perturbation 29, thermodynamic integration 30, and the MM/PB(GB)SA approach 31, 32. However, the decomposition of free energy was carried out either along an artificial pathway or in concern of only the two end states. On the other hand, the energy decomposition depends on the very path of integration 33. Thus, the reaction coordinate is considered to be the natural choice of the path along which the energy decomposition should be performed. Following this thinking path, the potential energy was then decomposed along the committor at the single-coordinate level in the TPE . This so-called emergent potential energy approach was extended by Li into a residue-residue mutual work analysis approach 34, in which the energy contribution is quantified by a so-called ensemble averaged work (EAW) and the EWA on a residue is simply the summation of the per-coordinate EAW over all the coordinates of the residue. Importantly, the EWA on a residue due to its interaction with another residue is provided by decomposing the atomic force on an atom into pairwise forces (the force on one atom from its interaction with another atom). The energy contribution on a residue from another one was shown to be different to the one on the latter residue due to the former one. We would like to emphasize that the MM-GBSA method assumes the residue-residue mutual works from each other are the same and the interaction energy between two residues are evenly divided in the per-residue decomposition. In the residue-residue mutual work approach 34, the energy contribution of a residue can be further decomposed into contributions due to translational, rotational, and internal motions of the residue.

Here, inspired by the principal curve defined in the transition path theory 35, a theoretical framework for the TPE, we proposed an ensemble average of any quantity weighted with reactive flux or current intensity over a non-equilibrium ensemble in which the reactive current is non-zero. By considering such a weighted ensemble average of the potential energy, an energy decomposition scheme along the reaction coordinate is derived and the properties of the energy components on various coordinates are discussed. Later, we define a reactive flux weighted by the potential energy in the subspace of collective variables (CVs), from which a multi-dimensional free energy analogue of CVs is then derived. In addition, we will show that the directional derivative of this free energy analogue in the direction of the reactive flux can be used in identifying the reaction coordinate and the minimum free energy path.

II. THEORY

A. Flux and current

Let \vectorsymx(t)\vectorsym{x}(t) be an infinitely long trajectory of a system that consists of two stable states A and B, where \vectorsymx\vectorsym{x} represents the position coordinates of the system. In this long trajectory, the trajectory pieces that initiate from configurations in the state A and commit to the state B without returning to the state A are known to be reactive trajectories from A to B, which form the so-called TPE. We denote UU as an arbitrary ensemble, which could be the TPE or other non-equilibrium ensembles taken from the long trajectory and ξ(\vectorsymx)\xi(\vectorsym{x}) as a generalized coordinate. From a theoretical framework proposed in Ref. [28] (here the notation and nomenclature in this reference are largely followed), the flux at a fixed location ξ\xi^{\prime} of ξ(\vectorsymx)\xi(\vectorsym{x}) in the ensemble UU is defined as

FU(ξ)=0Tξ˙(\vectorsymx(t))δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑tF^{U}(\xi^{\prime})=\int_{0}^{T}\dot{\xi}(\vectorsym{x}(t))\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt (1)

where hU(t)h_{U}(t) is an indicator function for the ensemble UU, that is hU(t)=1h_{U}(t)=1 if \vectorsymx(t)\vectorsym{x}(t) belongs to UU and hU(t)=0h_{U}(t)=0 otherwise.

If \vectorsymy{y1,y2,,yn}\vectorsym{y}\equiv\{y_{1},y_{2},\cdots,y_{n}\} is a set of CVs, which are functions of \vectorsymx(t)\vectorsym{x}(t), the current (from now on we use current when referring to a vector field and flux when a scalar is referred) in the space of CVs in the ensemble UU can then be defined as

\vectorsymJU(\vectorsymy)=0T\vectorsymy˙(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\vectorsym{J}^{U}(\vectorsym{y})=\int_{0}^{T}\dot{\vectorsym{y}}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (2)

B. Ensemble average weighted with current intensity

Given f(\vectorsymx)f(\vectorsym{x}) being an arbitrary function of \vectorsymx\vectorsym{x}, we consider a particular ensemble average of the quantity f(\vectorsymx)f(\vectorsym{x}) over the surface ξ(\vectorsymx)=ξ\xi(\vectorsym{x})=\xi^{\prime} in the ensemble UU,

f(\vectorsymx)ξ(\vectorsymx)=ξ,UFlux0Tf(\vectorsymx(t))ξ˙(\vectorsymx(t))δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t0Tξ˙(\vectorsymx(t))δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}\equiv\frac{\int_{0}^{T}f(\vectorsym{x}(t))\dot{\xi}(\vectorsym{x}(t))\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt}{\int_{0}^{T}\dot{\xi}(\vectorsym{x}(t))\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt} (3)

It is inspired by the principal curve proposed in the transition path theory 35 and is actually the average over the surface ξ(\vectorsymx)=ξ\xi(\vectorsym{x})=\xi^{\prime} of f(\vectorsymx)f(\vectorsym{x}) that is weighted with the current intensity passing through the point \vectorsymx\vectorsym{x} on the surface ξ(\vectorsymx)=ξ\xi(\vectorsym{x})=\xi^{\prime}, that is,

f(\vectorsymx)ξ(\vectorsymx)=ξ,UFlux=f(\vectorsymx)\vectorsymJU(\vectorsymx)ξ(\vectorsymx)δ(ξ(\vectorsymx)ξ)𝑑\vectorsymx\vectorsymJU(\vectorsymx)ξ(\vectorsymx)δ(ξ(\vectorsymx)ξ)𝑑\vectorsymx\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}=\frac{\int f(\vectorsym{x})\vectorsym{J}^{U}(\vectorsym{x})\cdot\nabla\xi(\vectorsym{x})\delta(\xi(\vectorsym{x})-\xi^{\prime})d\vectorsym{x}}{\int\vectorsym{J}^{U}(\vectorsym{x})\cdot\nabla\xi(\vectorsym{x})\delta(\xi(\vectorsym{x})-\xi^{\prime})d\vectorsym{x}} (4)

Since a point on the principal curve given in Eq. (63) in Ref. [ 35] can be viewed as a conditional canonical expectation of points in an isosurface along the principal curve itself under the assumptions: (1) the committor can be approximated by a function of the principal curve, (2) the curvature effects of the isosurface is negligible, and (3) the overdamped dynamics is assumed. In the following, we will show that f(\vectorsymx)ξ(\vectorsymx)=ξ,UFlux\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux} is a conditional canonical expectation of f(\vectorsymx)f(\vectorsym{x}) under similar assumptions. For a diffusion process where the Smoluchowski equation is applicable, the reactive current is given by 36

\vectorsymJR(\vectorsymx)=ρ(\vectorsymx)\vectorsymD(\vectorsymx)q(\vectorsymx)\vectorsym{J}_{R}(\vectorsym{x})=\rho(\vectorsym{x})\vectorsym{D}(\vectorsym{x})\nabla q(\vectorsym{x}) (5)

where ρ(\vectorsymx)\rho(\vectorsym{x}) is the equilibrium probability density in canonical ensemble, \vectorsymD(\vectorsymx)\vectorsym{D}(\vectorsym{x}) is the position-dependent diffusion matrix, and q(\vectorsymx)q(\vectorsym{x}) is the committor function. If ξ(\vectorsymx)\xi(\vectorsym{x}) is a monotonically increasing function of the committor, i.e., q(\vectorsymx)=g(ξ(\vectorsymx))q(\vectorsym{x})=g(\xi(\vectorsym{x})) and thus q=g(ξ)ξ\nabla q=g^{\prime}(\xi)\nabla\xi, and UU is chosen to be the TPE (then \vectorsymJU(\vectorsymx)\vectorsym{J}^{U}(\vectorsym{x}) is proportional to \vectorsymJR(\vectorsymx)\vectorsym{J}_{R}(\vectorsym{x})), we have,

f(\vectorsymx)ξ(\vectorsymx)=ξ,UFlux=\displaystyle\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}= f(\vectorsymx)ρ(\vectorsymx)g(ξ(\vectorsymx))ξ(\vectorsymx)\vectorsymD(\vectorsymx)ξ(\vectorsymx)δ(ξ(\vectorsymx)ξ)𝑑\vectorsymxρ(\vectorsymx)g(ξ(\vectorsymx))ξ(\vectorsymx)\vectorsymD(\vectorsymx)ξ(\vectorsymx)δ(ξ(\vectorsymx)ξ)𝑑\vectorsymx\displaystyle\frac{\int f(\vectorsym{x})\rho(\vectorsym{x})g^{\prime}(\xi(\vectorsym{x}))\nabla\xi(\vectorsym{x})\cdot\vectorsym{D}(\vectorsym{x})\nabla\xi(\vectorsym{x})\delta(\xi(\vectorsym{x})-\xi^{\prime})d\vectorsym{x}}{\int\rho(\vectorsym{x})g^{\prime}(\xi(\vectorsym{x}))\nabla\xi(\vectorsym{x})\cdot\vectorsym{D}(\vectorsym{x})\nabla\xi(\vectorsym{x})\delta(\xi(\vectorsym{x})-\xi^{\prime})d\vectorsym{x}} (6a)
=\displaystyle= f(\vectorsymx)ρ(\vectorsymx)ξ(\vectorsymx)\vectorsymD(\vectorsymx)ξ(\vectorsymx)δ(ξ(\vectorsymx)ξ)𝑑\vectorsymxρ(\vectorsymx)ξ(\vectorsymx)\vectorsymD(\vectorsymx)ξ(\vectorsymx)δ(ξ(\vectorsymx)ξ)𝑑\vectorsymx\displaystyle\frac{\int f(\vectorsym{x})\rho(\vectorsym{x})\nabla\xi(\vectorsym{x})\cdot\vectorsym{D}(\vectorsym{x})\nabla\xi(\vectorsym{x})\delta(\xi(\vectorsym{x})-\xi^{\prime})d\vectorsym{x}}{\int\rho(\vectorsym{x})\nabla\xi(\vectorsym{x})\cdot\vectorsym{D}(\vectorsym{x})\nabla\xi(\vectorsym{x})\delta(\xi(\vectorsym{x})-\xi^{\prime})d\vectorsym{x}} (6b)

Under the assumption of a local transition tube, in which most of the reactive current are located, the curvature of the surface ξ(\vectorsymx)=ξ\xi(\vectorsym{x})=\xi^{\prime} in the transition tube could be negligible and the diffusion on it may be approximately position independent (not necessarily isotropic), then ξ(\vectorsymx)\vectorsymD(\vectorsymx)ξ(\vectorsymx)\nabla\xi(\vectorsym{x})\cdot\vectorsym{D}(\vectorsym{x})\nabla\xi(\vectorsym{x}) is approximately a constant and can be cancelled out in the above fraction. We thus arrive,

f(\vectorsymx)ξ(\vectorsymx)=ξ,UFluxf(\vectorsymx)ρ(\vectorsymx)δ(ξ(\vectorsymx)ξ)𝑑\vectorsymxρ(\vectorsymx)δ(ξ(\vectorsymx)ξ)𝑑\vectorsymx\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}\approx\frac{\int f(\vectorsym{x})\rho(\vectorsym{x})\delta(\xi(\vectorsym{x})-\xi^{\prime})d\vectorsym{x}}{\int\rho(\vectorsym{x})\delta(\xi(\vectorsym{x})-\xi^{\prime})d\vectorsym{x}} (7)

Although we assume f(\vectorsymx)f(\vectorsym{x}) being a function of \vectorsymx\vectorsym{x}, the weighted ensemble average can be generalized to any time series f(t)f(t) extracted from trajectories and Eq. (3) is reformulated into

f(t)ξ(\vectorsymx)=ξ,UFlux0Tf(t)ξ˙(\vectorsymx(t))δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t0Tξ˙(\vectorsymx(t))δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t\langle f(t)\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}\equiv\frac{\int_{0}^{T}f(t)\dot{\xi}(\vectorsym{x}(t))\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt}{\int_{0}^{T}\dot{\xi}(\vectorsym{x}(t))\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt} (8)

In case that ξ(\vectorsymx)\xi(\vectorsym{x}) is a good one-dimensional representation of the reaction coordinate and the flux along it (FU(ξ)F^{U}(\xi)) presents a plateau region, in which FU(ξ)F^{U}(\xi) is a constant and equals the number of transition paths NTN_{\rm T}. We then have (Appendix A)

df(\vectorsymx)ξ(\vectorsymx)=ξ,UFluxdξ=df(\vectorsymx(t))dt1ξ˙(\vectorsymx(t))ξ(\vectorsymx)=ξ,UFluxfor ξ in the plateau region\frac{d\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}}{d\xi^{\prime}}=\langle\frac{df(\vectorsym{x}(t))}{dt}\frac{1}{\dot{\xi}(\vectorsym{x}(t))}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}\quad\text{for $\xi^{\prime}$ in the plateau region} (9)

In case that ξ(\vectorsymx)\xi(\vectorsym{x}) is the committor function or a monotone of the committor, Eq. (9) holds everywhere. We will show the usefulness of Eqs. (7) and (9) in the next subsection.

C. Energy decomposition along reaction coordinate

Given an independent and complete set of configurational coordinates \vectorsymz{z1,z2,,zn}\vectorsym{z}\equiv\{z_{\rm 1},z_{\rm 2},\cdots,z_{\rm n}\}, we consider a special case that f(\vectorsymz)f(\vectorsym{z}) is the potential energy V(\vectorsymz)V(\vectorsym{z}) and then

dV(\vectorsymz(t))dt1ξ˙(t)ξ(\vectorsymx)=ξ,UFlux=iV(\vectorsymz(t))ziz˙i(t)ξ˙(t)ξ(\vectorsymx)=ξ,UFlux\langle\frac{dV(\vectorsym{z}(t))}{dt}\frac{1}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}=\sum_{i}\langle\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}(t)}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux} (10)

Here, we avoid the explicit expressions of \vectorsymz\vectorsym{z} as functions of \vectorsymx\vectorsym{x} to simply the notations. Note that the dimension of V(\vectorsymz(t))ziz˙i(t)ξ˙(t)\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}(t)}{\dot{\xi}(t)} is the same as the one of the generalized force corresponding to the generalized coordinate ξ\xi, its integral over ξ\xi gives terms of the same dimension as energy, we thus formally defined the energy decomposed onto a coordinate ziz_{\rm i} along the coordinate ξ\xi as

Vzi,ξ=V(\vectorsymz(t))ziz˙i(t)ξ˙(t)ξ(\vectorsymx)=ξ,UFlux𝑑ξV_{z_{\rm i},\xi}=\int\langle\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}(t)}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}d\xi^{\prime} (11)

In the following, we will discuss the properties of the energy component in Eq. (11) in various cases under the conditions: (1) The ensemble UU is chosen to be the TPE; (2) The system can be described as a reaction coordinate in interaction with a large number of bath modes 37; (3) The behaviours of bath modes in the TPE resemble the ones in the equilibrium ensemble.

1. When ξ\xi is the reaction coordinate

We first consider an ideal case that ξ(\vectorsymx)\xi(\vectorsym{x}) is the reaction coordinate and it belongs to \vectorsymz\vectorsym{z}, then the energy component on the reaction coordinate ξ\xi along itself is

Vξ,ξ=V(\vectorsymz(t))ξξ(\vectorsymx)=ξ,UFlux𝑑ξV(\vectorsymz)ξξ(\vectorsymx)=ξ𝑑ξV_{\xi,\xi}=\int\langle\frac{\partial V(\vectorsym{z}(t))}{\partial\xi}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}d\xi^{\prime}\approx\int\langle\frac{\partial V(\vectorsym{z})}{\partial\xi}\rangle_{\xi(\vectorsym{x})=\xi^{\prime}}d\xi^{\prime} (12)

Here ξ(\vectorsymx)=ξ\langle\cdots\rangle_{\xi(\vectorsym{x})=\xi^{\prime}} stands for the conditional canonical average over the surface ξ(\vectorsymx)=ξ\xi(\vectorsym{x})=\xi^{\prime} and Eq. (7) is used. V(\vectorsymz)ξξ(\vectorsymx)=ξ\langle\frac{\partial V(\vectorsym{z})}{\partial\xi}\rangle_{\xi(\vectorsym{x})=\xi^{\prime}} is closely related to the potential of mean force AA along ξ\xi as 38

dA(ξ)dξ|ξ(\vectorsymx)=ξ=Vξξ(\vectorsymx)=ξkBTln|\vectorsymJ|ξξ(\vectorsymx)=ξ\frac{dA(\xi)}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}}=\langle\frac{\partial V}{\partial\xi}\rangle_{\xi(\vectorsym{x})=\xi^{\prime}}-\langle k_{\rm B}T\frac{\partial\ln|\vectorsym{J}|}{\partial\xi}\rangle_{\xi(\vectorsym{x})=\xi^{\prime}} (13)

with \vectorsymJ\vectorsym{J} being the Jacobian matrix upon transforming from Cartesian to generalized coordinates, kBk_{\rm B} being the Boltzmann’s constant and TT the temperature. If the entropic contribution due to the change of volume element associated with the generalized coordinates (kBTln|\vectorsymJ|ξξ(\vectorsymx)=ξ-\langle k_{\rm B}T\frac{\partial\ln|\vectorsym{J}|}{\partial\xi}\rangle_{\xi(\vectorsym{x})=\xi^{\prime}}) is negligible, the energy component Vξ,ξV_{\xi,\xi} will approximates the potential of mean force along ξ\xi.

The energy component on a coordinate other than the reaction coordinate (i.e., a bath mode) along the reaction coordinate ξ\xi is

dVzi,ξdξ|ξ(\vectorsymx)=ξ=1FU(ξ)0TV(\vectorsymz(t))ziz˙i(t)δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t0for ziξ\frac{dV_{z_{\rm i},\xi}}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}}=\frac{1}{F^{U}(\xi^{\prime})}\int_{0}^{T}\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}\dot{z}_{\rm i}(t)\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt\approx 0\quad\text{for $z_{\rm i}\neq\xi$} (14)

Thus, the energy component on a bath mode along the reaction coordinate vanishes as the distribution of the generalized velocity of a bath mode resembles the equilibrium one.

What if none of the coordinates in \vectorsymz\vectorsym{z} is the true reaction coordinate? Let us assume without loss of generality that the reaction coordinate ξ\xi is a function of z1z_{\rm 1} and z2z_{\rm 2} and η\eta is the bath mode that is orthogonal to ξ\xi in the subspace of z1z_{\rm 1} and z2z_{\rm 2}, then the energy component on zjz_{\rm j} (j{1,2}j\in\{1,2\}) along ξ\xi is

dVzj,ξdξ|ξ(\vectorsymx)=ξ\displaystyle\frac{dV_{z_{\rm j},\xi}}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}} =1FU(ξ)0TV(\vectorsymz(t))zj[zjξξ˙(t)+zjηη˙(t)]δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t\displaystyle=\frac{1}{F^{U}(\xi^{\prime})}\int_{0}^{T}\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm j}}[\frac{\partial z_{\rm j}}{\partial\xi}\dot{\xi}(t)+\frac{\partial z_{\rm j}}{\partial\eta}\dot{\eta}(t)]\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt (15a)
V(\vectorsymz(t))zjzjξξ(\vectorsymx)=ξ,UFlux\displaystyle\approx\langle\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm j}}\frac{\partial z_{\rm j}}{\partial\xi}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux} (15b)

Again, the energy term related to the bath mode η\eta vanishes by assumption. Thus, the energy component on a mixed coordinate (a coordinate that is a combination of the reaction coordinate and a bath mode) along the reaction coordinate does not vanish. Obviously, the summation of the energy components on all the mixed coordinates equals the energy component on the reaction coordinate along itself, which approximates the free energy along it. Thus, the energy decomposition scheme in Eq. (11) is an approximate decomposition of the free energy along the reaction coordinate into components along each coordinate in a set of configurational coordinates of the system.

2. When ξ\xi is a mixed coordinate

In practice, the one-dimensional coordinate obtained can be just an approximation of the reaction coordinate, we thus assume that ξ\xi is a function of the reaction coordinate qsq_{\rm s} and a bath mode η1\eta_{\rm 1} without loss of generality. It is easy to show that the energy component on a bath mode along ξ\xi vanishes. Now, we consider the energy component on ziz_{\rm i} that is a combination of the reaction coordinate qsq_{\rm s} and a bath mode η2\eta_{\rm 2}, which can be the same as η1\eta_{\rm 1} or be different to it, along ξ\xi

dVzi,ξdξ|ξ(\vectorsymx)=ξ\displaystyle\frac{dV_{z_{\rm i},\xi}}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}} =1FU(ξ)0TV(\vectorsymz(t))zi[ziqsq˙s(t)+ziη2η˙2(t)]δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t\displaystyle=\frac{1}{F^{U}(\xi^{\prime})}\int_{0}^{T}\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}[\frac{\partial z_{\rm i}}{\partial q_{\rm s}}\dot{q}_{\rm s}(t)+\frac{\partial z_{\rm i}}{\partial\eta_{\rm 2}}\dot{\eta}_{\rm 2}(t)]\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt (16a)
V(\vectorsymz(t))ziziqsq˙s(t)ξ˙(t)ξ(\vectorsymx)=ξ,UFlux\displaystyle\approx\langle\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}\frac{\partial z_{\rm i}}{\partial q_{\rm s}}\frac{\dot{q}_{\rm s}(t)}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux} (16b)

Summing over the energy components on all the mixed coordinates, we have

dV(\vectorsymz(t))dt1ξ˙(t)ξ(\vectorsymx)=ξ,UFluxV(\vectorsymz(t))qsq˙s(t)ξ˙(t)ξ(\vectorsymx)=ξ,UFlux\langle\frac{dV(\vectorsym{z}(t))}{dt}\frac{1}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}\approx\langle\frac{\partial V(\vectorsym{z}(t))}{\partial q_{\rm s}}\frac{\dot{q}_{\rm s}(t)}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux} (17)

Thus, the summation of the energy components on all the coordinates along ξ\xi does not equal the energy component on ξ\xi along itself when ξ\xi is a mixed coordinate as there exists at least another mixed coordinate on which the energy component along ξ\xi does not vanish. Actually, the energy decomposition scheme given in Eq. (11) aims to decompose the free energy along the true reaction coordinate rather than the one along ξ\xi. When ξ\xi is very close to the reaction coordinate, the energy components should be quite good approximations to the ones along the reaction coordinate.

3. Energy along ξ\xi in the plateau region of the flux

When ξ\xi is a good one-dimensional reaction coordinate and there exits a plateau region in the flux along it. Let us define that ξC\xi_{\rm C} and ξD\xi_{\rm D} are the starting point and end point of the plateau region, respectively. From Eqs. (9) and (17) with several transformations, the weighted average energy as a function of ξ\xi in the plateau region is

V(\vectorsymx)ξ(\vectorsymx)=ξD,UFluxV(\vectorsymx)ξ(\vectorsymx)=ξC,UFlux\displaystyle\langle V(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi_{\rm D},U}^{Flux}-\langle V(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi_{\rm C},U}^{Flux} ξCξDV(\vectorsymz(t))qsq˙s(t)ξ˙(t)ξ(\vectorsymx)=ξ,UFlux𝑑ξ\displaystyle\approx\int_{\xi_{\rm C}}^{\xi_{\rm D}}\langle\frac{\partial V(\vectorsym{z}(t))}{\partial q_{\rm s}}\frac{\dot{q}_{\rm s}(t)}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}d\xi^{\prime} (18a)
=1NT0TV(\vectorsymz(t))qsq˙s(t)ξCξDδ(ξ(t)ξ)𝑑ξ𝑑t\displaystyle=\frac{1}{N_{\rm T}}\int_{0}^{T}\frac{\partial V(\vectorsym{z}(t))}{\partial q_{\rm s}}\dot{q}_{\rm s}(t)\int_{\xi_{\rm C}}^{\xi_{\rm D}}\delta(\xi(t)-\xi^{\prime})d\xi^{\prime}dt (18b)

From the above equation, it is now clear that the weighted average energy along ξ\xi is related to the energy component along the reaction coordinate rather than the one along ξ\xi.

Now we formally define two free energy analogues A˘(ξ)\breve{A}(\xi) and A~(ξ)\tilde{A}(\xi) as follows,

A˘(ξ)\displaystyle\breve{A}(\xi^{\prime}) V(\vectorsymz)ξ(\vectorsymz)=ξ,UFlux\displaystyle\equiv\langle V(\vectorsym{z})\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}^{Flux} (19a)
dA~(ξ)dξ|ξ(\vectorsymz)=ξ\displaystyle\frac{d\tilde{A}(\xi)}{d\xi}|_{\xi(\vectorsym{z})=\xi^{\prime}} dV(\vectorsymz(t))dt1ξ˙(\vectorsymz(t))ξ(\vectorsymz)=ξ,UFlux\displaystyle\equiv\langle\frac{dV(\vectorsym{z}(t))}{dt}\frac{1}{\dot{\xi}(\vectorsym{z}(t))}\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}^{Flux} (19b)

The two free energy analogues are equivalent in the plateau region of the flux along ξ\xi and can be significantly different in other regions. Actually, Eq. (9) provides a convenient way in practice to estimate the quantities in Eq. (17). The calculation of V(\vectorsymz(t))qs\frac{\partial V(\vectorsym{z}(t))}{\partial q_{\rm s}} is non-trivial and the estimation of dV(\vectorsymz(t))dt\frac{dV(\vectorsym{z}(t))}{dt} using a finite differences method requires the trajectory to be saved at almost every integration step in MD simulations, while A˘(ξ)\breve{A}(\xi) can be computed from typical MD trajectories, in which the position and velocity informations are stored with a large time interval to save disk space in practice.

D. Connection with emergent potential energy

The energy component in Eq. (11) can be related to the emergent potential energy proposed in Ref. [25]. The emergent potential energy on a coordinate ziz_{\rm i} along ξ\xi is defined as

ΔVzi(\vectorsymz;ξ,Δξ)Δξ=V(\vectorsymz)ziz˙iξ˙sgn(ξ˙)ξ(\vectorsymz)=ξ,U=V(\vectorsymz)ziz˙i|ξ˙|ξ(\vectorsymz)=ξ,U\frac{\langle\Delta V_{z_{\rm i}}(\vectorsym{z};\xi^{\prime},\Delta\xi)\rangle}{\Delta\xi}=\langle\frac{\partial V(\vectorsym{z})}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}}{\dot{\xi}}\text{sgn}(\dot{\xi})\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}=\langle\frac{\partial V(\vectorsym{z})}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}}{|\dot{\xi}|}\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U} (20)

where sgn(x)(x) is the sign function and ξ(\vectorsymz)=ξ,U\langle\cdots\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U} stands for an ensemble average over the ensemble UU at ξ(\vectorsymz)=ξ\xi(\vectorsym{z})=\xi^{\prime}, which is defined as

f(t)ξ(\vectorsymx)=ξ,U0Tf(t)δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t0Tδ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t\langle f(t)\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}\equiv\frac{\int_{0}^{T}f(t)\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt}{\int_{0}^{T}\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt} (21)

On the other hand, the energy component on ziz_{\rm i} along ξ\xi in this work can be rewritten with similar notations as

V(\vectorsymz(t))ziz˙i(t)ξ˙(t)ξ(\vectorsymx)=ξ,UFlux=V(\vectorsymz)ziz˙iξ(\vectorsymz)=ξ,Uξ˙ξ(\vectorsymz)=ξ,U\langle\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}(t)}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}=\frac{\langle\frac{\partial V(\vectorsym{z})}{\partial z_{\rm i}}\dot{z}_{\rm i}\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}}{\langle\dot{\xi}\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}} (22)

Recall the definition of a transmission coefficient analogue suggested in Ref. [28],

κˇU(ξ)=0Tξ˙(\vectorsymx(t))δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t0T|ξ˙(\vectorsymx(t))|δ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t=ξ˙ξ(\vectorsymz)=ξ,U|ξ˙|ξ(\vectorsymz)=ξ,U\check{\kappa}^{U}(\xi^{\prime})=\frac{\int_{0}^{T}\dot{\xi}(\vectorsym{x}(t))\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt}{\int_{0}^{T}|\dot{\xi}(\vectorsym{x}(t))|\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt}=\frac{\langle\dot{\xi}\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}}{\langle|\dot{\xi}|\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}} (23)

The multiplication of the energy component in Eq. (22) and the transition coefficient analogue in Eq. (23) gives

V(\vectorsymz(t))ziz˙i(t)ξ˙(t)ξ(\vectorsymx)=ξ,UFluxκˇU(ξ)=V(\vectorsymz)ziz˙iξ(\vectorsymz)=ξ,U|ξ˙|ξ(\vectorsymz)=ξ,U\langle\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}(t)}{\dot{\xi}(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}*\check{\kappa}^{U}(\xi^{\prime})=\frac{\langle\frac{\partial V(\vectorsym{z})}{\partial z_{\rm i}}\dot{z}_{\rm i}\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}}{\langle|\dot{\xi}|\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}} (24)

When V(\vectorsymz)ziz˙i|ξ˙|\frac{\partial V(\vectorsym{z})}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}}{|\dot{\xi}|} and |ξ˙||\dot{\xi}| are independent to each other in the ensemble UU, that is,

V(\vectorsymz)ziz˙iξ(\vectorsymz)=ξ,U=V(\vectorsymz)ziz˙i|ξ˙|ξ(\vectorsymz)=ξ,U|ξ˙|ξ(\vectorsymz)=ξ,U\langle\frac{\partial V(\vectorsym{z})}{\partial z_{\rm i}}\dot{z}_{\rm i}\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}=\langle\frac{\partial V(\vectorsym{z})}{\partial z_{\rm i}}\frac{\dot{z}_{\rm i}}{|\dot{\xi}|}\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U}\langle|\dot{\xi}|\rangle_{\xi(\vectorsym{z})=\xi^{\prime},U} (25)

then the multiplication of the energy component and the transition coefficient analogue gives the emergent potential energy.

E. New quantity for reaction coordinate identification

Although the emergent potential energy provide a rigorous way to quantify from an energetic viewpoint the relative importance of a coordinate for a rare transition when the TPE is available, the estimation of it suffers from numerical instabilities in the region where the probability density in the TPE is very low. Here, we proposed the following quantity to appraise the relevance of a coordinate to the reaction coordinate

dA^U(ξ)dξ|ξ(\vectorsymx)=ξdA~U(ξ)dξ|ξ(\vectorsymx)=ξFU(ξ)NT=1NT0TdV(\vectorsymx(t))dtδ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t\displaystyle\frac{d\hat{A}^{U}(\xi)}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}}\equiv\frac{d\tilde{A}^{U}(\xi)}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}}*\frac{F^{U}(\xi^{\prime})}{N_{\rm T}}=\frac{1}{N_{\rm T}}\int_{0}^{T}\frac{dV(\vectorsym{x}(t))}{dt}\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt (26a)
\displaystyle\Rightarrow A^U(ξmax)A^U(ξmin)=ξminξmaxdA~U(ξ)dξ|ξ(\vectorsymx)=ξdξ=1NT0TdV(\vectorsymx(t))dthU(t)𝑑t\displaystyle\hat{A}^{U}(\xi_{\rm max})-\hat{A}^{U}(\xi_{\rm min})=\int^{\xi_{\rm max}}_{\xi_{\rm min}}\frac{d\tilde{A}^{U}(\xi)}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}}d\xi^{\prime}=\frac{1}{N_{\rm T}}\int_{0}^{T}\frac{dV(\vectorsym{x}(t))}{dt}h_{U}(t)dt (26b)

where, ξmin\xi_{\rm min} and ξmax\xi_{\rm max} are the lower and upper bounds of ξ\xi in the ensemble UU, respectively. When the flux FU(ξ)F^{U}(\xi) along ξ\xi is vanishingly small, the term FU(ξ)NT\frac{F^{U}(\xi^{\prime})}{N_{\rm T}} will render dAˇU(ξ)dξ|ξ(\vectorsymx)=ξ\frac{d\check{A}^{U}(\xi)}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}} to be close to zero and this quantity is unlikely to suffer from numerical instability. Since the difference of A^U(ξ)\hat{A}^{U}(\xi) at the two boundaries of any coordinate are the same, it will be convenient via visual inspection to compare A^U(ξ)\hat{A}^{U}(\xi) along different coordinates. When ξ\xi is the reaction coordinate, A~U(ξ)\tilde{A}^{U}(\xi) is reproduced. When ξ\xi is a bath mode,

dA^U(ξ)dξ|ξ(\vectorsymx)=ξTUρU(ξ)NTdV(\vectorsymx(t))dqsq˙sξ(\vectorsymx)=ξ,U\displaystyle\frac{d\hat{A}^{U}(\xi)}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}}\approx\frac{T^{U}\rho^{U}(\xi^{\prime})}{N_{\rm T}}\langle\frac{dV(\vectorsym{x}(t))}{dq_{\rm s}}\dot{q}_{\rm s}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U} (27a)
withTU0ThU(t)𝑑t,ρU(ξ)0Tδ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t0ThU(t)𝑑t\displaystyle\text{with}\quad T^{U}\equiv\int_{0}^{T}h_{U}(t)dt,\quad\rho^{U}(\xi^{\prime})\equiv\frac{\int_{0}^{T}\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt}{\int_{0}^{T}h_{U}(t)dt} (27b)

where TUT^{U} is the accumulated time in the ensemble UU and ρU(ξ)\rho^{U}(\xi^{\prime}) is the probability density at ξ(\vectorsymx)=ξ\xi(\vectorsym{x})=\xi^{\prime} in the ensemble UU. It should be reasonable to assume that dV(\vectorsymx(t))dqsq˙sξ(\vectorsymx)=ξ,U\langle\frac{dV(\vectorsym{x}(t))}{dq_{\rm s}}\dot{q}_{\rm s}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U} is a constant and then one immediately has

A^U(ξ)ΔVU¯ξminξρU(ξ)𝑑ξ\displaystyle\hat{A}^{U}(\xi)\approx\overline{\Delta V^{U}}\int_{\xi_{\rm min}}^{\xi}\rho^{U}(\xi^{\prime})d\xi^{\prime} (28a)
withΔVU¯1NT0TdV(\vectorsymx(t))dthU(t)𝑑t\displaystyle\text{with}\quad\overline{\Delta V^{U}}\equiv\frac{1}{N_{\rm T}}\int_{0}^{T}\frac{dV(\vectorsym{x}(t))}{dt}h_{U}(t)dt (28b)

F. Energy weighted current and multidimensional energy surface in the space of collective variables

It is interesting to note that the average energy weighted with current intensity along the reaction coordinate is directly related to the free energy along it, as indicated in Eqs. (9) and (12). While the conditional canonical average of potential energy gives the internal energy, the entropic component of the free energy seems mainly arises from the curvature of the transition tube or from the shape change of transition tube intersecting with the isosurfaces of the reaction coordinate. It could be insightful in the configurational space to look at an energy weighted current, which is defined as

\vectorsymJV(\vectorsymx)0TV(\vectorsymx(t))\vectorsymx˙(t)δ(\vectorsymx(t)\vectorsymx)𝑑t=V(\vectorsymx)\vectorsymJ(\vectorsymx)\vectorsym{J}_{V}(\vectorsym{x})\equiv\int_{0}^{T}V(\vectorsym{x}(t))\dot{\vectorsym{x}}(t)\delta(\vectorsym{x}(t)-\vectorsym{x})dt=V(\vectorsym{x})\vectorsym{J}(\vectorsym{x}) (29)

One immediately has,

\vectorsymJV(\vectorsymx)=V(\vectorsymx)\vectorsymJ(\vectorsymx)+V(\vectorsymx)\vectorsymJ(\vectorsymx)\nabla\vectorsym{J}_{V}(\vectorsym{x})=\nabla V(\vectorsym{x})\cdot\vectorsym{J}(\vectorsym{x})+V(\vectorsym{x})\nabla\cdot\vectorsym{J}(\vectorsym{x}) (30)

In case of a steady state, i.e., \vectorsymJ(\vectorsymx)=0\nabla\cdot\vectorsym{J}(\vectorsym{x})=0, then

\vectorsymJV(\vectorsymx)=\vectorsymJ(\vectorsymx)V(\vectorsymx)=0TdV(\vectorsymx(t))dtδ(\vectorsymx(t)\vectorsymx)𝑑t\nabla\vectorsym{J}_{V}(\vectorsym{x})=\vectorsym{J}(\vectorsym{x})\cdot\nabla V(\vectorsym{x})=\int_{0}^{T}\frac{dV(\vectorsym{x}(t))}{dt}\delta(\vectorsym{x}(t)-\vectorsym{x})dt (31)

Thus, the divergence of the energy weighted current equals the directional derivative of the potential energy in the direction of the reactive current for a steady state process. \vectorsymJ(\vectorsymx)V(\vectorsymx)\vectorsym{J}(\vectorsym{x})\cdot\nabla V(\vectorsym{x}) can also be viewed as the flux intensity along the gradient of the potential energy and is weighted by the magnitude of the gradient.

The energy weighted current associated with an ensemble UU is then defined as,

\vectorsymJVU(\vectorsymx)0TV(\vectorsymx(t))\vectorsymx˙(t)δ(\vectorsymx(t)\vectorsymx)hU(t)𝑑t\vectorsym{J}^{U}_{V}(\vectorsym{x})\equiv\int_{0}^{T}V(\vectorsym{x}(t))\dot{\vectorsym{x}}(t)\delta(\vectorsym{x}(t)-\vectorsym{x})h_{U}(t)dt (32)

The projection of \vectorsymJVU(\vectorsymx)\vectorsym{J}^{U}_{V}(\vectorsym{x}) to the subspace of CVs gives the energy weighted current in the CVs subspace (Appendix B),

\vectorsymJVU(\vectorsymy)=0TV(\vectorsymx(t))\vectorsymy˙(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\vectorsym{J}^{U}_{V}(\vectorsym{y})=\int_{0}^{T}V(\vectorsym{x}(t))\dot{\vectorsym{y}}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (33)

Inspired by Eq. (29), we thus seek for an energy function A˘VU(\vectorsymy)\breve{A}^{U}_{V}(\vectorsym{y}) of \vectorsymy\vectorsym{y} (thus a multidimensional free energy analogue in the subspace of CVs) such that \vectorsymJVU(\vectorsymy)=A˘VU(\vectorsymy)\vectorsymJU(\vectorsymy)\vectorsym{J}^{U}_{V}(\vectorsym{y})=\breve{A}^{U}_{V}(\vectorsym{y})\vectorsym{J}^{U}(\vectorsym{y}). One immediately finds that

A˘VU(\vectorsymy)=\vectorsymJVU(\vectorsymy)\vectorsymJU(\vectorsymy)\vectorsymJU(\vectorsymy)\vectorsymJU(\vectorsymy)\breve{A}^{U}_{V}(\vectorsym{y})=\frac{\vectorsym{J}^{U}_{V}(\vectorsym{y})\cdot\vectorsym{J}^{U}(\vectorsym{y})}{\vectorsym{J}^{U}(\vectorsym{y})\cdot\vectorsym{J}^{U}(\vectorsym{y})} (34)

and that in the one-dimensional case A˘VU(ξ)=V(\vectorsymx)ξ(\vectorsymx)=ξ,UFlux\breve{A}^{U}_{V}(\xi^{\prime})=\langle V(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}.

In analogue to Eq. (31), the directional derivative of the potential energy associated to an ensemble UU is defined as,

jU(\vectorsymx)\vectorsymJU(\vectorsymx)V(\vectorsymx)=0TdV(\vectorsymx(t))dtδ(\vectorsymx(t)\vectorsymx)hU(t)𝑑tj_{\nabla}^{U}(\vectorsym{x})\equiv\vectorsym{J}^{U}(\vectorsym{x})\cdot\nabla V(\vectorsym{x})=\int_{0}^{T}\frac{dV(\vectorsym{x}(t))}{dt}\delta(\vectorsym{x}(t)-\vectorsym{x})h_{U}(t)dt (35)

The projection of jU(\vectorsymx)j_{\nabla}^{U}(\vectorsym{x}) to the subspace of CVs gives,

jU(\vectorsymy)=jU(\vectorsymx)δ(\vectorsymy(\vectorsymx)\vectorsymy)𝑑\vectorsymx=0TdV(\vectorsymx(t))dtδ(\vectorsymy(t)\vectorsymy)hU(t)𝑑tj_{\nabla}^{U}(\vectorsym{y})=\int j_{\nabla}^{U}(\vectorsym{x})\delta(\vectorsym{y}(\vectorsym{x})-\vectorsym{y})d\vectorsym{x}=\int_{0}^{T}\frac{dV(\vectorsym{x}(t))}{dt}\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (36)

As suggested in Eq. (31), we seek for an free energy analogue A~VU(\vectorsymy)\tilde{A}^{U}_{V}(\vectorsym{y}) such that jU(\vectorsymy)=A~U(\vectorsymy)\vectorsymJU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y})=\nabla\tilde{A}^{U}(\vectorsym{y})\cdot\vectorsym{J}^{U}(\vectorsym{y}). Let us define \vectorsymg~VU(\vectorsymy)\tilde{\vectorsym{g}}^{U}_{V}(\vectorsym{y}) as the gradient of the free energy analogue A~(\vectorsymy)\tilde{A}(\vectorsym{y}), i.e., \vectorsymg~VU(\vectorsymy)A~U(\vectorsymy)\tilde{\vectorsym{g}}^{U}_{V}(\vectorsym{y})\equiv\nabla\tilde{A}^{U}(\vectorsym{y}). It is easy to show that in the one-dimensional case \vectorsymg~VU(ξ)=V(\vectorsymx)dt1ξ˙(\vectorsymx(t))ξ(\vectorsymx)=ξ,UFlux=dA~U(ξ)dξ|ξ(\vectorsymx)=ξ\tilde{\vectorsym{g}}^{U}_{V}(\xi^{\prime})=\langle\frac{V(\vectorsym{x})}{dt}\frac{1}{\dot{\xi}(\vectorsym{x}(t))}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}=\frac{d\tilde{A}^{U}(\xi)}{d\xi}|_{\xi(\vectorsym{x})=\xi^{\prime}}.

In the region of the subspace of the CVs where the divergence of the reactive current vanishes, that is to say that there is no source or sink in this region for the ensemble UU, we find that (Appendix C)

jU(\vectorsymy)=\vectorsymJVU(\vectorsymy)at \vectorsymy where \vectorsymJVU(\vectorsymy) is divergence-freej_{\nabla}^{U}(\vectorsym{y})=\nabla\cdot\vectorsym{J}^{U}_{V}(\vectorsym{y})\quad\text{at $\vectorsym{y}$ where $\vectorsym{J}^{U}_{V}(\vectorsym{y})$ is divergence-free} (37)

1. The choice of subsystem

For complex systems, it is not practical to explore the energetics of the whole system at the per-coordinate level. Usually, a proper subsystem is selected to reduce the dimension and the noisiness due to the large fluctuations in the environment or bath modes. For example, it is shown to be feasible and useful when only the residues close to the retinal were included in the analysis of residue-residue mutual works in rhodopsin 34. Let us define that the subsystem is described by \vectorsymy\vectorsym{y}, and the rest of the system is described by \vectorsymy\vectorsym{y}^{\prime} (i.e., {\vectorsymy,\vectorsymy}\{\vectorsym{y},\vectorsym{y}^{\prime}\} is another complete and independent set of configurational coordinates). In terms of {\vectorsymy,\vectorsymy}\{\vectorsym{y},\vectorsym{y}^{\prime}\}, the potential energy V(\vectorsymy,\vectorsymy)=V\vectorsymy(\vectorsymy)+V\vectorsymy(\vectorsymy)+Vcoupl(\vectorsymy,\vectorsymy)V(\vectorsym{y},\vectorsym{y}^{\prime})=V_{\vectorsym{y}}(\vectorsym{y})+V_{\vectorsym{y}^{\prime}}(\vectorsym{y}^{\prime})+V_{\rm coupl}(\vectorsym{y},\vectorsym{y}^{\prime}). V\vectorsymy(\vectorsymy)V_{\vectorsym{y}}(\vectorsym{y}) and V\vectorsymy(\vectorsymy)V_{\vectorsym{y}^{\prime}}(\vectorsym{y}^{\prime}) are the potential energy for the subsystem and its environment, respectively. Vcoupl(\vectorsymy,\vectorsymy)V_{\rm coupl}(\vectorsym{y},\vectorsym{y}^{\prime}) is the interaction energy between the subsystem and its environment. We thus also assume that the reaction coordinate is a function of \vectorsymy\vectorsym{y} and \vectorsymy\vectorsym{y}^{\prime} are bath modes, whose behaviours resemble the ones in an equilibrium ensemble.

The energy weighted current in the subspace of \vectorsymy\vectorsym{y} can then be expressed as

\vectorsymJVU(\vectorsymy)\displaystyle\vectorsym{J}^{U}_{V}(\vectorsym{y}) =0T[V\vectorsymy(\vectorsymy(t))+V\vectorsymy(\vectorsymy(t))+Vcoupl(\vectorsymy(t),\vectorsymy(t))]\vectorsymy˙(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\displaystyle=\int_{0}^{T}[V_{\vectorsym{y}}(\vectorsym{y}(t))+V_{\vectorsym{y}^{\prime}}(\vectorsym{y}^{\prime}(t))+V_{\rm coupl}(\vectorsym{y}(t),\vectorsym{y}^{\prime}(t))]\dot{\vectorsym{y}}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (38a)
=[V\vectorsymy(\vectorsymy)+V\vectorsymy¯]\vectorsymJU(\vectorsymy)+0TVcoupl(\vectorsymy(t),\vectorsymy(t))\vectorsymy˙(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\displaystyle=[V_{\vectorsym{y}}(\vectorsym{y})+\overline{V_{\vectorsym{y}^{\prime}}}]\vectorsym{J}^{U}(\vectorsym{y})+\int_{0}^{T}V_{\rm coupl}(\vectorsym{y}(t),\vectorsym{y}^{\prime}(t))\dot{\vectorsym{y}}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (38b)

where, V\vectorsymy¯\overline{V_{\vectorsym{y}^{\prime}}} is the average of V\vectorsymy(\vectorsymy)V_{\vectorsym{y}^{\prime}}(\vectorsym{y}^{\prime}) in the equilibrium ensemble and thus a constant, which can be ignored. Thus, \vectorsymJVU(\vectorsymy)\vectorsym{J}^{U}_{V}(\vectorsym{y}) can be calculated by ignoring the term V\vectorsymy(\vectorsymy)V_{\vectorsym{y}^{\prime}}(\vectorsym{y}^{\prime}), which is generally large and very noisy.

In addition, we can write jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}) as

jU(\vectorsymy)\displaystyle j_{\nabla}^{U}(\vectorsym{y}) =0T[iV(\vectorsymy(t),\vectorsymy(t))yiy˙i(t)+jV(\vectorsymy(t),\vectorsymy(t))yjy˙j(t)]δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\displaystyle=\int_{0}^{T}[\sum_{i}\frac{\partial V(\vectorsym{y}(t),\vectorsym{y}^{\prime}(t))}{\partial y_{\rm i}}\dot{y}_{\rm i}(t)+\sum_{j}\frac{\partial V(\vectorsym{y}(t),\vectorsym{y}^{\prime}(t))}{\partial y^{\prime}_{\rm j}}\dot{y}^{\prime}_{\rm j}(t)]\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (39a)
=0TiV(\vectorsymy(t),\vectorsymy(t))yiy˙i(t)δ(\vectorsymy(t)\vectorsymy)hU(t)dt\displaystyle=\int_{0}^{T}\sum_{i}\frac{\partial V(\vectorsym{y}(t),\vectorsym{y}^{\prime}(t))}{\partial y_{\rm i}}\dot{y}_{\rm i}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (39b)
=i[0TV(\vectorsymy(t),\vectorsymy(t))yiy˙i(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t0Ty˙i(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t0Ty˙i(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t]\displaystyle=\sum_{i}[\frac{\int_{0}^{T}\frac{\partial V(\vectorsym{y}(t),\vectorsym{y}^{\prime}(t))}{\partial y_{\rm i}}\dot{y}_{\rm i}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt}{\int_{0}^{T}\dot{y}_{\rm i}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt}\int_{0}^{T}\dot{y}_{\rm i}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt] (39c)

Thus, for the calculation of jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}), only the generalized forces on \vectorsymy\vectorsym{y} are required. Note that jU(\vectorsymy)=\vectorsymg~VU(\vectorsymy)\vectorsymJU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y})=\tilde{\vectorsym{g}}^{U}_{V}(\vectorsym{y})\cdot\vectorsym{J}^{U}(\vectorsym{y}), one natural solution for \vectorsymg~VU(\vectorsymy)\tilde{\vectorsym{g}}^{U}_{V}(\vectorsym{y}) is given below with the ii-th element of \vectorsymg~VU(\vectorsymy)\tilde{\vectorsym{g}}^{U}_{V}(\vectorsym{y}) being provided by

\vectorsymg~V,iU(\vectorsymy)=0TV(\vectorsymy(t),\vectorsymy(t))yiy˙i(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t0Ty˙i(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\tilde{\vectorsym{g}}^{U}_{V,\rm i}(\vectorsym{y})=\frac{\int_{0}^{T}\frac{\partial V(\vectorsym{y}(t),\vectorsym{y}^{\prime}(t))}{\partial y_{\rm i}}\dot{y}_{\rm i}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt}{\int_{0}^{T}\dot{y}_{\rm i}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt} (40)

Substituting V(\vectorsymy,\vectorsymy)V(\vectorsym{y},\vectorsym{y}^{\prime}) with V\vectorsymy(\vectorsymy)+V\vectorsymy(\vectorsymy)+Vcoupl(\vectorsymy,\vectorsymy)V_{\vectorsym{y}}(\vectorsym{y})+V_{\vectorsym{y}^{\prime}}(\vectorsym{y}^{\prime})+V_{\rm coupl}(\vectorsym{y},\vectorsym{y}^{\prime}), we find that

\vectorsymg~V,iU(\vectorsymy)=V\vectorsymy(\vectorsymy)yi+0TVcoupl(\vectorsymy(t),\vectorsymy(t))yiy˙i(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t0Ty˙i(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\tilde{\vectorsym{g}}^{U}_{V,\rm i}(\vectorsym{y})=\frac{\partial V_{\vectorsym{y}}(\vectorsym{y})}{\partial y_{\rm i}}+\frac{\int_{0}^{T}\frac{\partial V_{\rm coupl}(\vectorsym{y}(t),\vectorsym{y}^{\prime}(t))}{\partial y_{\rm i}}\dot{y}_{\rm i}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt}{\int_{0}^{T}\dot{y}_{\rm i}(t)\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt} (41)

Now, we assume that Vcoupl(\vectorsymy,\vectorsymy)=g(\vectorsymy)m(\vectorsymy)V_{\rm coupl}(\vectorsym{y},\vectorsym{y}^{\prime})=g(\vectorsym{y})m(\vectorsym{y}^{\prime}), \vectorsymg~V,iU(\vectorsymy)\tilde{\vectorsym{g}}^{U}_{V,\rm i}(\vectorsym{y}) can then be write as

\vectorsymg~V,iU(\vectorsymy)=[V\vectorsymy(\vectorsymy)+g(\vectorsymy)m(\vectorsymy)¯]yi\tilde{\vectorsym{g}}^{U}_{V,\rm i}(\vectorsym{y})=\frac{\partial[V_{\vectorsym{y}}(\vectorsym{y})+g(\vectorsym{y})\overline{m(\vectorsym{y}^{\prime})}]}{\partial y_{\rm i}} (42)

where, m(\vectorsymy)¯\overline{m(\vectorsym{y}^{\prime})} is the average of m(\vectorsymy)m(\vectorsym{y}^{\prime}) in the equilibrium ensemble and thus a constant. Similarly, we find that from Eq. (38b)

\vectorsymJVU(\vectorsymy)=[V\vectorsymy(\vectorsymy)+g(\vectorsymy)m(\vectorsymy)¯+V\vectorsymy¯]\vectorsymJU(\vectorsymy)\vectorsym{J}^{U}_{V}(\vectorsym{y})=[V_{\vectorsym{y}}(\vectorsym{y})+g(\vectorsym{y})\overline{m(\vectorsym{y}^{\prime})}+\overline{V_{\vectorsym{y}^{\prime}}}]\vectorsym{J}^{U}(\vectorsym{y}) (43)

From Eqs. (34) and (43), we have

A˘VU(\vectorsymy)=V\vectorsymy(\vectorsymy)+g(\vectorsymy)m(\vectorsymy)¯+V\vectorsymy¯\breve{A}^{U}_{V}(\vectorsym{y})=V_{\vectorsym{y}}(\vectorsym{y})+g(\vectorsym{y})\overline{m(\vectorsym{y}^{\prime})}+\overline{V_{\vectorsym{y}^{\prime}}} (44)

Obviously, when Vcoupl(\vectorsymy,\vectorsymy)V_{\rm coupl}(\vectorsym{y},\vectorsym{y}^{\prime}) can be expressed as the summation of functions with the same form of g(\vectorsymy)m(\vectorsymy)g(\vectorsym{y})m(\vectorsym{y}^{\prime}), Eq. (44) holds. Thus, under the assumption that the distribution of \vectorsymy\vectorsym{y}^{\prime} at any position of \vectorsymy\vectorsym{y} resemble the one in the equilibrium ensemble, A~U(\vectorsymy)\tilde{A}^{U}(\vectorsym{y}) is equivalent to A˘U(\vectorsymy)\breve{A}^{U}(\vectorsym{y}) (i.e., differ by a constant).

If the coupling between the subsystem and its environment is weak, that is Vcoupl(\vectorsymy,\vectorsymy)V\vectorsymy(\vectorsymy)V_{\rm coupl}(\vectorsym{y},\vectorsym{y}^{\prime})\ll V_{\vectorsym{y}}(\vectorsym{y}), \vectorsymg~V,iU(\vectorsymy)\tilde{\vectorsym{g}}^{U}_{V,\rm i}(\vectorsym{y}) and A˘U(\vectorsymy)\breve{A}^{U}(\vectorsym{y}) can be further approximated as

\vectorsymg~V,iU(\vectorsymy)\displaystyle\tilde{\vectorsym{g}}^{U}_{V,\rm i}(\vectorsym{y}) V\vectorsymy(\vectorsymy)yi\displaystyle\approx\frac{\partial V_{\vectorsym{y}}(\vectorsym{y})}{\partial y_{\rm i}} (45a)
A˘U(\vectorsymy)\displaystyle\breve{A}^{U}(\vectorsym{y}) V\vectorsymy(\vectorsymy)\displaystyle\approx V_{\vectorsym{y}}(\vectorsym{y}) (45b)

Thus, the above derivations rationalize that the free energy and the energy components can be obtained from a proper subsystem in close approximation. If the A~U(\vectorsymy)\nabla\tilde{A}^{U}(\vectorsym{y}) obtained with Eq. (45a) is close to the one with Eq. (41), the subsystem could be considered a proper one that preserves the free energy of the system. The energy components obtained from this subsystem are thus good approximations of the ones from the entire system.

2. Overdamped dynamics and parabolic energy barrier

Assuming an overdamped dynamics and a parabolic barrier of the potential of mean force A(\vectorsymy)A(\vectorsym{y}) with the barrier top at \vectorsymy=0\vectorsym{y}=0, that is βA(\vectorsymy)=\vectorsymyt\vectorsymH\vectorsymy/2\beta A(\vectorsym{y})=\vectorsym{y}^{t}\vectorsym{H}\vectorsym{y}/2, where β=1/(kBT)\beta=1/(k_{\rm B}T) and HH is the Hessian matrix. From Roux 16 in case of a position-independent diffusion matrix \vectorsymD\vectorsym{D}, the committor near the saddle point can be approximated by q(\vectorsymy)q(\vectorsymy\vectorsyme)q(\vectorsym{y})\approx q(\vectorsym{y}\cdot\vectorsym{e}) with \vectorsyme\vectorsym{e} being the unit vector parallel to q(\vectorsymy)\nabla q(\vectorsym{y}), and \vectorsyme\vectorsym{e} is the eigenvector of the only negative eigenvalue λ\lambda of \vectorsymH\vectorsymD\vectorsym{H}\vectorsym{D}, i.e., \vectorsymH\vectorsymD\vectorsyme=λ\vectorsyme\vectorsym{H}\vectorsym{D}\vectorsym{e}=-\lambda\vectorsym{e}, with Eq. (5) we can write jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}) as

jU(\vectorsymy)\displaystyle j_{\nabla}^{U}(\vectorsym{y}) eβA(\vectorsymy)\vectorsymDq(\vectorsymy)A(\vectorsymy)\displaystyle\propto e^{-\beta A(\vectorsym{y})}\vectorsym{D}\nabla q(\vectorsym{y})\cdot\nabla A(\vectorsym{y}) (46a)
eβA(\vectorsymy)\vectorsymyt\vectorsymH\vectorsymD\vectorsymeq(\vectorsymy\vectorsyme)\displaystyle\propto e^{-\beta A(\vectorsym{y})}\vectorsym{y}^{t}\vectorsym{H}\vectorsym{D}\vectorsym{e}q^{\prime}(\vectorsym{y}\cdot\vectorsym{e}) (46b)
eβA(\vectorsymy)q(qs)qs\displaystyle\propto e^{-\beta A(\vectorsym{y})}q^{\prime}(q_{\rm s})q_{\rm s} (46c)

here, qs=\vectorsymy\vectorsymeq_{\rm s}=\vectorsym{y}\cdot\vectorsym{e} is the reaction coordinate. The gradient of jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}) can be expressed as

jU(\vectorsymy)\displaystyle\nabla j_{\nabla}^{U}(\vectorsym{y}) eβA(\vectorsymy)q(qs)\vectorsyme+qs[eβA(\vectorsymy)q′′(qs)\vectorsyme+eβA(\vectorsymy)q(qs)(βA(\vectorsymy))]\displaystyle\propto e^{-\beta A(\vectorsym{y})}q^{\prime}(q_{\rm s})\vectorsym{e}+q_{\rm s}[e^{-\beta A(\vectorsym{y})}q^{\prime\prime}(q_{\rm s})\vectorsym{e}+e^{-\beta A(\vectorsym{y})}q^{\prime}(q_{\rm s})\nabla(-\beta A(\vectorsym{y}))] (47a)
eβA(\vectorsymy)q(qs)[\vectorsymeλDqs2\vectorsymeqs(βA(\vectorsymy))]\displaystyle\propto e^{-\beta A(\vectorsym{y})}q^{\prime}(q_{\rm s})[\vectorsym{e}-\frac{\lambda}{D}q_{\rm s}^{2}\vectorsym{e}-q_{\rm s}\nabla(\beta A(\vectorsym{y}))] (47b)

where, the conclusion that q′′(qs)=λqsq(qs)/Dq^{\prime\prime}(q_{\rm s})=-\lambda q_{\rm s}q^{\prime}(q_{\rm s})/D from Ref. [16] is used with D\vectorsymet\vectorsymD\vectorsymeD\equiv\vectorsym{e}^{t}\vectorsym{D}\vectorsym{e}. In the vicinity of the saddle point, i.e., q0q\approx 0, the vector jU(\vectorsymy)\nabla j_{\nabla}^{U}(\vectorsym{y}) is parallel to \vectorsyme\vectorsym{e}. Thus, the analysis of jU(\vectorsymy)\nabla j_{\nabla}^{U}(\vectorsym{y}) provides a way to find \vectorsyme\vectorsym{e}, the direction of the gradient of the committor near the saddle point.

3. A slow reaction coordinate coupled with fast bath modes

Now we consider a system that consists of a slow coordinate qsq_{\rm s}, which is also the only reaction coordinate, and a pool of fast bath modes \vectorsymqb\vectorsym{q}_{\rm b}. It is assumed that the dynamics along bath modes in a non-equilibrium ensemble UU resembles the one in equilibrium and the coupling between the bath modes and the reaction coordinate qsq_{\rm s} is weak. Since the components in the bath modes are vanishingly small, we can approximate jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}) with

jU(qs,\vectorsymqb)\displaystyle j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b}) 0TA(qs(t),\vectorsymqb(t))qsq˙s(t)δ(qs(t)qs)δ(\vectorsymqb(t)\vectorsymqb)hU(t)𝑑t\displaystyle\approx\int_{0}^{T}\frac{\partial A(q_{\rm s}(t),\vectorsym{q}_{\rm b}(t))}{\partial q_{\rm s}}\dot{q}_{\rm s}(t)\delta(q_{\rm s}(t)-q_{\rm s})\delta(\vectorsym{q}_{\rm b}(t)-\vectorsym{q}_{\rm b})h_{U}(t)dt (48a)
=K(qs;\vectorsymqb)¯ρU(\vectorsymqb)TU\displaystyle=\overline{K(q_{\rm s};\vectorsym{q}_{\rm b})}\rho^{U}(\vectorsym{q}_{\rm b})T^{U} (48b)
withK(qs;\vectorsymqb)¯0TA(qs(t),\vectorsymqb(t))qsq˙s(t)δ(qs(t)qs)δ(\vectorsymqb(t)\vectorsymqb)hU(t)𝑑t0Tδ(\vectorsymqb(t)\vectorsymqb)hU(t)𝑑t\displaystyle\text{with}\quad\overline{K(q_{\rm s};\vectorsym{q}_{\rm b})}\equiv\frac{\int_{0}^{T}\frac{\partial A(q_{\rm s}(t),\vectorsym{q}_{\rm b}(t))}{\partial q_{\rm s}}\dot{q}_{\rm s}(t)\delta(q_{\rm s}(t)-q_{\rm s})\delta(\vectorsym{q}_{\rm b}(t)-\vectorsym{q}_{\rm b})h_{U}(t)dt}{\int_{0}^{T}\delta(\vectorsym{q}_{\rm b}(t)-\vectorsym{q}_{\rm b})h_{U}(t)dt} (48c)
andρU(\vectorsymqb)=0Tδ(\vectorsymqb(t)\vectorsymqb)hU(t)𝑑t0ThU(t)𝑑t\displaystyle\text{and}\quad\rho^{U}(\vectorsym{q}_{\rm b})=\frac{\int_{0}^{T}\delta(\vectorsym{q}_{\rm b}(t)-\vectorsym{q}_{\rm b})h_{U}(t)dt}{\int_{0}^{T}h_{U}(t)dt} (48d)

If K(qs;\vectorsymqb)¯\overline{K(q_{\rm s};\vectorsym{q}_{\rm b})} is independent to \vectorsymqb\vectorsym{q}_{\rm b}, then

jU(qs,\vectorsymqb)𝑑\vectorsymqbK(qs;\vectorsymqb)¯ρU(\vectorsymqb)TU𝑑\vectorsymqb=K(qs;\vectorsymqb)¯TU\int j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b})d\vectorsym{q}_{\rm b}\approx\int\overline{K(q_{\rm s};\vectorsym{q}_{\rm b})}\rho^{U}(\vectorsym{q}_{\rm b})T^{U}d\vectorsym{q}_{\rm b}=\overline{K(q_{\rm s};\vectorsym{q}_{\rm b})}T^{U} (49)

On the other hand,

jU(qs,\vectorsymqb)𝑑\vectorsymqb\displaystyle\int j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b})d\vectorsym{q}_{\rm b} =0TdA(qs(t),\vectorsymqb(t))dtδ(qs(t)qs)hU(t)𝑑t\displaystyle=\int_{0}^{T}\frac{dA(q_{\rm s}(t),\vectorsym{q}_{\rm b}(t))}{dt}\delta(q_{\rm s}(t)-q_{\rm s})h_{U}(t)dt (50a)
=dA(qs(t),\vectorsymqb(t))dt1q˙s(t)qs(t)=qsFluxFU(qs)=dA~(qs)dqsFU(qs)\displaystyle=\langle\frac{dA(q_{\rm s}(t),\vectorsym{q}_{\rm b}(t))}{dt}\frac{1}{\dot{q}_{\rm s}(t)}\rangle_{q_{\rm s}(t)=q_{\rm s}}^{Flux}F^{U}(q_{\rm s})=\frac{d\tilde{A}(q_{\rm s})}{dq_{\rm s}}F^{U}(q_{\rm s}) (50b)

Thus, K(qs;\vectorsymqb)¯TUdA~(qs)dqsFU(qs)\overline{K(q_{\rm s};\vectorsym{q}_{\rm b})}T^{U}\approx\frac{d\tilde{A}(q_{\rm s})}{dq_{\rm s}}F^{U}(q_{\rm s}) and

jU(qs,\vectorsymqb)dA~(qs)dqsFU(qs)ρU(\vectorsymqb)j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b})\approx\frac{d\tilde{A}(q_{\rm s})}{dq_{\rm s}}F^{U}(q_{\rm s})\rho^{U}(\vectorsym{q}_{\rm b}) (51)

The gradient of jU(qs,\vectorsymqb)j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b}) is

jU(qs,\vectorsymqb)qs\displaystyle\frac{\partial j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b})}{\partial q_{\rm s}} [d2A~(qs)dqs2FU(qs)+dA~(qs)dqsdFU(qs)dqs]ρU(\vectorsymqb)\displaystyle\approx[\frac{d^{2}\tilde{A}(q_{\rm s})}{dq^{2}_{\rm s}}F^{U}(q_{\rm s})+\frac{d\tilde{A}(q_{\rm s})}{dq_{\rm s}}\frac{dF^{U}(q_{\rm s})}{dq_{\rm s}}]\rho^{U}(\vectorsym{q}_{\rm b}) (52a)
\vectorsymqbjU(qs,\vectorsymqb)\displaystyle\nabla_{\vectorsym{q}_{\rm b}}j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b}) dA~(qs)dqsFU(qs)\vectorsymqbρU(\vectorsymqb)\displaystyle\approx\frac{d\tilde{A}(q_{\rm s})}{dq_{\rm s}}F^{U}(q_{\rm s})\nabla_{\vectorsym{q}_{\rm b}}\rho^{U}(\vectorsym{q}_{\rm b}) (52b)

\vectorsymqb\nabla_{\vectorsym{q}_{\rm b}} denotes the derivatives with respect to \vectorsymqb\vectorsym{q}_{\rm b}. Eq. (52) suggests that the gradient of jU(qs,\vectorsymqb)j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b}) points the direction of the reaction coordinate when (1) dA~(qs)dqs=0\frac{d\tilde{A}(q_{\rm s})}{dq_{\rm s}}=0 or (2) \vectorsymqbρU(\vectorsymqb)=\vectorsym0\nabla_{\vectorsym{q}_{\rm b}}\rho^{U}(\vectorsym{q}_{\rm b})=\vectorsym{0}, a zero vector. One particular case when dA~(qs)dqs=0\frac{d\tilde{A}(q_{\rm s})}{dq_{\rm s}}=0 is at the barrier top of the free energy analogue (A~(qs)\tilde{A}(q_{\rm s})) along the reaction coordinate. It is the surface qs=qsq_{\rm s}=q_{\rm s}^{*} with dA~(qs)dqs|qs=qs=0\frac{d\tilde{A}(q_{\rm s})}{dq_{\rm s}}|_{q_{\rm s}=q_{\rm s}^{*}}=0, which could be the stochastic separatrix. At the surface qs=qsq_{\rm s}=q_{\rm s}^{*}, the directions of all jU(qs,\vectorsymqb)\nabla j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b}) are parallel to the direction of the reaction coordinate qsq_{\rm s} and the magnitude of jU(qs,\vectorsymqb))\nabla j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b})) is proportional to ρU(\vectorsymqb)\rho^{U}(\vectorsym{q}_{\rm b}). The point at which the magnitude of jU(qs,\vectorsymqb)\nabla j_{\nabla}^{U}(q_{\rm s},\vectorsym{q}_{\rm b}) reaches its maximum corresponds to the saddle point. A special case when \vectorsymqbρU(\vectorsymqb)=\vectorsym0\nabla_{\vectorsym{q}_{\rm b}}\rho^{U}(\vectorsym{q}_{\rm b})=\vectorsym{0} is a curve where ρU(\vectorsymqb)\rho^{U}(\vectorsym{q}_{\rm b}) reaches its maximum, which corresponds to the minimum free energy path and passes the saddle point. Thus, the direction of the reaction coordinate along the minimum free energy path can be obtained via visual inspection of the vector field jU(\vectorsymy)\nabla j_{\nabla}^{U}(\vectorsym{y}) in the subspace of CVs.

F. Time-lagged quantities

Analogue to the time-lagged flux and current proposed in Ref. [28], the extension of the above quantities to time-lagged ones is rather straightforward, for instance, by replace the generalized velocity \vectorsymy˙(t)\dot{\vectorsym{y}}(t) with \vectorsymy(t+τ)\vectorsymy(t)τ\frac{\vectorsym{y}(t+\tau)-\vectorsym{y}(t)}{\tau}. Here, τ\tau is the lag time. For example, the ensemble average weighted with current intensity given in Eq. [3] can be extended to the ensemble average weighted by time-lagged current intensity, which is defined as

f(\vectorsymx)ξ(\vectorsymx)=ξ,UFlux,τ0Tf(\vectorsymx(t))ξ(\vectorsymx(t+τ))ξ(\vectorsymx(t))τδ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t0Tξ(\vectorsymx(t+τ))ξ(\vectorsymx(t))τδ(ξ(\vectorsymx(t))ξ)hU(t)𝑑t\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux,\tau}\equiv\frac{\int_{0}^{T}f(\vectorsym{x}(t))\frac{\xi(\vectorsym{x}(t+\tau))-\xi(\vectorsym{x}(t))}{\tau}\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt}{\int_{0}^{T}\frac{\xi(\vectorsym{x}(t+\tau))-\xi(\vectorsym{x}(t))}{\tau}\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})h_{U}(t)dt} (53)

The time-lagged version of energy decomposed onto a coordinate ziz_{\rm i} along the coordinate ξ\xi in Eq. [11] is

Vzi,ξτ=V(\vectorsymz(t))zizi(t+τ)zi(t)ξ(t+τ)ξ(t)ξ(\vectorsymx)=ξ,UFlux,τ𝑑ξV_{z_{\rm i},\xi}^{\tau}=\int\langle\frac{\partial V(\vectorsym{z}(t))}{\partial z_{\rm i}}\frac{z_{\rm i}(t+\tau)-z_{\rm i}(t)}{\xi(t+\tau)-\xi(t)}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux,\tau}d\xi^{\prime} (54)

The time-lagged counterpart of jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}) in Eq. [36] is,

jU(\vectorsymy,τ)0TV(\vectorsymx(t+τ))V(\vectorsymx(t))τδ(\vectorsymy(t)\vectorsymy)hU(t)𝑑tj_{\nabla}^{U}(\vectorsym{y},\tau)\equiv\int_{0}^{T}\frac{V(\vectorsym{x}(t+\tau))-V(\vectorsym{x}(t))}{\tau}\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (55)

Time-lagged backward quantities and time-lagged mean quantities can be analogously defined. It will be interesting to explore the properties and usefulness of these time-lagged quantities in analysing non-equilibrium ensembles of MD trajectories.

III. CONCLUDING REMARKS

In a non-equilibrium ensemble, an important characteristics is that the reactive current in it is non-zero. Following this thinking path, we here propose an ensemble average of an arbitrary time series weighted with the reactive current intensity, which can be related to the conditional canonical average under proper approximations. When the quantity to be averaged is the potential energy, such a weighted ensemble average is demonstrated directly related to the free energy along the reaction coordinate. By decomposing the infinitesimal change of the potential energy into components on the coordinates that form an independent and complete set of configurational coordinates, a free energy decomposition scheme is established in Eq. (11) and is named as the energy decomposition along reaction coordinate. Although the energy decomposition is better to be done along the exact reaction coordinate, the energy decomposition along a good one-dimensional reaction coordinate gives quantitatively equivalent results, which are proportional to the ones along the true reaction coordinate as indicated in Eq. (18b). In addition, the energy component on a coordinate along the reaction coordinate is suggested to be able to appraise the relevance of the coordinate to the reaction coordinate and thus can be applied in the selection and optimization of a one-dimensional reaction coordinate.

Since the ensemble average in Eq. (3) gives a free energy analogue as a function of ξ\xi, one may ask whether a multi-dimensional free energy analogue can be defined based on the reactive current in the subspace of CVs? We thus proposed the potential energy weighted current and the directional derivative of the potential energy along the reactive current, from which free energy analogues A~(\vectorsymy)\tilde{A}(\vectorsym{y}) and A˘(\vectorsymy)\breve{A}(\vectorsym{y}) are defined, respectively. These free energy analogues can be considered as the extension of the one-dimensional free energy analogue. Although A~(\vectorsymy)\tilde{A}(\vectorsym{y}) and A˘(\vectorsymy)\breve{A}(\vectorsym{y}) are equivalent at regions where the reactive current is divergence-free, A~(\vectorsymy)\tilde{A}(\vectorsym{y}) should be in a closer relation than A˘(\vectorsymy)\breve{A}(\vectorsym{y}) with the free energy at regions where the divergence of the reactive current does not vanish. From a practical aspect, we would like to emphasize that Eq. (34) provides a convenient way to obtain a reasonable estimation of the multidimensional free energy in the region where the reactive current is divergence-free, while Eq. (40) can be generally applied to calculate the free energy surface from a non-equilibrium ensemble in which the reactive current is non-vanishing.

Interestingly, jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}), the directional derivative of the free energy analogue A~(\vectorsymy)\tilde{A}(\vectorsym{y}) as defined in Eq. (36), could be more useful than it appears. The gradient of jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}) gives a vector field from which the direction of the reaction coordinate can be easily identified along the minimum free energy path in the subspace of CVs (not the tangent of the minimum free energy path). We suggest that a few relevant coordinates can be first identified with Eq. (26a) without the calculation of partial derivative with respect to coordinates (an independent and complete set) and then a one-dimensional reaction coordinate as a (possibly curvilinear) function of these relevant coordinates can be constructed by visual inspection of the vector field obtained from jU(\vectorsymy)j_{\nabla}^{U}(\vectorsym{y}) with Eq. (39b).

We would like to point out that the potential energy in Eqs. (33) and (36) can be replaced by an arbitrary function of \vectorsymx\vectorsym{x} or more generally an arbitrary time series. Quantities analogous to A~(\vectorsymy)\tilde{A}(\vectorsym{y}) and A˘(\vectorsymy)\breve{A}(\vectorsym{y}) can then be defined as well and a relation similar to the one in Eq. (37) can also be obtained.

About the choice of the non-equilibrium ensemble, a natural choice is the TPE, in which the reactive flux along the reaction coordinate is known to be equal to the number of the transition paths. Another interesting ensemble could be the swarms of short trajectories, for example, the short trajectories initiated from configurations in a chain of replicas as in the string method 4 or a known transition path. One can imaging that the flux along the reaction coordinate in the transition state region will be close to zero as the forward (go towards the product state) and backward (go towards the reactant state) trajectories are equally probable. In the barrier region close to the reactant state, there will be more backward trajectory than the forward trajectory and thus the net flux will be negative, while the flux will be positive in the barrier region close to the product state. Thus, the quantities proposed here can generally be estimated for such an ensemble of trajectories. Alternatively, one can reverse the backward trajectories and thus convert them into forward trajectories and reweight these trajectories (although it is not trivial), then there will be a net positive flux along the reaction coordinate (including the transition state region) and the above quantities can be generally evaluated to gain insightful information about the transition process.

APPENDIX

(A) Proof of Eq. (9)

In the plateau region of the flux (FU(ξ)F^{U}(\xi)) along ξ\xi, we have that FU(ξ)F^{U}(\xi) equals the number of transition paths NTN_{\rm T} and dFU(ξ)dξ|ξ=ξ=0\frac{dF^{U}(\xi)}{d\xi}|_{\xi=\xi^{\prime}}=0. Note that f(\vectorsymx)ξ(\vectorsymx)=ξ,UFlux\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux} in Eq. (3) is a function of ξ\xi^{\prime}, we can consider its derivative with respect to ξ\xi^{\prime} for ξ\xi^{\prime} in the plateau region,

df(\vectorsymx)ξ(\vectorsymx)=ξ,UFluxdξ=\displaystyle\frac{d\langle f(\vectorsym{x})\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux}}{d\xi^{\prime}}= 1NT0Tf(\vectorsymx(t))ξ˙(\vectorsymx(t))dδ(ξ(\vectorsymx(t))ξ)dξhU(t)𝑑t\displaystyle\frac{1}{N_{\rm T}}\int_{0}^{T}f(\vectorsym{x}(t))\dot{\xi}(\vectorsym{x}(t))\frac{d\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})}{d\xi^{\prime}}h_{U}(t)dt (56a)
=\displaystyle= 1NT0Tf(\vectorsymx(t))dδ(ξ(\vectorsymx(t))ξ)dthU(t)𝑑t\displaystyle\frac{1}{N_{\rm T}}\int_{0}^{T}f(\vectorsym{x}(t))\frac{-d\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})}{dt}h_{U}(t)dt (56b)
=\displaystyle= 1NT0Td[f(\vectorsymx(t))hU(t)]dtδ(ξ(\vectorsymx(t))ξ)𝑑t\displaystyle\frac{1}{N_{\rm T}}\int_{0}^{T}\frac{d[f(\vectorsym{x}(t))h_{U}(t)]}{dt}\delta(\xi(\vectorsym{x}(t))-\xi^{\prime})dt (56c)
=\displaystyle= df(\vectorsymx(t))dt1ξ˙(\vectorsymx(t))ξ(\vectorsymx)=ξ,UFlux\displaystyle\langle\frac{df(\vectorsym{x}(t))}{dt}\frac{1}{\dot{\xi}(\vectorsym{x}(t))}\rangle_{\xi(\vectorsym{x})=\xi^{\prime},U}^{Flux} (56d)

From Eq. (56b) to Eq. (56c), integration by parts is used and the fact is noted that there is no source or sink in the plateau region as indicated in dFU(ξ)dξ|ξ=ξ=0\frac{dF^{U}(\xi)}{d\xi}|_{\xi=\xi^{\prime}}=0. From Eq. (56c) to Eq. (56d), we note that dhU(t)/dt=0dh_{U}(t)/dt=0 for any time when the trajectory hits the surface ξ(\vectorsymx)=ξ\xi(\vectorsym{x})=\xi^{\prime}.

(B) The energy weighted current in the subspace of collective variables

The ii-th element of \vectorsymJVU(\vectorsymy)\vectorsym{J}^{U}_{V}(\vectorsym{y}) is the projection of \vectorsymJVU(\vectorsymx)\vectorsym{J}^{U}_{V}(\vectorsym{x}) to yiy_{\rm i} in the subspace of CVs, that is

\vectorsymJV,iU(\vectorsymy)\displaystyle\vectorsym{J}^{U}_{V,\rm i}(\vectorsym{y}) =\vectorsymJVU(\vectorsymx)yi(\vectorsymx)δ(\vectorsymy(\vectorsymx)\vectorsymy)𝑑\vectorsymx\displaystyle=\int\vectorsym{J}^{U}_{V}(\vectorsym{x})\nabla y_{\rm i}(\vectorsym{x})\delta(\vectorsym{y}(\vectorsym{x})-\vectorsym{y})d\vectorsym{x} (57a)
=0TV(\vectorsymx(t))\vectorsymx˙(t)yi(\vectorsymx)δ(\vectorsymx(t)\vectorsymx)δ(\vectorsymy(\vectorsymx)\vectorsymy)𝑑\vectorsymxhU(t)𝑑t\displaystyle=\int_{0}^{T}V(\vectorsym{x}(t))\dot{\vectorsym{x}}(t)\cdot\int\nabla y_{\rm i}(\vectorsym{x})\delta(\vectorsym{x}(t)-\vectorsym{x})\delta(\vectorsym{y}(\vectorsym{x})-\vectorsym{y})d\vectorsym{x}h_{U}(t)dt (57b)
=0TV(\vectorsymx(t))yi(\vectorsymx(t))\vectorsymx˙(t)δ(\vectorsymy(\vectorsymx(t))\vectorsymy)hU(t)𝑑t\displaystyle=\int_{0}^{T}V(\vectorsym{x}(t))\nabla y_{\rm i}(\vectorsym{x}(t))\cdot\dot{\vectorsym{x}}(t)\delta(\vectorsym{y}(\vectorsym{x}(t))-\vectorsym{y})h_{U}(t)dt (57c)
=0TV(\vectorsymx(t))y˙i(\vectorsymx(t))δ(\vectorsymy(\vectorsymx(t))\vectorsymy)hU(t)𝑑t\displaystyle=\int_{0}^{T}V(\vectorsym{x}(t))\dot{y}_{\rm i}(\vectorsym{x}(t))\delta(\vectorsym{y}(\vectorsym{x}(t))-\vectorsym{y})h_{U}(t)dt (57d)

(C) Proof of Eq. (37)

In the region of the subspace of the CVs where the divergence of the reactive current vanishes, i.e., \vectorsymJU(\vectorsymy)=0\nabla\cdot\vectorsym{J}^{U}(\vectorsym{y})=0, which indicates that there is no source or sink in this region for the ensemble UU, we find that

jU(\vectorsymy)\displaystyle j_{\nabla}^{U}(\vectorsym{y}) =0TdV(\vectorsymx(t))dtδ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\displaystyle=\int_{0}^{T}\frac{dV(\vectorsym{x}(t))}{dt}\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (58a)
=0Td[δ(\vectorsymy(t)\vectorsymy)hU(t)]dtV(\vectorsymx(t))𝑑t\displaystyle=-\int_{0}^{T}\frac{d[\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)]}{dt}V(\vectorsym{x}(t))dt (58b)
=0Tdδ(\vectorsymy(t)\vectorsymy)dtV(\vectorsymx(t))hU(t)𝑑t\displaystyle=-\int_{0}^{T}\frac{d\delta(\vectorsym{y}(t)-\vectorsym{y})}{dt}V(\vectorsym{x}(t))h_{U}(t)dt (58c)
=0TV(\vectorsymx(t))\vectorsymy˙(t)δ(\vectorsymy(t)\vectorsymy)hU(t)𝑑t\displaystyle=\int_{0}^{T}V(\vectorsym{x}(t))\dot{\vectorsym{y}}(t)\cdot\nabla\delta(\vectorsym{y}(t)-\vectorsym{y})h_{U}(t)dt (58d)
=\vectorsymJVU(\vectorsymy)\displaystyle=\nabla\cdot\vectorsym{J}^{U}_{V}(\vectorsym{y}) (58e)

From Eq. (58a) to Eq. (58b), integration by parts is applied. From Eq. (58b) to Eq. (58c), we note that dhU(t)/dt=0dh_{U}(t)/dt=0 for any time when the trajectory hits the surface \vectorsymy(\vectorsymx)=\vectorsymy\vectorsym{y}(\vectorsym{x})=\vectorsym{y}.

ACKNOWLEDGMENTS

This work was supported by Natural Science Foundation of Guangdong Province, China (Grant No. 2020A1515010984) and the Start-up Grant for Young Scientists (860-000002110384), Shenzhen University.

References

  • Karplus and McCammon 2002 Karplus, M.; McCammon, J. A. Nature structural biology 2002, 9, 646–652
  • Wang et al. 2021 Wang, X.; Singh, N.; Li, W. Systems Medicine; Elsevier, 2021; pp 182–189
  • Chandler 1978 Chandler, D. The Journal of Chemical Physics 1978, 68, 2959–2970
  • Pan et al. 2008 Pan, A. C.; Sezer, D.; Roux, B. The journal of physical chemistry B 2008, 112, 3432–3440
  • Allen et al. 2009 Allen, R. J.; Valeriani, C.; ten Wolde, P. R. Journal of Physics: Condensed Matter 2009, 21, 463102
  • Borrero and Escobedo 2007 Borrero, E. E.; Escobedo, F. A. The Journal of chemical physics 2007, 127, 164101
  • Faradjian and Elber 2004 Faradjian, A. K.; Elber, R. The Journal of chemical physics 2004, 120, 10880–10889
  • Elber 2017 Elber, R. Quarterly reviews of biophysics 2017, 50
  • Berezhkovskii and Szabo 2019 Berezhkovskii, A. M.; Szabo, A. The Journal of chemical physics 2019, 150, 054106
  • Zuckerman and Chong 2017 Zuckerman, D. M.; Chong, L. T. Annual review of biophysics 2017, 46, 43–57
  • Dellago et al. 1998 Dellago, C.; Bolhuis, P. G.; Csajka, F. S.; Chandler, D. The Journal of Chemical Physics 1998, 108, 1964
  • Bolhuis et al. 1998 Bolhuis, P. G.; Dellago, C.; Chandler, D. Faraday Discussions 1998, 110, 421–436
  • Dellago et al. 1999 Dellago, C.; Bolhuis, P. G.; Chandler, D. The Journal of Chemical Physics 1999, 110, 6617
  • Bolhuis et al. 2002 Bolhuis, P. G.; Chandler, D.; Dellago, C.; Geissler, P. L. Annual Review of Physical Chemistry 2002, 53, 291–318
  • Dellago et al. 2002 Dellago, C.; Bolhuis, P. G.; Geissler, P. L. Adv. in Chem. Phys. 2002, 123, 1–78
  • Roux 2021 Roux, B. The Journal of Physical Chemistry A 2021, 125, 7558–7571
  • Li and Ma 2014 Li, W.; Ma, A. Molecular simulation 2014, 40, 784–793
  • Berezhkovskii and Szabo 2005 Berezhkovskii, A.; Szabo, A. The Journal of chemical physics 2005, 122, 014503
  • Rhee and Pande 2005 Rhee, Y. M.; Pande, V. S. The Journal of Physical Chemistry B 2005, 109, 6780–6786
  • Ma and Dinner 2005 Ma, A.; Dinner, A. R. J Phys Chem B 2005, 109, 6769–79
  • Peters and Trout 2006 Peters, B.; Trout, B. L. The Journal of Chemical Physics 2006, 125, 054108
  • Peters et al. 2007 Peters, B.; Beckham, G. T.; Trout, B. L. The Journal of chemical physics 2007, 127, 034109–034109
  • Peters et al. 2013 Peters, B.; Bolhuis, P. G.; Mullen, R. G.; Shea, J.-E. The Journal of chemical physics 2013, 138, 054106
  • Li and Ma 2015 Li, W.; Ma, A. The Journal of chemical physics 2015, 143, 11B603_1
  • Li and Ma 2016 Li, W.; Ma, A. The Journal of chemical physics 2016, 144, 134104
  • Li 2018 Li, W. The Journal of chemical physics 2018, 148, 084105
  • Li 2022 Li, W. The Journal of Chemical Physics 2022, 156, 054117
  • Li 2022 Li, W. bioRxiv 2022, doi.org/10.1101/2022.02.23.481712
  • Boresch et al. 1994 Boresch, S.; Archontis, G.; Karplus, M. Proteins: Structure, Function, and Bioinformatics 1994, 20, 25–33
  • Brady and Sharp 1995 Brady, G. P.; Sharp, K. A. Journal of molecular biology 1995, 254, 77–85
  • Srinivasan et al. 1998 Srinivasan, J.; Cheatham, T. E.; Cieplak, P.; Kollman, P. A.; Case, D. A. Journal of the American Chemical Society 1998, 120, 9401–9409
  • Gohlke and Case 2004 Gohlke, H.; Case, D. A. Journal of computational chemistry 2004, 25, 238–250
  • Smith and van Gunsteren 1994 Smith, P. E.; van Gunsteren, W. F. The Journal of Physical Chemistry 1994, 98, 13735–13740
  • Li 2020 Li, W. Journal of Chemical Theory and Computation 2020, 16, 1834–1842
  • Weinan and Vanden-Eijnden 2010 Weinan, E.; Vanden-Eijnden, E. Annual review of physical chemistry 2010, 61, 391–420
  • Johnson and Hummer 2012 Johnson, M. E.; Hummer, G. The Journal of Physical Chemistry B 2012, 116, 8573–8583
  • Zwanzig 1973 Zwanzig, R. Journal of Statistical Physics 1973, 9, 215–220
  • Darve 2007 Darve, E. Free Energy Calculations; Springer, 2007; pp 119–170