Gentlest ascent dynamics on manifolds defined by adaptively sampled point-clouds

Juan M. Bello-Rivas [email protected] [ Anastasia Georgiou [ Hannes Vandecasteele [ Ioannis G. Kevrekidis [email protected] [

Abstract

Finding saddle points of dynamical systems is an important problem in practical applications such as the study of rare events of molecular systems. Gentlest ascent dynamics (GAD) ¹ is one of a number of algorithms in existence that attempt to find saddle points in dynamical systems. It works by deriving a new dynamical system in which saddle points of the original system become stable equilibria. GAD has been recently generalized to the study of dynamical systems on manifolds (differential algebraic equations) described by equality constraints ² and given in an extrinsic formulation. In this paper, we present an extension of GAD to manifolds defined by point-clouds, formulated using the intrinsic viewpoint. These point-clouds are adaptively sampled during an iterative process that drives the system from the initial conformation (typically in the neighborhood of a stable equilibrium) to a saddle point. Our method requires the reactant (initial conformation), does not require the explicit constraint equations to be specified, and is purely data-driven.

JHU]Department of Chemical and Biomolecular Engineering, Whiting School of Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, 21218, MD, USA JHU]Department of Chemical and Biomolecular Engineering, Whiting School of Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, 21218, MD, USA KULeuven]Department of Computer Science, KU Leuven, Celestijnenlaan 200A, 3001 Leuven JHU]Department of Chemical and Biomolecular Engineering, Whiting School of Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, 21218, MD, USA \alsoaffiliationDepartments of Applied Mathematics and Statistics, Johns Hopkins University, 3400 North Charles Street, Baltimore, 21218, MD, USA

{tocentry} [Uncaptioned image]

1 Introduction

The problem of finding saddle points of dynamical systems has one of its most notable applications in the search for transition states of chemical systems described at the atomistic level, since saddle points coincide with transition states ³ at the zero temperature limit. While finding local stable equilibria (sinks) is a relatively straightforward matter, finding saddle points is a more complicated endeavor for which a number of algorithms have been presented in the literature ⁴.

Saddle point search methods can be classified according to whether they require one single input state (usually the reactant, located at the minimum of a free energy well) or two states (reactant and product). The Gentlest Ascent Dynamics (GAD) ¹ belongs to the class of methods requiring a single reactant state as input and its applicability has been demonstrated with atomistic chemical systems ^{5, 6}. GAD can be regarded as a variant of the dimer method that is formulated as a continuous dynamical system whose integral curves with initial condition at the reactant state can lead to saddle points. Variants of GAD such as high-index saddle dynamics (HiSD) have been the subject of recent research efforts and applications ^{7, 8, 9, 2, 10, 11, 12, 13}.

While many search schemes attempt to find an optimal path between reactant and product (or between reactant state and transition state), it is interesting that there exist continuous curves joining the desired states in a variety of ways: following isoclines ^{14, 15}, gradient extremals ^{16, 17}, and the GAD studied in this paper. In most cases the study of these curves has been carried out in the Euclidean setting with some exceptions on the manifold of internal coordinates ^{18, 6} and on manifolds defined by the zeros of smooth maps ² (in these cases, the algorithms are formulated extrinsically on the ambient space). Our contribution is formulated intrinsically and is valid on arbitrary manifolds, not necessarily explicitly defined by an atlas or by the zeros of maps. Importantly, algorithms like the one presented here or our previous work ¹⁹ do not rely on a priori knowledge of good collective coordinates, but rather use manifold learning to find them on the fly. In our method, there is a feedback loop of data collection that drives progress towards a saddle point.

In this paper, we study an application of the GAD to manifolds defined by point-clouds. The manifold does not need to be characterized in advance either by the zeros of a smooth function or by an atlas, and it is only assumed that the user is capable of sampling the vicinity of arbitrary points on the manifold (e.g., umbrella sampling based on reduced local coordinates). The algorithm uses dimensionality reduction (namely, diffusion map coordinates ²⁰) to define a dynamical system intrinsically on the reduced coordinates that can lead to a saddle point. Since the saddle point in general is not expected to lie in the vicinity of the reactant, our algorithm works by iteratively sampling the manifold on the fly, resolving the path on the local chart, and repeatedly switching charts until convergence. Our approach shares algorithmic elements with our previous work ¹⁹ which, however, instead of GAD dynamics on manifolds, was following isoclines on manifolds.

2 Gentlest Ascent Dynamics and Idealized Saddle Dynamics

Gentlest Ascent Dynamics (GAD) ¹ is an algorithm for finding saddle points of dynamical systems. We propose an extension of GAD to manifolds defined by point-clouds that finds saddle points combining nonlinear dimensionality reduction and adaptive sampling.

Let $U\colon\mathbb{R}^{n}\to\mathbb{R}$ be a smooth potential energy function and consider the associated gradient vector field $X$ in $\mathbb{R}^{n}$ given by $\dot{x}=X(x)$ , where $X=-DU$ with $D$ denoting the gradient (or, equivalently, the Jacobian matrix). We restrict ourselves here, for the sake of simplicity, to the case of gradient systems and index-1 saddle points. The GAD algorithm consists of integrating the equations of motion of the related dynamical system $\hat{X}$ on an extended phase space $\mathbb{R}^{2n}$ given by

\left\{\begin{aligned} &\dot{x}=-H(v)DU(x),\\ &\dot{v}=-D^{2}U(x)v+r(x,v)v,\end{aligned}\right.

(1)

where $x,v\in\mathbb{R}^{n}$ , $H(v)w=w-(2\,v\!\cdot\!w)v$ is the Householder reflection ²¹ of $w\in\mathbb{R}^{n}$ across the hyperplane $\langle v\rangle^{\perp}=\{z\in\mathbb{R}^{n}\mid v\cdot z=0\}$ and $r(x,v)=\|v\|^{-2}\,v\cdot D^{2}U(x)v$ is the Rayleigh quotient of the Hessian matrix $D^{2}U(x)$ corresponding to the vector $v\in\mathbb{R}^{n}$ .

Remark 1.

The right hand side of the ordinary differential equation for $\dot{v}$ is the term

-D^{2}U(x)v+r(x,v)v.

If $v$ is an eigenvector of $D^{2}U(x)$ with eigenvalue $\lambda$ , then $r(x,v)=\lambda$ . Now note that the constrained optimization problem consisting of finding the extrema of $Z(x)=\frac{1}{2}\|-DU(x)\|^{2}$ along the level sets $C=\{x\in\mathbb{R}^{n}\mid U(x)=c\}$ is precisely given by the Lagrange equation

D^{2}U(x)DU(x)-\lambda DU(x)=0,

which says that the extrema of the magnitude of the gradient, $Z$ , along $C$ are attained wherever the gradient field $X=-DU$ happens to be an eigenvector of the Hessian $D^{2}U$ .

The intuition behind GAD is that the force $-DU$ can be written in a basis determined by the eigenvectors of the Hessian $D^{2}U$ , which gives us the stable and unstable directions in the vicinity of an index-1 saddle point. The second differential equation in (1) acts as a (continuous) eigensolver yielding the unstable direction. Then, we can flip the sign of the component of the force corresponding to the unstable direction to make the resulting vector point towards the saddle point (see Figure 1).

Refer to caption — Figure 1: Vector field corresponding to the first equation of GAD on the potential $U(x^{1},x^{2})=(x^{1})^{2}-(x^{2})^{2}$ with $v=(0,1)\in\mathbb{R}^{2}$ being the unstable direction. Note how the steepest descent direction is reflected across the orthogonal complement $\langle v\rangle^{\perp}$ to obtain $\dot{x}$ .

2.1 Idealized saddle-point dynamics

We consider a variant of the GAD algorithm named Idealized Saddle Dynamics (ISD) ²², given by

\dot{x}=-H(v)DU(x),

where $v\in\mathbb{R}^{n}$ is an eigenvector of $D^{2}U(x)$ corresponding to the smallest eigenvalue $\lambda$ .

We now leave the Euclidean setting behind and study the Riemannian setting (see the appendix for a summary of the topic and the notation). Let $M$ be a $d$ -dimensional smooth manifold with Riemannian metric $g$ and let $U\in C^{\infty}(M)$ be a potential energy function. We consider the gradient field $X=-\operatorname{grad}U\in\mathfrak{X}(M)$ and we write the following equation for the ISD vector field on $M$ ,

\hat{X}=X-2g(V,X)V\in\mathfrak{X}(M),

(2)

where $V$ is the vector field on $M$ defined by choosing the eigenvector (normalized so that $g(V,V)=1$ ) corresponding to the smallest eigenvalue of the Hessian matrix. However, in this case the Hessian must be defined in terms of the covariant derivative on $(M,g)$ with respect to the Levi-Civita connection ^{23, 24}. To be precise, given two vector fields $S,T\in\mathfrak{X}(M)$ , we have

\nabla^{2}U(S,T)=\nabla_{T}(\nabla_{S}U)-\nabla_{\nabla_{T}S}U,

which is a tensor of type $(0,2)$ , where $\nabla$ denotes the covariant derivative induced by the Levi-Civita connection (the notation $\nabla_{T}S$ represents the covariant derivative of $S$ in the direction of $T$ ). We apply the sharp ( $\sharp$ ) isomorphism (raising or lowering indices) to turn $\nabla^{2}U$ into a $(1,1)$ -tensor. Therefore,

(\operatorname{Hess}U)T=\nabla_{T}\operatorname{grad}U.

The eigenvector corresponding to the lowest eigenvalue of $\operatorname{Hess}U$ at each point $p\in M$ induces a vector field $V$ whose integral curves are curves that may join an initial point with a saddle point.

The ISD formulation of GAD is particularly amenable to be coupled with dimensionality reduction approaches because the resulting eigenproblem is often of much lower dimensionality than the ambient space in which the original dynamical system is defined.

Example 1 (Exact solution of a model system).

Let us compute a concrete case of ISD, first with known exact formulas and later on with approximations using diffusion maps and Gaussian processes. Consider the sphere

\mathbb{S}^{2}=\{(x^{1},x^{2},x^{3})\in\mathbb{R}^{3}\mid(x^{1})^{2}+(x^{2})^{2}+(x^{3})^{2}=1\}.

and the stereographic projection from the North pole onto the tangent plane at the South pole. The system of coordinates is given by

\phi(x^{1},x^{2},x^{3})=(\frac{x^{1}}{1-x^{3}},\frac{x^{2}}{1-x^{3}})\in\mathbb{R}^{2},

Let $(u^{1},u^{2})\in\mathbb{R}^{2}$ . The corresponding parameterization, $\psi=\phi^{-1}$ , is given by

\psi(u^{1},u^{2})=\frac{1}{1+(u^{1})^{2}+(u^{2})^{2}}(2u^{1},2u^{2},(u^{1})^{2}+(u^{2})^{2}-1).

The pullback of the Euclidean metric $h=\mathrm{d}x^{1}\otimes\mathrm{d}x^{1}+\mathrm{d}x^{2}\otimes\mathrm{d}x^{2}+\mathrm{d}x^{3}\otimes\mathrm{d}x^{3}$ by $\psi$ gives us the metric $g$

g=\psi^{\star}h=\frac{4\mathrm{d}u^{1}\otimes\mathrm{d}u^{1}+4\mathrm{d}u^{2}\otimes\mathrm{d}u^{2}}{\left(1+(u^{1})^{2}+(u^{2})^{2}\right)^{2}}

The non-redundant Christoffel symbols $\Gamma_{ij}^{k}$ that characterize the Levi-Civita connection $\nabla$ are

\Gamma_{1,1}^{1}=\frac{-2u^{1}}{1+(u^{1})^{2}+(u^{2})^{2}},\quad\Gamma_{1,2}^{1}=\frac{-2u^{2}}{1+(u^{1})^{2}+(u^{2})^{2}},

\Gamma_{2,2}^{1}=-\Gamma_{1,1}^{1},\quad\Gamma_{1,1}^{2}=-\Gamma_{1,2}^{1},\quad\Gamma_{1,2}^{2}=\Gamma_{1,1}^{1},\quad\text{and}\quad\Gamma_{2,2}^{2}=\Gamma_{1,2}^{1}.

The energy $E(x^{1},x^{2},x^{3})=x^{1}x^{2}x^{3}$ , constrained on $\mathbb{S}^{2}\subset\mathbb{R}^{3}$ is transformed to $U=\psi^{\star}E$ on $\mathbb{S}^{2}$ ,

U(u^{1},u^{2})=\frac{4u^{1}u^{2}\left((u^{1})^{2}+(u^{2})^{2}-1\right)}{\left((u^{1})^{2}+(u^{2})^{2}+1\right)^{3}},

The force on $\mathbb{S}^{2}$ is the negative of the gradient

\operatorname{grad}U=\left({{(u^{2})^{5}-2\,(u^{1})^{2}\,(u^{2})^{3}+\left(-3\,(u^{1})^{4}+8\,(u^{1})^{2}-1\right)\,{u^{2}}}\over{(u^{2})^{4}+\left(2\,(u^{1})^{2}+2\right)\,(u^{2})^{2}+(u^{1})^{4}+2\,(u^{1})^{2}+1}}\right)\,\frac{\partial}{\partial{u^{1}}}\\ +\left(-{{3\,{u^{1}}\,(u^{2})^{4}+\left(2\,(u^{1})^{3}-8\,{u^{1}}\right)\,(u^{2})^{2}-(u^{1})^{5}+{u^{1}}}\over{(u^{2})^{4}+\left(2\,(u^{1})^{2}+2\right)\,(u^{2})^{2}+(u^{1})^{4}+2\,(u^{1})^{2}+1}}\right)\,\frac{\partial}{\partial{u^{2}}}

The potential energy and the gradient field are shown in Figure 2(a).

The Hessian is then given by the $(1,1)$ tensor

\nabla\operatorname{grad}U=A\frac{\partial}{\partial{u^{1}}}\otimes\mathrm{d}u^{1}+B\frac{\partial}{\partial{u^{2}}}\otimes\mathrm{d}u^{1}+B\frac{\partial}{\partial{u^{1}}}\otimes\mathrm{d}u^{2}+D\frac{\partial}{\partial{u^{2}}}\otimes\mathrm{d}u^{2},

where

	$\displaystyle A$	$\displaystyle=-{{4\,{u^{1}}\,{u^{2}}\,\left((u^{2})^{4}+(u^{2})^{2}-(u^{1})^{4}+11\,(u^{1})^{2}-6\right)}\over{\left((u^{2})^{2}+(u^{1})^{2}+1\right)^{3}}}$
	$\displaystyle B$	$\displaystyle=-{{(u^{2})^{6}-5\,(u^{1})^{2}\,(u^{2})^{4}-5\,(u^{2})^{4}-5\,(u^{1})^{4}\,(u^{2})^{2}+30\,(u^{1})^{2}\,(u^{2})^{2}-5\,(u^{2})^{2}+(u^{1})^{6}-5\,(u^{1})^{4}-5\,(u^{1})^{2}+1}\over{\left((u^{2})^{2}+(u^{1})^{2}+1\right)^{3}}}$
	$\displaystyle D$	$\displaystyle={{4\,{u^{1}}\,{u^{2}}\,\left((u^{2})^{4}-11\,(u^{2})^{2}-(u^{1})^{4}-(u^{1})^{2}+6\right)}\over{\left((u^{2})^{2}+(u^{1})^{2}+1\right)^{3}}}.$

The smallest eigenvector of $\nabla\operatorname{grad}U$ determines the vector field $V\in\mathfrak{X}(\mathbb{S}^{2})$ that is used in the formulation of the ISD vector field (2) and is shown in Figure 2(b).

Example 1 required the knowledge of a particular system of coordinates (namely, the stereographic projection from the North pole to the tangent space at the South pole) mapping three-dimensional points on the sphere to two-dimensional coordinates; here this allows us to work with a single chart. In some settings the system of coordinates of the underlying manifold is either unknown or difficult to obtain. It is possible under those circumstances to replicate the steps in Example 1 in the absence of a system of coordinates given as a closed-form expression by resorting to manifold learning / dimensionality reduction techniques. In our case, as we shall discuss next, we use diffusion maps on points sampled from a local neighborhood of the manifold to extract a suitable system of coordinates. Fitting a Gaussian process to the diffusion map coordinates of the point-cloud yields a local system of coordinates $\phi$ that can be evaluated at arbitrary points (not necessarily those in the sampled point-cloud). Once we have a system of coordinates, we can again proceed to estimate the Riemannian metric as well as the Levi-Civita connection, and compute the flow of the ISD vector field $\hat{X}$ given in (2) to find saddle points.

Revisiting Example 1 and approaching it with the aforementioned procedure allows us to obtain trajectories that lead to saddle points, as shown in Figures 3 to 5.

The choice of diffusion map coordinates for dimensionality reduction and Gaussian processes for non-linear regression is inessential and both methods could be replaced by, for instance, neural networks (e.g., a variational autoencoder and a graph neural network, respectively). An important aspect of our approach is the fact that it operates on local neighborhoods of points, as opposed to using data from longtime simulations, and only then applying dimensionality reduction techniques to extract global collective variables. From a geometric viewpoint, this is motivated by the fact that a sufficiently small neighborhood of a manifold can always be transformed onto a subset of its tangent space by a smooth invertible map. From a physical viewpoint, different sets of collective variables govern different stages of a reaction (e.g., the distance between a ligand and a receptor is important when both molecules are far away whereas their relative orientation might be more important to ligand-binding, when both molecules become close).

One interesting aspect of diffusion maps is its relation to the infinitesimal generator of a diffusion process ²⁵, which in turn has connections to the committor function, which is an optimal reaction coordinate ^{26, 27, 28}. We conclude this discussion on collective variables by pointing out that a priori knowledge of good reaction coordinates (some recent examples can be found in ^{29, 30}) can often be put in one-to-one correspondence with diffusion map coordinates ^{31, 32}.

2.2 Mean force

In computational statistical mechanics we are often interested in dynamics on Riemannian manifolds endowed with a probability distribution. For instance, for simulations at constant temperature, we use the Boltzmann distribution with probability density proportional to $\exp\{-\beta U(x)\}$ , where $\beta>0$ is the inverse temperature. The relevant vector field at $u$ in the local chart is the mean force ^{33, 34, 35}, given by

X_{u}=-\langle\operatorname{grad}\left(U\circ\psi-\tfrac{1}{2}\beta^{-1}\log\det(D\psi^{\top}D\psi)\right)\rangle_{u},

where $\langle\cdot\rangle_{u}$ denotes the ensemble average with respect to the Boltzmann distribution, conditioned on $\{x\in\mathbb{R}^{n}\mid\phi(x)=u\}$ . There are a number of numerical methods that estimate the mean force as a step in their calculations. Adaptive biasing force ³⁶ and the string method ³⁷ are examples of those.

3 Algorithm

Consider the manifold $M\subset\mathbb{R}^{n}$ and the gradient dynamical system $X\in\mathfrak{X}(M)$ . We begin by drawing a total of $N$ samples from the manifold $M$ in the neighborhood of an initial point $p\in M$ . This can be done in a variety of ways depending on the application. If $M$ is the inertial manifold of a dynamical system $X$ , then a reasonable way to approach this problem is to generate $N$ distinct perturbations $\{p_{(i)}\in\mathbb{R}^{n}\mid i=1,\dotsc,N\}$ of $p$ and propagate them according to the flow of $X$ during a (short) time horizon $\tau>0$ . Doing so, we obtain a data set $\mathscr{D}=\{q_{(i)}=\exp_{\tau}p_{(i)}\in M\mid i=1,\dotsc,N\}$ approximately on the manifold. Alternatively, one may numerically solve a stochastic differential equation such as the Brownian dynamics equation,

\mathrm{d}q_{t}=X(q_{t})\,\mathrm{d}t+\sigma\,\mathrm{d}B_{t},

(3)

where the drift is the vector field $X$ , $\sigma>0$ is a constant, and $B_{t}$ is a standard $n$ -dimensional Brownian motion. Solving (3) (possibly with an added RMSD-based restraint around the initial conformation) up to a certain time $\tau>0$ and extracting an uncorrelated subset of the states at different time steps yields a data set $\mathscr{D}=\{q_{(i)}=q_{t_{i}}\mid i=1,\dotsc,N,0\leq t_{1}\leq\cdots\leq t_{N}\leq\tau\}$ .

Next, we apply a dimensionality reduction algorithm on the data set $\mathscr{D}$ to obtain a set of reduced coordinates $\phi$ . In our case, we use diffusion maps ^{20, 25} to obtain a set of vectors $\phi_{(i)}\in\mathbb{R}^{d}$ with $d\leq n$ but other methods, such as local tangent space alignment ³⁸, may be used as well. It is important to note that our dimensionality reduction method is applied to a local neighborhood of an initial point and, therefore, it is expected to yield a reasonable approximation to a chart on that neighborhood.

We fit a Gaussian process regressor $\phi$ to the pairs of points $(q_{(i)},\phi_{(i)})\in\mathbb{R}^{n}\times\mathbb{R}^{d}$ to obtain a smooth map $\phi\colon M\subset\mathbb{R}^{n}\to\mathbb{R}^{d}$ that will act as a system of coordinates (in particular, $\phi(q_{(i)})\approx\phi_{(i)}$ ). Proceeding in an analogous fashion, we compute the inverse mapping $\psi=\phi^{-1}$ .

Note that one possible way of estimating the dimensionality is by computing the average of the approximate rank of the Jacobian matrix of $\phi$ at (a subset of) the data points and retaining the components that yield a local chart.

Remark 2.

We can reduce the computational expense of the Gaussian process regression by reusing the kernel matrix with entries $\mathrm{e}^{-\|q_{(i)}-q_{(j)}\|^{2}/2\varepsilon}$ (for some $\varepsilon>0$ ) calculated during the computation of the diffusion map coordinates as the covariance matrix for the Gaussian process (assuming that it is formulated using the squared exponential kernel).

Remark 3.

It is not always possible to obtain a suitable Gaussian process regressor for $\psi=\phi^{-1}$ . An alternative is to add an Ornstein-Uhlenbeck process to the stochastic differential equation (3) in order to obtain

\mathrm{d}q_{t}=\left(X(q_{t})-\kappa(\phi(q_{t})-\phi_{0})\right)\,\mathrm{d}t+\sigma\,\mathrm{d}B_{t},

where $\kappa>0$ is a hyperparameter and $\phi_{0}$ is a prescribed point not necessarily in $\phi(\mathscr{D})$ . Computing the ensemble average $\langle q_{t}\rangle$ of the solution to the above equation yields a point $q_{(0)}$ such that $\phi(q_{(0)})\approx\phi_{0}$ or, equivalently, $q_{(0)}=\phi^{-1}(\phi_{(0)})$ . This works because the new term added to the drift nudges the system towards a point in ambient space such that its image by $\phi$ is the prescribed point $\phi_{0}$ .

We consider the values $X_{q_{(i)}}\in TM$ of the vector field $X$ at the points in the data set $\mathscr{D}$ and map them via the system of coordinates in order to obtain the vector field $\phi_{\star}X_{q_{(i)}}=D\phi(q_{(i)})X_{q_{(i)}}$ in the new coordinates, where $D\phi$ is the Jacobian matrix of $\phi$ (note that the Jacobian-vector product can be computed either as a closed-form formula or efficiently using automatic differentiation —e.g., using jvp in JAX ³⁹). We fit another Gaussian process to the pushforward vector field $\phi_{\star}X$ in order to be able to evaluate it at arbitrary points in the new coordinates and we abuse notation in what follows by also referring to $\phi_{\star}X$ as $X$ .

At this stage, we can readily compute the Riemannian metric $g$ as

g=\sum_{i,j=1}^{d}g_{ij}\,\mathrm{d}\phi^{i}\otimes\mathrm{d}\phi^{j},

(4)

where $g_{ij}=D\psi^{\top}D\psi$ for $i,j=1,\dotsc,d$ . The metric induces an inner product, denoted by the bracket $\langle\cdot,\cdot\rangle$ , between tangent vectors such that if $S=\sum_{i=1}^{d}S^{i}\frac{\partial}{\partial\phi^{i}}$ and $T=\sum_{i=1}^{d}T^{i}\frac{\partial}{\partial\phi^{i}}$ are the expressions in local coordinates of two tangent vectors $S,T\in T_{u}M$ at a point $u\in M$ , then $\langle S,T\rangle=\sum_{i,j=1}^{d}g_{ij}S^{i}T^{i}$ .

Using (4), we obtain the coefficients of the Levi-Civita connection ²³,

\Gamma^{\ell}_{jk}=\sum_{i=1}^{d}g^{\ell i}\left(\frac{\partial g_{ij}}{\partial\phi^{k}}+\frac{\partial g_{ik}}{\partial\phi^{j}}-\frac{\partial g_{jk}}{\partial\phi^{i}}\right)

for $j,k,\ell\in\{1,\dotsc,d\}$ , where $g^{ij}$ denotes the entries of the inverse of the matrix with components $g_{ij}$ . This, in turn, allows us to take the covariant derivative and the Hessian, defined by

\operatorname{Hess}U(S,T)=\langle\nabla_{T}\operatorname{grad}U,S\rangle

for arbitrary tangent vectors $S,T$ . Observe that the Hessian is defined on the local chart, not on the ambient space. The eigenvector $V$ , normalized with respect to $g$ , with smallest eigenvalue of the Hessian then yields a vector field

\hat{X}=X-2\,Vg(V,X)

such that the index-1 saddle points of $X$ become stable equilibria ¹ of $\hat{X}$ . Consequently, an integral curve of $\hat{X}$ in the vicinity of a saddle point leads to the saddle point.

In general, in order to carry out the computation until convergence, we must frequently switch charts. This is due to the fact that at each step, we sample a point-cloud $\mathscr{D}$ in a small neighborhood of a given point and the Gaussian process regressor $\phi$ yields an approximation to $X$ on the chart that cannot extrapolate far away from the sampled points. Therefore, when we reach the confines of $\mathscr{D}$ (which can be determined by the density of points), we ought to map the latest point of our integral curve of $\hat{X}$ from the chart back to the ambient space using $\psi=\phi^{-1}$ , as discussed earlier. After that, we start the whole procedure again: sample a new data set $\mathscr{D}$ , compute $\phi$ , trace an integral curve of $\hat{X}$ , etc.

Remark 4.

An alternative approach to the one presented here could consist of exploiting the fact that GAD trajectories are geodesics ⁴⁰ of a Finsler metric ^{41, 42} and to numerically compute said geodesics in each learned local chart in a similar manner to what we propose.

The factor that impacts the algorithm the most is the deviation of the sampled points in the data set from the manifold $M$ . In other words, the farther away from $M$ our data points are, the more noisy is the estimate of $\hat{X}$ , and, consequently, the harder it is to find the saddle points.

3.1 Numerical examples

The computations that follow were carried out using the JAX ³⁹ and Diffrax ⁴³ libraries, and the code to reproduce our results is available at https://github.com/jmbr/gentlest_ascent_dynamics_on_manifolds.

In our two examples below, we automatically set the bandwidth parameter (usually denoted by $\varepsilon$ ) in diffusion maps to be equal to the squared median of the pairwise distances between points. The regularization parameters in the Gaussian processes $\phi$ and $\psi$ are chosen so that their scores are sufficiently close to 1 on a test set.

3.1.1 Sphere

The preceding algorithm applied to the vector field on the sphere $\mathbb{S}^{2}$ introduced in Example 1 produces the iterations shown in Figures 3, 4, and 5. These figures depict the integral curves (highlighted) in the local neighborhoods obtained by sampling and integrating $\hat{X}$ . The full integral curve joining the initial point to a saddle point at the equator of the sphere in Figure 6.

We sampled $10^{3}$ points per iteration and integrated the ISD vector field using an explicit Euler integrator for a total $10^{3}$ steps with a time-step length of $10^{-4}$ . The algorithm converges to a saddle point in ten iterations.

3.1.2 Regular surface

Our second example is the Müller-Brown (MB) potential ⁴⁴ mapped onto a regular surface. Namely, the manifold $M$ is the graph of the function (see Figure 7),

f(x^{1},x^{2})=\sum_{k^{1}=0}^{K}\sum_{k^{2}=0}^{K}a_{k^{1},k^{2}}\cos(k^{1}x^{1}+k^{2}x^{2}+b_{k^{1},k^{2}}),

where $K=3$ and the coefficients are given in Table 1.

$k^{1}$	$k^{2}$	$a_{k^{1},k^{2}}$	$b_{k^{1},k^{2}}$
0	1	0.9490	0.8838
0	2	0.4575	0.6564
1	0	0.4152	0.7449
1	2	0.2911	0.3619
2	0	0.4121	0.5469
3	2	0.2817	0.4719

Table 1: Non-zero coefficients characterizing the regular surface.

After eight iterations, sampling $5\times 10^{3}$ points and integrating for $10^{3}$ steps per iteration with a time step length of $10^{-4}$ , our algorithm successfully constructs a path joining the initial point located at a minimum (sink) of the MB potential to the nearest saddle point. The relative errors between the points in the constructed path and the known coordinates of the saddle point are shown in Figure 8.

4 Conclusions

We have presented a formulation of GAD on manifolds defined by point-clouds that are meant to be sampled on-demand. Our formulation is intrinsic and does not require the specification of the manifolds either by a given atlas or by the zeros of a smooth map. The required charts are discovered through a data-driven, iterative process that only requires knowledge of an initial conformation of the reactant. We illustrated our approach with two simple examples and we expect the results to transfer to the high-dimensional dynamical systems of interest in computational statistical mechanics.

{acknowledgement}

This work was supported by the US Air Force Office of Scientific Research (AFOSR) and the US Department of Energy DOE with IIT: SA22-0052-S001 and AFOSR-MURI: FA9550-21-1-0317.

Appendix A Appendix

In this section we review basic notions of Riemannian geometry relevant to this paper. Our aim is to emphasize intuition over formalism as much as possible. We refer the reader to general treatises on the topic such as ^{23, 45} for a deeper presentation and we especially recommend the relevant material in ^{46, 47} for the working physical chemist.

A.1 Tensor spaces

Let $V$ and $W$ be two vector spaces over the real numbers and denote by $\mathscr{V}(V\times W)$ the vector space generated by all finite linear combinations of elements of the Cartesian product $V\times W$ . The tensor product $V\otimes W$ is a vector subspace of $\mathscr{V}(V\times W)$ with elements of the form $v\otimes w$ , where $v\in V$ , $w\in W$ , such that:

1.

$(v_{1}+v_{2})\otimes w=v_{1}\otimes w+v_{2}\otimes w$ , where $v_{1},v_{2}\in V$ .
2.

$v\otimes(w_{1}+w_{2})=v\otimes w_{1}+v\otimes w_{2}$ , where $w_{1},w_{2}\in W$ .
3.

$(\alpha v)\otimes w=\alpha\,v\otimes w$ , where $\alpha\in\mathbb{R}$ .
4.

$v\otimes(\alpha w)=\alpha\,v\otimes w$ .

The above construction can be recursively extended to tensor spaces of arbitrary numbers of factors (i.e., given vector spaces $V_{1},\dotsc,V_{n}$ , with $n\in\mathbb{N}$ , we construct $V_{1}\otimes V_{2}\otimes\dotsb\otimes V_{n}$ as $V_{1}\otimes(V_{2}\otimes\dotsb\otimes V_{n})$ and so on and so forth).

Given a vector space $V$ , its dual space, denoted by $V^{\star}$ , is the vector space formed by all linear maps $f\colon V\to\mathbb{R}$ . A tensor space of type $(p,q)$ is a tensor product

T_{q}^{p}(V)=\left(\bigotimes_{i=1}^{p}V\right)\otimes\left(\bigotimes_{i=1}^{q}V^{\star}\right).

(5)

Elements of $T_{q}^{p}(V)$ are called tensors of type $(p,q)$ .

Similarly to how all vector spaces of fixed dimension over the real numbers are linearly isomorphic to each other, all tensor spaces of type $(p,q)$ over the reals are linearly isomorphic to each other and, therefore, we can assume that the factors are arranged as in (5).

Example 2.

The set of linear maps from $V=\mathbb{R}^{n}$ to $W=\mathbb{R}^{m}$ is a vector space. If $\{e_{1},\dotsc,e_{n}\}$ is an orthonormal basis of $V$ and $\{f_{1},\dotsc,f_{m}\}$ is an orthonormal basis of $W$ , then $\{e^{i}\colon V\to\mathbb{R}\mid i=1,\dotsc,n\}$ is the basis of the dual space $V^{\star}$ determined by

e^{i}(e_{j})=\begin{cases}1,&\text{if $i=j$,}\\ 0,&\text{otherwise}.\end{cases}

Consequently, the linear map $A\colon V\to W$ can be written as

A=\sum_{i=1}^{n}\sum_{j=1}^{m}a_{i}^{j}\,f_{j}\otimes e^{i},

where $a_{i}^{j}\in\mathbb{R}$ . Additionally, if $\{e_{1},\dotsc e_{n}\}$ and $\{f_{1},\dotsc,f_{m}\}$ are, respectively, the canonical bases of $V=\mathbb{R}^{n}$ and $W=\mathbb{R}^{m}$ , then $f_{j}\otimes e^{i}\in W\otimes V^{\star}$ coincides with the outer product $f_{j}e_{i}^{\top}\in\mathbb{R}^{m\times n}$ and $A$ coincides with the $m\times n$ real matrix

A=\begin{bmatrix}a_{1}^{1}&\cdots&a_{1}^{n}\\ \vdots&\ddots&\vdots\\ a_{m}^{1}&\cdots&a_{m}^{n}\end{bmatrix}.

The wedge product is defined as

v_{1}\wedge\dotsb\wedge v_{k}=\frac{1}{k!}\sum_{\sigma\in S_{k}}\operatorname{sgn}(\sigma)\,v_{\sigma^{-1}(1)}\otimes\dotsb\otimes v_{\sigma^{-1}(k)},

where $S_{k}$ is the group of permutations of $\{1,\dotsc,k\}$ and $\operatorname{sgn}(\sigma)$ is equal to $-1$ if the permutation $\sigma$ is odd and $+1$ if it is even. This is equivalent to the projection of the tensor $v_{1}\otimes\dotsb\otimes v_{k}$ onto the subspace $\Lambda^{k}V$ of anti-symmetric tensors of $\bigotimes_{i=1}^{k}V$ .

Remark 5.

The signature $\operatorname{sgn}(\sigma)$ coincides with the Levi-Civita symbol. The latter should not be confused with the Levi-Civita connection, to be introduced later.

A.2 Smooth manifolds and smooth curves

In what follows, we consider a smooth map to be a map that has sufficiently many derivatives and whose derivatives are continuous functions.

A chart is a tuple $(U,\phi)$ such that $U$ is an open set in a topological space and $\phi\colon U\to\mathbb{R}^{m}$ is a diffeomorphism (i.e., a smooth mapping with a smooth inverse $\psi^{-1}$ ). A smooth manifold $M$ is a topological space together with a family of charts (i.e., an atlas),

\left\{(U_{\alpha},\phi_{\alpha})\,|\,\phi_{\alpha}\colon U_{\alpha}\subset M\to\mathbb{R}^{m}\right\},

such that for every $p\in M$ there exists a chart $(U,\phi)$ with $p\in U$ . Moreover, for any other chart $(V,\chi)$ such that $p\in V$ , the function $\phi\circ\chi^{-1}$ is smooth (see Figure 9).

\begin{overpic}[width=303.53267pt]{figures/charts.png} \put(45.0,45.0){\vector(-1,-1){15.0}} \put(55.0,45.0){\vector(1,-1){15.0}} \put(40.0,20.0){\vector(1,0){20.0}} \put(32.5,40.0){$\chi$} \put(65.0,40.0){$\phi$} \put(42.5,12.5){$\phi\circ\chi^{-1}$} \put(5.0,30.0){$\mathbb{R}^{m}$} \put(85.0,30.0){$\mathbb{R}^{m}$} \end{overpic}

Figure 9: Smooth

2

-dimensional manifold (a torus) with two overlapping charts

U

and

V

and their corresponding systems of coordinates

\phi(U)

and

\chi(V)

Example 3.

The sets

N=\left\{(x^{1},x^{2},x^{3})\in\mathbb{R}^{3}\,|\,x^{3}=f(x^{1},x^{2})=\sqrt{1-(x^{1})^{2}-(x^{2})^{2}}\right\}

and

\mathbb{S}^{2}=\left\{(x^{1},x^{2},x^{3})\in\mathbb{R}^{3}\,|\,F(x^{1},x^{2},x^{3})=0\right\},

where $F(x^{1},x^{2},x^{3})=(x^{1})^{2}+(x^{2})^{2}+(x^{3})^{2}-1$ , are both smooth manifolds. The former is the graph of the function $f$ and the latter is implicitly defined by the zeros of the equation of a 2-sphere (i.e., the points in $\mathbb{R}^{3}$ at which the function $F$ vanishes).

Differential Geometry is often explained following the intrinsic point of view in which nothing outside of the manifold is considered. Nevertheless, for didactic purposes it is sometimes useful to think of a manifold as being embedded in a sufficiently high-dimensional Euclidean space. This can always be done due to the celebrated embedding theorem of Whitney ⁴⁸, which guarantees that we can embed any smooth $m$ -dimensional manifold $M$ in $\mathbb{R}^{2m}$ .

A smooth curve on a manifold is a smooth mapping $\gamma\colon I\subset\mathbb{R}\to M$ where $I$ is an interval of $\mathbb{R}$ .

A.3 Tangent spaces and tangent bundles

Again, let $M$ be a smooth $m$ -dimensional manifold and let $p\in M$ be some arbitrary point on it. A tangent vector $v$ at $p$ is a map that assigns a real number to each smooth, real-valued function of $M$ such that

1.

$v(f+g)=v(f)+v(g)$ ,
2.

$v(\lambda f)=\lambda v(f)$ ,
3.

$v(fg)=v(f)\,g+f\,v(g)$ ,

For any $f,g\colon M\to\mathbb{R}$ smooth and $\lambda\in\mathbb{R}$ .

The set of tangent vectors at $p\in M$ is a vector space called the tangent space, denoted by $T_{p}M$ . Indeed, $T_{p}M$ is a vector space because if $v,w$ are tangent vectors at $p$ and $\lambda\in\mathbb{R}$ , then the axioms

1.

$(v+w)(f)=v(f)+w(f)$ ,
2.

$(\lambda\,v)(f)=\lambda\,v(f)$ ,

are satisfied.

In a local chart, the vectors

\frac{\partial}{\partial x^{1}}\bigg{|}_{p},\dotsc,\frac{\partial}{\partial x^{m}}\bigg{|}_{p}

constitute a basis of $T_{p}M$ .

Example 4.

Let $M=\mathbb{R}^{m}$ . In this case, the directional derivative of $f\colon M\to\mathbb{R}$ at $p\in M$ in the direction $v=(v^{1},\dotsc,v^{m})\in\mathbb{R}^{m}$ is

\nabla f(p)\cdot v=\sum_{i}v^{i}\frac{\partial f}{\partial x^{i}}(p).

In our notation, we would instead say that $v=\sum_{i}v^{i}\,\frac{\partial}{\partial x^{i}}\big{|}_{p}$ is a tangent vector of $M$ at $p$ .

We can view each tangent vector at $p$ as the velocity vector of a smooth curve $\gamma\colon[0,1]\subset\mathbb{R}\to M$ such that $\gamma(0)=p$ . In that case,

v(f)=\frac{\mathrm{d}}{\mathrm{d}t}f(\gamma(t))=\sum_{i}\dot{\gamma}^{i}(t)\,\frac{\partial f}{\partial x^{i}}(\gamma(t))=\left(\sum_{i}\underbrace{\dot{\gamma}^{i}(t)}_{=v^{i}}\,\frac{\partial}{\partial x^{i}}\bigg{|}_{\gamma(t)}\right)(f).

The geometric interpretation of tangent vectors and tangent spaces is shown in Figure 10.

\begin{overpic}[width=281.85034pt,trim=42.67912pt 99.58464pt 42.67912pt 184.9429pt,clip]{figures/tangentspace.png} \put(49.0,52.0){$p$} \put(83.0,30.0){$T_{p}M$} \put(48.0,10.0){$M$} \end{overpic}

Figure 10: A tangent vector can be regarded intuitively as an “arrow” based at a point

p\in M

. We use tangent vectors as a tool to evaluate directional derivatives of real-valued functions defined on

M

. The tangent space of a manifold at

p

can be viewed as the span of all the tangent vectors at

p

The collection

TM=\bigcup_{p\in M}T_{p}M

of the tangent spaces at each point of $M$ is called the tangent bundle and it is a smooth manifold in its own right.

Example 5 (The tangent bundle in classical mechanics).

If we denote the position of a mechanical system by $q\in M$ , where $M$ is the configurational manifold of the system, and we consider an arbitrary trajectory $t\in[0,1]\mapsto\gamma(t)\in M$ starting at the point $\gamma(0)=q$ with initial velocity $\dot{\gamma}(0)=v$ , then we see that the tangent bundle is the collection of all the positions and velocities of the mechanical system under consideration. Moreover, the Lagrangian is a real-valued function of the tangent bundle,

	$\displaystyle L$	$\displaystyle\colon TM\to\mathbb{R}$
		$\displaystyle(q,v)\mapsto L(q,v)=\tfrac{1}{2}v\cdot mv-U(q),$

where $m$ is the mass matrix of the system.

The momenta $p$ are obtained from the velocities $v$ by duality (via the Legendre transform ⁴⁹). Because of that, the phase space of a mechanical system is a manifold that is closely related to the tangent bundle of the configurational manifold $M$ . Indeed, the phase space is the co-tangent bundle $TM^{\star}$ of the configurational manifold $M$ .

The tangent bundle of a manifold is the prototype of a more general construction called a vector bundle that we will introduce next (albeit not fully rigorously). Informally, a vector bundle is a collection of vector spaces that are in correspondence to each point of $M$ . Slightly more formally, we say that a vector bundle over $M$ is a tuple $(E,\pi,M)$ where:

1.

The total space $E$ and the base space $M$ are smooth manifolds.
2.

The projection $\pi\colon{E}\to{M}$ is a smooth map whose preimages (known as fibers) $E_{p}=\pi^{-1}(p)$ for $p\in M$ are vector spaces (there are additional conditions that must be satisfied but they are beyond the scope of this appendix).

A trivial example of a vector bundle is shown in Figure 11.

\begin{overpic}[width=216.81pt,trim=42.67912pt 0.0pt 56.9055pt 85.35826pt,clip]{figures/trivialbundle.png} \put(50.0,60.0){\vector(0,-1){30.0}} \put(55.0,42.5){$\pi$} \put(80.0,5.0){$M$} \put(85.0,55.0){$E$} \end{overpic}

Figure 11: An example of a vector bundle. The base space in this case is the circle

M

and the total space is the cylinder

E

shown on top of

M

. The vertical lines on the cylinder represent the fibers (i.e., the one-dimensional vector spaces that are the preimages by

\pi

of each point of

M

In the case of the tangent bundle, the fiber $E_{p}$ is the tangent space $T_{p}M$ of $M$ at $p$ . More generally, we can take the fibers to be tensor spaces.

A.4 Vector fields and sections

A vector field is a correspondence of a tangent vector $v_{p}$ to each point $p\in M$ . The set of all vector fields over a manifold $M$ is denoted by $\mathfrak{X}(M)$ . An example of a vector field is shown in Figure 12.

In local coordinates,

v_{p}=\sum_{i}v^{i}(p)\,\frac{\partial}{\partial x^{i}}\bigg{|}_{p}

From now on, we adopt the convention

\partial_{i}=\frac{\partial}{\partial x^{i}}.

To any smooth vector field $v\in\mathfrak{X}(M)$ and any $t\in\mathbb{R}$ , we can associate the flow map, $\Phi_{t}\colon M\to M$ , that satisfies

\frac{\mathrm{d}}{\mathrm{d}t}\Phi_{t}(p)=v_{\Phi_{t}(p)}.

The smooth curve $t\mapsto\Phi_{t}(p)$ is also called an integral curve of $v$ .

In local coordinates, if $v=\sum_{i}v^{i}\,\partial_{i}$ , then $t\mapsto\Phi_{t}(p)=(\gamma^{1}(t),\dotsc,\gamma^{m}(t))$ solves the initial value problem

\left\{\begin{aligned} &\tfrac{\mathrm{d}}{\mathrm{d}t}{\gamma}^{i}(t)=v^{i}(\gamma(t)),\\ &\text{for each $i=1,\dotsc,m$},\end{aligned}\right.

with the initial condition $\Phi_{0}(p)=\gamma(0)=p$ . We often denote $\frac{\mathrm{d}\Phi}{\mathrm{d}t}$ by $\dot{\Phi}$ .

Let ${\pi}\colon{E}\to{M}$ be a vector bundle. A section is a smooth map

	$\displaystyle{s}\colon{M}$	$\displaystyle\to{E}$
	$\displaystyle{p}$	$\displaystyle\mapsto{s(p)\in E_{p}}$

such that the composition of $s$ followed by the projection $\pi$ is the identity in $M$ (or, equivalently, $\pi\circ s=1_{M}$ ). Indeed, we have the following diagram:

The set of sections of a vector bundle $E$ , denoted by $\Gamma(E)$ , is a vector space with the operations of addition,

(s+s^{\prime})(p)=s(p)+s^{\prime}(p)

and scalar product,

(\lambda\,s)(p)=\lambda\,s(p)

In local coordinates $(x^{1},\dotsc,x^{m})$ around a point $p$ with a basis $e_{1},\dotsc,e_{m}\in E_{p}$ , the expression of a section $s$ can be written as

s=\sum_{i}s^{i}e_{i}.

A.5 Pullbacks and pushforwards

Consider a smooth map $\phi\colon M\to N$ between manifolds $M$ and $N$ with respective local coordinates $(x^{1},\dotsc,x^{m})$ and $(y^{1},\dotsc,y^{n})$ . Let $j_{1},\dotsc,j_{k}\in\{1,\dotsc,n\}$ and let $f\colon N\to\mathbb{R}$ be a smooth function. The pullback of the expression $f(y^{1},\dotsc,y^{n})\,\mathrm{d}y^{j_{1}}\otimes\dotsb\otimes\mathrm{d}y^{j_{k}}$ is defined by

\phi^{\star}\left(f(y^{1},\dotsc,y^{n})\,\mathrm{d}y^{j_{1}}\otimes\dotsb\otimes\mathrm{d}y^{j_{k}}\right)\\ =f(\phi^{1}(x^{1},\dotsc,x^{m}),\dotsc,\phi^{n}(x^{1},\dotsc,x^{m}))\sum_{i_{1},\dotsc,i_{k}=1}^{m}\frac{\partial\phi^{j_{1}}}{\partial x^{i_{1}}}\dotsb\frac{\partial\phi^{j_{k}}}{\partial x^{i_{k}}}\,\mathrm{d}x^{i_{1}}\otimes\dotsb\otimes\mathrm{d}x^{i_{k}}.

Consider a vector field $X\in\mathfrak{X}(M)$ , the pushforward of $X$ by $\phi$ is a vector field in $\mathfrak{X}(N)$ defined by

(\phi_{\star}X_{p})(h)=X_{p}(h\circ\phi)

for all smooth functions $h\in C^{\infty}(N)$ and all points $p\in M$ .

Remark 6.

The pullback generalizes the transformation of the integrand in the change of variables formula known from integral calculus. The pushforward of a vector field is an expression of the chain rule from differential calculus.

A.6 Riemannian metrics

Let $M$ be an $m$ -dimensional smooth manifold. A Riemannian metric is a smooth map $g$ such that each point $p\in M$ is mapped to a symmetric and positive definite matrix $g(p)\colon T_{p}M\times T_{p}M\to\mathbb{R}$ . If $\partial_{i},\partial_{j}\in T_{p}M$ are vectors from an orthonormal frame in the local chart $U$ , the components of the Riemannian metric can be written as

g_{ij}(p)=g_{p}(\partial_{i},\partial_{j})=\partial_{i}\cdot g(p)\,\partial_{j}=\langle\partial_{i},\partial_{j}\rangle,

for all $p\in U$ and $i,j=1,\dotsc,m$ . The inner product $g_{p}(v,w)$ between $v=\sum_{i}v^{i}\partial_{i}\in T_{p}M$ and $w=\sum_{i}w^{i}\partial_{i}\in T_{p}M$ is defined by linearity. The norm of an arbitrary vector $v=\sum_{i}v^{i}\,\partial_{i}\in T_{p}M$ is $\|v\|=\sqrt{g_{p}(v,v)}$ .

Remark 7.

If we take $V=T_{p}M$ , then its dual is $V^{\star}=T_{p}M^{\star}$ with orthonormal basis given by $\{e^{i}\colon V\to\mathbb{R}\}$ . When $V=T_{p}M$ , as in this case, it is customary to write $\partial_{i}=e_{i}$ and $\mathrm{d}x^{i}=e^{i}$ . From the previous discussion, we see that the Riemannian metric $g$ is a section in $\Gamma(TM^{\star}\otimes TM^{\star})$ because $g(p)\colon T_{p}M\times T_{p}M\to\mathbb{R}$ is linear and we can write it as

g(p)=\sum_{ij}g_{ij}(p)\,\mathrm{d}x^{i}_{p}\otimes\mathrm{d}x^{j}_{p}.

Example 6 (Euclidean metric in $\mathbb{R}^{n}$ ).

The Euclidean metric in $\mathbb{R}^{n}$ is

g=\sum_{i=1}^{n}\mathrm{d}x^{i}\otimes\mathrm{d}x^{i}.

(6)

Consider the polar coordinates in $\mathbb{R}^{2}$ given by

\left\{\begin{aligned} x^{1}(r,\theta)&=r\cos\theta,\\ x^{2}(r,\theta)&=r\sin\theta,\end{aligned}\right.

where $r\geq 0$ and $0<\theta<2\pi$ . We have

\mathrm{d}x^{1}=\cos\theta\,\mathrm{d}r-r\sin\theta\,\mathrm{d}\theta\quad\text{and}\quad\mathrm{d}x^{2}=\sin\theta\,\mathrm{d}r+r\cos\theta\,\mathrm{d}\theta.

Therefore, the Euclidean metric (6) is pulled back as:

g=\mathrm{d}x^{1}\otimes\mathrm{d}x^{1}+\mathrm{d}x^{2}\otimes\mathrm{d}x^{2}=\mathrm{d}r\otimes\mathrm{d}r+r^{2}\mathrm{d}\theta\otimes\mathrm{d}\theta.

While in Euclidean coordinates the components of the metric tensor are $g_{ij}=\delta_{ij}$ , in polar coordinates they are $g_{11}=1$ , $g_{22}=r^{2}$ , $g_{12}=g_{21}=0$ .

Let $I=[0,1]\subset\mathbb{R}$ and let $\gamma\colon I\to M$ be a smooth curve such that $t\mapsto\gamma(t)=(\gamma^{1}(t),\dotsc,\gamma^{m}(t))$ . The arc-length of a curve $\gamma$ is

\ell(\gamma)=\int_{0}^{1}\sqrt{\dot{\gamma}(t)\cdot g(\gamma(t))\,\dot{\gamma}(t)}\,\mathrm{d}t=\int_{0}^{1}\|\dot{\gamma}(t)\|\,\mathrm{d}t.

Given an arbitrary smooth manifold $M$ , we can regard it as the configurational space of a (constrained) mechanical system. Let $\gamma\colon[0,1]\to M$ be a curve on $M$ . If we set the potential energy to be constant ( $U(q)=0$ , for simplicity) and we think of the Riemannian metric $g$ as a (position-dependent) mass matrix, then it turns out that the action integral is

\int_{0}^{1}L(\gamma(t),\dot{\gamma}(t))\,\mathrm{d}t,

(7)

where the Lagrangian $L$ contains only a kinetic energy term,

L(\gamma(t),\dot{\gamma}(t))=\tfrac{1}{2}\dot{\gamma}(t)\cdot g(\gamma(t))\,\dot{\gamma}(t),

A curve $\gamma$ that locally minimizes the action $\int_{0}^{1}L(\gamma,\dot{\gamma})\mathrm{d}t$ is called a geodesic curve of $M$ . By Jensen’s inequality ⁵⁰, curves that minimize (7) also minimize the arc-length.

A.7 Connections

Consider a vector bundle $E$ on the manifold $M$ and let $\Gamma(E)$ denote the sections of $E$ . We are interested in studying how a section $s\in\Gamma(E)$ changes along a given direction $v\in T_{p}M$ . In essence, we want to define what it means to take the limit

\lim_{h\to 0}\frac{\tau_{h}^{-1}s(\gamma(h))-s(\gamma(0))}{h},

(8)

where $\tau_{h}\colon E_{p}\to E_{\gamma(h)}$ is a linear isomorphism between fibers and $\gamma\colon[0,1]\to M$ is a smooth curve on $M$ such that $\gamma(0)=p$ and $\dot{\gamma}(0)=v$ . If $M=\mathbb{R}^{n}$ , then we can take $\tau_{h}$ to be the identity map but in general we need a non-trivial $\tau_{h}$ to account for the fact that the fibers $E_{\gamma(h)}$ and $E_{\gamma(0)}$ are different vector spaces. To formalize this notion, we define the concept of a connection.

Let $v,w\in\mathfrak{X}(M)$ , $f\colon M\to\mathbb{R}$ smooth, and $\lambda\in\mathbb{R}$ . A connection $\nabla\colon\mathfrak{X}(M)\times\Gamma(E)\to\Gamma(E)$ acts on sections $s,t\in\Gamma(E)$ as follows:

1.

$\nabla_{v}(\lambda s)=\lambda\nabla_{v}s$ .
2.

$\nabla_{v}(s+t)=\nabla_{v}s+\nabla_{v}t$ .
3.

$\nabla_{v}(f\,s)=v(f)\,s+f\,\nabla_{v}s$ .
4.

$\nabla_{v+w}s=\nabla_{v}s+\nabla_{w}s$ .
5.

$\nabla_{fv}s=f\nabla_{v}s$ .

In short, the connection $\nabla$ is linear on its first argument (a vector field) and acts as a derivation (i.e., as Leibniz’s rule) on its second argument (a section). The covariant derivative of the section $s$ in the direction of $v$ is defined by $\nabla_{v}s$ .

We can characterize the covariant derivative by writing it as:

\nabla_{v}s=\bar{\nabla}_{v}s+A_{v}s,

where $\bar{\nabla}_{v}$ is known as the flat connection. If we introduce local coordinates $(x^{1},\dotsc,x^{m})$ around a point in the open set $U\subset M$ , and consider the coordinate vector fields $\partial_{k}$ and a basis $e_{i}$ of $\Gamma(E)$ , then $\bar{\nabla}_{k}e_{i}=0$ where $\nabla_{k}=\nabla_{\partial_{k}}$ . Therefore,

\nabla_{k}e_{i}=\sum_{j}A^{j}_{ki}e_{j},

where $A^{j}_{ki}$ are the components of the vector potential. In full generality, we have

	$\displaystyle\nabla_{v}s$	$\displaystyle=\nabla_{\sum_{k}v^{k}\partial_{k}}\left(\sum_{i}s^{i}e_{i}\right)=\sum_{k}v^{k}\nabla_{k}\left(\sum_{i}s^{i}e_{i}\right)$
		$\displaystyle=\sum_{k}\sum_{i}v^{k}\left(\left(\nabla_{k}s^{i}\right)e_{i}+s^{i}\nabla_{k}e_{i}\right)$
		$\displaystyle=\sum_{k}\sum_{i}v^{k}\left(\left(\partial_{k}s^{i}\right)e_{i}+s^{i}A^{j}_{ki}e_{j}\right)$
		$\displaystyle=\sum_{i}\sum_{k}v^{k}\left(\partial_{k}s^{i}+s^{j}A^{i}_{kj}\right)e_{i}.$

Consequently, each component of the covariant derivative of the section $s$ along the vector $v$ is of the form

(\nabla_{v}s)^{i}=\sum_{k}v^{k}\left(\partial_{k}s^{i}+s^{j}A^{i}_{kj}\right).

Remark 8.

The definition of a covariant derivative $\nabla_{v}s$ is equivalent to (8) (see ⁵¹ Chapter 6). In other words, the flat connection and the vector potential fully characterize all the possible ways in which we could define a derivative as a limit of the form (8). We will see next that there is a special type of connection that is determined by the Riemannian metric.

A.8 The Levi-Civita connection

A Riemannian metric induces a unique connection called the Levi-Civita connection. The components of the vector potential of the Levi-Civita connection are called the Christoffel symbols and we shall derive them next using classical mechanics.

Consider a free particle with unit mass moving on the manifold $M$ with position $q$ and velocity $\dot{q}$ . Its Lagrangian is

L(q,\dot{q})=\frac{1}{2}\sum_{i=1}^{m}\sum_{j=1}^{m}g_{ij}(q)\,\dot{q}^{i}\dot{q}^{j}.

The Euler-Lagrange equations for this mechanical system are

\frac{\mathrm{d}}{\mathrm{d}t}\frac{\partial L}{\partial\dot{q}^{i}}-\frac{\partial L}{\partial q^{i}}=0,\quad\text{for}\quad i=1,\dotsc,m.

This implies that

	$\displaystyle 0$	$\displaystyle=\sum_{j=1}^{m}\frac{\mathrm{d}}{\mathrm{d}t}\left(g_{ij}(q)\,\dot{q}^{j}\right)-\frac{1}{2}\sum_{j=1}^{m}\sum_{k=1}^{m}\frac{\partial g_{jk}}{\partial q^{i}}(q)\,\dot{q}^{j}\dot{q}^{k}$
		$\displaystyle=\sum_{j=1}^{m}g_{ij}(q)\,\ddot{q}^{j}+\sum_{j=1}^{m}\sum_{k=1}^{m}\left(\frac{\partial g_{ij}}{\partial q^{k}}(q)-\frac{1}{2}\frac{\partial g_{jk}}{\partial q^{i}}(q)\right)\dot{q}^{j}\dot{q}^{k},$

for $i=1,\dotsc,m$ . Using the identity $\sum_{j=1}^{m}\sum_{k=1}^{m}\frac{\partial g_{ij}}{\partial q^{k}}=\sum_{j=1}^{m}\sum_{k=1}^{m}\frac{\partial g_{ik}}{\partial q^{j}},$ we find

0=\sum_{j=1}^{m}g_{ij}(q)\,\ddot{q}^{j}+\frac{1}{2}\sum_{j=1}^{m}\sum_{k=1}^{m}\left(\frac{\partial g_{ij}}{\partial q^{k}}(q)+\frac{\partial g_{ik}}{\partial q^{j}}(q)-\frac{\partial g_{jk}}{\partial q^{i}}(q)\right)\dot{q}^{j}\dot{q}^{k},

for $i=1,\dotsc,m$ . Finally, multiplying both sides above by $\sum_{i=1}g^{\ell i}$ , where $g^{\ell i}$ are the components of the inverse metric tensor (i.e., $\sum_{i=1}g^{\ell i}g_{ij}=\delta_{\ell,j}$ where $\delta_{\ell j}$ is the Kronecker delta), we arrive at the geodesic equation

0=\ddot{q}^{\ell}+\sum_{j=1}^{m}\sum_{k=1}^{m}\Gamma^{\ell}_{jk}(q)\,\dot{q}^{j}\dot{q}^{k},

(9)

where

\Gamma^{\ell}_{jk}(q)=\frac{1}{2}\sum_{i=1}^{m}g^{\ell i}(q)\left(\frac{\partial g_{ij}}{\partial q^{k}}(q)+\frac{\partial g_{ik}}{\partial q^{j}}(q)-\frac{\partial g_{jk}}{\partial q^{i}}(q)\right)

is the expression of the Christoffel symbol for $i,j,\ell\in\{1,\dotsc,m\}$ .

Remark 9.

Equation (9) is known as the geodesic equation and can be written more succinctly as $\nabla_{\dot{q}}\dot{q}=0$ . The phase flow of this equation when the initial velocity has unit length is known as the exponential map $\exp_{t}$ for $t\geq 0$ . Therefore, $\exp_{0}p=p$ , $\frac{\mathrm{d}}{\mathrm{d}t}\exp_{t}p=\dot{q}(t)$ such that $\|\dot{q}(0)\|=1$ , and $q(t)=\exp_{t}p$ solves the second order ODE (9).

Remark 10.

In the Euclidean case, $M=\mathbb{R}^{m}$ and $g_{ij}=\delta_{ij}$ , leading to the usual expression of the kinetic energy and to the free particle moving in a straight line. When $M$ is an arbitrary Riemannian manifold and the free particle moves along a geodesic, we can interpret the Christoffel symbols as the terms giving rise to the corrections to an otherwise straight trajectory so that it remains on the manifold $M$ .

A.9 Raising and lowering indices

The inner product $g\colon TM\times TM\to\mathbb{R}$ between two vectors $X,Y\in T_{p}M$ determined by the Riemannian metric gives rise to a linear isomorphism (called flat) between $T_{p}M$ and $T_{p}^{\star}M$ by mapping

X\in T_{p}M\mapsto X^{\flat}=g(X,\cdot)\in T_{p}^{\star}M.

The inverse linear map (called sharp) is

\omega\in T_{p}^{\star}M\mapsto\omega^{\sharp}\in T_{p}M

such that $\omega=g(\omega^{\sharp},\cdot)$ . The isomorphisms introduced above are called the musical isomorphisms.

Denoting by $(g^{ij})$ the inverse of the metric tensor $g$ , we write the components of $\omega^{\sharp}$ and $X^{\flat}$ as

{(\omega^{\sharp})^{i}=\sum_{j}g^{ij}\omega_{j}}\quad\text{and}\quad{(X^{\flat})_{i}=\sum_{j}g_{ij}X^{i}}.

Example 7.

The gradient of a function $f\in C^{\infty}(M)$ is defined as $\operatorname{grad}f=\mathrm{d}f^{\sharp}$ , where $\mathrm{d}f=\sum_{i=1}^{m}\frac{\partial f}{\partial x^{i}}\mathrm{d}x^{i}$ . We have seen that in the case of the plane $\mathbb{R}^{2}$ with polar coordinates, $g=\mathrm{d}r\otimes\mathrm{d}r+r^{2}\mathrm{d}\theta\otimes\mathrm{d}\theta$ . Consequently, $\operatorname{grad}f=\frac{\partial f}{\partial r}\frac{\partial}{\partial r}+\frac{1}{r^{2}}\frac{\partial f}{\partial\theta}\frac{\partial}{\partial\theta}$ .

The musical isomorphisms can be used with individual factors of a tensor product to map between the primal and dual vector spaces.

References

E and Zhou 2011 E, W.; Zhou, X. The Gentlest Ascent Dynamics. Nonlinearity 2011, 24, 1831–1842
Yin et al. 2022 Yin, J.; Huang, Z.; Zhang, L. Constrained High-Index Saddle Dynamics for the Solution Landscape with Equality Constraints. Journal of Scientific Computing 2022, 91, 62
Hänggi et al. 1990 Hänggi, P.; Talkner, P.; Borkovec, M. Reaction-Rate Theory: Fifty Years after Kramers. Reviews of Modern Physics 1990, 62, 251–341
Olsen et al. 2004 Olsen, R. A.; Kroes, G. J.; Henkelman, G.; Arnaldsson, A.; Jónsson, H. Comparison of Methods for Finding Saddle Points without Knowledge of the Final States. The Journal of Chemical Physics 2004, 121, 9776–9792, Publisher: American Institute of Physics
Samanta et al. 2014 Samanta, A.; Chen, M.; Yu, T.-Q.; Tuckerman, M.; E, W. Sampling saddle points on a free energy surface. The Journal of Chemical Physics 2014, 140, 164109–164109
Quapp and Bofill 2014 Quapp, W.; Bofill, J. M. Locating saddle points of any index on potential energy surfaces by the generalized gentlest ascent dynamics. Theoretical Chemistry Accounts 2014, 133, 1510
Gu and Zhou 2017 Gu, S.; Zhou, X. Multiscale gentlest ascent dynamics for saddle point in effective dynamics of slow-fast system. Communications in Mathematical Sciences 2017, 15, 2279–2302, Publisher: International Press of Boston
Gu and Zhou 2018 Gu, S.; Zhou, X. Simplified gentlest ascent dynamics for saddle points in non-gradient systems. Chaos: An Interdisciplinary Journal of Nonlinear Science 2018, 28, 123106, Publisher: American Institute of Physics
Yin et al. 2019 Yin, J.; Zhang, L.; Zhang, P. High-Index Optimization-Based Shrinking Dimer Method for Finding High-Index Saddle Points. SIAM Journal on Scientific Computing 2019, 41, A3576–A3595, Publisher: Society for Industrial and Applied Mathematics
Yin et al. 2022 Yin, J.; Zhang, L.; Zhang, P. Solution Landscape of the Onsager Model Identifies Non-Axisymmetric Critical Points. Physica D: Nonlinear Phenomena 2022, 430, 133081
Gu et al. 2022 Gu, S.; Wang, H.; Zhou, X. Active Learning for Saddle Point Calculation. Journal of Scientific Computing 2022, 93, 78
Luo et al. 2022 Luo, Y.; Zheng, X.; Cheng, X.; Zhang, L. Convergence Analysis of Discrete High-Index Saddle Dynamics. 2022; arXiv:2204.00171 [cs, math]
Zhang et al. 2022 Zhang, L.; Zhang, P.; Zheng, X. Error Estimates for Euler Discretization of High-Index Saddle Dynamics. SIAM Journal on Numerical Analysis 2022, 60, 2925–2944, Publisher: Society for Industrial and Applied Mathematics
Quapp et al. 1998 Quapp, W.; Hirsch, M.; Imig, O.; Heidrich, D. Searching for saddle points of potential energy surfaces by following a reduced gradient. Journal of Computational Chemistry 1998, 19, 1087–1100
Quapp et al. 1998 Quapp, W.; Hirsch, M.; Heidrich, D. Bifurcation of reaction pathways: the set of valley ridge inflection points of a simple three-dimensional potential energy surface. Theoretical Chemistry Accounts 1998, 100, 285–299
Basilevsky 1982 Basilevsky, M. V. The topography of potential energy surfaces. Chemical Physics 1982, 67, 337–346
Hoffman et al. 1986 Hoffman, D. K.; Nord, R. S.; Ruedenberg, K. Gradient extremals. Theoretica chimica acta 1986, 69, 265–279
Quapp 2004 Quapp, W. Newton Trajectories in the Curvilinear Metric of Internal Coordinates. Journal of Mathematical Chemistry 2004, 36, 365–379
Bello-Rivas et al. 2022 Bello-Rivas, J. M.; Georgiou, A.; Guckenheimer, J.; Kevrekidis, I. G. Staying the Course: iteratively locating equilibria of dynamical systems on Riemannian manifolds defined by point-clouds. Journal of Mathematical Chemistry 2022,
Coifman and Lafon 2006 Coifman, R. R.; Lafon, S. Diffusion Maps. Applied and Computational Harmonic Analysis 2006, 21, 5–30
Golub and Van Loan 2013 Golub, G. H.; Van Loan, C. F. Matrix computations, 4th ed.; Johns Hopkins Studies in the Mathematical Sciences; Johns Hopkins University Press, 2013
Levitt and Ortner 2017 Levitt, A.; Ortner, C. Convergence and Cycling in Walker-type Saddle Search Algorithms. SIAM Journal on Numerical Analysis 2017, 55, 2204–2227
do Carmo 1992 do Carmo, M. P. Riemannian geometry; Mathematics: Theory & Applications; Birkhäuser Boston, Inc., Boston, MA, 1992
Absil et al. 2008 Absil, P.-A.; Mahony, R.; Sepulchre, R. Optimization Algorithms on Matrix Manifolds; Princeton University Press, 2008
Nadler et al. 2006 Nadler, B.; Lafon, S.; Coifman, R. R.; Kevrekidis, I. G. Diffusion Maps, Spectral Clustering and Reaction Coordinates of Dynamical Systems. Applied and Computational Harmonic Analysis 2006, 21, 113–127
Peters 2016 Peters, B. Reaction Coordinates and Mechanistic Hypothesis Tests. Annual Review of Physical Chemistry 2016, 67, 669–690
Berezhkovskii and Szabo 2019 Berezhkovskii, A. M.; Szabo, A. Committors, First-Passage Times, Fluxes, Markov states, Milestones, and All That. The Journal of Chemical Physics 2019, MMMK, 054106
Roux 2022 Roux, B. Transition Rate Theory, Spectral Analysis, and Reactive Paths. The Journal of Chemical Physics 2022, 156, 134111, Publisher: American Institute of Physics
Wu et al. 2022 Wu, S.; Li, H.; Ma, A. Exact Reaction Coordinates for Flap Opening in HIV-1 Protease. Proceedings of the National Academy of Sciences 2022, 119, e2214906119
Manuchehrfar et al. 2022 Manuchehrfar, F.; Li, H.; Ma, A.; Liang, J. Reactive Vortexes in a Naturally Activated Process: Non-Diffusive Rotational Fluxes at Transition State Uncovered by Persistent Homology. The Journal of Physical Chemistry B 2022, 126, 9297–9308
Chiavazzo et al. 2017 Chiavazzo, E.; Covino, R.; Coifman, R. R.; Gear, C. W.; Georgiou, A. S.; Hummer, G.; Kevrekidis, I. G. Intrinsic Map Dynamics Exploration for Uncharted Effective Free-Energy Landscapes. Proceedings of the National Academy of Sciences 2017, 201621481–201621481
Fujisaki et al. 2018 Fujisaki, H.; Moritsugu, K.; Mitsutake, A.; Suetani, H. Conformational Change of a Biomolecule Studied by the Weighted Ensemble Method: Use of the Diffusion Map Method to Extract Reaction Coordinates. The Journal of Chemical Physics 2018, 149, 134112
Fixman 1974 Fixman, M. Classical Statistical Mechanics of Constraints: A Theorem and Application to Polymers. Proceedings of the National Academy of Sciences 1974, 71, 3050–3053
Carter et al. 1989 Carter, E. A.; Ciccotti, G.; Hynes, J. T.; Kapral, R. Constrained Reaction Coordinate Dynamics for the Simulation of Rare Events. Chemical Physics Letters 1989, 156, 472–477
den Otter 2000 den Otter, W. K. Thermodynamic Integration of the Free Energy along a Reaction Coordinate in Cartesian Coordinates. The Journal of Chemical Physics 2000, 112, 7283–7292, Publisher: American Institute of Physics
Darve et al. 2008 Darve, E.; Rodríguez-Gómez, D.; Pohorille, A. Adaptive Biasing Force Method for Scalar and Vector Free Energy Calculations. The Journal of Chemical Physics 2008, 128, 144120–144120
Maragliano et al. 2006 Maragliano, L.; Fischer, A.; Vanden-Eijnden, E.; Ciccotti, G. String Method in Collective Variables: Minimum Free Energy Paths and Isocommittor Surfaces. The Journal of Chemical Physics 2006, 125, 024106
Zhang and Zha 2004 Zhang, Z.; Zha, H. Principal Manifolds and Nonlinear Dimensionality Reduction via Tangent Space Alignment. SIAM Journal on Scientific Computing 2004, 26, 313–338, Publisher: Society for Industrial and Applied Mathematics
Bradbury et al. 2018 Bradbury, J.; Frostig, R.; Hawkins, P.; Johnson, M. J.; Leary, C.; Maclaurin, D.; Necula, G.; Paszke, A.; VanderPlas, J.; Wanderman-Milne, S.; Zhang, Q. JAX: composable transformations of Python+NumPy programs. http://github.com/google/jax (Last accessed 2023-04-20), 2018
Bofill and Quapp 2016 Bofill, J. M.; Quapp, W. The variational nature of the gentlest ascent dynamics and the relation of a variational minimum of a curve and the minimum energy path. Theoretical Chemistry Accounts 2016, 135, 1–14
Randers 1941 Randers, G. On an Asymmetrical Metric in the Four-Space of General Relativity. Physical Review 1941, 59, 195–199
Chern and Shen 2005 Chern, S.-S.; Shen, Z. Riemann-Finsler Geometry; Nankai Tracts in Mathematics; WORLD SCIENTIFIC, 2005; Vol. 6
Kidger 2021 Kidger, P. On Neural Differential Equations. Ph.D. thesis, University of Oxford, 2021
Müller and Brown 1979 Müller, K.; Brown, L. D. Location of Saddle Points and Minimum Energy Paths by a Constrained Simplex Optimization procedure. Theoretica chimica acta 1979, 53, 75–93
Frankel 2011 Frankel, T. The Geometry of Physics: An Introduction, 3rd ed.; Cambridge University Press, 2011
Mezey 1987 Mezey, P. G. Potential Energy Hypersurfaces; Studies in physical and theoretical chemistry; Elsevier Scientific Publishing Co., Amsterdam, 1987; Vol. 53
Wales 2001 Wales, D. Energy Landscapes: Applications to Clusters, Biomolecules and Glasses, 1st ed.; Cambridge University Press, 2001
Whitney 1944 Whitney, H. The Self-Intersections of a Smooth $n$ -manifold in $2n$ -space. The Annals of Mathematics 1944, 45, 220–220
Arnold 1989 Arnold, V. I. Mathematical Methods of Classical Mechanics; Graduate Texts in Mathematics; Springer New York, 1989; Vol. 60
Rudin 1987 Rudin, W. Real and Complex Analysis, 3rd ed.; McGraw-Hill Book Co., 1987
Spivak 1999 Spivak, M. A Comprehensive Introduction to Differential Geometry. Vol. II, 3rd ed.; Publish or Perish, Inc., 1999
52 Ref. 51, Chapter 6

Gentlest ascent dynamics on manifolds defined by adaptively sampled point-clouds

Abstract

1 Introduction

2 Gentlest Ascent Dynamics and Idealized Saddle Dynamics

Remark 1.

2.1 Idealized saddle-point dynamics

Example 1 (Exact solution of a model system).

2.2 Mean force

3 Algorithm

Remark 2.

Remark 3.

Remark 4.

3.1 Numerical examples

3.1.1 Sphere

3.1.2 Regular surface

4 Conclusions

Appendix A Appendix

A.1 Tensor spaces

Example 2.

Remark 5.

A.2 Smooth manifolds and smooth curves

Example 3.

A.3 Tangent spaces and tangent bundles

Example 4.

Example 5 (The tangent bundle in classical mechanics).

A.4 Vector fields and sections

A.5 Pullbacks and pushforwards

Remark 6.

A.6 Riemannian metrics

Remark 7.

Example 6 (Euclidean metric in ℝn\mathbb{R}^{n}).

A.7 Connections

Remark 8.

A.8 The Levi-Civita connection

Remark 9.

Remark 10.

A.9 Raising and lowering indices

Example 7.

References

Example 6 (Euclidean metric in $\mathbb{R}^{n}$ ).