Linear operator theory of phase mixing

Keir Darling and Lawrence M. Widrow
Department of Physics, Engineering Physics & Astronomy, Queen’s University, Stirling Hall, Kingston ON K7L 3N6, Canada
E-mail: [email protected]

(Accepted XXX. Received YYY; in original form ZZZ)

Abstract

We study solutions of the collisionless Boltzmann equation (CBE) in a functional Koopman representation. This facilitates the use of linear spectral techniques characteristic of the analysis of Schrödinger-type equations. For illustrative purposes, we consider the classical phase mixing of a non-interacting distribution function in a quartic potential. Solutions are determined perturbatively relative to a harmonic oscillator. We impose a form of coarse-graining by choosing a finite dimensional basis to represent the distribution function and time evolution operators, which sets a minimum length scale on phase space structure. We observe a relationship between the dimension of the representation and the multiplicity of the harmonic oscillator eigenvalues. System dynamics are understood in terms of degenerate subspaces of the linear operator spectra. Each subspace is associated with a mode of the harmonic oscillator, the first two being bending and breathing structures. The quartic potential splits the degenerate eigenvalues within each subspace. This facilitates the formation of spiral structure as deformations from the harmonic oscillator modes. We ultimately argue that this construction provides a promising avenue for study of self-interacting systems experiencing phase mixing, which is an outstanding problem in the context of the Gaia DR2 vertical phase space spirals.

keywords:

Galaxy: disc – Galaxy: kinematics and dynamics – Galaxy: structure.

^†^†pubyear: 2024^†^†pagerange: Linear operator theory of phase mixing–B.2

1 Introduction

In this paper we investigate the relaxation of collisionless systems through phase mixing. In a statistical description, the macrostate of a system is specified by the phase space distribution function, $f$ . This quantifies the probability that a particle exists within an infinitesimal volume of phase space (Sethna, 2006). Liouville’s theorem requires that $f$ is conserved along orbits, and it therefore satisfies the collisionless Boltzmann equation (CBE) (Arnold, 1989). For $s$ spatial degrees of freedom, this is equivalent to the incompressible flow of a $2s$ dimensional fluid in a velocity field specified by Hamilton’s equations. An introductory description of this can be found in Binney & Tremaine (2008), but we summarize as follows. Let us assume an anharmonic potential in which orbital frequency of test particles is dependent on their amplitudes of oscillation. In the fluid analogy, this means that vorticity of the velocity field depends on the spatial coordinate. A distribution out of equilibrium with such a potential will deform with time as packets of density with different energies orbit at varying frequencies. For a fixed conservative Hamiltonian this continues indefinitely, with different energy orbits becoming increasingly out of phase with each other. In this process, the scale of structure in the distribution decreases. Eventually when the scale becomes so small that adjacent wraps of the mixed distribution become indistinguishable, the system has equilibrated.

Phase mixing in general leads to complex structures, especially for the $s=3$ case present in galactic dynamics and cosmology (Tremaine, 1999; Perrett et al., 2003; Abel et al., 2012). Even for $s=1$ , which applies to considerations of the vertical motion in the Galactic disc, this process is not trivial. With astrometric data from Gaia Collaboration et al. (2018), a one armed spiral was observed in the vertical phase space of Solar Neighborhood stars (Antoja et al., 2018). In reality this is a $s=3$ system, as it is unlikely that the vertical dynamics are decoupled from motion in the plane (Hunt et al., 2021), but much attention has been given to the structure in the univariate case (Schönrich & Binney, 2018; Darling & Widrow, 2019a; Bennett & Bovy, 2018, 2021). At present, no models have reproduced the exact form of the Gaia spiral.

In Darling & Widrow (2019b), it was suggested that the phase mixing process can be represented with the discrete spectrum of a linear time-evolution operator of finite dimension. There, eigenfunctions were estimated numerically from the full temporal history of a system by applying Dynamic Mode Decomposition (DMD) (Mezić, 2005; Rowley et al., 2009; Kutz et al., 2016) to $N$ -body simulations. This served to investigate the claim that self-gravity should not be ignored in phase mixing (Darling & Widrow, 2019a), as well as to explore the representation of this process with persistent oscillatory structures. Stable oscillations such as bending and breathing modes were observed in a self-interacting system in an anharmonic potential. When modifying the relative dominance of self-interaction and anharmonic forcing, these oscillatory structures were more prominent the closer the system was to purely self-interacting. For stronger anharmonic forcing, they were deformed to include spiral structure.

DMD is closely related to Koopman theory (Koopman, 1931), which supposes that a complex, potentially nonlinear system can be represented as a simpler linear one by studying its evolution in terms of observable functions of its state space. This concept was used to interpret the results in Darling & Widrow (2019b), arguing that the binning of $N$ -body simulations constituted a mapping to observables. Because of the numerical nature of that work, it was difficult to study the supposed mechanism of phase mixing from discrete modes, or establish a concrete connection to Koopman theory. The DMD approach also draws comparison to the use of multichannel Singular Spectrum Analysis (mSSA) (Weinberg & Petersen, 2021; Johnson et al., 2023). In both cases, principle component analysis (PCA) based techniques are applied to time-series data. A novel aspect of the mSSA work was the temporal analysis of basis function expansion coefficients. This motivates treating the expansion coefficients as Koopman observables, rather than the distribution function.

In the present work, we aim to investigate a mechanism of phase mixing with a discrete linear operator spectrum, emphasizing the role of representation scale, and properties of the spectrum. The essential premise is that the minimum length scale and degeneracy of evolution operator eigenvalues depend on the dimension of the representation. The degenerate eigenvalues define subspaces of the operator spectrum. Splitting of these eigenvalues by anharmonic forcing causes differential rotation among the subspace eigenvectors, which manifests as phase mixing in $f$ . System dynamics are determined in terms of basis function expansion coefficients, which we treat as functionals of $f$ , adopting a functional Koopman formalism for our calculations. In doing so, we apply techniques from degenerate perturbation theory of the Schrödinger equation. All of our calculations are carried out symbolically.

This paper is organized as follows. In Section 2 we define our coordinates and Hamiltonian, as well as the vector spaces in which we perform our calculations. In section 3 we define the time evolution operator for functionals of $f$ , and then restate the problem in this formalism. In Section 4 we define a matrix representation for our operators, and compute the harmonic oscillator spectrum. Section 5 contains the perturbative treatment of the anharmonic potential, and a description of the eigenvalue splitting mechanism for phase mixing. Sections 6 and 7 contain a discussion of connections to other works, and concluding remarks on foreseeable extensions.

2 Preliminaries

Consider a two-dimensional phase space comprising position $\bar{q}$ and momentum $\bar{p}$ . We concern ourselves with the phase space distribution function, $f(\bar{q},\bar{p})$ . This is defined such that,

\mathrm{P}\bigl{(}\bar{q},\bar{p}\in\mathcal{R}\bigr{)}=\int_{\mathcal{R}}d\bar{q}d\bar{p}\ f(\bar{q},\bar{p}),

(1)

is the probability of finding a particle in the region of phase space, $\mathcal{R}$ . The dynamics of $f$ are prescribed by the Hamiltonian density, $h(\bar{q},\bar{p})$ , according to the CBE,

\frac{\partial f}{\partial t}+[f,h]=0.

(2)

Here $[f,h]=\frac{\partial f}{\partial q}\frac{\partial h}{\partial p}-\frac{\partial f}{\partial p}\frac{\partial h}{\partial q}$ denotes the Poisson bracket.

2.1 System Hamiltonian

We begin with the Hamiltonian density for a harmonic oscillator with frequency $\kappa$ ,

h_{0}(\bar{q},\bar{p})=\tfrac{1}{2}\left(\bar{p}^{2}+\kappa^{2}\bar{q}^{2}\right).

(3)

This is to be understood as the standard kinematic term plus the leading term in a Taylor series of any even potential. By Jeans’ theorem, the Hamiltonian defines the coordinate dependence of the equilibrium distribution function. Let us choose

f_{0}(\bar{q},\bar{p})=\frac{1}{Z}\mathrm{e}^{-\frac{1}{2}\beta h_{0}(\bar{q},\bar{p})}.

(4)

Here, $\beta$ is a coldness parameter inversely proportional to the velocity dispersion, and $Z$ is the partition function. It is convenient to work with scaled dimensionless coordinates, so we let

\bar{q}=\tfrac{1}{\kappa}\sqrt{\tfrac{2}{\beta}}q,\ \ \bar{p}=\sqrt{\tfrac{2}{\beta}}p.

(5)

In these coordinates, the Hamiltonian density and distribution function become

h_{0}(q,p)=\frac{1}{\beta}\left(q^{2}+p^{2}\right),\ \ f_{0}(q,p)=\frac{1}{Z}\mathrm{e}^{-\frac{1}{2}(q^{2}+p^{2})}.

(6)

Now we add an anharmonic correction of the form $\varphi\bar{q}^{4}$ , representative of the next term in the even Taylor series. Our total Hamiltonian density in scaled coordinates is

h(q,p)=h_{0}(q,p)+\varphi\frac{4}{\kappa^{4}\beta^{2}}q^{4}.

(7)

The factor $4\kappa^{-4}\beta^{-2}$ comes from the coordinate transformation in equation 5. We will use $\varphi=-\frac{\beta}{12}(\frac{\kappa}{2})^{4}$ in our calculations. This is chosen to assure stable orbits for a range of initial conditions while introducing vorticity that decreases with position, of which the effects can be seen within a few dynamical times.

2.2 Space of bivariate functions

Any configuration of $f$ is an element of a function space defined for the independent variables $q$ and $p$ . Denote by $\mathscr{H}$ the space of functions of $q$ and $p$ on a domain $\mathcal{D}$ . We treat $\mathscr{H}$ as a Hilbert space equipped with an inner product, which we denote $\langle g_{1},g_{2}\rangle$ for any $g_{1},g_{2}\in\mathscr{H}$ (see equation 59). We further assume that there exists a set of linearly independent functions $\{e_{j,k}\}$ that form a basis in $\mathscr{H}$ . Any function $g\in\mathscr{H}$ can be written in terms of the basis functions by projecting onto them (equation 60).

2.3 Conjugate space of functionals

Given the function space $\mathscr{H}$ , one may consider another, distinct vector space housing its functionals. Such a space is called dual to $\mathscr{H}$ , and is denoted $\mathscr{H}^{*}$ (also a Hilbert space). For the purposes of this work, we refer to a functional of $f$ as any linear mapping

G[f]=\langle f,g\rangle,

(8)

which takes the input function, here the distribution function $f$ , and maps it to $\mathbb{C}$ (the complex numbers) by integration with a given $g$ . For every $g\in\mathscr{H}$ , there is a unique $G\in\mathscr{H}^{*}$ given by equation 8 (Riesz representation theorem, see for example Conway (1994)). In the present context, the input function will always be $f$ , and it is treated as a variable relative to a functional $G[f]\in\mathscr{H}^{*}$ in the same sense that $q$ and $p$ are variables with respect to a function $g(q,p)\in\mathscr{H}$ . The functions $f,g\in\mathscr{H}$ and the functional $G\in\mathscr{H}^{*}$ are further related by the functional derivative, $\frac{\delta G}{\delta f}=g(q,p)$ .

The dual space $\mathscr{H}^{*}$ has an inner product and basis that can be understood in terms of the corresponding constructions in $\mathscr{H}$ . Given a basis $\{e_{j,k}\}$ of $\mathscr{H}$ , there exists a basis for $\mathscr{H}^{*}$ , which we denote $\{E_{j,k}\}$ . The two are related by the biorthonormal condition, $E_{j,k}[e_{j^{\prime},k^{\prime}}]=\delta_{j}^{j^{\prime}}\delta_{k}^{k^{\prime}}$ . The inner product of any two $G_{1},G_{2}\in\mathscr{H}^{*}$ is denoted $\langle G_{1},G_{2}\rangle_{*}$ , and defined in equation 61. The subscript asterisk signifies that the operation is in $\mathscr{H}^{*}$ .

Finally we highlight some important quantities that exist in $\mathscr{H}^{*}$ . The total energy, which is the expectation value of the Hamiltonian with respect to $f$ , is the functional $H[f]=\langle f,h\rangle$ . Additionally, if we project $f$ onto a set of basis functions in $\mathscr{H}$ as in equation 60, the coefficients are the functionals $E_{j,k}^{\dagger}$ . The ^† operation denotes complex conjugation on scalars, and the conjugate transpose on vectors and matrices. If we compute the dynamics of $E_{j,k}$ according to $H$ in $\mathscr{H}^{*}$ , we obtain the time-evolution of the basis function expansion coefficients for $f$ in $\mathscr{H}$ .

2.4 Particular choice of basis functions

We choose as a basis for $\mathscr{H}$ products of univariate Gaussian-Hermite functions. That is,

\displaystyle e_{j,k}(q,p)=N_{j,k}f_{0}(q,p)P_{j}(q)P_{k}(p),

(9)

where $P_{j}(x)$ denotes the $j$ th Hermite polynomial of either $q$ or $p$ , and $N_{j,k}$ is a normalization constant. The corresponding functionals are denoted $E_{j,k}[f]=\langle f,e_{j,k}\rangle$ . In Appendix A we define the polynomials $P_{j}$ , the coefficients $N_{j,k}$ , and state two recurrence relations for the Hermite polynomials we will make use of in Sections 4.1 and 5.1.

3 Dynamics

In what follows, we will use the notion of a flow map. This is an operator on $\mathscr{H}$ denoted $\hat{S}^{t}$ , which takes an initial state $f$ to another state at time $t$ , $\hat{S}^{t}f$ . The particular action of $\hat{S}^{t}$ is prescribed by the CBE (equation 2).

3.1 Time evolution operator for functionals

Knowing the flow map $\hat{S}^{t}$ is tantamount to solving equation 2. Here, we leverage Koopman (1931) to effectively apply the flow map, without dealing explicitly with the CBE. Discussion of the Koopman formalism in the context of astrophysics can be found in Darling & Widrow (2019b) and Darling & Widrow (2021), but for a more rigorous description of the functional realization we will use here, see Nakao & Mezić (2020). Briefly, the idea is that rather than consider the distribution function directly, one can instead study the dynamics of “observables”. In the case of a partial differential equation like the CBE, the observables take the form of functionals, $G[f]$ . Let us define a linear time evolution operator that acts on $\mathscr{H}^{*}$ , which we denote $\hat{U}^{t}$ . This is called the Koopman operator, and its action on any $G\in\mathscr{H}^{*}$ is,

\hat{U}^{t}G[f]=G[\hat{S}^{t}f].

(10)

That is, $\hat{U}^{t}$ applies the flow map to the argument function $f$ , but does not change the form of the functional $G[f]$ . If it is easier to determine the action of $\hat{U}^{t}$ than $\hat{S}^{t}$ , one can compute the functional $\hat{U}^{t}G[f]$ , which encodes information about $f$ at time $t$ . Subsequently, if we know how to infer $f$ from a functional or set of functionals, we can compute the future state $\hat{S}^{t}f$ .

Treating $\hat{U}^{t}G$ as a function of $t$ , we may write its infinitesimal change with respect to $t$ as

\frac{d}{dt}\hat{U}^{t}G[f]=\lim_{\tau\rightarrow 0}\frac{\hat{U}^{t+\tau}G[f]-\hat{U}^{t}G[f]}{\tau}.

(11)

To evaluate the $\tau$ limit, we separate out the $\hat{U}^{t}$ (by linearity), and apply equation 10 for $\hat{U}^{\tau}G$ . That is,

\frac{d}{dt}\hat{U}^{t}G[f]=\hat{U}^{t}\left(\lim_{\tau\rightarrow 0}\frac{G[\hat{S}^{\tau}f]-G[f]}{\tau}\right).

(12)

For small $\tau$ , $\hat{S}^{\tau}f=f+\tau[h,f]+\mathcal{O}(\tau^{2})$ . Replacing $G[\hat{S}^{\tau}f]$ in equation 12 with this takes care of the limit, and we are left with

\frac{d}{dt}\hat{U}^{t}G[f]=\Bigl{\langle}[h,f],\frac{\delta}{\delta f}\hat{U}^{t}G\Bigr{\rangle}\coloneqq\hat{A}\Bigl{(}\hat{U}^{t}G[f]\Bigr{)}.

(13)

Here we have defined as $\hat{A}$ the infinitesimal generator of $\hat{U}^{t}$ . This is another operator on $\mathscr{H}^{*}$ , and will be the focus of much of this work. To be clear, the action of $\hat{A}$ on any $G\in\mathscr{H}^{*}$ is

\hat{A}G=\Bigl{\langle}[h,f],\frac{\delta G}{\delta f}\Bigr{\rangle}=\Bigl{\langle}f,\left[\frac{\delta G}{\delta f},\frac{\delta H}{\delta f}\right]\Bigr{\rangle}.

(14)

The final equality is obtained by expanding the Poisson bracket and performing integration by parts with respect to $p$ on the first term and $q$ on the second. This final form is called a Morrison bracket, which originates in plasma physics (Morrison, 1980). We denote this multiplicative operation between vectors in $\mathscr{H}^{*}$ by $[G,H]_{f}$ , indicating that it is the bracket between $G$ and $H$ with respect to $f$ . We prefer this form, as it makes clear that a functional $G$ acted on by $\hat{A}$ is another vector in $\mathscr{H}^{*}$ . For $g=\frac{\delta G}{\delta f}$ , $\hat{A}G$ maps the expectation value of $g$ to that of $[g,h]$ . An operator of this form has appeared in the gravitational context previously in for example Perez (2005), where the Morrison bracket is used to form a so-called functional Vlasov equation.

Equation 13 is a Schrödinger-type equation. It follows that the operator $\hat{U}^{t}$ satisfies an ordinary differential equation, and has the general solution

\hat{U}^{t}=\mathrm{e}^{\int_{0}^{t}d\tau\hat{A}}.

(15)

The reader familiar with quantum mechanics (QM) may find that this resembles the Heisenberg representation, with $\hat{A}$ analogous to the Hamiltonian operator. Like in QM, we can construct solutions to equation 13 by solving the associated eigenvalue problem of $\hat{A}$ .

3.2 Time evolution of $f$

Denote the $j$ th eigenvalue and eigenfunctional of $\hat{A}$ by $\lambda_{j}$ and $\Psi_{j}[f]$ respectively. The eigenvalue problem is then, $\hat{A}\Psi_{j}=\lambda_{j}\Psi_{j}$ . Let $\psi_{j}(q,p)=\frac{\delta\Psi_{j}}{\delta f}$ . For convenience, we call this quantity an eigenfunction of $\hat{A}$ , but really it is an eigenfunction of the Liouville operator, $[h,\cdot]$ . Suppose that $\{\psi_{j}\}$ forms a basis of $\mathscr{H}$ , and $\{\Psi_{j}\}$ of $\mathscr{H}^{*}$ .

In this paper, we focus on the case where $\frac{\partial\hat{A}}{\partial t}=0$ , which corresponds to a time-independent potential. This holds for our target Hamiltonian in equation 7, but is not true in general for a self-consistent distribution function satisfying both the CBE and Poisson equation. To study the response to both an external potential and self-consistent density perturbations, one must consider $\frac{\partial\hat{A}}{\partial t}\neq 0$ . We will discuss this briefly in Sections 6 and 7, but treatment of that problem is not in the scope of this work.

In the time-independent generator case, equation 15 simplifies to $\hat{U}^{t}=\mathrm{e}^{\hat{A}t}$ , and we have

\hat{U}^{t}\Psi_{j}[f]=\Psi_{j}[\hat{S}^{t}f]=\mathrm{e}^{\lambda_{j}t}\Psi_{j}[f].

(16)

By conjugate symmetry of the inner product, the expansion coefficient of $f$ onto $\psi_{j}$ at time $t$ is then

\langle\psi_{j},\hat{S}^{t}f\rangle=\Psi^{\dagger}_{j}[\hat{S}^{t}f]=\mathrm{e}^{\lambda_{j}^{\dagger}t}\langle\psi_{j},f\rangle.

(17)

Applying $\hat{S}^{t}$ to equation 60 and using this result, $\hat{S}^{t}f$ is

\hat{S}^{t}f(q,p)=\sum_{j=1}^{\infty}\langle\psi_{j},f\rangle\mathrm{e}^{\lambda_{j}^{\dagger}t}\psi_{j}(q,p).

(18)

3.3 Restatement of the problem

Now that we have established the formalism in which we will carry out our calculations, it is advantageous to restate the original problem. Recall that the target Hamiltonian is stated in equation 7. In terms of total energy functionals, this corresponds to $H=H_{0}+\varphi H_{4}$ , where $H_{0}=\langle f,h_{0}\rangle$ and $H_{4}=\langle f,\frac{4}{\kappa^{4}\beta^{2}}q^{4}\rangle$ . It follows that the generators are $\hat{A}_{0}=[\cdot,H_{0}]_{f}$ , $\hat{A}_{4}=[\cdot,H_{4}]_{f}$ , and $\hat{A}=\hat{A}_{0}+\varphi\hat{A}_{4}$ . Our goal then is to compute the spectrum of $\hat{A}$ , so that we can use equation 18.

4 Matrix Representation

To carry out calculations, we map the quantities in $\mathscr{H}^{*}$ to finite dimensional matrices and vectors. We begin by choosing a finite dimension, $M\in\mathbb{N}$ . If we take all bivariate Hermite polynomials up to order $N$ , the dimension of our finite basis is $M=\frac{1}{2}(N+1)(N+2)$ . We define a reference vector for the $\mathscr{H}$ basis as

\mathbf{u}=\left(e_{0,0},\cdots,e_{N,0},\cdots,e_{0,N}\right).

(19)

This is a vector-valued function of the phase space coordinates. For any vector $\mathbf{v}\in\mathbb{C}^{M}$ , the contraction $\mathbf{u}^{\dagger}\cdot\mathbf{v}$ returns a linear combination of the basis functions.

When we choose the dimension $M$ for the matrix representation, we introduce coarse-graining by imposing a minimum length scale. The resultant scale is set by the maximum polynomial order $N$ , and the model parameters that appear in the coordinate scaling (equation 5). The minimum scale can be quantified by the distance between adjacent roots in the highest order polynomial appearing in the basis functions, $e_{j,k}$ . Note that the roots are not necessarily evenly spaced, so the effective resolution must be understood as a function of the phase space coordinates.

4.1 Matrix Elements of $\hat{A}_{0}$

In order to compute the spectrum of $\hat{A}_{0}$ , we construct a $M\times M$ matrix, which we denote $\bm{\mathsf{A}}_{\mathsf{0}}$ . Its elements are obtained by computing the inner products,

\left(\bm{\mathsf{A}}_{\mathsf{0}}\right)_{j^{\prime},k^{\prime}}^{j,k}=\langle E_{j^{\prime},k^{\prime}},\hat{A}_{0}E_{j,k}\rangle_{*}.

(20)

Substituting $G\rightarrow E_{j,k}$ , and $H\rightarrow H_{0}$ into equation 14, and applying the definition of the $\mathscr{H}^{*}$ inner product in equation 61, we obtain

\displaystyle\left(\bm{\mathsf{A}}_{\mathsf{0}}\right)_{j^{\prime},k^{\prime}}^{j,k}=

\displaystyle\langle e_{j^{\prime},k^{\prime}}(q,p),[h_{0}(q,p),e_{j,k}(q,p)]\rangle.

(21)

Expanding the Poisson bracket, and applying equation 63 we are left with the set of integrals,

$\displaystyle\left(\bm{\mathsf{A}}_{\mathsf{0}}\right)_{j^{\prime},k^{\prime}}^{j,k}=$	$\displaystyle\kappa N_{j^{\prime},k^{\prime}}N_{j,k}\int_{\mathcal{D}}dqdpf_{0}^{2}(q,p)$	(22)
	$\displaystyle\times\bigg{(}kqP_{j^{\prime}}(q)P_{j}(q)P_{k^{\prime}}(p)P_{k-1}(p)$
	$\displaystyle-jpP_{k^{\prime}}(p)P_{k}(p)P_{j^{\prime}}(q)P_{j-1}(q)\bigg{)}.$

The integrals are easier to evaluate if we first apply equation 64. Doing so for $qP_{j}(q)$ in the first term and $pP_{k}(p)$ in the second gives

$\displaystyle\left(\bm{\mathsf{A}}_{\mathsf{0}}\right)_{j^{\prime},k^{\prime}}^{j,k}=$	$\displaystyle\frac{\kappa}{2}N_{j^{\prime},k^{\prime}}N_{j,k}\int_{\mathcal{D}}dqdpf_{0}^{2}(q,p)$	(23)
	$\displaystyle\times\bigg{(}kP_{j^{\prime}}(q)P_{j+1}(q)P_{k^{\prime}}(p)P_{k-1}(p)$
	$\displaystyle-jP_{k^{\prime}}(p)P_{k+1}(p)P_{j^{\prime}}(q)P_{j-1}(q)\bigg{)}.$

From the orthogonality of the Hermite polynomials with respect to $f_{0}^{2}$ , this reduces to

\left(\bm{\mathsf{A}}_{\mathsf{0}}\right)_{j^{\prime},k^{\prime}}^{j,k}=\kappa\bigg{(}\sqrt{k(j+1)}\delta_{j^{\prime}}^{j+1}\delta_{k^{\prime}}^{k-1}-\sqrt{j(k+1)}\delta_{k^{\prime}}^{k+1}\delta_{j^{\prime}}^{j-1}\bigg{)}.

(24)

We organize the resulting $M^{2}$ real numbers into an $M\times M$ matrix according to the ordering of the $e_{j,k}$ indices in the reference vector $\mathbf{u}$ (equation 19).

4.2 Spectrum of $\hat{A}_{0}$

With a matrix representation of $\hat{A}_{0}$ , we can compute a subset of its spectrum. Let $\bm{\mathsf{A}}_{\mathsf{0}}\bm{\Psi}^{(0)}=\bm{\Psi}^{(0)}\bm{\Lambda}^{(0)}$ denote the eigendecomposition of $\bm{\mathsf{A}}_{\mathsf{0}}$ , where

\bm{\Psi}^{(0)}=\begin{pmatrix}|&|&&\\ \bm{\psi}_{1}^{(0)}&\bm{\psi}_{2}^{(0)}&\cdots&\\ |&|&&\ \end{pmatrix},\ \ \bm{\Lambda}^{(0)}=\begin{pmatrix}\lambda_{1}^{(0)}&0&\cdots\\ 0&\lambda_{2}^{(0)}&\\ \vdots&&\ddots\\ \end{pmatrix}.

(25)

Here, $\bm{\psi}_{j}^{(0)}\in\mathbb{C}^{M}$ and $\lambda_{j}^{(0)}\in\mathbb{C}$ . We can transform the eigenvectors of $\bm{\mathsf{A}}_{\mathsf{0}}$ into eigenfunctions of $\hat{A}_{0}$ by taking their contraction with the reference vector $\mathbf{u}$ . Denoting the eigenfunctions of $\hat{A}_{0}$ by $\psi_{j}^{(0)}(q,p)$ , we have

\psi_{j}^{(0)}(q,p)=\mathbf{u}^{\dagger}\cdot\bm{\psi}_{j}^{(0)}.

(26)

In general, there can be a combination of real and imaginary eigenvalues. Complex eigenvalues come in conjugate pairs, as do their associated eigenvectors. All $\bm{\mathsf{A}}_{\mathsf{0}}$ eigenvalues $\lambda_{j}^{(0)}$ are determined by the characteristic polynomial,

\mathrm{det}\left(\lambda^{(0)}\bm{\mathsf{I}}-\bm{\mathsf{A}}_{\mathsf{0}}\right)=0,

(27)

where $\bm{\mathsf{I}}$ denotes the $M\times M$ identity matrix. We compute the $\lambda_{j}^{(0)}$ by finding the roots of equation 27. It is possible that some of these roots are repeated, and it is such repeated roots that we refer to as degenerate eigenvalues of $\bm{\mathsf{A}}_{\mathsf{0}}$ . We will refer to the number of times a given eigenvalue is repeated as its multiplicity, denoted $d_{j}$ .

The eigenvectors $\bm{\psi}_{j}^{(0)}$ associated with $\lambda_{j}^{(0)}$ are computed by solving,

\left(\lambda_{j}^{(0)}\bm{\mathsf{I}}-\bm{\mathsf{A}}_{\mathsf{0}}\right)\bm{\psi}_{j}^{(0)}=\mathbf{0},

(28)

for $\bm{\psi}_{j}^{(0)}\neq\mathbf{0}$ . We assume that all of the eigenvectors are linearly independent, including those associated with the degenerate eigenvalues. This means that associated with each eigenvalue $\lambda_{j}^{(0)}$ , is a subspace of $\bm{\mathsf{A}}_{\mathsf{0}}$ , spanned by the corresponding eigenvectors. For the non-degenerate eigenvalues, these subspaces are 1-dimensional, where as for the degenerate eigenvalues they have dimension equal to the degenerate eigenvalue multiplicity, $d_{j}$ .

We show in Fig. 1 and 2 the sets of linearly independent eigenfunctions associated with $\lambda_{j}^{(0)}=-i\kappa$ and $\lambda_{j}^{(0)}=-i2\kappa$ respectively. The former corresponds to the standard bending mode structure, and the latter the breathing mode. In each case, the presented function is the real part of one member of a conjugate pair. When the full complex functions are taken together with their conjugates, multiplication with the relevant complex exponentials produces real valued clockwise rotating bending and breathing patterns.

Refer to caption — Figure 1: Level set contours of $\hat{A}_{0}$ eigenfunctions, denoted $\psi_{j}^{(0)}(q,p)$ , for the degenerate subspace associated with $\lambda_{j}^{(0)}=-\mathrm{i}\kappa$ . Each of these are linearly independent functions associated with their shared eigenvalue. We show only the real part of each, noting that the imaginary part in this case differs only by a rotation of $\pi/2$ . This four dimensional subspace occurs for $N=8$ .

4.3 Dynamics of $f$ from $\hat{A}_{0}$

The dynamics of an arbitrary initial condition according to $h_{0}$ are determined by the spectrum of $\hat{A}_{0}$ . Explicitly, $f$ is mapped to a new function $\hat{S}_{0}^{t}f$ at time $t$ by the flow of equation 2, for $h=h_{0}$ . We have from equation 18,

\hat{S}_{0}^{t}f(q,p)=\sum_{j=1}^{M}\langle\psi_{j}^{(0)},f\rangle\mathrm{e}^{\lambda_{j}^{\dagger(0)}t}\psi_{j}^{(0)}(q,p).

(29)

For the harmonic potential, the dynamics of $f$ are a rigid rotation of the initial condition about the origin. It is not immediately apparent from equation 29, but we can verify that no deformation to the distribution occurs with the conserved quantities in the spectrum of $\hat{A}_{0}$ . There are several eigenvectors of $\bm{\mathsf{A}}_{\mathsf{0}}$ associated with a zero eigenvalue. Each of these corresponds to a conserved quantity. By solving equation 28 for $\lambda_{j}^{(0)}=0$ , we find the sequence:

$\displaystyle\psi_{1}^{(0)}(q,p)$	$\displaystyle=e_{0,0},$	(30)
$\displaystyle\psi_{2}^{(0)}(q,p)$	$\displaystyle=\tfrac{\sqrt{2}}{2}\left(e_{2,0}+e_{0,2}\right),$
$\displaystyle\psi_{3}^{(0)}(q,p)$	$\displaystyle=\tfrac{1}{4}\left(\sqrt{6}e_{4,0}+2e_{2,2}+\sqrt{6}e_{0,4}\right),$
$\displaystyle\psi_{4}^{(0)}(q,p)$	$\displaystyle=\tfrac{1}{4}\left(\sqrt{5}e_{6,0}+\sqrt{3}e_{4,2}+\sqrt{3}e_{2,4}+\sqrt{5}e_{0,6}\right)...$

The first two correspond to conservation of phase space density and energy, and are themselves constants of motion in the sense that $[\psi_{1}^{(0)},h_{0}]=[\psi_{2}^{(0)},h_{0}]=0$ . The other eigenfunctions are not conserved in this sense. It is rather the corresponding functionals which are conserved. This can be understood in terms of the weighted statistical moments of $f$ , defined as $F_{m,n}[f]=\langle f,f_{0}\ q^{m}p^{n}\rangle$ . All of the zero-eigenvalue functions in equation 30 contain even polynomials of $q$ and $p$ . This means that the corresponding functionals $\langle f,\psi_{j}^{(0)}\rangle$ are linear combinations of bivariate moments $F_{m,n}$ corresponding only to even powers in both variables. Such moments are variance, covariance, kurtosis, cokurtosis and so on. These moments quantify the character of symmetric properties of the distribution, ignoring asymmetries like mean and skew. This means that when the integrals over phase space are carried out, the result is the same regardless of where along the harmonic oscillator orbit the distribution is. Any deformation in $f$ will change the values of the integrals in this sequence. Their presence as zero-eigenvalue functionals in the spectrum of $\hat{A}_{0}$ restricts the dynamics to rigid rotation.

5 Anharmonic Potential

With the harmonic oscillator spectrum calculated, we can now proceed to estimating the spectrum of $\hat{A}$ . We begin by computing the matrix elements of $\hat{A}_{4}$ .

5.1 Matrix elements of $\hat{A}_{4}$

The equivalent expression to equation 21 for the $\hat{A}_{4}$ matrix elements is obtained by making the substitutions $G\rightarrow E_{j,k}$ , and $H\rightarrow H_{4}$ in equation 14. We have,

\displaystyle\left(\bm{\mathsf{A}}_{\mathsf{4}}\right)_{j^{\prime},k^{\prime}}^{j,k}=

\displaystyle\Bigl{\langle}e_{j^{\prime},k^{\prime}}(q,p),\left[\tfrac{4}{\kappa^{4}\beta^{2}}q^{4},e_{j,k}(q,p)\right]\Bigr{\rangle}.

(31)

We evaluate the Poisson bracket, applying equation 63 to the Hermite polynomial derivatives, and equation 64 to the $f_{0}$ derivatives. We are left with

	$\displaystyle\left(\bm{\mathsf{A}}_{\mathsf{4}}\right)_{j^{\prime},k^{\prime}}^{j,k}=$	$\displaystyle\frac{8}{\kappa^{3}\beta}N_{j^{\prime},k^{\prime}}N_{j,k}\int_{\mathcal{D}}dqdpf_{0}^{2}(q,p)q^{3}P_{j^{\prime}}(q)P_{j}(q)$		(32)
		$\displaystyle\times P_{k^{\prime}}(p)\left(kP_{k-1}(p)-\tfrac{1}{2}P_{k+1}(p)\right).$		(32)

To evaluate the these integrals for all indices, it is easiest to re-write $q^{3}P_{j}(q)$ strictly in terms of Hermite polynomials. Recursive application of equation 64 yields (for arbitrary $n\in\mathbb{N}$ ) an expansion of the form

x^{n}P_{j}(x)=\sum_{m=0}^{n}b_{m}(j)P_{j+n-2m}(x),

(33)

where the coefficients $b_{m}(j)$ are polynomials in $j$ of order $m$ . Table 1 contains a list of the relevant coefficients for both a quartic and sextic potential. To compute the contribution from $q^{3}P_{j}(q)$ , let us define the integral

\displaystyle I_{j^{\prime}}^{j}

\displaystyle=\int_{-\infty}^{+\infty}dq\ \mathrm{e}^{-q^{2}}q^{3}P_{j^{\prime}}(q)P_{j}(q).

(34)

With this definition, the matrix elements of $\hat{A}_{4}$ (equation 32) are expressed as

	$\displaystyle\left(\bm{\mathsf{A}}_{\mathsf{4}}\right)_{j^{\prime},k^{\prime}}^{j,k}=$	$\displaystyle\frac{8}{Z^{2}\kappa^{3}\beta}N_{j^{\prime},k^{\prime}}N_{j,k}I_{j^{\prime}}^{j}\int_{-\infty}^{+\infty}dp\ \mathrm{e}^{-p^{2}}$		(35)
		$\displaystyle\times P_{k^{\prime}}(p)\left(kP_{k-1}(p)-\tfrac{1}{2}P_{k+1}(p)\right),$		(35)

where we have used $f_{0}^{2}=Z^{-2}\mathrm{e}^{-q^{2}}\mathrm{e}^{-p^{2}}$ .

Application of equation 33, with the coefficients from the $n=3$ column of Table 1 yields

\displaystyle I_{j^{\prime}}^{j}=\sum_{m=0}^{3}b_{m}(j)\int_{-\infty}^{+\infty}dq\ \mathrm{e}^{-q^{2}}P_{j^{\prime}}(q)P_{j+3-2m}(q).

(36)

This is evaluated easily by the orthogonality of the Hermite polynomials, leaving

I_{j^{\prime}}^{j}=\sum_{m=0}^{3}b_{m}(j)\left(\sqrt{\pi}2^{j+3-2m}(j+3-2m)!\right)^{\tfrac{1}{2}}\delta_{j^{\prime}}^{j+3-2m}.

(37)

We then substitute the evaluated $I_{j^{\prime}}^{j}$ into equation 35, and compute the remaining momentum space integral. We are left with

	$\displaystyle\left(\bm{\mathsf{A}}_{\mathsf{4}}\right)_{j^{\prime},k^{\prime}}^{j,k}=$	$\displaystyle\frac{4\varphi}{\kappa^{3}\beta}\sum_{m=0}^{3}b_{m}(j)N_{j,k}\delta_{j^{\prime}}^{j+3-2m}$		(38)
		$\displaystyle\times\left(\frac{2k\delta_{k^{\prime}}^{k-1}}{N_{j+3-2m,k-1}}-\frac{\delta_{k^{\prime}}^{k+1}}{N_{j+3-2m,k+1}}\right).$		(38)

Again, we organize the resulting values into an $M\times M$ matrix according to the ordering of indices in $\mathbf{u}$ .

$m$	$b_{m}$ $(n=3)$	$b_{m}$ $(n=5)$
0	$2^{-3}$	$2^{-5}$
1	$\frac{3}{4}(j+1)$	$\frac{1}{2}(j+\frac{5}{4})$
2	$\frac{3}{2}j^{2}$	$\frac{5}{2}(\frac{1}{2}j^{2}+j+\frac{3}{4})$
3	$j!/(j-3)!$	$\frac{5}{2}j(j^{2}+\frac{1}{2})$
4		$5j(\frac{1}{2}j^{3}-2j^{2}+\frac{5}{2}j-1)$
5		$j!/(j-5)!$

Table 1: Coefficients for Hermite polynomial expansion of matrix element integrand as in equation 33.

5.2 Spectrum of $\hat{A}$

To apply the ordinary differential equation solution for $\hat{U}^{t}$ , and compute $\hat{S}^{t}f$ from equation 18, we need the spectrum of $\hat{A}$ . The matrix

\bm{\mathsf{A}}=\bm{\mathsf{A}}_{\mathsf{0}}+\varphi\bm{\mathsf{A}}_{\mathsf{4}},

(39)

does not admit a diagonalization directly, so we adopt a perturbative treatment. We follow a standard procedure for analysis of the Schrödinger equation in QM (for an introductory description, see for example Griffiths & Schroeter (2018)). Suppose that the eigenvalues and eigenvectors of $\bm{\mathsf{A}}$ can be written as a power series in the parameter $\varphi$ . That is,

	$\displaystyle\lambda_{j}(\varphi)$	$\displaystyle=\lambda_{j}^{(0)}+\varphi\lambda_{j}^{(1)}+\mathcal{O}(\varphi^{2}),$		(40)
	$\displaystyle\bm{\psi}_{j}(\varphi)$	$\displaystyle=\bm{\psi}_{j}^{(0)}+\varphi\bm{\psi}_{j}^{(1)}+\mathcal{O}(\varphi^{2}).$		(40)

We assume that these quantities satisfy the eigenvalue problem,

\left(\bm{\mathsf{A}}_{\mathsf{0}}+\varphi\bm{\mathsf{A}}_{\mathsf{4}}\right)\bm{\psi}_{j}(\varphi)=\lambda_{j}(\varphi)\bm{\psi}_{j}(\varphi).

(41)

Substituting the expressions in equation 40 into 41 and equating terms of equal power in $\varphi$ , we obtain a sequence of equations relating the power series coefficients and the operators $\bm{\mathsf{A}}_{\mathsf{0}}$ and $\bm{\mathsf{A}}_{\mathsf{4}}$ . The zero order equation is the eigenvalue problem for $\bm{\mathsf{A}}_{\mathsf{0}}$ , stated in Section 4.2. The first order equation is

\displaystyle\left(\bm{\mathsf{A}}_{\mathsf{0}}-\lambda_{j}^{(0)}\right)\bm{\psi}_{j}^{(1)}

\displaystyle=\left(\lambda^{(1)}_{j}-\bm{\mathsf{A}}_{\mathsf{4}}\right)\bm{\psi}_{j}^{(0)}.

(42)

We assume that the basis formed by the eigenvectors of $\bm{\mathsf{A}}_{\mathsf{0}}$ spans the eigenspace of $\bm{\mathsf{A}}$ . That is,

\bm{\psi}_{j}(\varphi)=\sum_{k=1}^{M}\left({\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j}(\varphi)\right)\bm{\psi}_{k}^{(0)}.

(43)

Note that ${\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j}(\varphi)$ indicates a dot product in $\mathbb{C}^{M}$ and the result is scalar. Substituting $\bm{\psi}_{j}(\varphi)$ as in equation 40 into equation 43, we express the first order correction as

\bm{\psi}_{j}^{(1)}=\sum_{k=1}^{M}\left({\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j}^{(1)}\right)\bm{\psi}_{k}^{(0)}.

(44)

In order to compute this correction we need an expression for the coefficients, ${\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j}^{(1)}$ . We proceed by contracting equation 42 with ${\bm{\psi}_{k}^{(0)}}^{\dagger}$ . This yields

\left(\lambda_{k}^{(0)}-\lambda_{j}^{(0)}\right){\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\psi}^{(1)}_{j}=\lambda_{j}^{(1)}\delta_{j}^{k}-{\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{\mathsf{4}}\bm{\psi}_{j}^{(0)}.

(45)

For $k=j$ , the left hand side is zero, and we have the first order correction to the eigenvalues,

\lambda_{j}^{(1)}={\bm{\psi}_{j}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{\mathsf{4}}\bm{\psi}_{j}^{(0)}.

(46)

For $k\neq j$ , $\delta_{j}^{k}=0$ and we obtain the coefficients we need for equation 44. That is,

{\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\psi}^{(1)}_{j}=\frac{{\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{\mathsf{4}}\bm{\psi}_{j}^{(0)}}{\lambda_{j}^{(0)}-\lambda_{k}^{(0)}}.

(47)

Summing over these coefficients as in equation 44 yields the $\mathcal{O}(\varphi)$ correction to the eigenvectors of $\bm{\mathsf{A}}$ . Explicitly, the first order correction to the non-degenerate eigenvectors of $\bm{\mathsf{A}}$ is

\bm{\psi}_{j}^{(1)}=\sum_{k=1}^{M}\frac{{\bm{\psi}_{k}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{\mathsf{4}}\bm{\psi}_{j}^{(0)}}{\lambda_{j}^{(0)}-\lambda_{k}^{(0)}}\bm{\psi}_{k}^{(0)}.

(48)

We can use the corrections computed here as-is for the non-degenerate subset of the $\bm{\mathsf{A}}_{\mathsf{0}}$ spectrum, where $\lambda_{j}^{(0)}\neq\lambda_{k}^{(0)}$ if $j\neq k$ . The situation is more complicated if $\lambda_{j}^{(0)}=\lambda_{k}^{(0)}$ . As discussed in Section 4.2, the spectrum of $\bm{\mathsf{A}}_{\mathsf{0}}$ in general contains degenerate subspaces, where there are multiple distinct eigenvectors associated with the same eigenvalue (see for example Fig. 1 and 2). This poses an issue for the correction in equation 48, where we must sum over all of the linearly independent eigenvectors. If $j$ and $k$ are such that both eigenvectors lie within a degenerate subspace, the denominator here is zero. To proceed, we treat the $\bm{\mathsf{A}}_{\mathsf{4}}$ contribution separately for each degenerate subspace.

5.3 Treatment of degenerate eigenvalues

We begin by adding a second index label to the eigenvalues of $\bm{\mathsf{A}}_{\mathsf{0}}$ . That is, let $\lambda_{j,k}^{(0)}$ denote the $k$ th repetition of the $j$ th unique eigenvalue of $\bm{\mathsf{A}}_{\mathsf{0}}$ . The same indexing scheme is applied to the eigenvectors. As stated in Section 4.2, all eigenvectors corresponding to repeated eigenvalues are themselves linearly independent. So although we have $\lambda_{j,k}^{(0)}=\lambda_{j,k^{\prime}}^{(0)}$ , for the eigenvectors, $\bm{\psi}_{j,k}^{(0)}\neq\bm{\psi}_{j,k^{\prime}}^{(0)}$ .

For any fixed $j$ , the $\bm{\psi}_{j,k}^{(0)}$ are linearly independent and span a subspace of the $\bm{\mathsf{A}}_{\mathsf{0}}$ eigenspace corresponding to a single eigenvalue $\lambda_{j}^{(0)}$ . Any linear combination of these vectors is itself an eigenvector of $\bm{\mathsf{A}}_{\mathsf{0}}$ , with eigenvalue $\lambda_{j}^{(0)}$ . We can make use of this property in determining the $\bm{\mathsf{A}}_{\mathsf{4}}$ contribution. We begin by looking at the spectrum of $\bm{\mathsf{A}}_{\mathsf{4}}$ when it is projected onto each degenerate subspace of $\bm{\mathsf{A}}_{\mathsf{0}}$ separately. This process is as follows.

For each eigenvalue $\lambda_{j}^{(0)}$ with multiplicity $d_{j}>1$ , we define the subspace basis as

\bm{\Psi}_{j}^{(0)}=\begin{pmatrix}|&&|\\ \bm{\psi}_{j,1}^{(0)}&\cdots&\bm{\psi}_{j,d_{j}}^{(0)}\\ |&&|\end{pmatrix}.

(49)

We then compute the matrix elements of $\hat{A}_{4}$ projected onto the degenerate subspace. Denoting the $j$ th subspace projection $\tilde{\bm{\mathsf{A}}}_{\mathsf{4}}^{j}$ , we have

(\tilde{\bm{\mathsf{A}}}_{\mathsf{4}}^{j})_{k}^{k^{\prime}}={\bm{\psi}_{j,k}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{\mathsf{4}}\bm{\psi}_{j,k^{\prime}}^{(0)}.

(50)

Note that the projected matrices are $d_{j}\times d_{j}$ rather than $M\times M$ . With the projected generator matrix, we compute its eigenvalues, $\lambda^{(1)}_{j,k}\in\mathbb{C}$ and eigenvectors, $\bm{\xi}_{j,k}\in\mathbb{C}^{d_{j}}$ . These satisfy

\tilde{\bm{\mathsf{A}}}_{\mathsf{4}}^{j}\bm{\xi}_{j,k}=\lambda^{(1)}_{j,k}\bm{\xi}_{j,k}.

(51)

5.4 Simultaneous eigenvectors of $\hat{A}_{0}$ and $\hat{A}_{4}$

The $\bm{\xi}_{j,k}$ comprise weights for linear combinations of the degenerate subspace basis vectors $\bm{\psi}^{(0)}_{j,k}$ . Contracting with the subspace basis $\bm{\Psi}_{j}^{(0)}$ in equation 49, we can get back vectors in $\mathbb{C}^{M}$ . Explicitly, we write

\tilde{\bm{\psi}}_{j,k}^{(0)}=\bm{\Psi}^{(0)}_{j}\bm{\xi}_{j,k},\ \ 1\leq k\leq d_{j}.

(52)

Each of the $\tilde{\bm{\psi}}_{j,k}^{(0)}$ are eigenvectors of both the projected $\bm{\mathsf{A}}_{\mathsf{4}}$ , with eigenvalue $\lambda^{(1)}_{j,k}$ , and of $\bm{\mathsf{A}}_{\mathsf{0}}$ with eigenvalue $\lambda_{j}^{(0)}$ . Because of this, we can replace the $d_{j}$ original eigenvectors of $\bm{\mathsf{A}}_{\mathsf{0}}$ making up the $j$ th degenerate subspace with the new $\tilde{\bm{\psi}}_{j,k}^{(0)}$ .

We need these new eigenvectors to resolve an ambiguous definition of the unperturbed $\bm{\psi}_{j,k}^{(0)}$ . If we did not do this, the transition between $\varphi=0$ and $\varphi>0$ could constitute a discontinuous change in the eigenvectors. We must avoid this to assure that we have a smooth variation of the eigenvectors with respect to change in the perturbation parameter $\varphi$ . Otherwise, the original power series assumption (equation 40) would be invalid. From this point on, we replace all of the degenerate subspace eigenvectors, $\bm{\psi}_{j,k}^{(0)}$ with the new vectors we just found, $\tilde{\bm{\psi}}^{(0)}_{j,k}$ . For notational simplicity, we drop the tilde. Any time from this point on that we refer to the degenerate subspace vectors, it should be assumed that they are the ones defined by equation 52 that diagonalize $\bm{\mathsf{A}}_{\mathsf{4}}$ in their respective subspaces. The eigenfunctions are obtained from the eigenvectors in the same way as for the harmonic oscillator, using an analog to equation 26.

5.5 Degenerate Subspace Corrections

To compute the $\mathcal{O}(\varphi)$ corrections for the degenerate eigenvalues and eigenvectors, we follow a similar procedure to Section 4.2, carrying out the necessary algebra in Appendix B. To summarize, we again assume a power series in $\varphi$ for both the eigenvalues and eigenvectors of $\bm{\mathsf{A}}$ (equation 68), and then compute the coefficients of the $\mathcal{O}(\varphi)$ terms in these series.

5.5.1 Eigenvalues of $\hat{A}$

The expression for the first order corrected eigenvalues is the same as it was in the non-degenerate case, just with the double index on the degenerate subsets. Explicitly,

\lambda_{j,k}(\varphi)=\lambda_{j}^{(0)}+\varphi\lambda_{j,k}^{(1)}+\mathcal{O}(\varphi^{2}),

(53)

where $\lambda_{j,k}^{(1)}$ is given by equation 72, derived in Appendix B.1. This is obtained by diagonalizing $\bm{\mathsf{A}}_{\mathsf{4}}$ in the $j$ th degenerate subspace as in Section 5.3. Here, the eigenvalues of $\bm{\mathsf{A}}_{\mathsf{4}}$ are distinct within any given degenerate subspace $\bm{\Psi}_{j}^{(0)}$ . That is, $\lambda^{(1)}_{j,k}\neq\lambda^{(1)}_{j,k^{\prime}}$ for $k\neq k^{\prime}$ . This means that the degenerate eigenvalues of $\bm{\mathsf{A}}_{\mathsf{0}}$ have been split by $\bm{\mathsf{A}}_{\mathsf{4}}$ . If this is not true, a higher order expansion in $\varphi$ is required.

We show in Fig. 3 the corrected eigenvalues corresponding to half of the degenerate subspaces (the values shown are conjugate partners to those in the omitted half). The $-\mathrm{i}\lambda_{j,k}(\varphi)/\kappa$ curves serve to illustrate the splitting of the degenerate subset of the $\hat{A}_{0}$ spectrum. The degenerate subspace dimension decreases as $|\lambda_{j}^{(0)}|$ increases. The curves corresponding to a single degenerate eigenvalue of $\hat{A}_{0}$ converge at $\varphi=0$ , but possess distinct values otherwise. The spread of the $\lambda_{j,k}(\varphi)$ for fixed $j$ , increases with $\varphi$ . This corresponds to the strength of the anharmonic contribution to the potential. It is this distance between eigenvalues of $\hat{A}$ within a given degenerate subspace that drives differential rotation in the distribution function. The vertical line at $\varphi/\varphi_{0}=1$ indicates the chosen value of $\varphi$ used in our example calculations.

5.5.2 Eigenvectors of $\hat{A}$

The eigenvectors of $\bm{\mathsf{A}}$ are expressed as the power series,

\bm{\psi}_{j,k}(\varphi)=\bm{\psi}_{j,k}^{(0)}+\varphi\bm{\psi}_{j,k}^{(1)}+\mathcal{O}(\varphi^{2}).

(54)

The first order correction $\bm{\psi}_{j,k}^{(1)}$ is derived in Appendix B.2, and can ultimately be written as

\bm{\psi}_{j,k}^{(1)}=\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}\left(\sum_{l\neq k}^{d_{j}}\frac{{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{m}^{(0)}}{\lambda_{j,l}^{(1)}-\lambda_{j,k}^{(1)}}\bm{\psi}_{j,l}^{(0)}-\bm{\psi}_{m}^{(0)}\right).

(55)

As in Section 4.2, we can transform the eigenvectors of $\bm{\mathsf{A}}$ into eigenfunctions of $\hat{A}$ by taking their contraction with the reference vector, $\mathbf{u}$ . We have

\psi_{j,k}(q,p;\varphi)=\mathbf{u}^{\dagger}\cdot\bm{\psi}_{j,k}(\varphi).

(56)

We show in Fig. 4 an example set of $\hat{A}$ eigenfunctions associated with $\lambda_{j}^{(0)}=-\mathrm{i}\kappa$ . As discussed in Section 4.2, this is the eigenvalue that controls the rotation of the simple bending mode in Fig. 1. The $\hat{A}$ eigenfunctions contain more sign changes along a radial path from the origin than those of the harmonic oscillator. This facilitates a segmenting of phase space density with distance from the origin (proportional to the action in this case). Each of these structures possesses a different orbital frequency, corresponding to the four diverging solid red lines starting at $-\mathrm{i}\lambda_{j,k}(\varphi)/\kappa=1$ in Fig. 3. The relative spacing of these lines determines the relative rates of rotation, and therefore the relative phase of the structures in Fig. 4, as time progresses. The combination of these two factors causes differential rotation in the distribution function, producing the spiral structure characteristic of phase mixing. The same general idea applies to the other degenerate subspace eigenfunctions, although there are more complicated structures present in the larger $|\lambda_{j}^{(0)}|$ eigenfunctions.

To illustrate the importance of degenerate subspaces to the mixing process, we consider the sum over eigenvectors within a given subspace. Let us define the $j$ th subspace sum as

\psi_{j}^{\Sigma}(q,p,t)=\sum_{k=1}^{d_{j}}\langle\psi_{j,k},f\rangle\psi_{j,k}(q,p;\varphi)\mathrm{e}^{\lambda_{j,k}^{\dagger}(\varphi)t}.

(57)

We show this quantity in Fig. 5, with the particular $j$ value corresponding to the index of the $\lambda_{j}^{(0)}=-\mathrm{i}\kappa$ subspace. The initial condition is $f_{0}(q,p+\delta p)$ , with $\delta p=0.3$ . At $t=0$ , the split eigenvalues have no effect, and the net contribution from the subspace is the bending mode. As time progresses, the eigenfunctions in Fig. 4 rotate increasingly out of phase with each other, deforming the bending mode structure into interlocked positive and negative one-armed spirals.

We have observed so far that particular types of structures correspond to degenerate subspaces of the harmonic oscillator spectrum. To reiterate, these subspaces are sets of linearly independent eigenfunctions of $\hat{A}_{0}$ , all associated with the same eigenvalue. When the anharmonic potential contribution is added, the degenerate eigenvalues split, but the correspondence between particular structures (bending, breathing, etc.) and the degenerate subspaces remains. The subspace dimension determines the complexity of representable structure, and taken in combination with the properties of the contained eigenfunctions (spacing of their roots), sets the minimum length scale of that structure.

5.6 Dynamics of $f$ from $\hat{A}$

With an estimated spectrum of the full generator, $\hat{A}$ , we can compute the distribution function at time $t$ . Separating the sum in equation 18 into the distinct contributions from our perturbative analysis we have

\displaystyle\hat{S}^{t}f(q,p)=\sum_{j=1}^{M-D}\langle\psi_{j},f\rangle\psi_{j}(q,p;\varphi)\mathrm{e}^{\lambda_{j}^{\dagger}(\varphi)t}+\sum_{j>M-D}^{M}\psi_{j}^{\Sigma}(q,p,t).

(58)

On the right hand side from left to right these are, the non-degenerate subspace of the spectrum of $\hat{A}$ , and the total contribution from the $D$ degenerate subspaces.

In Fig. 6, we show three snapshots of $\hat{S}^{t}f$ for three values of $N$ . The initial condition is again $f_{0}(q,p+\delta p)$ (the same as for Fig. 5). Time progresses down the rows, and basis size increases across the columns. Looking first at $N=14$ (rightmost column), the distribution function winds into a one-armed spiral as $t$ increases. Let us compare this to the $N=12$ and $N=10$ columns, focusing on $(2\pi)^{-1}\kappa t=8$ (second row). Relative to $N=14$ , the other two cases appear to have more sophisticated structure, with the complexity increasing as $N$ decreases. This apparent structure is however an artifact of the truncated basis representation in a manner similar to the Gibbs phenomenon. That is, a deficiency of terms in the series over the basis functions leads to a poor representation of the true $f$ .

The basis size set by the polynomial order $N$ determines how well a temporally isolated snapshot of the distribution function can be represented. In a finite dimensional representation, the exact form of $f$ is only known at the initial condition, and is projected onto the basis functions. For $t>0$ , every future state $\hat{S}^{t}f$ is determined from the initial projection according to the eigenvalues of $\hat{A}$ . The manner in which the distribution deforms from some $t_{1}$ to another time $t_{2}$ depends on the structure of the eigenfunctions and their associated eigenvalues. In the present context, this hinges on the relative spacing of the $\mathcal{O}(\varphi)$ eigenvalues within each degenerate subspace. As discussed in Section 5.5.2 and demonstrated in Fig. 5, the mechanism for spiral formation is the out of phase rotation of the degenerate subspace eigenvectors. The temporal characteristics of this process are determined by the split eigenvalues in Fig. 3. This occurs within each degenerate subspace, and the total contribution from the mechanism is encapsulated in the rightmost term in equation 58.

The number of eigenvalues, and their multiplicities in the $\hat{A}_{0}$ spectrum increase with $N$ . For $\lambda_{j}^{(0)}=-\mathrm{i}\kappa$ , we have $d_{j}=5,6,7$ for $N=10,12,14$ . When the degenerate eigenvalues have been split by the anharmonic potential, the multiplicity prescribes the number of distinct rates of rotation for the subspace eigenvectors. This can also be understood as an increase in frequency domain coverage in the sense of a Fourier transform. The length scale set by $N$ determines which structures can be well represented by the basis regardless of $t$ . The fanning out in frequency shown in Fig. 3 coupled with the form of the $\psi_{j,k}$ determines the resolvability of spatiotemporal structure.

6 Discussion

The resolvability of structure in $f$ depends on its representation, or means of observation. When one chooses a number of particles in an $N$ -body simulation, a grid resolution in an equation solver, or a finite dimensional basis, a scale is imposed. In this work, we aimed to highlight the relationship between imposed length scale from dimension of representation, and multiplicity of eigenvalues in the spectrum of $\hat{A}_{0}$ . Further, that in such a case splitting of the degenerate eigenvalues by $\hat{A}_{4}$ drives mixing in $f$ .

For the spectrum of $\hat{A}_{0}$ , the number of unique degenerate eigenvalues, and their average multiplicity increases with $M$ , the size of the basis. In this construction, spiral formation is achieved through a linear combination of structures that span the entire $(q,p)$ plane (Fig. 4). This is in contrast to the picture described in Section 1, in which essentially every infinitesimal annulus of phase space volume at a different radius from the origin orbits with a different frequency. In Banik et al. (2022), phase mixing is achieved through a linear response term of the form $\mathrm{e}^{in(\theta-\Omega(I)t)}$ , where $(\theta,I)$ are angle-action coordinates, and $\Omega$ is the oscillatory frequency of the orbits. Since the frequencies depends on the actions, orbits at different actions will possess varying frequencies, leading to deformation of $f$ .

One could suppose that for an infinite dimensional basis, $f$ decomposes into an infinite set of delta functions of $q$ and $p$ , each nonzero at a different point in the plane. In this case, all of the different packets of phase space density may have different orbital frequencies, and we obtain the original picture of the process, essentially moving to a discrete particle representation. Given the premise of increasing dimension corresponding to increasing degeneracy of the eigenvalues, we suppose that in the limit case of an infinite dimensional representation, the discretely split degenerate eigenvalues may become the continuous bands described in Mathur (1990) and Weinberg (1991).

The analysis here omits a self-interaction potential. This means that $\hat{A}$ does not depend on $t$ , and the integral in equation 15 is trivial. Were this not the case, the operator solution $\hat{U}^{t}$ would take the form of a Dyson series, obtained by iterative solution of the implicit equation $\hat{U}^{t}=1+\int_{0}^{t}d\tau\hat{A}(\tau)\hat{U}^{\tau}$ (see for example, Sakurai & Napolitano (2017)). In doing this, it would be sensible to separate the self-interaction contribution to the Hamiltonian, and parameterize its strength with for example, $\alpha$ . Computing the necessary Dyson series to some order in $\alpha$ is technically possible. The main constraint is that the matrix element integrals cannot necessarily be evaluated analytically for a non-polynomial potential in our chosen basis. Using the exact one-dimensional Green’s function of the Poisson equation poses a challenge in this regard. One option is to adopt a Hamiltonian Mean Field (HMF) approach as in Inagaki & Konishi (1993), replacing the absolute value function with a polynomial that preserves some desired properties. In this case, the self-interaction can be expressed in terms of the moments of $f$ as in Section 4.3, which work nicely with the functional formalism used here. The simplest case would be a quadratic interaction, $v(q,q^{\prime})\propto(q-q^{\prime})^{2}$ . In that case, the self-interaction is a quadratic potential well that tracks the expectation value of position with respect to $f$ , which is the moment $\hat{U}^{t}F_{1,0}$ .

In this work we considered systems with only a single spatial degree of freedom. This is applicable to the vertical motions of Solar Neighborhood stars in the limit that vertical and in-plane motions decouple. The general approach described in this work may be extended to higher dimensional phase spaces in order to incorporate radial and/or azimuthal motions. The general framework requires that CBE solutions reside in a Hilbert space $\mathscr{H}$ , and that there exists a dual space of the corresponding functionals $\mathscr{H}^{*}$ . We can in principle assume this for any number of spatial degrees of freedom. It is when computing a representation of the Koopman generator $\hat{A}$ that we must handle the problem a little differently. First it should be noted that increasing the dimension of the phase space will require additional degrees of freedom on the basis functions. In the case of a truncated finite dimensional basis, this means that in order to achieve a comparable resolution to the calculations here, a larger number of basis functions is necessary.

Assuming a Milky Way-like geometry, we briefly outline here possible choices for the basis functions, and discuss how one might proceed. In Section 4, we used Gaussian-Hermite functions for the vertical position and momentum, and can similarly use such functions for the radial and azimuthal momenta. For the radial position, we need orthogonal functions on the interval $[0,\infty)$ that approach zero at infinity. These criteria are met by Laguerre polynomials weighted by a radially decaying exponential function. For the azimuthal position we could use any set of periodic functions orthogonal on $[0,2\pi]$ .

If the increased matrix size poses a computational barrier, one may consider studying the system dynamics with expectation values of particular functions rather than the distribution function itself. One approach is to look at the mean vertical position and/or momentum as a function the radial and azimuthal coordinates as in for example Chequers & Widrow (2017). These partial moments have been represented in terms of Koopman generator eigenfunctions computed with DMD in Widrow et al. (2020). The functional formalism used in this work can be extended to various moments by transforming the Morrison bracket definition of $\hat{A}$ (equation 14) into a form prescribed by Kupershmidt & Manin (1978). This yields an equation of motion for moments rather than the distribution function itself. A description of this and its correspondence to Jeans’ equations can be found in Chapter 3 of Darling (2024).

7 Conclusions

We have described a procedure for determining the CBE dynamics of $f$ , by mapping the problem into a set of functionals $G[f]$ which satisfy a Schrödinger-type equation. To calculate explicit matrix representations of operators, we adopted a finite dimensional set of basis functions that impose a minimum length scale on $f$ . For an illustrative example of phase mixing, we treated a quartic potential perturbatively with respect to a harmonic oscillator solution. We observed that the multiplicity of harmonic oscillator eigenvalues increases with the dimension of the finite basis representation. Subsequently, we showed that the anharmonic potential splits the degenerate eigenvalues, leading to the amplitude dependent frequencies characteristic of phase mixing.

In Section 5.5.2, we computed the $\mathcal{O}(\varphi)$ corrected eigenfunctions of $\hat{A}$ . These structures were segmented into subspaces, with the particular groupings set by the spectrum of $\hat{A}_{0}$ according to the degeneracy of its eigenvalues. Different classes of structure (for example bending or breathing) were associated with each subspace. The scenario depicted in Fig. 5 can be understood as the kinematic response to a bending mode perturbation, projected onto the $\lambda_{j}^{(0)}=-\mathrm{i}\kappa$ subspace. This demonstrated how the standard bending mode structure deforms to contain spiral structure, similar to the observations in Darling & Widrow (2019b). This is of interest in the context of the Gaia spiral, especially with respect to the hypothesis that the observed structure stems from a bending mode excitation (Darling & Widrow, 2019a). The essential takeaway from this is that the $\hat{A}$ spectrum derived from a solvable model like the harmonic oscillator can be used to model both processes characteristic of self-gravity, and phase mixing. For the approach taken in Darling & Widrow (2019b), one could compute basis functions for the evolution of $f$ which were suited to representing the particular dynamics of the simulation they were derived from. The form of the basis functions derived from DMD was heavily dependent on the parameters of the simulation. They were especially disparate between simulations dominated by self-gravity and anharmonic forcing. In the formalism used here, we suggest that one can construct a single basis suited to both cases. The trade-off needed for this is that instead of a single eigenfunction representative of a principal component of the dynamics, one has a subspace of dimension $d_{j}\geq 1$ (usually $>$ ). The eigenfunctions spanning a subspace would at first glance appear suited only to represent the highly symmetric oscillatory structures typical of the harmonic oscillator spectrum, but eigenvalue splitting in proportion to anharmonic forcing magnitude allows for out of phase rotation. Such rotation facilitates spiral structure formation in the subspace contributions and consequently in the distribution function.

In modeling the Gaia spiral, it is possible that there does not exist a purely kinematic description which yields the observed structure. Entertaining that possibility, one must ask how self-interaction changes the standard mechanism for the formation of phase space spirals. It is feasible to consider self-interaction and comparatively static anharmonic forcing as competing effects, with their relative magnitudes impacting the spatiotemporal structure of the distribution function. This relative dominance has previously been quantified by a “live fraction" parameter, denoted $\alpha$ (Darling & Widrow, 2019b; Bennett & Bovy, 2021; Darling & Widrow, 2021). It was observed in Darling & Widrow (2019b) that when evolution is primarily driven by self-interaction ( $\alpha\simeq 0.8$ ), the dominant contributions to the Koopman spectrum resemble the bending and breathing modes that appear in the harmonic oscillator spectrum (Fig. 1 and 2). As the relative strength of the self-interaction was increased, the structures associated with the harmonic oscillator spectrum were deformed to contain spiral structure, resembling quite closely the sum over the $\lambda_{j}^{(0)}=-\mathrm{i}\kappa$ subspace in Fig. 5 for $t>0$ . We suppose that the spiral-bending and spiral-breathing contributions to the Koopman spectrum observed numerically in Darling & Widrow (2019b) can be modeled by the degenerate subspaces of the perturbed harmonic oscillator spectrum associated with their respective structures. For the quartic Hamiltonian, the eigenvalue splitting mechanism represents the effect of the anharmonic forcing. The self-interaction can be treated with time-dependent perturbation theory. Specifically this involves the more general Dyson series for $\frac{\partial\hat{A}}{\partial t}\neq 0$ discussed in Section 6. This treatment can be applied to each subspace individually, so one can observe the effect of self-interaction on the spiral-bending and spiral-breathing modes in a similar way to Darling & Widrow (2019b), but with an analytic model.

A crucial assumption here is that one can determine a basis sufficient for the evolution of the distribution function. If that is the case, the problem is reduced to determining the set of coefficients that produce the correct linear combination. It is the relative magnitudes of the coefficients that determine the form of $f$ at any particular time, but it is the relative phase of the time-dependent coefficients that prescribe the evolution and mixing. Acknowledging the possibility that there is no fully time-independent mapping from the hypothetically “correct" initial condition to the observed structure, the effect of time-dependent forces from the self-interaction of the distribution is essential for successful modeling. We suggest that a subspace-wise analysis of the interplay between anharmonic forcing and self-interaction is a preferable way forward. Preliminary calculations using the time-dependent $\hat{A}$ Dyson series indicate that addition of a self-interaction term to the quartic Hamiltonian leads to a time-varying rate of mixing. This is corroborated by fully self-consistent (Darling & Widrow, 2019a) and live fraction based (Darling & Widrow, 2019b) numerical experiments. In all cases, it was observed that a self-interacting distribution undergoes mostly kinematic phase mixing at early times to form a spiral, but the winding slows down after the initial formation. A detailed exploration of this is beyond the scope of the present work, and is left to a future article.

It is also of interest to further investigate the scale-dependent mixing process studied here in the context of other work around coarse-grained evolution of the CBE, as in Chavanis et al. (1996) or Chavanis, P. H. & Bouchet, F. (2005). In the latter, one of the definitions of a coarse-grained $f$ is a windowed functional. That is, $f$ is taken in convolution with some kernel that sets the representation scale. Preliminary work suggests that the equation of motion in that case is equivalent to equation 13 up to a diffusion current term.

Acknowledgements

This work was supported by a Discovery Grant with the Natural Sciences and Engineering Research Council of Canada. We are thankful to Mike Petersen for insightful comments. We also thank Scott Tremaine, Francesco Cellarosi, Aaron Vincent, and Stephen Hughes for helpful discussions.

Data Availability

Calculations were carried out in MATLAB. Custom software used for this paper is available upon reasonable request. Colormaps used in contour plots are thanks to Thyng et al. (2016).

References

Abel et al. (2012) Abel T., Hahn O., Kaehler R., 2012, MNRAS, 427, 61
Antoja et al. (2018) Antoja T., et al., 2018, Nature, 561, 360
Arnold (1989) Arnold V., 1989, Mathematical methods of classical mechanics. Vol. 60, Springer
Banik et al. (2022) Banik U., Weinberg M. D., van den Bosch F. C., 2022, ApJ, 935, 135
Bennett & Bovy (2018) Bennett M., Bovy J., 2018, MNRAS, 482, 1417
Bennett & Bovy (2021) Bennett M., Bovy J., 2021, MNRAS, 503, 376
Binney & Tremaine (2008) Binney J., Tremaine S., 2008, Galactic Dynamics: Second Edition. Princeton University
Chavanis, P. H. & Bouchet, F. (2005) Chavanis, P. H. Bouchet, F. 2005, A&A, 430, 771
Chavanis et al. (1996) Chavanis P. H., Sommeria J., Robert R., 1996, The Astrophysical Journal, 471, 385
Chequers & Widrow (2017) Chequers M. H., Widrow L. M., 2017, MNRAS, 472, 2751
Conway (1994) Conway J., 1994, A Course in Functional Analysis. Graduate Texts in Mathematics, Springer New York
Darling (2024) Darling K., 2024, Linear Operator Theory of Phase Mixing in Collisionless Systems
Darling & Widrow (2019a) Darling K., Widrow L. M., 2019a, MNRAS, 484, 1050
Darling & Widrow (2019b) Darling K., Widrow L. M., 2019b, MNRAS, 490, 114
Darling & Widrow (2021) Darling K., Widrow L. M., 2021, MNRAS, 506, 3098
Gaia Collaboration et al. (2018) Gaia Collaboration et al., 2018, A&A, 616, A11
Griffiths & Schroeter (2018) Griffiths D. J., Schroeter D. F., 2018, Introduction to Quantum Mechanics, 3 edn. Cambridge University Press
Hunt et al. (2021) Hunt J. A. S., Stelea I. A., Johnston K. V., Gandhi S. S., Laporte C. F. P., Bédorf J., 2021, MNRAS, 508, 1459
Inagaki & Konishi (1993) Inagaki S., Konishi T., 1993, PASJ, 45, 733
Johnson et al. (2023) Johnson A. C., Petersen M. S., Johnston K. V., Weinberg M. D., 2023, MNRAS, 521, 1757
Koopman (1931) Koopman B. O., 1931, Proceedings of the National Academy of Sciences, 17, 315
Kupershmidt & Manin (1978) Kupershmidt B. A., Manin Y. I., 1978, Functional analysis and its applications, 11, 188
Kutz et al. (2016) Kutz J. N., Brunton S. L., Brunton B. W., Proctor J. L., 2016, Dynamic Mode Decomposition: Data-Driven Modeling of Complex Systems. SIAM
Mathur (1990) Mathur S. D., 1990, MNRAS, 243, 529
Mezić (2005) Mezić I., 2005, Nonlinear Dynamics, 41, 309
Morrison (1980) Morrison P. J., 1980, Physics letters. A, 80, 383
Nakao & Mezić (2020) Nakao H., Mezić I., 2020, Chaos, 30, 113131
Perez (2005) Perez J., 2005, Transport Theory and Statistical Physics, 34, 391
Perrett et al. (2003) Perrett K. M., Stiff D. A., Hanes D. A., Bridges T. J., 2003, ApJ, 589, 790
Rowley et al. (2009) Rowley C. W., Mezic I., Bagheri S., Schlatter P., Henningson D. S., 2009, Journal of Fluid Mechanics, 641, 115–127
Sakurai & Napolitano (2017) Sakurai J. J., Napolitano J., 2017, Modern Quantum Mechanics, 2 edn. Cambridge University Press, doi:10.1017/9781108499996
Schönrich & Binney (2018) Schönrich R., Binney J., 2018, MNRAS, 481, 1501
Sethna (2006) Sethna J. P., 2006, Statistical Mechanics: Entropy, Order Parameters and Complexity, first edition edn. Oxford University Press, Great Clarendon Street, Oxford OX2 6DP
Thyng et al. (2016) Thyng K. M., Greene C. A., Hetland R. D., Zimmerle H. M., DiMarco S. F., 2016, Oceanography, 29, 9
Tremaine (1999) Tremaine S., 1999, MNRAS, 307, 877
Weinberg (1991) Weinberg M. D., 1991, ApJ, 373, 391
Weinberg & Petersen (2021) Weinberg M. D., Petersen M. S., 2021, MNRAS, 501, 5408
Widrow et al. (2020) Widrow L. M., Darling K., Li H., 2020, in Valluri M., Sellwood J. A., eds, Vol. 353, Galactic Dynamics in the Era of Large Surveys. pp 65–70, doi:10.1017/S1743921319009049

Appendix A Preliminary Definitions

A.1 Inner products

The inner product in $\mathscr{H}$ is

\langle g_{1},g_{2}\rangle=\int_{\mathcal{D}}dqdp\ g_{1}(q,p)g_{2}^{\dagger}(q,p),

(59)

where ^† indicates complex conjugation. This product possesses conjugate symmetry, $\langle g_{1},g_{2}\rangle^{\dagger}=\langle g_{2},g_{1}\rangle$ . For a set of linearly independent basis functions $\{e_{j,k}\}$ as defined in Section 2, the inner product facilitates projection of an arbitrary function $g\in\mathscr{H}$ . Such an expansion may be expressed as

g(q,p)=\sum_{j=0}^{\infty}\sum_{k=0}^{\infty}\langle e_{j,k},g\rangle e_{j,k}(q,p).

(60)

The inner product in $\mathscr{H}^{*}$ is

\langle G_{1},G_{2}\rangle_{*}=\bigg{\langle}\frac{\delta G_{2}}{\delta f},\frac{\delta G_{1}}{\delta f}\bigg{\rangle}.

(61)

An expansion analogous to equation 60 applies to arbitrary functionals $G\in\mathscr{H}^{*}$ .

A.2 Hermite polynomials

The Hermite polynomials $P_{j}(x)$ introduced in Section 2.4 are defined by the Rodrigues formula,

P_{j}(x)=(-1)^{n}\mathrm{e}^{x^{2}}\frac{d^{j}}{dx^{j}}\left(\mathrm{e}^{-x^{2}}\right).

(62)

Hermite polynomials satisfy the following two recurrence relations:

\frac{d}{dx}P_{j}(x)=2jP_{j-1}(x),

(63)

xP_{j}(x)=\frac{1}{2}P_{j+1}(x)+jP_{j-1}(x).

(64)

The normalization constants for the Gaussian-Hermite basis functions defined in Section 2.4 are

N_{j,k}=\frac{Z}{\sqrt{\pi 2^{j+k}j!k!}}.

(65)

Our chosen bivariate functions inherit orthogonality from the Hermite polynomials. Explicitly,

\langle e_{j,k},e_{j^{\prime},k^{\prime}}\rangle=\delta_{j^{\prime}}^{j}\delta_{k^{\prime}}^{k}.

(66)

This applies to the entire real number line in both variables, so our domain $\mathcal{D}$ is set to the entire phase space.

Appendix B Perturbative treatment of degenerate eigenvalues

We aim to find eigenvectors $\bm{\psi}_{j,k}(\varphi)$ and eigenvalues $\lambda_{j,k}(\varphi)$ that satisfy the perturbed eigenvalue problem

\left(\bm{\mathsf{A}}_{\mathsf{0}}+\varphi\bm{\mathsf{A}}_{\mathsf{4}}\right)\bm{\psi}_{j,k}=\lambda_{j,k}\bm{\psi}_{j,k}.

(67)

This is achieved by assuming a power series in $\varphi$ for both quantities,

	$\displaystyle\lambda_{j,k}(\varphi)$	$\displaystyle=\lambda_{j,k}^{(0)}+\varphi\lambda_{j,k}^{(1)}+\mathcal{O}(\varphi^{2}),$		(68)
	$\displaystyle\bm{\psi}_{j,k}(\varphi)$	$\displaystyle=\bm{\psi}_{j,k}^{(0)}+\varphi\bm{\psi}_{j,k}^{(1)}+\mathcal{O}(\varphi^{2}).$		(68)

This requires that both the eigenvectors and eigenvalues vary smoothly with respect to change in the perturbation parameter $\varphi$ . Since the eigenvalues are scalar, this is the case by default. We assure a smooth variation in the eigenvectors by the procedure described in Section 5.4.

Substituting the assumed power series into equation 67, and equating terms of equal power in $\varphi$ , one finds a set of equations. Those corresponding to the first three powers of $\varphi$ are,

$\displaystyle\left(\bm{\mathsf{A}}_{0}-\lambda_{j}^{(0)}\right)\bm{\psi}_{j,k}^{(0)}$	$\displaystyle=0,$	(69)
$\displaystyle\left(\bm{\mathsf{A}}_{0}-\lambda_{j}^{(0)}\right)\bm{\psi}_{j,k}^{(1)}$	$\displaystyle=\left(\lambda^{(1)}_{j,k}-\bm{\mathsf{A}}_{4}\right)\bm{\psi}_{j,k}^{(0)},$
$\displaystyle\left(\bm{\mathsf{A}}_{0}-\lambda_{j}^{(0)}\right)\bm{\psi}^{(2)}_{j,k}$	$\displaystyle=\left(\lambda_{j,k}^{(1)}-\bm{\mathsf{A}}_{4}\right)\bm{\psi}_{j,k}^{(1)}+\lambda_{j,k}^{(2)}\bm{\psi}_{j,k}^{(0)}.$

B.1 Degenerate case: eigenvalues

To determine the first order correction to the eigenvalues, we begin by contracting the $\mathcal{O}(\varphi)$ case (second line) in equation 69 with ${\bm{\psi}^{(0)}_{j,l}}^{\dagger}$ . That is

{\bm{\psi}^{(0)}_{j,l}}^{\dagger}\cdot\left(\bm{\mathsf{A}}_{0}-\lambda_{j}^{(0)}\right)\bm{\psi}_{j,k}^{(1)}={\bm{\psi}^{(0)}_{j,l}}^{\dagger}\cdot\left(\lambda^{(1)}_{j,k}-\bm{\mathsf{A}}_{4}\right)\bm{\psi}_{j,k}^{(0)}.

(70)

Since ${\bm{\psi}^{(0)}_{j,k}}^{\dagger}$ is a left-eigenvector of $\bm{\mathsf{A}}_{0}$ , ${\bm{\psi}^{(0)}_{j,k}}^{\dagger}\bm{\mathsf{A}}_{0}=\lambda_{j,k}^{(0)}{\bm{\psi}^{(0)}_{j,k}}^{\dagger}$ , and the left hand side is zero. We are left with

\lambda_{j,k}^{(1)}{\bm{\psi}^{(0)}_{j,l}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(0)}={\bm{\psi}^{(0)}_{j,l}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}.

(71)

Since ${\bm{\psi}^{(0)}_{j,l}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(0)}=\delta_{l}^{k}$ , equation 71 then reduces to

\lambda_{j,k}^{(1)}={\bm{\psi}^{(0)}_{j,k}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}.

(72)

That is, the $\mathcal{O}(\varphi)$ corrections to the degenerate eigenvalues are the diagonal entries of the perturbation, $\bm{\mathsf{A}}_{4}$ , when it is projected onto the degenerate subspace corresponding to $\lambda^{(0)}_{j,k}$ . In other words, if we project $\bm{\mathsf{A}}_{4}$ onto a degenerate subspace, the eigenvalues of the projected matrix are the first order corrections $\lambda_{j,k}^{(1)}$ . The first order correction $\lambda_{j,k}^{(1)}$ in equation 72 goes into the power series definition for the eigenvalues of $\hat{A}$ in Section 5.5.1 (equation 53).

B.2 Degenerate case: eigenvectors

In general, the eigenvectors of $\bm{\mathsf{A}}$ may contain components from the entire space, including vectors from both in and not in the degenerate subspace. That is, we want to know how to project $\bm{\psi}_{j,k}^{(1)}$ onto the sets of $\bm{\psi}_{j}^{(0)}$ and $\bm{\psi}_{j,k}^{(0)}$ . It follows that we can do this if we have general expressions for both of the contractions, ${\bm{\psi}_{j}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}$ and ${\bm{\psi}_{j,k}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}$ .

We will treat both cases separately, first computing the contribution from the subset of the complete basis that excludes the degenerate subspace. We begin by contracting the first order case in equation 69 with ${\bm{\psi}_{j}^{(0)}}^{\dagger}$ . We have

{\bm{\psi}_{l}^{(0)}}^{\dagger}\cdot\left(\bm{\mathsf{A}}_{0}-\lambda_{j}^{(0)}\right)\bm{\psi}_{j,k}^{(1)}={\bm{\psi}_{l}^{(0)}}^{\dagger}\cdot\left(\lambda^{(1)}_{j,k}-\bm{\mathsf{A}}_{4}\right)\bm{\psi}_{j,k}^{(0)}.

(73)

Similar to when we computed the eigenvalue correction, we note that ${\bm{\psi}_{j}^{(0)}}^{\dagger}\bm{\mathsf{A}}_{0}=\lambda_{j}^{(0)}{\bm{\psi}_{j}^{(0)}}^{\dagger}$ . Additionally, we know that ${\bm{\psi}_{l}^{(0)}}^{\dagger}$ and $\bm{\psi}_{j,k}^{(0)}$ are orthogonal for all $l,j,k$ , since one of them is within the degenerate subspace and the other is not. From these two arguments, we can simplify equation 73 to

{\bm{\psi}_{l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}=\frac{{\bm{\psi}_{l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{l}^{(0)}-\lambda_{j}^{(0)}}.

(74)

For the degenerate subset contribution, we do not get any new information by attempting a contraction of ${\bm{\psi}_{j,k}^{(0)}}^{\dagger}$ with the first order case in equation 69. Let us instead try the $\mathcal{O}(\varphi^{2})$ equation. We take

	$\displaystyle{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\left(\bm{\mathsf{A}}_{0}-\lambda_{j}^{(0)}\right)\bm{\psi}^{(2)}_{j,k}={\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\left(\lambda_{j,k}^{(1)}-\bm{\mathsf{A}}_{4}\right)\bm{\psi}_{j,k}^{(1)}$		(75)
	$\displaystyle+\lambda_{j,k}^{(2)}{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(0)}.$		(75)

Note that we use the same index $j$ on the first index for the introduced vector, as if we allowed for a different index, we would be considering the case that one degenerate subspace is projected onto another. All of the distinct subspaces are orthogonal, so this can only result in zero.

We can proceed by again noting that ${\bm{\psi}_{j,l}^{(0)}}^{\dagger}\bm{\mathsf{A}}_{0}=\lambda_{j,l}^{(1)}$ . Since $\lambda_{j,l}=\lambda_{j}\ \forall\ l$ , the left hand side is zero. Further, ${\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(0)}=\delta_{l}^{k}$ , so we are are left with

\lambda_{j,k}^{(2)}\delta_{l}^{k}=-{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\left(\lambda_{j,k}^{(1)}-\bm{\mathsf{A}}_{4}\right)\bm{\psi}_{j,k}^{(1)}.

(76)

Of course, we do not know $\bm{\psi}_{j,k}^{(1)}$ , as that is what we are trying to determine here. We do know however that it can have contributions from $\bm{\psi}_{j}^{(0)}$ and $\bm{\psi}_{j,l}^{(0)}$ . Let us suppose that it takes the form

\bm{\psi}_{j,k}^{(1)}=\sum_{m\neq j}^{M}\left({\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right)\bm{\psi}_{m}^{(0)}+\sum_{l}^{d_{j}}\left({\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right)\bm{\psi}_{j,l}^{(0)}.

(77)

We have already found what we need to express the first sum explicitly in equation 74. With this, we may write

\bm{\psi}_{j,k}^{(1)}=\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}\bm{\psi}_{m}^{(0)}+\sum_{l}^{d_{j}}\left({\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right)\bm{\psi}_{j,l}^{(0)}.

(78)

Continuing with equation 76, we have

$\displaystyle\lambda_{j,k}^{(2)}\delta_{l}^{k}=$	$\displaystyle-\lambda_{j,k}^{(1)}\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{m}^{(0)}$	(79)
	$\displaystyle+\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{m}^{(0)}$
	$\displaystyle-\sum_{n}^{d_{j}}\lambda_{j,k}^{(1)}\left({\bm{\psi}_{j,n}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right)\left({\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,n}^{(0)}\right)$
	$\displaystyle+\sum_{n}^{d_{j}}\left({\bm{\psi}_{j,n}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right){\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,n}^{(0)}$

The first term on the right hand side is zero since ${\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{m}^{(0)}$ are orthogonal for $m\neq j$ . The sum in the term on the right hand side collapses because ${\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,n}^{(0)}=\delta_{l}^{n}$ . Making these simplifications leaves us with,

$\displaystyle\lambda_{j,k}^{(2)}\delta_{l}^{k}=$	$\displaystyle\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{m}^{(0)}$	(80)
	$\displaystyle-\lambda_{j,k}^{(1)}\left({\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right)$
	$\displaystyle+\sum_{n}^{d_{j}}\left({\bm{\psi}_{j,n}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right){\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,n}^{(0)}.$

Since the degenerate subspace basis vectors $\bm{\psi}_{j,k}^{(0)}$ are chosen such that they diagonalize $\bm{\mathsf{A}}_{4}$ , the only nonzero matrix elements ${\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,n}^{(0)}$ are those along the diagonal. We may therefore simplify this term as

	$\displaystyle\lambda_{j,k}^{(2)}\delta_{l}^{k}=$	$\displaystyle\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{m}^{(0)}$		(81)
		$\displaystyle-\lambda_{j,k}^{(1)}\left({\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right)+\sum_{n}\lambda_{j,l}^{(1)}\delta_{l}^{n}\left({\bm{\psi}_{j,n}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}\right).$		(81)

Collapsing the sum according to the $\delta_{l}^{n}$ we are left with

	$\displaystyle\lambda_{j,k}^{(2)}\delta_{l}^{k}=\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{m}^{(0)}$		(82)
	$\displaystyle+\left(\lambda_{j,l}^{(1)}-\lambda_{j,k}^{(1)}\right){\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}.$		(82)

This equation has two cases. First, if $l\neq k$ , we obtain the explicit expression for ${\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}$ that we have been looking for. That is

{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}=\frac{1}{\lambda_{j,l}^{(1)}-\lambda_{j,k}^{(1)}}\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}{\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{m}^{(0)}.

(83)

Taking the case $l=k$ yields the $\mathcal{O}(\varphi^{2})$ correction to the eigenvalues. That is,

\displaystyle\lambda_{j,k}^{(2)}=\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}{\bm{\psi}_{j,k}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{m}^{(0)}.

(84)

These second order corrections are shown in the dotted lines in Fig. 3.

Now that we have explicit expressions for both ${\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}$ and ${\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\psi}_{j,k}^{(1)}$ we can express the first order corrections to the eigenvectors of $\bm{\mathsf{A}}_{4}$ as we had outlined in equation 77.

	$\displaystyle\bm{\psi}_{j,k}^{(1)}=$	$\displaystyle-\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}\bm{\psi}_{m}^{(0)}$		(85)
		$\displaystyle+\sum_{l\neq k}^{d_{j}}\frac{1}{\lambda_{j,l}^{(1)}-\lambda_{j,k}^{(1)}}\sum_{m\neq j}^{M}\frac{{\bm{\psi}_{m}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{j,k}^{(0)}}{\lambda_{m}^{(0)}-\lambda_{j}^{(0)}}\left({\bm{\psi}_{j,l}^{(0)}}^{\dagger}\cdot\bm{\mathsf{A}}_{4}\bm{\psi}_{m}^{(0)}\right)\bm{\psi}_{j,l}^{(0)}.$		(85)

Factoring, this can be written more compactly as equation 55, which is the final result we use in Section 5.5.2.