\WarningFilter

latexA float is stuck \WarningFilternamerefThe definition of

On reduced modeling of the modulational dynamics in magnetohydrodynamics

S. Jin\aff1, 2 \corresp [email protected] I. Y. Dodin\aff1, 2 \aff1Department of Astrophysical Sciences, Princeton University, Princeton, New Jersey 08543, USA \aff2Princeton Plasma Physics Laboratory, Princeton, New Jersey 08540, USA

Abstract

This paper explores structure formation in two-dimensional magnetohydrodynamic (MHD) turbulence as a modulational instability (MI) of turbulent fluctuations. We focus on the early stages of structure formation and consider simple backgrounds that allow for a tractable model of the MI while retaining the full chain of modulational harmonics. This approach allows us to systematically examine the validity of popular closures such as the quasilinear approximation and other low-order truncations. We find that, although such simple closures can provide quantitatively accurate approximations of the MI growth rates in some regimes, they can fail to capture the modulational dynamics in adjacent regimes even qualitatively, falsely predicting MI when the system is actually stable. We find that this discrepancy is due to the excitation of propagating spectral waves (PSWs) which can ballistically transport energy along the modulational spectrum, unimpeded until dissipative scales, thereby breaking the feedback loops that would otherwise sustain MIs. PSWs can be self-maintained as global modes with real frequencies and drain energy from the primary structure at a constant rate until the primary structure is depleted. To describe these waves within a reduced model, we propose an approximate spectral closure that captures them and MIs on the same footing. We also find that introducing corrections to ideal MHD, conservative or dissipative, can suppress PSWs and reinstate the accuracy of the quasilinear approximation. In this sense, ideal MHD is a ‘singular’ system that is particularly sensitive to the accuracy of the closure within mean-field models.

1 Introduction

1.1 Motivation

Coherent-structure formation from turbulence is ubiquitous in nature, intrinsically compelling, and one of the precious few aspects of turbulence that are relatively yielding to analytical efforts (Hussain, 1986; Karimabadi et al., 2013; Smolyakov et al., 2000; Krashennikov et al., 2008). For magnetohydrodynamic (MHD) turbulence (Biskamp, 2003; Beresnyak, 2019; Schekochihin, 2022) and the turbulent-dynamo problem (Brandenburg et al., 2012; Tobias, 2021; Rincon et al., 2016), this has spawned a vast body of work known as mean-field electrodynamics, or mean-field dynamo theory (Rädler, 2007; Krause & Rädler, 1980; Moffatt, 1978; Brandenburg, 2018; Gruzinov & Diamond, 1994). In such approaches, the velocity and magnetic fields are split into fluctuations and mean, or average, fields (the definition of average depends on the problem); then, various closures are implemented to obtain the mean-field evolution in response to the fluctuations (Blackman & Field, 2002; Brandenburg & Subramanian, 2005b; Pouquet et al., 1976; Kraichnan, 1977; Nicklaus & Stix, 1988).

These closures ultimately rely on a truncation of a formally infinite chain of correlation functions of increasingly high order.¹¹1Here, we use the term ‘truncation’ to refer to low-order closures in general, as opposed to precisely the closure choice of setting all correlations to zero after some cutoff. The latter, which we will refer to as a simple truncation, is done in the quasilinear approximation discussed below but not, for example, in the various $\smash{\tau}$ approximations (Brandenburg & Subramanian, 2005b; Blackman & Field, 2002). This is a common approach to reduced modeling of nonlinear systems, and one can expect it to work when the fluctuations are small compared to the mean field. However, in turbulent structure formation, the usual roles of perturbation and background are switched, that is, mean fields emerge as small modulations of order-one correlation functions that characterize turbulent fluctuations. Hence, the question of the validity of low-order closures becomes nontrivial.

In this paper, we are concerned with the popular closure called the first-order smoothing (Krause & Rädler, 1980) or second-order correlation approximation (Rädler, 1982). It implies that the mean-field evolution is determined only by second-order correlations of the fluctuations, which, in a broader context, is also known as the quasilinear approximation (Dodin, 2022). On the one hand, one can expect its validity domain to be extremely limited. It can be rigorously justified only for small magnetic Reynolds numbers or short correlation times (Brandenburg & Subramanian, 2005a; Rincon, 2019), and comparisons with direct numerical simulations often corroborate this expectation (Pipin & Proctor, 2008; Schrinner et al., 2005). On the other hand, the quasilinear approximation endures in its popularity as a workhorse that is too convenient to cast aside in analytical calculations (Squire & Bhattacharjee, 2015; Lingam & Bhattacharjee, 2016; Masada & Sano, 2014; Gopalakrishnan & Singh, 2023). Furthermore, in some cases, quasilinear calculations have been found to produce good agreement with direct numerical simulations (DNS) even outside of their formal validity domain (Käpylä et al., 2006).²²2Confusingly, various analytical closures and DNS can conflict and agree in various combinations (Zhou & Blackman, 2021). But this issue is beyond the scope of the present paper. It is therefore important to understand when, and how exactly, the quasilinear approximation fails. Answering this question requires exploring a mean-field model that retains high-order correlations. That is what the present paper is about.

1.2 Outline

Here, we explore structure formation in MHD turbulence as a modulational instability (Zakharov & Ostrovsky, 2009) (MI) of the flow-velocity and magnetic-field fluctuations comprising the background turbulence. (For a recent overview of this approach in application to drift-wave and hydrodynamic turbulence, see Zhu & Dodin (2021); Tsiolis et al. (2020).) For simplicity, we restrict our consideration to two-dimensional (2-D) turbulence. Although bounded planar flows cannot sustain long-lasting magnetic fields in two dimensions (an anti-dynamo theorem by Zeldovich (1957)), 2-D turbulence nevertheless hosts rich physics and has long served as popular testing grounds for various aspects of MHD turbulence theory due to its (relative) tractability (Pouquet, 1978; Biskamp & Schwarz, 2001; Nazarenko, 2007; Giorgio et al., 2017; Biskamp et al., 1996). The goal of this paper is to complement those works with a detailed study of MIs using both analytical calculations and DNS.

Specifically, we study how energy is transferred from primary fluctuations to mean magnetic field and mean velocity during the early stages of structure formation, when these mean fields are weak and the primary structure can be treated as fixed. To analyze this process in detail, we assume a maximally reduced model of the turbulent fluctuations, namely, a spatially monochromatic primary structure. The simplicity of this model has been detracting attention from it in the literature in favor of more realistic turbulence settings, but there is more to it that meets the eye. Even with a monochromatic primary structure, the modulational dynamics remains remarkably intricate and, depending on the system parameters, can change drastically (figure 1) in ways that are not captured by the common closures or obvious from the standard eigenmode analysis (Friedberg, 1987). However, the simplicity of our model helps unravel this mystery.

We find that, when the primary structure truly experiences a MI, this MI can be described well with typical truncated models that neglect high-order correlations. However, already in adjacent regimes, such truncated models can fail spectacularly and produce false positives for instability. To study this process in detail, we propose an ‘extended’ quasilinear theory (XQLT) that treats the primary structure as fixed but includes the entire spectrum of modulational harmonics (as opposed to just the low-order harmonics, as usual). From XQLT, also corroborated by DNS, we find that the difference between said regimes is due to a fundamental difference in the structure of the modulational spectrum. For unstable modes, the spectrum is exponentially localised at low harmonic numbers, so truncated models are justified. But this localization does not always occur. At other parameters, modulational modes turn into constant-amplitude waves propagating down the spectrum, unimpeded until dissipative scales. These spectral waves are self-maintained as global modes with real frequencies and cause ballistic energy transport along the spectrum, breaking the feedback loops that could otherwise sustain MI. The ballistic transport drains energy from the primary structure at a constant rate until the primary structure is depleted. Notably, these effects are overlooked if one assumes from the start that the modulation spectrum is confined and that, accordingly, dissipation entirely vanishes in the limit of zero viscosity and resistivity.

A comment is due here regarding how the established truncated MHD closures fit into this picture. Given the indispensability of such simple closures, a higher-resolution understanding of their limitations can help guide and interpret their inevitable application. For example, we show that both conservative and dissipative corrections to ideal MHD tend to restore the validity of the quasilinear approximation. In this sense, ideal incompressible MHD is a ‘singular’ theory that is particularly sensitive to the accuracy of the closure within mean-field models. That said, it is an important conclusion of our work that, unless deviations from ideal incompressible MHD are substantial, changing the turbulence parameters even slightly can produce qualitative inaccuracies in an otherwise workable reduced model. Therefore, efforts to extrapolate closures or infer them from general considerations should be supplemented with first-principle calculations based on the underlying modulational dynamics. As a step in this direction, we propose a spectral closure that captures both the propagating spectral waves and MIs within our model. (But, of course, more work remains to be done for generic MHD turbulence.)

The paper is organised as follows. In section 2, we present the basic equations and derive the XQLT. In section 3, we examine the ability of truncated models of low-order to predict the MI rate. In section 4, we introduce spectral waves and describe their properties. In section 5, we propose a closure that can simultaneously capture MIs and spectral waves. In section 6, we demonstrate the impact of corrections to ideal MHD on the modulational dynamics. In section 7, we summarise our main results. Auxiliary calculations are presented in appendices.

Refer to caption — Figure 1: DNS of modulational dynamics seeded at $\smash{\tau=0}$ with random noise. (This figure is discussed in detail in later sections. For the notation, see sections 2.1 and 2.2.) Figures (a), (c), and (e) correspond to a modulationally unstable primary mode (17) with $\smash{\theta^{\pm}=\pm\upi/6}$ , and (b), (d), and (f) correspond to a modulationally stable primary mode with $\smash{\theta^{\pm}=\pm\upi/3}$ . Figures (a) and (b) show $\smash{z_{x}^{+}(t,y)}$ , (c) and (d) show $\smash{z_{y}^{+}(t,x)}$ , and (e) and (f) show the corresponding energy breakdown. Specifically, $\smash{\mathcal{E}_{v}}$ is the normalised kinetic energy (21c), $\smash{\mathcal{E}_{b}=E_{b}/E_{\text{tot}}}$ is the normalised magnetic energy (21d), $\smash{\mathcal{E}_{\text{mod}}}$ is the normalised modulational-mode energy (21b), and $\smash{\mathcal{E}_{\text{pri}}}$ is the normalised primary-mode energy (21a). The colour bar shows the field amplitudes normalised to $\smash{\mathcal{A}/p}$ . The spatial coordinates are normalised to $\smash{1/p}$ .

2 Extended quasilinear theory

2.1 Base model

2.1.1 Incompressible resistive MHD

As a base model, we assume incompressible resistive MHD with homogeneous mass density $\smash{\rho=\text{const}}$ . The corresponding governing equations are


	$\displaystyle\partial_{t}\boldsymbol{v}+(\boldsymbol{v}\cdot\nabla)\boldsymbol{v}=(\boldsymbol{b}\cdot\nabla)\boldsymbol{b}-\nabla P+\nu\nabla^{2}\boldsymbol{v},$	(1a)
	$\displaystyle\partial_{t}\boldsymbol{b}=\nabla\times(\boldsymbol{v}\times\boldsymbol{b})+\eta\nabla^{2}\boldsymbol{b}.$	(1b)
Here $\smash{\boldsymbol{v}}$ is the fluid velocity; $\smash{\boldsymbol{b}\doteq\boldsymbol{B}/\sqrt{4\upi\rho}}$ is the local Alfvén velocity (the symbol $\smash{\doteq}$ denotes definitions), i.e. rescaled magnetic field $\smash{\boldsymbol{B}}$ ; and $\smash{P\doteq(P_{\text{kin}}+B^{2}/8\upi)/\rho}$ is the normalised total pressure, with $\smash{P_{\text{kin}}}$ being the kinetic pressure. Also, $\smash{\nu}$ is the viscosity and $\smash{\eta}$ is the resistivity, both of which are assumed either constant or negligible (except at small enough scales). Due to incompressibility and magnetic Gauss’s law, one also has

	$\displaystyle\nabla\cdot\boldsymbol{v}=0,\qquad\nabla\cdot\boldsymbol{b}=0.$	(1c)

In order to put (1) in a more symmetric form, let us rewrite them in terms of the two Elsässer fields $\smash{\boldsymbol{z}^{\pm}}$ (Elsasser, 1950), which are also solenoidal:

\displaystyle\boldsymbol{z}^{\pm}\doteq\boldsymbol{v}\pm\boldsymbol{b},\qquad\nabla\cdot\boldsymbol{z}^{\pm}=0.

(2)

This leads to two coupled equations for $\smash{\boldsymbol{z}^{\pm}}$ :

\displaystyle\partial_{t}\boldsymbol{z}^{\pm}=-(\boldsymbol{z}^{\mp}\cdot\nabla)\boldsymbol{z}^{\pm}-\nabla P+\nu_{+}\nabla^{2}\boldsymbol{z}^{\pm}+\nu_{-}\nabla^{2}\boldsymbol{z}^{\mp},

(3)

where $\smash{\nu_{+}\doteq(\nu+\eta)/2}$ and $\smash{\nu_{-}\doteq(\nu-\eta)/2}$ . The normalised total pressure $\smash{P}$ can be found as follows. By taking the divergence of (1a) and using (2), one obtains

\displaystyle\nabla^{2}P=-\nabla\cdot[(\boldsymbol{z}^{\mp}\cdot\nabla)\boldsymbol{z}^{\pm}].

(4)

Let us introduce the wavevector operator $\smash{\hat{\boldsymbol{k}}\doteq-\mathrm{i}\nabla}$ , so $\smash{\hat{k}^{2}\doteq\smash{\hat{\boldsymbol{k}}}^{2}=-\nabla^{2}}$ . Let us also assume some appropriate (say, periodic) boundary conditions. Then, (4) yields

\displaystyle P=-\hat{k}^{-2}\hat{\boldsymbol{k}}\cdot[(\boldsymbol{z}^{\mp}\cdot\hat{\boldsymbol{k}})\boldsymbol{z}^{\pm}]+\text{const},

(5)

whence $\smash{\nabla P}$ in (3) can be expressed through $\smash{\boldsymbol{z}^{\pm}}$ . Alternatively, $\smash{\nabla P}$ can be eliminated from (3) by taking the curl of this equation and rewriting the result in terms of the Elsässer vorticities

\displaystyle\boldsymbol{w}^{\pm}\doteq\nabla\times\boldsymbol{z}^{\pm}.

(6)

Indeed, due to (2), one can express $\smash{\boldsymbol{z}^{\pm}}$ using a vector potential $\smash{\boldsymbol{a}^{\pm}}$ such that $\smash{\boldsymbol{z}^{\pm}=\nabla\times\boldsymbol{a}^{\pm}}$ . Let us assume the gauge such that $\smash{\nabla\cdot\boldsymbol{a}^{\pm}=0}$ . Then,

\displaystyle\boldsymbol{w}^{\pm}=\nabla\times(\nabla\times\boldsymbol{a}^{\pm})=-\nabla^{2}\boldsymbol{a}^{\pm}\equiv\hat{k}^{2}\boldsymbol{a}^{\pm},

(7)

whence

\displaystyle\boldsymbol{z}^{\pm}=\mathrm{i}\hat{k}^{-2}(\hat{\boldsymbol{k}}\times\boldsymbol{w}^{\pm}).

(8)

(Here, we assume that $\smash{\boldsymbol{z}^{\pm}}$ have zero spatial average, so $\smash{\hat{k}^{2}}$ is invertible.) Then, (3) can be expressed through $\smash{\boldsymbol{w}^{\pm}}$ alone:

\displaystyle\partial_{t}\boldsymbol{w}^{\pm}=-\hat{\boldsymbol{k}}\times\{[(\hat{\boldsymbol{k}}\times\hat{k}^{-2}\boldsymbol{w}^{\mp})\cdot\hat{\boldsymbol{k}}](\hat{\boldsymbol{k}}\times\hat{k}^{-2}\boldsymbol{w}^{\pm})\}-\hat{k}^{2}(\nu_{+}\boldsymbol{w}^{\pm}+\nu_{-}\boldsymbol{w}^{\mp}).

(9)

2.1.2 2-D collisionless model

For simplicity, below we adopt a 2-D model, meaning that $\smash{\boldsymbol{z}^{\pm}}$ lie in the $\smash{(x,y)}$ plane and $\smash{\partial_{z}=0}$ . In this case, the only potentially nonzero component of $\smash{\boldsymbol{w}^{\pm}}$ is the $\smash{z}$ -component, $\smash{w^{\pm}\doteq(\nabla\times\boldsymbol{z}^{\pm})_{z}}$ . We also forgo an explicit treatment of the viscosities $\smash{\nu_{\pm}}$ below. (However small, though, these viscosities remain nonnegligible in that they determine absorbing boundary conditions at infinity in the spectral space; see section 4.) Then, the vector equation (9) can be replaced with a scalar equation for $\smash{w^{\pm}}$ :

\displaystyle\partial_{t}w^{\pm}=\epsilon_{lmz}\Bigg{[}\left(\frac{\hat{k}_{m}}{\hat{k}^{2}}\,w^{\mp}\right)\left(\hat{k}_{l}w^{\pm}\right)-\left(\frac{\hat{k}_{n}\hat{k}_{l}}{\hat{k}^{2}}\,w^{\mp}\right)\left(\frac{\hat{k}_{n}\hat{k}_{m}}{\hat{k}^{2}}\,w^{\pm}\right)\Bigg{]},

(10)

where $\smash{\epsilon_{lmz}}$ is the Levi–Civita symbol with the third index fixed to $\smash{z}$ . Note that the right-hand side of (10) vanishes whenever $\smash{w^{+}}$ or $\smash{w^{-}}$ is zero. This means that any stationary $\smash{w^{+}}$ is a solution as long as $\smash{w^{-}}$ is zero and vice versa.

2.1.3 Fourier representation

Assuming the Fourier representation

\displaystyle w^{\pm}=\sum_{\boldsymbol{k}}w_{\boldsymbol{k}}^{\pm}(t)\mathrm{e}^{\mathrm{i}\boldsymbol{k}\cdot\boldsymbol{x}},

(11)

where $\smash{\boldsymbol{x}\doteq(x,y)}$ , one arrives at the following equation for the Fourier coefficients $\smash{w_{\boldsymbol{k}}^{\pm}}$ :

\displaystyle\partial_{t}w^{\pm}_{\boldsymbol{k}}=\sum_{\boldsymbol{k}_{1},\boldsymbol{k}_{2}}T(\boldsymbol{k}_{1},\boldsymbol{k}_{2})w^{\mp}_{\boldsymbol{k}_{1}}w^{\pm}_{\boldsymbol{k}_{2}}\delta_{\boldsymbol{k},\boldsymbol{k}_{2}+\boldsymbol{k}_{1}},

(12)

where the dot is the time derivative, $\smash{\delta}$ is the Kronecker delta, $\smash{T(\boldsymbol{k}_{1},\boldsymbol{k}_{2})}$ are the coupling coefficients given by

\displaystyle T(\boldsymbol{k}_{1},\boldsymbol{k}_{2})=[\boldsymbol{e}_{z}\cdot(\boldsymbol{k}_{2}\times\boldsymbol{k}_{1})]\,\frac{(\boldsymbol{k}_{2}+\boldsymbol{k}_{1})\cdot\boldsymbol{k}_{2}}{k_{1}^{2}k_{2}^{2}},

(13)

and $\smash{\boldsymbol{e}_{j}}$ is the unit vector along the $\smash{j}$ th axis. Accordingly, the kinetic and magnetic energy are given by


$\displaystyle E_{v}$	$\displaystyle=\frac{\rho}{8}\,\sum_{\boldsymbol{k}}\frac{1}{k^{2}}\,\|w_{\boldsymbol{k}}^{+}+w_{\boldsymbol{k}}^{-}\|^{2}\equiv\sum_{\boldsymbol{k}}E_{{v},\boldsymbol{k}},$	(14a)
$\displaystyle E_{b}$	$\displaystyle=\frac{\rho}{8}\,\sum_{\boldsymbol{k}}\frac{1}{k^{2}}\,\|w_{\boldsymbol{k}}^{+}-w_{\boldsymbol{k}}^{-}\|^{2}\equiv\sum_{\boldsymbol{k}}E_{{b},\boldsymbol{k}},$	(14b)

where $\smash{E_{{v},\boldsymbol{k}}}$ , and $\smash{E_{{b},\boldsymbol{k}}}$ are the corresponding spectral energy densities. Then, the total energy density is

\displaystyle E=\frac{\rho}{4}\,\sum_{\boldsymbol{k}}\frac{1}{k^{2}}\,(|w_{\boldsymbol{k}}^{+}|^{2}+|w_{\boldsymbol{k}}^{-}|^{2})\equiv\sum_{\boldsymbol{k}}E_{\boldsymbol{k}},

(15)

where $\smash{E_{\boldsymbol{k}}}$ is the spectral total-energy density.

2.2 Harmonic coupling mediated by a primary structure

2.2.1 Equation for the modulational spectrum

Let us explore modulational stability of a Fourier harmonic with a given wavevector $\smash{\boldsymbol{k}=\boldsymbol{p}}$ and the corresponding order-one Fourier amplitudes $\smash{w_{\boldsymbol{p}}^{\pm}}$ . We will call it the primary harmonic or, if multiple $\smash{\boldsymbol{p}}$ s are present, primary structure. Let us choose axes such that $\smash{\boldsymbol{p}=p\boldsymbol{e}_{y}}$ and assume a perturbation with a wavevector $\smash{\boldsymbol{q}}$ . As can be seen from (13), nontrivial coupling requires a component of $\smash{\boldsymbol{q}}$ perpendicular to $\smash{\boldsymbol{p}}$ , and the component of $\smash{\boldsymbol{q}}$ parallel to $\smash{\boldsymbol{p}}$ does not qualitatively affect the modulational dynamics.³³3As shown below, the modulational dynamics is determined mostly by the asymptotic form of the coefficients $\smash{\alpha_{n}^{\pm}}$ and $\smash{\beta_{n}^{\pm}}$ (introduced in section 2.2.2) at large $\smash{n}$ , and this form is insensitive to the component of $\smash{\boldsymbol{q}}$ parallel to $\smash{\boldsymbol{p}}$ . For simplicity then, let us assume $\smash{\boldsymbol{q}=q\boldsymbol{e}_{x}}$ . Let us also assume $\smash{w_{\boldsymbol{p}}^{\pm}=\mathcal{O}(1)}$ , $\smash{w_{\boldsymbol{q}+n\boldsymbol{p}}^{\pm}=\mathcal{O}(\epsilon)}$ , and same for any other $\smash{w_{\boldsymbol{k}\neq\boldsymbol{p}}^{\pm}}$ , where $\smash{\epsilon\ll 1}$ is a small parameter. (Remember that $\smash{\mathcal{O}(\epsilon)}$ means, loosely speaking, ‘scales as $\smash{\epsilon}$ or smaller’ but not necessarily ‘of order $\smash{\epsilon}$ ’.) Then, by linearizing (12) in $\smash{\epsilon}$ , one obtains the following chain of equations:

	$\displaystyle\partial_{t}w^{\pm}_{\boldsymbol{k}}\approx\,\,$	$\displaystyle T(\boldsymbol{p},\boldsymbol{k}_{-})w^{\mp}_{\boldsymbol{p}}w^{\pm}_{\boldsymbol{k}_{-}}+T(-\boldsymbol{p},\boldsymbol{k}_{+})w^{\mp}_{-\boldsymbol{p}}w^{\pm}_{\boldsymbol{k}_{+}}$
	$\displaystyle+$	$\displaystyle T(\boldsymbol{k}_{-},\boldsymbol{p})w^{\mp}_{\boldsymbol{k}_{-}}w^{\pm}_{\boldsymbol{p}}+T(\boldsymbol{k}_{+},-\boldsymbol{p})w^{\mp}_{\boldsymbol{k}_{+}}w^{\pm}_{-\boldsymbol{p}},$		(16)

where $\smash{\boldsymbol{k}_{\pm}\doteq\boldsymbol{k}\pm\boldsymbol{p}}$ and $\smash{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}}$ with integer $\smash{n}$ . For all other $\smash{\boldsymbol{k}}$ , one obtains $\smash{\partial_{t}w^{\pm}_{\boldsymbol{k}}\approx 0}$ . In particular, this means that $\smash{w^{\pm}_{\boldsymbol{p}}}$ can be considered fixed; i.e. $\smash{A^{\pm}\doteq w^{\pm}_{\boldsymbol{p}}\approx\text{const}}$ . In terms of the Elsässer fields, this corresponds to a primary structure

\displaystyle\boldsymbol{z}^{\pm}=-\frac{2}{p}\,\mathcal{A}^{\pm}\sin(py+\theta^{\pm})\boldsymbol{e}_{x},

(17)

where $\smash{\mathcal{A^{\pm}}\doteq|A^{\pm}|}$ and $\smash{\theta^{\pm}\doteq\arg A^{\pm}}$ .

For the modulation energy density and the primary-wave energy density,


	$\displaystyle E_{\text{mod}}\doteq\sum_{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}}E_{\boldsymbol{k}}=\sum_{n}\frac{\rho}{4}\sum_{\sigma=\pm 1}\frac{\|w_{\boldsymbol{q}+n\boldsymbol{p}}^{\sigma}\|^{2}}{q^{2}+n^{2}p^{2}},$		(18a)
	$\displaystyle E_{\text{pri}}\doteq E_{\boldsymbol{p}}=\frac{\rho}{4}\sum_{\sigma=\pm 1}\frac{\|w_{\boldsymbol{p}}^{\sigma}\|^{2}}{p^{2}},$		(18b)

one has


	$\displaystyle\frac{\mathrm{d}E_{\text{mod}}}{\mathrm{d}t}=\sum_{n}\frac{\rho}{4}\sum_{\sigma=\pm 1}\frac{w_{\boldsymbol{q}+n\boldsymbol{p}}^{\sigma*}\,\partial_{t}w_{\boldsymbol{q}+n\boldsymbol{p}}^{\sigma}}{q^{2}+n^{2}p^{2}}+\text{c.c.},$		(19a)
	$\displaystyle\frac{\mathrm{d}E_{\text{pri}}}{\mathrm{d}t}=\frac{\rho}{4}\sum_{\sigma=\pm 1}\frac{w_{\boldsymbol{p}}^{\sigma*}\dot{w}_{\boldsymbol{p}}^{\sigma}}{p^{2}}+\text{c.c.},$		(19b)

where ‘c.c.’ stands for ‘complex conjugate’. Using

\displaystyle\partial_{t}w_{\boldsymbol{p}}^{\sigma}=\sum_{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}}T(\boldsymbol{k},\boldsymbol{p-k})w_{\boldsymbol{k}}^{-\sigma}w_{\boldsymbol{p-k}}^{\sigma},

(20)

for $\smash{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}}$ , one finds that our approximate equations conserve the total energy density $\smash{E_{\text{tot}}=E_{\text{pri}}+E_{\text{mod}}}$ exactly within this approximation. (For brevity, from now on we will refer to the energy densities simply as energies.) We will assume that the initial modulational energy is negligible, so $\smash{E_{\text{tot}}=\rho\mathcal{A}^{2}/4p^{2}}$ , where $\smash{\mathcal{A}^{2}\doteq(\mathcal{A}^{+})^{2}+(\mathcal{A}^{-})^{2}}$ , and we will also use the following dimensionless energies:


	$\displaystyle\mathcal{E}_{\text{pri}}=E_{\text{pri}}/E_{\text{tot}},$		(21a)
	$\displaystyle\mathcal{E}_{\text{mod}}=E_{\text{mod}}/E_{\text{tot}},$		(21b)
	$\displaystyle\mathcal{E}_{v}=E_{v}/E_{\text{tot}},$		(21c)
	$\displaystyle\mathcal{E}_{v}=E_{b}/E_{\text{tot}}.$		(21d)

2.2.2 Dimensionless equations

In order to reduce the number of free parameters, let us perform a variable transformation $\smash{w_{\boldsymbol{q}+n\boldsymbol{p}}^{\pm}\mapsto\psi_{n}^{\pm}}$ :

\displaystyle w_{\boldsymbol{q}+n\boldsymbol{p}}^{\pm}=\sqrt{\frac{\mathcal{A}^{3}(q^{2}+n^{2}p^{2})}{\mathcal{A}^{\mp}p^{2}}}\exp\left(\mathrm{i}n\,\frac{\theta^{+}+\theta^{-}}{2}\right)\psi_{n}^{\pm},

(22)

where $\smash{\psi_{n}^{\pm}=\mathcal{O}(\epsilon)}$ . Then, (16) becomes

	$\displaystyle\partial_{t}\psi_{n}^{\pm}=\quad$	$\displaystyle\mathcal{A}^{\mp}\mathrm{e}^{\mp\mathrm{i}\theta}\alpha_{n}^{-}\psi_{n-1}^{\pm}+\sqrt{\mathcal{A}^{+}\mathcal{A}^{-}}\mathrm{e}^{\pm\mathrm{i}\theta}\beta_{n}^{-}\psi_{n-1}^{\mp}$
	$\displaystyle+$	$\displaystyle\mathcal{A^{\mp}}\mathrm{e}^{\pm\mathrm{i}\theta}\alpha_{n}^{+}\psi_{n+1}^{\pm}+\sqrt{\mathcal{A}^{+}\mathcal{A}^{-}}\mathrm{e}^{\mp\mathrm{i}\theta}\beta_{n}^{+}\psi_{n+1}^{\mp},$		(23)

where $\smash{\theta\doteq(\theta^{+}-\theta^{-})/2}$ and $\smash{\alpha_{n}^{\pm}}$ and $\smash{\beta_{n}^{\pm}}$ are given by


	$\displaystyle\alpha_{n}^{\pm}\doteq\mp r\,\frac{r^{2}+n(n\pm 1)}{\sqrt{(r^{2}+n^{2})(r^{2}+(n\pm 1)^{2}})},$		(24a)
	$\displaystyle\beta_{n}^{\pm}\doteq-r\,\frac{n}{\sqrt{(r^{2}+n^{2})(r^{2}+(n\pm 1)^{2})}}.$		(24b)

Note that these coefficients depend only on the ratio $\smash{r\doteq q/p}$ rather than on $\smash{q}$ and $\smash{p}$ separately. From (2.2.2), one can also see that the individual phases $\smash{\theta^{+}}$ and $\smash{\theta^{-}}$ per se are, expectedly, unimportant and the dynamics is affected by these phases only in the combination $\smash{\theta^{+}-\theta^{-}}$ .

It is also convenient for our purposes to introduce

\displaystyle\tau\doteq\mathcal{A}\,t

(25)

as the new dimensionless time, as well as $\smash{\phi\doteq\arctan(\mathcal{A}^{-}/\mathcal{A}^{+})\in[0,\upi/2]}$ , so that

\displaystyle\mathcal{A}^{+}=\mathcal{A}\cos\phi,\qquad\mathcal{A}^{-}=\mathcal{A}\sin\phi.

(26)

In particular, $\smash{\phi=0}$ and $\smash{\phi=\upi/2}$ correspond to structures consisting only of $\smash{w_{\boldsymbol{p}}^{+}}$ and $\smash{w_{\boldsymbol{p}}^{-}}$ , respectively, and $\smash{\phi=\upi/4}$ corresponds to a ‘balanced’ background with $\smash{\mathcal{A}^{+}=\mathcal{A}^{-}}$ . For simplicity, $\smash{\phi=\upi/4}$ will be assumed from now on throughout the paper (except where specified otherwise). In this case, one has

\displaystyle\frac{E_{\text{pri},b}}{E_{\text{pri},v}}=\frac{|w_{\boldsymbol{p}}^{+}-w_{\boldsymbol{p}}^{-}|^{2}}{|w_{\boldsymbol{p}}^{+}+w_{\boldsymbol{p}}^{-}|^{2}}=\tan^{2}\theta,

(27)

where $\smash{E_{\text{pri},v}}$ and $\smash{E_{\text{pri},b}}$ are the kinetic and magnetic energy densities in the primary mode respectively. Hence, $\smash{\theta}$ can be understood as the parameter that determines the relative weights of the kinetic and magnetic-field energy in the primary-mode energy. In particular, $\smash{\theta=0}$ corresponds to a primary structure that involves only velocity perturbations, while $\smash{\theta=\upi/2}$ corresponds to a primary structure that involves only magnetic-field perturbations.

Using this notation, one can express (2.2.2) in the following compact form:

\displaystyle\dot{\boldsymbol{\psi}}_{n}=\sum_{\sigma=\pm 1}\boldsymbol{G}_{n}^{\sigma}\boldsymbol{\psi}_{n+\sigma}

(28)

(the dot denotes $\smash{\partial_{\tau}}$ ), or even more compactly as

\displaystyle\dot{\boldsymbol{\psi}}=\boldsymbol{M}\boldsymbol{\psi}.

(29)

Here, $\smash{\boldsymbol{\psi}_{n}\doteq(\psi_{n}^{-},\psi_{n}^{+})^{\intercal}}$ is a two-component column vector (the symbol ^⊺ denotes transposition), $\smash{\boldsymbol{\psi}\doteq(\ldots,\boldsymbol{\psi}_{-1},\boldsymbol{\psi}_{0},\boldsymbol{\psi}_{1}\ldots)^{\intercal}}$ is an infinite-dimensional block vector consisting of $\smash{\boldsymbol{\psi}_{n}}$ , the matrices $\smash{\boldsymbol{G}_{n}^{\sigma}}$ are given by

\displaystyle\boldsymbol{G}_{n}^{\sigma}\doteq\begin{pmatrix}\alpha_{n}^{\sigma}\cos\phi\,\mathrm{e}^{-\mathrm{i}\sigma\theta}&\beta_{n}^{\sigma}\sqrt{\cos\phi\sin\phi}\,\mathrm{e}^{\mathrm{i}\sigma\theta}\\ \beta_{n}^{\sigma}\sqrt{\cos\phi\sin\phi}\,\mathrm{e}^{-\mathrm{i}\sigma\theta}&\alpha_{n}^{\sigma}\sin\phi\,\mathrm{e}^{\mathrm{i}\sigma\theta}\end{pmatrix},

(30)

and $\smash{\boldsymbol{M}}$ is the following block matrix:

\boldsymbol{M}\doteq\medsize\begin{pmatrix}\ddots&&&\vdots&&&\iddots\\ &0&\boldsymbol{G}_{-2}^{+}&0&0&0&\\ &\boldsymbol{G}_{-1}^{-}&0&\boldsymbol{G}_{-1}^{+}&0&0&\\ \ldots&0&\boldsymbol{G}_{0}^{-}&0&\boldsymbol{G}_{0}^{+}&0&\ldots\\ &0&0&\boldsymbol{G}_{1}^{-}&0&\boldsymbol{G}_{1}^{+}\\ &0&0&0&\boldsymbol{G}_{2}^{-}&0&\\ \iddots&&&\vdots&&&\ddots\end{pmatrix}.

It is readily seen then that the system’s dynamics is controlled only by the dimensionless parameters $\smash{r}$ , $\smash{\theta}$ , and $\smash{\phi}$ , while $\smash{\mathcal{A}}$ and $\smash{p}$ determine only its characteristic temporal and spatial scales, respectively.

Equations (28) and (29) are similar to the equations that describe a one-dimensional chain of coupled linear oscillators. The $\smash{n}$ th elementary cell of the chain consists of two different oscillators characterised by $\smash{\psi_{n}^{+}}$ and $\smash{\psi_{n}^{-}}$ , respectively. Similar equations emerge in studies of phase-space turbulence for the coupling between Hermite moments of the particle phase-space probability distribution (Hammett et al., 1993; Nastac et al., 2023; Adkins & Schekochihin, 2018; Parker, 2016; Kanekar et al., 2015). Throughout this work, we use (28) and (29) to study the nature of collective oscillations in the chain of modulational harmonics, which can be understood as Floquet modes of the linearised system. Unstable oscillations of this system (i.e. MIs) lead to structure formation on top of the primary structure.

One can also consider (29) as a Schrödinger equation with a Hamiltonian $\smash{\mathrm{i}\boldsymbol{M}}$ . This Hamiltonian is not Hermitian, because modulations are parametrically coupled with the primary mode (through which energy can be either gained or lost) and dissipate at $\smash{n\to\infty}$ . At the same time, this Hamiltonian is invariant under the time-reversal transformation ( $\smash{\mathrm{i}\to-\mathrm{i}}$ ) and the parity transformation in the spectral space ( $\smash{n\rightarrow-n}$ , $\smash{\sigma\rightarrow-\sigma}$ ). This makes the system (28) $\smash{\mathcal{PT}}$ -symmetric (Bender, 2005; Bender et al., 2019). Depending on the balance of sources and sinks, such systems can support modes with entirely real frequencies (unbroken $\smash{\mathcal{PT}}$ symmetry) and pairs of modes whose frequencies are mutually complex-conjugate (broken $\smash{\mathcal{PT}}$ symmetry). This will be discussed further in section 4.

As long as the underlying ordering assumption [ $\smash{|w_{\boldsymbol{q}+n\boldsymbol{p}}^{\pm}|/|w_{\boldsymbol{p}}^{\pm}|\sim\mathcal{O}(\epsilon)}$ ] holds, XQLT (28) provides an excellent approximation of the nonlinear modulational dynamics of (3) (figure 2). However, practical applications of XQLT require truncations of this system. In the remainder of this work, we discuss to what extent such truncations can be adequate and compare our analytical results to DNS of (28), in lieu of the full (3).

3 Truncated models

3.1 Basic equations

A common way to truncate a problem like (29) is to postulate that all $\smash{\psi_{n}}$ with $\smash{|n|>N}$ are negligible. One can also understand this as imposing reflective boundary conditions in the oscillator chain $\smash{\{\boldsymbol{\psi}_{n}^{\sigma}\}}$ :

\displaystyle\boldsymbol{\psi}_{N+1}^{\sigma}=\boldsymbol{\psi}_{-(N+1)}^{\sigma}=0.

(31)

Retained in this case are harmonics with wavevectors $\smash{\boldsymbol{p}}$ and $\smash{\boldsymbol{q}+n\boldsymbol{p}}$ , with $n=0,\pm 1,\ldots,\pm N$ . This means that the resulting system has $\smash{Q=1+(1+2N)}$ degrees of freedom. We call this procedure $\smash{Q}$ -mode truncation ( $\smash{Q}$ MT). The corresponding truncation of $\smash{\boldsymbol{\psi}}$ will be denoted as $\smash{\boldsymbol{\psi}_{Q\text{MT}}}$ , and the corresponding truncation of $\smash{\boldsymbol{M}}$ will be denoted as $\smash{\boldsymbol{M}_{Q\text{MT}}}$ , so (29) becomes

\displaystyle\dot{\boldsymbol{\psi}}_{Q\text{MT}}=\boldsymbol{M}_{Q\text{MT}}\boldsymbol{\psi}_{Q\text{MT}}.

(32)

Because of (31), eigenmodes of this system can be understood as standing waves in the oscillator chain $\smash{\{\boldsymbol{\psi}_{n}^{\sigma}\}}$ and have the form $\smash{\boldsymbol{\psi}_{Q\text{MT}}=\boldsymbol{Y}\exp(-\mathrm{i}\omega\tau)}$ , where $\smash{\omega}$ are global frequencies and $\smash{\boldsymbol{Y}}$ are constant ‘polarization vectors’. According to (29), these vectors satisfy $\smash{\boldsymbol{D}\boldsymbol{Y}=0}$ , where $\smash{\boldsymbol{D}=\mathrm{i}\omega\mathbb{1}+\boldsymbol{M}_{Q\text{MT}}}$ and $\smash{\mathbb{1}}$ is a unit $\smash{Q\times Q}$ matrix. Then, $\smash{\omega}$ can be found from

\displaystyle\det\boldsymbol{D}=0

(33)

and MIs correspond to modes with $\smash{\operatorname{Im}\omega>0}$ .

The success of this approach hinges on the assumption that higher harmonics truly remain negligible. In other words, a $\smash{Q}$ MT should be able to adequately describe the evolution of a modulational mode if its eigenvector $\smash{\boldsymbol{\psi}_{Q\text{MT}}}$ is heavily weighted towards the low-order harmonics. Below, we explore to what extent this assumption is satisfied in various regimes.

3.2 Growth rates

The lowest-order $\smash{Q}$ MT is 4MT, which corresponds to $\smash{N=1}$ . Within this model, the primary harmonic with $\smash{\boldsymbol{k}=\boldsymbol{p}}$ is assumed to interact with the modulational mode at $\smash{\boldsymbol{k}=\boldsymbol{q}}$ only through two sidebands $\smash{\boldsymbol{k}=\boldsymbol{q}\pm\boldsymbol{p}}$ ; then, $\smash{\boldsymbol{\psi}_{\text{4MT}}=(\boldsymbol{\psi}_{-1},\boldsymbol{\psi}_{0},\boldsymbol{\psi}_{1})^{\intercal}}$ and

\displaystyle\boldsymbol{M}_{\text{4MT}}=\medsize\begin{pmatrix}0&\boldsymbol{G}_{-1}^{+}&0\\ \boldsymbol{G}_{0}^{-}&0&\boldsymbol{G}_{0}^{+}\\ 0&\boldsymbol{G}_{1}^{-}&0\end{pmatrix}.

(34)

(This model can be understood as quasilinear, because, although the modulational dynamics remains nonlinear, the second- and higher-order harmonics of the perturbation are neglected.) The 4MT equations yield

\displaystyle\omega^{2}=\frac{r^{4}}{1+r^{2}}\left(1\pm\sqrt{1-(1-r^{-4}\cos^{2}2\theta)\sin^{2}2\phi}\right).

(35)

[Note that these are normalised frequencies corresponding to the normalised time $\smash{\tau}$ . To obtain the actual physical frequencies, one must multiply the right-hand side of (35) by $\smash{\mathcal{A}}$ .] Assuming $\smash{\phi=\upi/4}$ , one finds that two out of four branches of this dispersion relation are unstable at $\smash{|r|<1}$ . The corresponding growth rate $\smash{\Gamma\doteq\operatorname{Im}\omega}$ is maximised at $\smash{\theta=0}$ and $\smash{\theta=\upi/2}$ at the value

\displaystyle\Gamma=r\sqrt{\frac{1-r^{2}}{1+r^{2}}}

(36)

(from now on, we assume $\smash{r>0}$ for clarity of notation), and the corresponding polarization vectors are as follows:

\displaystyle\boldsymbol{Y}_{0}=\begin{cases}(1,+1)^{\intercal},&\theta=0,\\ (1,-1)^{\intercal},&\theta=\upi/2.\\ \end{cases}

(37)

The case $\smash{\theta=0}$ can be recognised as a purely hydrodynamic Kelvin–Helmholtz instability (KHI), i.e. one that does not involve magnetic field. The case $\smash{0<\theta<\upi/4}$ can be understood as the MHD generalization of the KHI.

Potentially more accurate is a model with $\smash{N=2}$ , or 6MT, which accounts for harmonics with $\smash{\boldsymbol{k}=\boldsymbol{q}\pm 2\boldsymbol{p}}$ ; then, $\smash{\boldsymbol{\psi}_{\text{4MT}}=(\boldsymbol{\psi}_{-2},\boldsymbol{\psi}_{-1},\boldsymbol{\psi}_{0},\boldsymbol{\psi}_{1},\boldsymbol{\psi}_{2})^{\intercal}}$ and

\displaystyle\boldsymbol{M}_{\text{6MT}}=\medsize\begin{pmatrix}0&\boldsymbol{G}_{-2}^{+}&0&0&0\\ \boldsymbol{G}_{-1}^{-}&0&\boldsymbol{G}_{-1}^{+}&0&0\\ 0&\boldsymbol{G}_{0}^{-}&0&\boldsymbol{G}_{0}^{+}&0\\ 0&0&\boldsymbol{G}_{1}^{-}&0&\boldsymbol{G}_{1}^{+}\\ 0&0&0&\boldsymbol{G}_{2}^{-}&0\end{pmatrix}.

(38)

One can derive an analytical expression for $\smash{\omega}$ from here just like we derived (35) from (34). However, this expression is cumbersome and not particularly instructive, so it is not presented explicitly. Truncations with larger $\smash{Q}$ , albeit also explored by us, will not be presented either for the same reason.

Figure 3 shows a comparison of the 4MT and 6MT predictions and the MI growth rates inferred numerically from nonlinear DNS of (9). Note that the truncated models quantitatively agree with each other and with the DNS at some parameters but drastically disagree at other parameters. In particular, the 4MT predicts that $\smash{\Gamma}$ is an even function of $\smash{\upi/2-\theta}$ , while the 6MT predicts that this is not the case. Furthermore, the DNS predicts that, for example, at $\smash{r=0.5}$ , the system is stable at all $\smash{\theta>\upi/2}$ [figure 3(b)]. Let us discuss this in detail.

As expected from the analytical formula (36) based on the 4MT, and as we have also found numerically, MI is typically suppressed at $\smash{r\gtrsim 1}$ ; hence, below we focus on the regime $\smash{r\in(0,1)}$ . A comparison of the growth rate predicted by nonlinear DNS and truncated models in this range is presented figure 5. The corresponding primary modes with $\smash{\theta\lesssim\upi/4}$ are modulationally unstable, and the growth rates predicted by 4MT and 6MT are in good agreement with DNS. Such primary structures are dominated by the velocity field, $\smash{\mathcal{E}_{\text{pri},v}>\mathcal{E}_{\text{pri},b}}$ [see (27)], and thus will be called $\smash{\boldsymbol{v}}$ -dominated primary modes (VDPMs). At $\smash{\theta\gtrsim\upi/4}$ , primary structures are dominated by the magnetic field, $\smash{\mathcal{E}_{\text{pri},b}>\mathcal{E}_{\text{pri},v}}$ , and thus will be called $\smash{\boldsymbol{b}}$ -dominated primary modes (BDPMs). As seen in figure 5, BDPMs are modulationally stable, but truncated models predict otherwise. Figures 4(a) and (b) show the discrepancy between the predictions of nonlinear DNS, 4MT, and 6MT for representative parameters, specifically, $\smash{\theta\in(0,\upi/2)}$ and $\smash{\phi=\upi/4}$ . (Stability is primarily determined by $\smash{\theta}$ and $\smash{r}$ . Deviations of $\smash{\phi}$ from $\smash{\upi/4}$ result only in quantitative adjustments, unless $\smash{\phi}$ is close to $\smash{0}$ or $\smash{\upi/2}$ .) In other words, truncated modes tend to overestimate the growth rate and systematically produce false positives for instability. Figure 5 shows that, although the magnitude of disagreement decreases with $\smash{N}$ , the systematic overprediction of instability persists.

The reason for this overprediction can be understood by analyzing the eigenmode structure. (See also appendix A for an alternative explanation.) As to be discussed in section 4, unstable modes are evanescent spectral waves, whose $\smash{|Y_{n}|}$ exponentially decrease with $\smash{|n|}$ [figure 6(a)]:

\displaystyle|Y_{n}|\propto\exp(-|n|/l).

(39)

(We assume $\smash{\phi=\upi/4}$ for simplicity, so $\smash{|Y_{n}^{+}|=|Y_{n}^{-}|}$ and the upper index in $\smash{|Y_{n}^{\sigma}|}$ can be omitted.) That is why truncated models correctly predict MI for modes that are actually unstable. The spectral scale $\smash{l}$ depends on the system parameters, and the smaller $\smash{l}$ the more accurate $\smash{Q}$ MT is. At some parameters, though, particularly when $\smash{\theta\rightarrow\upi/4}$ , $\smash{l}$ becomes large or even infinite, such that $\smash{|Y_{n}|}$ asymptotes to a nonzero constant at $\smash{|n|\to\infty}$ [figure 6(b)]. In this case, $\smash{Q}$ MT is bound to fail. The corresponding modes are propagating spectral waves (PSWs). While they also receive energy from the primary structure at low $\smash{n}$ , PSWs transport this energy along the spectrum to $\smash{|n|\to\infty}$ . There, the energy is unavoidably dissipated by viscosity or resistivity, however small those are. This dissipation makes PSWs more stable than any $\smash{Q}$ MT would predict, because $\smash{Q}$ MTs assume that the mode energy forever resides at small $\smash{n}$ , where dissipation is small or zero. Below, we discuss these effects in more detail.

4 Spectral waves

4.1 Dynamics at large $|n|$

First, let us consider spectral waves at $\smash{|n|\gg 1}$ . In this limit, one has $\smash{\alpha_{n}^{\pm}\rightarrow\mp r}$ and $\smash{\beta_{n}^{\pm}\rightarrow 0}$ , so (29) yields two decoupled wave equations for $\smash{\psi_{n}^{+}}$ and $\smash{\psi_{n}^{-}}$ :


$\displaystyle\dot{\psi}_{n}^{-}$	$\displaystyle=r\cos\phi\big{(}\mathrm{e}^{\mathrm{i}\theta}\psi_{n-1}^{-}-\mathrm{e}^{-\mathrm{i}\theta}\psi_{n+1}^{-}\big{)},$	(40a)
$\displaystyle\dot{\psi}_{n}^{+}$	$\displaystyle=r\sin\phi\big{(}\mathrm{e}^{-\mathrm{i}\theta}\psi_{n-1}^{+}-\mathrm{e}^{\mathrm{i}\theta}\psi_{n+1}^{+}\big{)}.$	(40b)

These equations allow solutions in the form of monochromatic waves:

\displaystyle\psi_{n}^{\sigma}=\Psi^{\sigma}\exp(-\mathrm{i}\omega\tau+\mathrm{i}K^{\sigma}n),

(41)

with constant amplitude $\smash{\Psi^{\sigma}}$ , frequency $\smash{\omega}$ , and ‘wavenumber’ $\smash{K^{\sigma}}$ . The quotation marks are added as a reminder that the corresponding waves propagate in the spectral space rather than in the physical space. Also note that $\smash{K^{\pm}}$ are defined unambiguously only within the first Brillouin zone, $\smash{K^{\pm}\in(-\upi,\upi)}$ .

From (40), one readily finds that spectral waves obey the following dispersion relations:

\displaystyle\omega=\omega_{0}^{\sigma}\sin\kappa^{\sigma},\qquad\kappa^{\pm}\doteq K^{\pm}\pm\theta,

(42)

where $\smash{\omega_{0}^{+}=2r\sin\phi}$ and $\smash{\omega_{0}^{-}=2r\cos\phi}$ . Accordingly,spectral waves in the $\smash{n}$ space along the $\smash{\psi^{\sigma}}$ channel have the group velocity

\displaystyle v_{g}^{\sigma}=\omega_{0}^{\sigma}\cos\kappa^{\sigma}

(43)

and can propagate either up or down the spectrum (figure 7). The dependence of $\smash{\omega}$ and $\smash{v_{g}^{\pm}}$ on $\smash{\theta}$ can be removed by a gauge transformation. Specifically, $\smash{\kappa^{\pm}}$ becomes the true wavenumber if one adopts the variables $\smash{\bar{\psi}_{n}^{\pm}\doteq\psi_{n}^{\pm}\mathrm{e}^{\mp\mathrm{i}n\theta}}$ instead of $\smash{\psi_{n}^{\pm}}$ . Also notably, to the extent that the (large) index $\smash{n}$ can be approximately considered a continuous variable, (40) can be written as usual nondispersive-wave equations for $\smash{\zeta^{\pm}(\tau,n)\doteq\bar{\psi}_{n}^{\pm}(\tau)}$ :

\displaystyle\partial_{\tau}\zeta^{\pm}=-\omega_{0}^{\pm}\,\partial_{n}\zeta^{\pm}.

(44)

This model corresponds to the small- $\smash{\kappa^{\sigma}}$ limit of (42), that is, $\smash{\omega\approx\omega_{0}^{\sigma}\kappa^{\sigma}}$ .

The above equations can be used to describe PSW packets localised at large $\smash{|n|}$ but not the global eigenmodes (because the latter involve dynamics at small $\smash{|n|}$ ). In particular, (42) are not enough to find the global-mode frequencies. Yet they allow one to determine the mode structure at $\smash{|n|\gg 1}$ if one knows the mode frequency from other considerations (or, vice versa, to find the mode frequency if $\smash{\kappa^{\pm}}$ are known). In particular, unstable modes have complex $\smash{\omega}$ , so, according to (42), their asymptotic wavenumber cannot be real. This explains why unstable modes are evanescent. In contrast, for real $\smash{\omega}$ , (42) allows for real wavenumbers (provided that $\smash{|\omega|\leq\omega_{0}^{\pm}}$ ), i.e. predicts PSWs. These results are corroborated by DNS of (29). For example, figure 6 shows the mode structures inferred from the asymptotic dynamics of $\smash{\psi_{n}^{\sigma}(\tau)}$ at large enough $\smash{\tau}$ for the initial conditions

\displaystyle\psi_{n}^{\sigma}(\tau=0)=\begin{cases}\epsilon\exp(\mathrm{i}\xi^{\sigma}_{n}),&n=\pm 1,\\ 0,&n\neq\pm 1.\end{cases}

(45)

Here, $\smash{\epsilon}$ is a constant small amplitude (within the linear approximation, one can always rescale $\smash{\psi_{n}^{\sigma}}$ such that $\smash{\epsilon=1}$ ), and $\smash{\xi_{n}}$ are parameters that change the polarization of the initial conditions while keeping the initial modulation energy fixed. (The specific values of $\smash{\xi_{n}^{\sigma}}$ are given in the captions of figures in which initial condition dependent results of DNS are presented.) This form of the initial conditions is chosen to emulate a simple modulation of the primary mode.

The fact that the properties of spectral waves are determined only by the asymptotic properties of $\smash{\alpha_{n}^{\pm}}$ and $\smash{\beta_{n}^{\pm}}$ makes these waves particularly robust. In particular, note that the asymptotic form of $\smash{\alpha_{n}^{\pm}}$ and $\smash{\beta_{n}^{\pm}}$ at $\smash{|n|\to\infty}$ remains the same even when $\smash{\boldsymbol{p}}$ and $\smash{\boldsymbol{q}}$ are not orthogonal, so the above results readily extend to general $\smash{\boldsymbol{q}}$ . Likewise, a similar picture holds also for three-dimensional interactions, although the mode polarization can be more complicated in this case. Notably, PSWs provide ballistic rather than diffusive (or super-diffusive) energy transport along the spectrum. This distinguishes them from the cascades that are commonly discussed in turbulence theories and result from eddy–eddy interactions, or wave–wave collisions (Galtier et al., 2000).

4.2 Global modes in the form of PSWs

In realistic settings, spectral waves are excited at finite $\smash{|n|}$ and propagate down the spectrum. Evanescent waves reach $\smash{|n|\sim l}$ [see (39)] in finite time $\smash{\tau_{0}}$ , get reflected, and eventually (asymptotically) settle as stationary standing waves on time scales of the order of a few $\smash{\tau_{0}}$ . Such waves can be adequately described by truncated models. In contrast, PSWs continue to propagate toward infinity indefinitely (until they dissipate at scales where viscosity or resistivity is no longer negligible). Thus, they never develop components propagating up the spectrum at large $\smash{|n|}$ and so they never become quite like the standing-wave eigenmodes predicted by truncated models (section 3.1). This means that actual PSWs cannot be adequately described by analyzing $\smash{Q}$ MT in principle. Instead, they should be considered in the context of the initial-value problem, much like Langmuir waves appear in plasma kinetic theory within Landau’s initial-value approach as opposed to the Case–van Kampen true-eigenmode approach (Stix, 1992).

Consequently, even though (29) has infinitely many degrees of freedom, the number of relevant global modes, i.e. those that propagate towards infinity, is finite. We find that there are only two of them, which can be attributed to the fact that $\smash{\dim\boldsymbol{\psi}_{n}=2}$ . (Likewise, at a given wavenumber, only one, Langmuir, branch of relevant electrostatic waves in electron plasma remains relevant after transients are gone.) For example, for $\smash{\phi=\upi/4}$ , we find from DNS that

\displaystyle\omega=\pm\sqrt{2}r\cos\theta.

(46)

This dependence and the corresponding mode structures are illustrated in figure 8. Each of the two modes has two wavenumbers associated with it: one determines $\smash{\psi^{\sigma}_{n}}$ at $\smash{n\to+\infty}$ , and the other one determines $\smash{\psi^{-\sigma}_{n}}$ at $\smash{n\to-\infty}$ (figure 9). Thus, there are four $\smash{\kappa^{\sigma}_{d}}$ overall, where $\smash{\sigma}$ denotes the corresponding component of $\smash{\boldsymbol{\psi}_{n}}$ , and $\smash{d}$ denotes the direction of propagation ( $\smash{d=\pm 1}$ is the sign of the group velocity at $\smash{|n|\to\infty}$ ). They can be found by combining (46) with (42) and applying $\smash{\operatorname{sgn}v_{g}=d}$ , i.e. $\smash{\omega_{0}^{\sigma}d\cos\kappa^{\sigma}_{d}>0}$ . This yields


$\displaystyle\omega>0:\quad$	$\displaystyle\kappa^{+}_{+}=\frac{\upi}{2}-\theta,\kern 10.0pt$	$\displaystyle\kappa^{-}_{-}=-\frac{3\upi}{2}+\theta,$	(47a)
$\displaystyle\omega<0:\qquad$	$\displaystyle\kappa^{+}_{-}=\frac{3\upi}{2}-\theta,\kern 10.0pt$	$\displaystyle\kappa^{-}_{+}=-\frac{\upi}{2}+\theta,$	(47b)

where $\smash{\kappa^{\sigma}_{d}=K^{\sigma}_{d}+\sigma\theta}$ and the values of $\smash{K^{\sigma}_{d}}$ are defined modulo the Brillouin zone (i.e. $\smash{K^{-}_{-}=-3\upi/2}$ should be understood as $\smash{\upi/2}$ and so on). Remember, though, that this applies only to global eigenmodes. Transient waves propagating in the region $\smash{|n|\gg 1}$ can have any $\smash{\kappa^{\sigma}}$ (figure 7).

The fact that the frequencies of these modes are real indicates that the modes are self-sustained notwithstanding the dissipation that they unavoidably experience at $\smash{|n|\to\infty}$ . The corresponding evolution of the wave spectrum is illustrated in figure 9(a) and the corresponding evolution in the physical space is shown in figure 10. This remarkable dynamics is understood from the fact that the energy drain via this dissipation is balanced by the energy injection from the primary mode at small $\smash{|n|}$ . As seen from (19) and (16), the dimensionless modulational energy (21b), or

\displaystyle\mathcal{E}_{\text{mod}}\doteq\sum_{\sigma}\frac{\mathcal{A}}{\mathcal{A}^{-\sigma}}\,|\psi_{n}^{\sigma}|^{2}\equiv\sum_{n}\mathcal{E}_{n},

(48)

is governed by

\displaystyle\dot{\mathcal{E}}_{n}=\sum_{\sigma}(I_{n}^{\sigma}+F_{n}^{\sigma})+\text{c.c.},

(49)

where $\smash{I_{n}}$ and $\smash{F_{n}}$ are given by

	$\displaystyle I_{n}^{\sigma}=\frac{\mathcal{A}\sqrt{\mathcal{A}^{\sigma}}}{(\mathcal{A}^{-\sigma})^{3/2}}\,(\beta_{n}^{-}\mathrm{e}^{\mathrm{i}\sigma\theta}\psi_{n-1}^{-\sigma}+\beta_{n}^{+}\mathrm{e}^{-\mathrm{i}\sigma\theta}\psi_{n+1}^{-\sigma})\psi_{n}^{\sigma*},$		(50)
	$\displaystyle F_{n}^{\sigma}=\frac{\mathcal{A}}{\mathcal{A}^{-\sigma}}\,(\alpha_{n}^{-}\mathrm{e}^{-\mathrm{i}\sigma\theta}\psi_{n-1}^{\sigma}+\alpha_{n}^{+}\mathrm{e}^{\mathrm{i}\sigma\theta}\psi_{n+1}^{\sigma})\psi_{n}^{\sigma*}.$		(51)

Because $\smash{\sum_{n}F_{n}^{\sigma}=0}$ , the terms $\smash{F_{n}^{\sigma}}$ represent the energy flux that is carried along the modulation spectrum and is conserved within each sub-channel $\smash{\sigma}$ . In contrast, $\smash{I_{n}}$ can be understood as injection terms in that they also appear, with the opposite signs, in the equation for the primary wave,

\displaystyle\dot{\mathcal{E}}_{\text{pri}}=-\sum_{n}\sum_{\sigma=\pm 1}I_{n}^{\sigma}+\text{c.c.},

(52)

which follows from (20).

The linear spectral waves discussed so far correspond to the regime when the change of $\smash{\mathcal{E}_{\text{pri}}}$ is negligible. Eventually, though, as higher harmonics are populated and thus absorb more energy, the primary wave will be depleted. This means, in particular, that no primary wave can be truly stationary. Even an arbitrarily small perturbation will generally launch a PSW, and this PSW will eventually deplete the primary wave.

4.3 Anomalous dissipation

Since a PSW mode carries energy towards $\smash{|n|\rightarrow\infty}$ , eventually, a large enough $\smash{|n|}$ is reached where the energy is dissipated regardless of how small the viscosity and resistivity are. Thus, a PSW exhibits an effective, or ‘anomalous’, dissipation rate $\smash{\gamma}$ that is independent of $\smash{\nu}$ and $\smash{\eta}$ in the limit $\smash{\nu,\eta\to 0}$ . This effect is different from the anomalous transport caused by eddy–eddy collisions in turbulence (for example, see Donzis et al. (2005)) in that the energy transport caused by a PSW is ballistic. As a result, $\smash{\gamma}$ is straightforward to calculate, which is done as follows.

As discussed in section 4.2, a global PSW at large $\smash{|n|}$ has a flat mode structure, i.e. a structure with $\smash{\mathcal{E}_{n}}$ independent of $\smash{n}$ . This structure establishes itself as an expanding ‘shelf’ whose edges (wave fronts) propagate across the modulational spectrum to $\smash{n\to\pm\infty}$ at the group velocity $\smash{v_{g}}$ (figure 11). Since the height of the shelf, $\smash{\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu}$ (the overbar here denotes averaging over the PSW temporal period and over all $\smash{n}$ within the shelf), remains constant, this process drains energy from the primary mode linearly in time:

\displaystyle\dot{\mathcal{E}}_{\text{pri}}=-2|v_{g}|\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu=\text{const},

(53)

where we consider the dynamics average over the PSW temporal period. Hence,

\displaystyle\gamma\doteq-\frac{\dot{\mathcal{E}}_{\text{pri}}}{\mathcal{E}_{\text{pri}}}=2|v_{g}|\,\frac{\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu}{\mathcal{E}_{\text{pri}}}\sim\omega_{0}^{\pm}\epsilon^{2}.

(54)

(The rate relative to the physical time $\smash{t}$ , as opposed to $\smash{\tau}$ , is $\smash{\mathcal{A}}$ times this $\smash{\gamma}$ .)

Let us again assume the initial conditions (45) and $\smash{\phi=\upi/4}$ . Through DNS of (29), we find that the global PSW-mode amplitude is maximised when $\smash{|\arg(\xi^{\sigma}_{\pm 1}/\xi^{-\sigma}_{\pm 1})|=|\arg(\xi^{\sigma}_{\pm 1}/\xi^{\sigma}_{\mp 1})|=\upi}$ , irrespective of $\smash{\theta}$ . (Any particular choice of $\smash{\xi^{\sigma}_{n}}$ that satisfies this condition affects only phases of the modulational dynamics and does not impact averaged quantities that determine $\smash{\gamma}$ .) This corresponds to the case when all modulational-seed energy is in the magnetic field; i.e. $\smash{\mathcal{E}_{b}=\mathcal{E}}$ and $\smash{\mathcal{E}_{v}=0}$ . Figure 11 shows the corresponding anomalous dissipation rate normalised to the seed energy, along with its determining factors – the system-dependent PSW group velocity $\smash{v_{g}}$ and the average spectral energy density $\smash{\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu}$ , which is determined by the initial conditions. It can be seen that, for the MI unstable VDPMs ( $\smash{\theta<\upi/4}$ ), spectral waves are relatively slow and have a small amplitude. The transition to modulational stability occurs at $\smash{|v_{g}|\sim\Gamma}$ . This condition can be understood as a threshold beyond which (i.e. at $\smash{|v_{g}|\gtrsim\Gamma}$ ) PSWs provide a sufficiently fast escape route for the energy injected at small $\smash{|n|}$ , such that the positive feedback loops [see (50)] supporting MIs can no longer be sustained. In the context of $\smash{\mathcal{PT}}$ symmetry (section 2.2.2), the spectral group velocity can be understood as the effective coupling between the links in the oscillator chain, connecting the energy injection from the primary mode at small $\smash{n}$ to the energy sink at $\smash{n\to\infty}$ . As $\smash{|v_{g}|}$ increases with $\smash{\theta}$ , the system transitions from broken to unbroken $\smash{\mathcal{PT}}$ symmetry.

Also, notice the following. If modulations at multiple $\smash{\boldsymbol{q}}$ s are present, they launch independent cascades along $\smash{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}}$ and thus the drain on the shared primary mode will be additive. This regime is illustrated by figures 1(d)-(f), which show the modulational dynamics for a BDPM ( $\smash{\theta=\upi/3}$ ) seeded with noise. The rather nondescript modulational dynamics seen in figures 1(d) and (e) can be understood as the superposition of the many structures like that in figure 10 for various $\smash{\boldsymbol{q}}$ . Also, the linear-in-time depletion of the primary mode seen in figure 1(f) can be understood as the sum of the PSW-driven energy fluxes along the constituent modulational channels.

5 Unified closure

As discussed in section 3.1, naive $\smash{Q}$ MTs assume the perfectly reflecting boundary conditions (31) for spectral waves, thus precluding PSWs and ignoring potential dissipation at large $\smash{|n|}$ . One can expect that, instead of (31), it would be more accurate to adopt the following closure in anticipation of spectral waves:

\displaystyle\psi_{N+1}^{\sigma}=\mathrm{e}^{\mathrm{i}K^{\sigma}}\psi_{N}^{\sigma}

(55)

(and similarly for $\smash{\psi_{-(N+1)}^{\sigma}}$ ). The value of $\smash{K^{\sigma}}$ is found, for given $\smash{\omega}$ , from (42):

\displaystyle\mathrm{e}^{\mathrm{i}K^{\sigma}}=\mathrm{e}^{-\mathrm{i}\sigma\theta}\Big{[}\mathrm{i}\Omega^{\sigma}+\sqrt{1-(\Omega^{\sigma})^{2}}\Big{]},

(56)

where $\smash{\Omega^{\sigma}\doteq\omega/\omega_{0}^{\sigma}}$ , and we restrict our attention to the range $\smash{|\operatorname{Re}\Omega\,|\leq 1}$ to avoid discontinuities, a choice which is consistent with the range of observed solutions. The sign of the square root is chosen to enforce outward propagation at $\smash{\operatorname{Im}\omega=0}$ , which also enforces that unstable solutions exponentially decay in the direction of propagation $\smash{|\exp(\mathrm{i}K^{\sigma}_{+})|<1}$ .⁴⁴4A different choice of signs would also yield formally valid solutions, but would not have the properties that correspond to PSWs of physical interest. (Conversely, damped solutions grow in the direction of propagation.) As seen from figure 12, the resulting closure

\displaystyle\psi_{N+1}^{\sigma}=\mathrm{e}^{-\mathrm{i}\sigma\theta}\,\Big{[}\mathrm{i}\Omega^{\sigma}+\sqrt{1-(\Omega^{\sigma})^{2}}\Big{]}\psi_{N}^{\sigma}

(57)

leads to reasonably accurate results already $\smash{N=2}$ , both for MIs and PSWs.

Because the closure becomes exact only in the limit $\smash{N\rightarrow\infty}$ , it also yields spurious solutions (gray curves in figure 12) at finite $\smash{N}$ . One can understand this from the analogy with the Case–van Kampen modes mentioned in section 4.2. These solutions become increasingly stable, and thus less of an issue, as $\smash{N}$ increases. Also note that the closure adequately describes true MIs. This is explained by the fact that $\smash{\psi_{n}^{\sigma}}$ at large $\smash{|n|}$ are exponentially small and do not matter for unstable modes anyway.

6 Quasilinear approximation beyond ideal MHD

As discussed in the previous sections, the existence of PSWs undermines the standard quasilinear approximation in application to ideal incompressible MHD. Given the ubiquity of quasilinear modeling in the literature, it may seem concerning that the quasilinear approximation can fail so spectacularly. Interestingly, though, the quasilinear approximation is somewhat more robust beyond the ideal-MHD limit, both due to conservative corrections and dissipation.

Let us discuss the former first. Without attempting to describe any particular physical system, let us consider the following modified version of (3):

\displaystyle\partial_{t}\boldsymbol{z}^{\pm}=-(\boldsymbol{z}^{\mp}\cdot\nabla)\boldsymbol{z}^{\pm}-\nabla P+\lambda\partial_{x}\nabla^{2}\boldsymbol{z}^{\pm},

(58)

where $\smash{\lambda}$ is a constant parameter. The last term is intended as a simple ad hoc correction that, while causing deviation from ideal MHD, leaves the primary-mode evolution unaffected and preserves MHD’s key invariants, specifically, the energy and cross helicity (appendix B). Perhaps notably, this term introduces rotational asymmetry in the $\smash{(x,y)}$ plane. Similar terms can appear due to a background magnetic field or differential rotation (Heinonen et al., 2023).

Taking $\smash{\phi=\upi/4}$ for simplicity, one arrives at the following corresponding modification of the linear (2.2.2):

\displaystyle\sqrt{2}(\partial_{\tau}+\mathrm{i}\delta_{n})\psi_{n}^{\pm}=\quad\mathrm{e}^{\mp\mathrm{i}\theta}\alpha_{n}^{-}\psi_{n-1}^{\pm}+\mathrm{e}^{\pm\mathrm{i}\theta}\beta_{n}^{-}\psi_{n-1}^{\mp}+\mathrm{e}^{\pm\mathrm{i}\theta}\alpha_{n}^{+}\psi_{n+1}^{\pm}+\mathrm{e}^{\mp\mathrm{i}\theta}\beta_{n}^{+}\psi_{n+1}^{\mp},

(59)

where $\smash{\delta_{n}^{\pm}\doteq\Lambda r(r^{2}+n^{2})}$ , $\smash{\Lambda\doteq\lambda/\mathcal{A}p^{3}}$ , and $\smash{\alpha_{n}^{\pm}}$ , $\smash{\beta_{n}^{\pm}}$ are as in (24). In the large- $\smash{|n|}$ limit, one has $\smash{\alpha_{n}^{\pm}\sim 1}$ , $\smash{\beta_{n}^{\pm}\to 0}$ , $\smash{\delta_{n}\sim\Lambda n^{2}}$ , so one obtains the following scaling:

\displaystyle|\psi_{n+1}^{\pm}|\sim\frac{|\psi_{n}^{\pm}|}{\Lambda n^{2}}.

(60)

This shows that the harmonic magnitude $\smash{|\psi_{n}|}$ decreases rapidly (super-exponentially) with $\smash{|n|}$ , and thus low-order truncations may be justified. Note that although the opposite scaling, $\smash{|\psi_{n-1}^{\pm}|\sim|\psi_{n}^{\pm}|/\Lambda n^{2}}$ is also formally possible, such modes cannot be excited, as is the case with inward propagating PSWs. Figure 13(a) shows that, indeed, the agreement between analytical QL growth rates and nonlinear DNS improves as the parameter $\smash{\Lambda}$ increases. The agreement becomes nearly perfect for $\smash{\Lambda\gtrsim 0.5}$ . Figure 13(b) shows that the same effect can be achieved if, instead of the $\smash{\lambda}$ term in (58), one introduces sufficiently strong viscosity. In this case, one also has a modified (2.2.2) with the exact form of (59), but with $\smash{\delta_{n}^{\pm}\doteq\mu(r^{2}+n^{2})}$ , where $\smash{\mu\doteq\nu_{+}/\mathcal{A}p^{2}}$ (with $\smash{\nu_{-}=0}$ for simplicity). Again, the agreement becomes nearly exact for $\smash{\mu\gtrsim 0.5}$ .

7 Summary

In this paper, we explore structure formation in two-dimensional MHD turbulence as a modulational instability (MI) of turbulent fluctuations. We focus on the early stages of structure formation and consider simple backgrounds that allow for a tractable model of the MI while retaining the full chain of modulational harmonics. This approach allows for a systematic examination of the importance of high-order correlations that are typically ignored in mean-field theories.

We find that, when the primary structure truly experiences a MI, this MI can be described well with typical truncated models that neglect high-order correlations. However, already in adjacent regimes, such truncated models can fail spectacularly and produce false positives for instability. To study this process in detail, we propose an ‘extended’ quasilinear theory (XQLT) that treats the primary structure as fixed but includes the entire spectrum of modulational harmonics (as opposed to just the low-order harmonics, as usual). From XQLT, also corroborated by DNS, we find that the difference between said regimes is due to a fundamental difference in the structures of the modulational spectra. For unstable modes, the spectrum is exponentially localised at low harmonic numbers, so truncated models are justified. But this localization does not always occur. At other parameters, modulational modes turn into constant-amplitude waves propagating down the spectrum, unimpeded until dissipative scales. These spectral waves are self-maintained as global modes with real frequencies and cause ballistic energy transport along the spectrum, breaking the feedback loops that could otherwise sustain MI.

The ballistic transport by PSWs drains energy from the primary structure at a constant rate until the primary structure is depleted. Because global PSWs exist at almost all parameters, this means that almost any primary structure in ideal incompressible MHD will eventually be depleted. This means, in particular, that sustainability of MHD structures is not entirely limited to the issue of exponentially growing linear instabilities, as PSWs must also be taken into consideration. To describe them within a reduced model, we propose a closure that successfully captures both the propagating spectral waves and MIs for the assumed primary structure.

Finally, we find that departures from ideal MHD constrains the form of modulational eigenmodes, in turn suppressing the amplitude and impact of high harmonics. This allows us to end on an informed yet optimistic note regarding the applicability of the quasilinear approximation for structure formation in dispersive forms of MHD. That said, it is an important conclusion of our work that, unless deviations from ideal incompressible MHD are substantial, changing the turbulence parameters even slightly can utterly destroy the applicability of an otherwise workable reduced model. An understanding of the complex modulational dynamics supported by (nearly) incompressible MHD provides important context in the interpretation of existing simple closures and potentially opens the path to building more reliable alternatives.

This research was supported by the U.S. Department of Energy through contract No. DE-AC02-09CH11466.

Appendix A Quasilinear time scale

The quasilinear approximation holds when the quasilinear MI growth rate exceeds the rate at which energy escapes to higher harmonics. In the main text, we take the latter to be the group velocity of the PSW global mode. However, one might object that the properties of PSWs are well defined only at large $\smash{n}$ , while the departure from quasilinear occurs at the modest $\smash{n=2}$ . It may be instructive then to develop an alternative argument that would not assume large $\smash{n}$ . Here, we propose such an argument by considering the initial-value problem.

For simplicity, let us assume the initial conditions such that only $\smash{\boldsymbol{\psi}_{0}(\tau=0)}$ is nonzero, with all other $\smash{\boldsymbol{\psi}_{n\neq 0}(\tau=0)}$ are zero. We also adopt $\smash{(\phi,\theta)=(\upi/4,0)}$ and $\smash{(\phi,\theta)=(\upi/4,\upi/2)}$ as representative cases for VDPMs and BDPMs respectively. For these parameters, (28) can be further reduced due to the parity of the initial conditions. Also, the coupling matrices $\smash{\boldsymbol{G_{n}^{\sigma}}}$ become particularly simple:

\displaystyle\boldsymbol{G_{n}^{\sigma}}=\begin{cases}\frac{1}{\sqrt{2}}\begin{pmatrix}\alpha_{n}^{\sigma}&\beta_{n}^{\sigma}\\ \beta_{n}^{\sigma}&\alpha_{n}^{\sigma}\end{pmatrix},&\theta=0,\\ \frac{\mathrm{i}\sigma}{\sqrt{2}}\begin{pmatrix}-\alpha_{n}^{\sigma}&\beta_{n}^{\sigma}\\ -\beta_{n}^{\sigma}&\alpha_{n}^{\sigma}\end{pmatrix},&\theta=\frac{\upi}{2}.\end{cases}

(61)

For the case $\smash{\theta=0}$ , we assume assume $\smash{\psi_{0}^{+}=\psi_{0}^{-}}$ at $\smash{\tau=0}$ . Then, $\smash{\dot{\psi}_{n}^{+}(\tau=0)=\dot{\psi}_{n}^{-}(\tau=0)}$ as well, and thus $\smash{\psi_{n}^{+}=\psi_{n}^{-}}$ at all times, i.e. $\smash{\boldsymbol{b}\equiv 0}$ . We will refer to this case as $\smash{\boldsymbol{v}}$ -only initial conditions (VIC). The corresponding dynamics can be described by a single function $\smash{\psi_{n}\doteq\psi_{n}^{+}=\psi_{n}^{-}}$ , which satisfies the equation

\displaystyle\dot{\psi}_{n}=\frac{\alpha_{n}^{-}+\beta_{n}^{-}}{\sqrt{2}}\,\psi_{n-1}+\frac{\alpha_{n}^{+}+\beta_{n}^{+}}{\sqrt{2}}\,\psi_{n+1}.

(62)

For the case $\smash{\theta=\upi/2}$ , we assume $\smash{\psi_{0}^{+}=-\psi_{0}^{-}}$ at $\smash{\tau=0}$ . Similarly to the VIC case, this implies that $\smash{\boldsymbol{v}\equiv 0}$ . We will refer to this case as $\smash{\boldsymbol{b}}$ -only initial conditions (BIC). The corresponding dynamics can be described by a single function $\smash{\psi_{n}\doteq-\mathrm{i}^{n}\psi_{n}^{+}=(-\mathrm{i})^{n}\psi_{n}^{-}}$ , which satisfies the equation

\displaystyle\dot{\psi}_{n}=\frac{\alpha_{n}^{-}-(-1)^{n}\beta_{n}^{-}}{\sqrt{2}}\,\psi_{n-1}+\frac{\alpha_{n}^{+}-(-1)^{n}\beta_{n}^{+}}{\sqrt{2}}\,\psi_{n+1}.

(63)

To estimate the nonlinear time scale $\smash{\tau_{\text{NL}}}$ on which $\smash{\psi_{2}}$ might become comparable with $\smash{\psi_{0}}$ , let us consider (62) and (63) for $\smash{n=2}$ at early times, when $\smash{\psi_{3}}$ remains negligible:

\displaystyle\ddot{\psi_{2}}=\begin{cases}\frac{\alpha_{2}^{-}+\beta_{2}^{-}}{\sqrt{2}}\left(\frac{\alpha_{1}^{-}+\beta_{1}^{-}}{\sqrt{2}}\,\psi_{0}+\frac{\alpha_{1}^{+}+\beta_{1}^{+}}{\sqrt{2}}\,\psi_{2}\right)&\text{for VIC},\\ \frac{\alpha_{2}^{-}-\beta_{2}^{-}}{\sqrt{2}}\left(\frac{\alpha_{1}^{-}+\beta_{1}^{-}}{\sqrt{2}}\,\psi_{0}+\frac{\alpha_{1}^{+}+\beta_{1}^{+}}{\sqrt{2}}\,\psi_{2}\right)&\text{for BIC}.\end{cases}

(64)

Initially, $\smash{|\psi_{0}|\gg|\psi_{2}|}$ , so the second term in the brackets can be neglected in both cases and the nonlinear time scale can be readily estimated:

\displaystyle\tau_{\text{NL}}^{-2}=\frac{1}{2}\times\begin{cases}(\alpha_{2}^{-}+\beta_{2}^{-})(\alpha_{1}^{-}+\beta_{1}^{-})&\text{for VIC},\\ (\alpha_{2}^{-}-\beta_{2}^{-})(\alpha_{1}^{-}+\beta_{1}^{-})&\text{for BIC}.\end{cases}

(65)

In contrast, the quasilinear time scale $\smash{\tau_{\text{QL}}}$ , which follows from the 4MT equations (34), is identical in both cases:

\displaystyle\tau_{\text{QL}}^{-2}=\alpha_{0}^{-}(\alpha_{1}^{-}+\beta_{1}^{-}).

(66)

The ratio of time scales is then

\displaystyle\left(\frac{\tau_{\text{QL}}}{\tau_{\text{NL}}}\right)^{2}=\begin{cases}2\sqrt{r^{2}+4}&\text{for VIC},\\ \frac{2r\sqrt{r^{2}+4}}{r+4}&\text{for BIC}.\end{cases}

(67)

For the range of interest ( $\smash{|r|<1}$ ), this always yields

\displaystyle\frac{\tau_{\text{QL}}}{\tau_{\text{NL,BIC}}}<1<\frac{\tau_{\text{QL}}}{\tau_{\text{NL,VIC}}}.

(68)

(For example, at $\smash{r=0.5}$ , one has $\smash{\tau_{\text{QL}}/\tau_{\text{NL,VIC}}\approx 0.5}$ , and $\smash{\tau_{\text{QL}}/\tau_{\text{NL,BIC}}\approx 1.5}$ .) Thus, quasilinear is sufficient for VIC but not for BIC, in agreement with our numerical results.

Appendix B Properties of (58)

The operator introduced in (58) conserves energy (here it is normalised to the constant mass density):

\displaystyle E\doteq\frac{1}{4}\int\mathrm{d}\boldsymbol{x}\,(|\boldsymbol{z}^{+}|^{2}+|\boldsymbol{z}^{-}|^{2}),

(69)

and cross helicity:

\displaystyle H_{C}\doteq\frac{1}{4}\int\mathrm{d}\boldsymbol{x}\,(|\boldsymbol{z}^{+}|^{2}-|\boldsymbol{z}^{-}|^{2}).

(70)

These can be expressed in terms of the Elsasser energies

\displaystyle E^{\pm}\doteq\int\mathrm{d}\boldsymbol{x}\,|\boldsymbol{z}^{\pm}|^{2}/4,

(71)

\displaystyle E=E^{+}+E^{-},\qquad H_{C}=E^{+}-E^{-}.

(72)

Equation (58) yields

\displaystyle\begin{split}\frac{\mathrm{d}E^{\pm}}{\mathrm{d}t}&=\frac{1}{4}\int\mathrm{d}\boldsymbol{x}\,\partial_{t}|\boldsymbol{z}^{\pm}|^{2}\\ &=\frac{1}{2}\int\mathrm{d}\boldsymbol{x}\,\boldsymbol{z}^{\pm}\cdot\partial_{t}\boldsymbol{z}^{\pm}\\ &=\frac{\lambda}{2}\int\mathrm{d}\boldsymbol{x}\,\boldsymbol{z}^{\pm}\cdot\partial_{x}\nabla^{2}\boldsymbol{z}^{\pm}\\ &=\frac{\lambda}{2}\int\mathrm{d}\boldsymbol{x}\,z^{\pm}_{l}\partial_{x}\partial_{m}^{2}z^{\pm}_{l}\\ &=-\frac{\lambda}{2}\int\mathrm{d}\boldsymbol{x}\,(\partial_{m}z^{\pm}_{l})(\partial_{x}\partial_{m}z^{\pm}_{l})\\ &=-\frac{\lambda}{4}\int\mathrm{d}\boldsymbol{x}\partial_{x}\,(\partial_{m}z^{\pm}_{l}\partial_{m}z^{\pm}_{l})\\ &=0.\end{split}

(73)

Thus,

	$\displaystyle\frac{\mathrm{d}E}{\mathrm{d}t}=\frac{\mathrm{d}E^{+}}{\mathrm{d}t}+\frac{\mathrm{d}E^{-}}{\mathrm{d}t}=0,$		(74)
	$\displaystyle\frac{\mathrm{d}H_{C}}{\mathrm{d}t}=\frac{\mathrm{d}E^{+}}{\mathrm{d}t}-\frac{\mathrm{d}E^{-}}{\mathrm{d}t}=0,$		(75)

meaning that (58) conserves both $\smash{E}$ and $\smash{H_{C}}$ .

References

Adkins & Schekochihin (2018) Adkins, T. & Schekochihin, A. A. 2018 A solvable model of Vlasov-kinetic plasma turbulence in Fourier–Hermite phase space. J. Plasma Phys. 84 (1), 905840107.
Bender (2005) Bender, C. M. 2005 Introduction to $\mathcal{PT}$ -symmetric quantum theory. Contemp. Phys. 46, 277.
Bender et al. (2019) Bender, C. M., Dorey, P. E., Dunning, C., Fring, A., Hook, D. W., Jones, H. F., Kuzhel, S., Lévai, G. & Tateo, R. 2019 PT Symmetry. World Scientific Publishing Europe Ltd.
Beresnyak (2019) Beresnyak, A. 2019 MHD turbulence. Living Rev. Comput. Astrophys. 5 (1), 2.
Biskamp (2003) Biskamp, D. 2003 Magnetohydrodynamic Turbulence. Cambridge Univ. Press.
Biskamp & Schwarz (2001) Biskamp, D. & Schwarz, E. 2001 On two-dimensional magnetohydrodynamic turbulence. Phys. Plasmas 8 (7), 3282–3292.
Biskamp et al. (1996) Biskamp, D., Schwarz, E. & Drake, J. F. 1996 Two-dimensional electron magnetohydrodynamic turbulence. Phys. Rev. Lett. 76 (8), 1264.
Blackman & Field (2002) Blackman, E. G. & Field, G. B. 2002 New dynamical mean-field dynamo theory and closure approach. Phys. Rev. Lett. 89, 265007.
Brandenburg (2018) Brandenburg, A. 2018 Advances in mean-field dynamo theory and applications to astrophysical turbulence. J. Plasma Phys. 84 (4), 735840404.
Brandenburg et al. (2012) Brandenburg, A., Sokoloff, D. & Subramanian, K. 2012 Current status of turbulent dynamo theory: from large-scale to small-scale dynamos. Space Sci. Rev. 169, 123–157.
Brandenburg & Subramanian (2005a) Brandenburg, A. & Subramanian, K. 2005a Astrophysical magnetic fields and nonlinear dynamo theory. Phys. Rep. 417 (1), 1–209.
Brandenburg & Subramanian (2005b) Brandenburg, A. & Subramanian, K. 2005b Minimal tau approximation and simulations of the alpha effect. Astron. Astrophys. 439 (3), 835–843.
Dodin (2022) Dodin, I. Y. 2022 Quasilinear theory for inhomogeneous plasma. J. Plasma Phys. 88, 905880407.
Donzis et al. (2005) Donzis, D. A., Sreenivasan, K. R. & Yeung, P. K. 2005 Scalar dissipation rate and dissipative anomaly in isotropic turbulence. J. Fluid Mech. 532, 199–216.
Elsasser (1950) Elsasser, W. M. 1950 The hydromagnetic equations. Phys. Rev. 79 (1), 183.
Friedberg (1987) Friedberg, J. P. 1987 Ideal Magnetohydrodynamics. New York: Plenum Press.
Galtier et al. (2000) Galtier, S., Nazarenko, S. V., Newell, A. C. & Pouquet, A. 2000 A weak turbulence theory for incompressible magnetohydrodynamics. J. Plasma Phys. 63 (5), 447–488.
Giorgio et al. (2017) Giorgio, E. De, Servidio, S. & Veltri, P. 2017 Coherent structure formation through nonlinear interactions in 2D magnetohydrodynamic turbulence. Sci. Rep. 7, 13849.
Gopalakrishnan & Singh (2023) Gopalakrishnan, K. & Singh, N. K. 2023 Mean-field dynamo due to spatio-temporal fluctuations of the turbulent kinetic energy. J. Fluid Mech. 973, A29.
Gruzinov & Diamond (1994) Gruzinov, A. V. & Diamond, P. H. 1994 Self-consistent theory of mean-field electrodynamics. Phys. Rev. Lett. 72 (11), 1651.
Hammett et al. (1993) Hammett, G. W., Beer, M. A., Dorland, W., Cowley, S. C. & Smith, S. A. 1993 Developments in the gyrofluid approach to tokamak turbulence simulations. Plasma Phys. Control. Fusion 35, 973.
Heinonen et al. (2023) Heinonen, R. A., Diamond, P. H., Katz, M. F. D. & Ronimo, G. E. 2023 Generation of momentum transport in weakly turbulent $\beta$ -plane magnetohydrodynamics. Phys. Rev. E 107, 025202.
Hussain (1986) Hussain, A. K. M. Fazle 1986 Coherent structures and turbulence. J. Fluid Mech. 173, 303–356.
Kanekar et al. (2015) Kanekar, A., Schekochihin, A. A., Dorland, W. & Loureiro, N. F. 2015 Fluctuation-dissipation relations for a plasma-kinetic Langevin equation. J. Plasma Phys. 81 (1), 305810104.
Käpylä et al. (2006) Käpylä, P. J., Korpi, M. J., Ossendrijver, M. & Stix, M. 2006 Magnetoconvection and dynamo coefficients - III. $\alpha$ -effect and magnetic pumping in the rapid rotation regime. Astron. Astrophys. 455 (2), 401–412.
Karimabadi et al. (2013) Karimabadi, H., Roytershteyn, V., Wan, M., Matthaeus, W. H., Daughton, W., Wu, P., Shay, M., Loring, B., Borovsky, J., Leonardis, E., Chapman, S. C. & Nakamura, T. K. M. 2013 Coherent structures, intermittent turbulence, and dissipation in high-temperature plasmas. Phys. Plasmas 20 (1), 012303.
Kraichnan (1977) Kraichnan, R. H. 1977 Eulerian and Lagrangian renormalization in turbulence theory. J. Fluid Mech. 83 (2), 349–374.
Krashennikov et al. (2008) Krashennikov, S. I., D’Ippolito, D. A. & Myra, J. R. 2008 Recent theoretical progress in understanding coherent structures in edge and SOL turbulence. J. Plasma Phys. 74 (5), 679–717.
Krause & Rädler (1980) Krause, F. & Rädler, K. H. 1980 Mean-field Magnetohydrodynamics and Dynamo Theory. Oxford: Pergamon Press.
Lingam & Bhattacharjee (2016) Lingam, M. & Bhattacharjee, A. 2016 Hall current effects in mean-field dynamo theory. Astrophys. J. 829 (1), 51.
Masada & Sano (2014) Masada, Y. & Sano, T. 2014 Mean-field modeling of an $\alpha^{2}$ dynamo coupled with direct numerical simulations of rigidly rotating convection. Astrophys. J. Lett. 794 (1), L6.
Moffatt (1978) Moffatt, H. K. 1978 Field Generation in Electrically Conducting Fluids. Cambridge Univ. Press.
Nastac et al. (2023) Nastac, M. L., Ewart, R. J., Sengupta, W., Schekochihin, A. A., Barnes, M. & Dorland, W. D. 2023 Phase-space entropy cascade and irreversibility of stochastic heating in nearly collisionless plasma turbulence, arXiv: 2310.18211.
Nazarenko (2007) Nazarenko, S. 2007 2D enslaving of MHD turbulence. New J. Phys. 9 (8), 307.
Nicklaus & Stix (1988) Nicklaus, B. & Stix, M. 1988 Corrections to first order smoothing in mean-field electrodynamics. Geophys. Astrophys. Fluid Dyn. 43 (2), 149–166.
Parker (2016) Parker, J. T. 2016 Gyrokinetic simulations of fusion plasmas using a spectral velocity space representation, arXiv: 1603.04727.
Pipin & Proctor (2008) Pipin, V. V. & Proctor, M. R. E. 2008 Closure tests for mean field magnetohydrodynamics using a self-consistent reduced model. Mon. Not. R. Astron. Soc. 388 (1), 367–374.
Pouquet (1978) Pouquet, A. 1978 On two-dimensional magnetohydrodynamic turbulence. J. Fluid Mech. 88 (1), 1–16.
Pouquet et al. (1976) Pouquet, A., Frisch, U. & Léorat, J. 1976 Strong MHD helical turbulence and the nonlinear dynamo effect. J. Fluid Mech. 77 (2), 321–354.
Rädler (1982) Rädler, K. H. 1982 On dynamo action in the high-conductivity limit. Geophys. Astrophys. Fluid Dyn. 20 (3-4), 191–211.
Rädler (2007) Rädler, K. H. 2007 Mean-Field Dynamo Theory: Early Ideas and Today’s Problems, pp. 55–72. Dordrecht: Springer Netherlands.
Rincon (2019) Rincon, F. 2019 Dynamo theories. J. Plasma Phys. 85 (4), 205850401.
Rincon et al. (2016) Rincon, F., Califano, F., Schekochihin, A. A. & Valentini, F. 2016 Turbulent dynamo in a collisionless plasma. Proc. Natl. Acad. Sci. U.S.A. 113 (15), 3950–3953.
Schekochihin (2022) Schekochihin, A. A. 2022 MHD turbulence: a biased review. J. Plasma Phys. 88 (5), 155880501.
Schrinner et al. (2005) Schrinner, M., K.-H. Rädler, D., Schmitt, Rheinhardt, M. & Christensen, U. 2005 Mean-field view on rotating magnetoconvection and a geodynamo model. Astron. Nachrichten 326 (3-4), 245–249.
Smolyakov et al. (2000) Smolyakov, A. I., Diamond, P. H. & Malkov, M. 2000 Coherent structure phenomena in drift wave–zonal flow turbulence. Phys. Rev. Lett. 84, 491–494.
Squire & Bhattacharjee (2015) Squire, J. & Bhattacharjee, A. 2015 Electromotive force due to magnetohydrodynamic fluctuations in sheared rotating turbulence. Phys. Rev. E 92, 053101.
Stix (1992) Stix, T. H. 1992 Waves in Plasmas. New York: AIP.
Tobias (2021) Tobias, S. M. 2021 The turbulent dynamo. J. Fluid Mech. 912, P1.
Tsiolis et al. (2020) Tsiolis, V., Zhou, Y. & Dodin, I. Y. 2020 Structure formation in turbulence as an instability of effective quantum plasma. Phys. Lett. A 384, 126377.
Zakharov & Ostrovsky (2009) Zakharov, V. E. & Ostrovsky, L. A. 2009 Modulation instability: The beginning. Physica D 238, 540.
Zeldovich (1957) Zeldovich, Y. B. 1957 The magnetic field in the two-dimensional motion of a conducting turbulent fluid. Sov. Phys. JETP 4, 460–462.
Zhou & Blackman (2021) Zhou, H. & Blackman, E. G. 2021 On the shear-current effect: toward understanding why theories and simulations have mutually and separately conflicted. Mon. Not. R. Astron. Soc. 507 (4), 5732–5746.
Zhu & Dodin (2021) Zhu, H. & Dodin, I. Y. 2021 Wave-kinetic approach to zonal-flow dynamics: recent advances. Phys. Plasmas 28, 032303.