This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\WarningFilter

latexA float is stuck \WarningFilternamerefThe definition of

On reduced modeling of the modulational dynamics in magnetohydrodynamics

S. Jin\aff1, 2 \corresp [email protected]    I. Y. Dodin\aff1, 2 \aff1Department of Astrophysical Sciences, Princeton University, Princeton, New Jersey 08543, USA \aff2Princeton Plasma Physics Laboratory, Princeton, New Jersey 08540, USA
Abstract

This paper explores structure formation in two-dimensional magnetohydrodynamic (MHD) turbulence as a modulational instability (MI) of turbulent fluctuations. We focus on the early stages of structure formation and consider simple backgrounds that allow for a tractable model of the MI while retaining the full chain of modulational harmonics. This approach allows us to systematically examine the validity of popular closures such as the quasilinear approximation and other low-order truncations. We find that, although such simple closures can provide quantitatively accurate approximations of the MI growth rates in some regimes, they can fail to capture the modulational dynamics in adjacent regimes even qualitatively, falsely predicting MI when the system is actually stable. We find that this discrepancy is due to the excitation of propagating spectral waves (PSWs) which can ballistically transport energy along the modulational spectrum, unimpeded until dissipative scales, thereby breaking the feedback loops that would otherwise sustain MIs. PSWs can be self-maintained as global modes with real frequencies and drain energy from the primary structure at a constant rate until the primary structure is depleted. To describe these waves within a reduced model, we propose an approximate spectral closure that captures them and MIs on the same footing. We also find that introducing corrections to ideal MHD, conservative or dissipative, can suppress PSWs and reinstate the accuracy of the quasilinear approximation. In this sense, ideal MHD is a ‘singular’ system that is particularly sensitive to the accuracy of the closure within mean-field models.

1 Introduction

1.1 Motivation

Coherent-structure formation from turbulence is ubiquitous in nature, intrinsically compelling, and one of the precious few aspects of turbulence that are relatively yielding to analytical efforts (Hussain, 1986; Karimabadi et al., 2013; Smolyakov et al., 2000; Krashennikov et al., 2008). For magnetohydrodynamic (MHD) turbulence (Biskamp, 2003; Beresnyak, 2019; Schekochihin, 2022) and the turbulent-dynamo problem (Brandenburg et al., 2012; Tobias, 2021; Rincon et al., 2016), this has spawned a vast body of work known as mean-field electrodynamics, or mean-field dynamo theory (Rädler, 2007; Krause & Rädler, 1980; Moffatt, 1978; Brandenburg, 2018; Gruzinov & Diamond, 1994). In such approaches, the velocity and magnetic fields are split into fluctuations and mean, or average, fields (the definition of average depends on the problem); then, various closures are implemented to obtain the mean-field evolution in response to the fluctuations (Blackman & Field, 2002; Brandenburg & Subramanian, 2005b; Pouquet et al., 1976; Kraichnan, 1977; Nicklaus & Stix, 1988).

These closures ultimately rely on a truncation of a formally infinite chain of correlation functions of increasingly high order.111Here, we use the term ‘truncation’ to refer to low-order closures in general, as opposed to precisely the closure choice of setting all correlations to zero after some cutoff. The latter, which we will refer to as a simple truncation, is done in the quasilinear approximation discussed below but not, for example, in the various τ\smash{\tau} approximations (Brandenburg & Subramanian, 2005b; Blackman & Field, 2002). This is a common approach to reduced modeling of nonlinear systems, and one can expect it to work when the fluctuations are small compared to the mean field. However, in turbulent structure formation, the usual roles of perturbation and background are switched, that is, mean fields emerge as small modulations of order-one correlation functions that characterize turbulent fluctuations. Hence, the question of the validity of low-order closures becomes nontrivial.

In this paper, we are concerned with the popular closure called the first-order smoothing (Krause & Rädler, 1980) or second-order correlation approximation (Rädler, 1982). It implies that the mean-field evolution is determined only by second-order correlations of the fluctuations, which, in a broader context, is also known as the quasilinear approximation (Dodin, 2022). On the one hand, one can expect its validity domain to be extremely limited. It can be rigorously justified only for small magnetic Reynolds numbers or short correlation times (Brandenburg & Subramanian, 2005a; Rincon, 2019), and comparisons with direct numerical simulations often corroborate this expectation (Pipin & Proctor, 2008; Schrinner et al., 2005). On the other hand, the quasilinear approximation endures in its popularity as a workhorse that is too convenient to cast aside in analytical calculations (Squire & Bhattacharjee, 2015; Lingam & Bhattacharjee, 2016; Masada & Sano, 2014; Gopalakrishnan & Singh, 2023). Furthermore, in some cases, quasilinear calculations have been found to produce good agreement with direct numerical simulations (DNS) even outside of their formal validity domain (Käpylä et al., 2006).222Confusingly, various analytical closures and DNS can conflict and agree in various combinations (Zhou & Blackman, 2021). But this issue is beyond the scope of the present paper. It is therefore important to understand when, and how exactly, the quasilinear approximation fails. Answering this question requires exploring a mean-field model that retains high-order correlations. That is what the present paper is about.

1.2 Outline

Here, we explore structure formation in MHD turbulence as a modulational instability (Zakharov & Ostrovsky, 2009) (MI) of the flow-velocity and magnetic-field fluctuations comprising the background turbulence. (For a recent overview of this approach in application to drift-wave and hydrodynamic turbulence, see Zhu & Dodin (2021); Tsiolis et al. (2020).) For simplicity, we restrict our consideration to two-dimensional (2-D) turbulence. Although bounded planar flows cannot sustain long-lasting magnetic fields in two dimensions (an anti-dynamo theorem by Zeldovich (1957)), 2-D turbulence nevertheless hosts rich physics and has long served as popular testing grounds for various aspects of MHD turbulence theory due to its (relative) tractability (Pouquet, 1978; Biskamp & Schwarz, 2001; Nazarenko, 2007; Giorgio et al., 2017; Biskamp et al., 1996). The goal of this paper is to complement those works with a detailed study of MIs using both analytical calculations and DNS.

Specifically, we study how energy is transferred from primary fluctuations to mean magnetic field and mean velocity during the early stages of structure formation, when these mean fields are weak and the primary structure can be treated as fixed. To analyze this process in detail, we assume a maximally reduced model of the turbulent fluctuations, namely, a spatially monochromatic primary structure. The simplicity of this model has been detracting attention from it in the literature in favor of more realistic turbulence settings, but there is more to it that meets the eye. Even with a monochromatic primary structure, the modulational dynamics remains remarkably intricate and, depending on the system parameters, can change drastically (figure 1) in ways that are not captured by the common closures or obvious from the standard eigenmode analysis (Friedberg, 1987). However, the simplicity of our model helps unravel this mystery.

We find that, when the primary structure truly experiences a MI, this MI can be described well with typical truncated models that neglect high-order correlations. However, already in adjacent regimes, such truncated models can fail spectacularly and produce false positives for instability. To study this process in detail, we propose an ‘extended’ quasilinear theory (XQLT) that treats the primary structure as fixed but includes the entire spectrum of modulational harmonics (as opposed to just the low-order harmonics, as usual). From XQLT, also corroborated by DNS, we find that the difference between said regimes is due to a fundamental difference in the structure of the modulational spectrum. For unstable modes, the spectrum is exponentially localised at low harmonic numbers, so truncated models are justified. But this localization does not always occur. At other parameters, modulational modes turn into constant-amplitude waves propagating down the spectrum, unimpeded until dissipative scales. These spectral waves are self-maintained as global modes with real frequencies and cause ballistic energy transport along the spectrum, breaking the feedback loops that could otherwise sustain MI. The ballistic transport drains energy from the primary structure at a constant rate until the primary structure is depleted. Notably, these effects are overlooked if one assumes from the start that the modulation spectrum is confined and that, accordingly, dissipation entirely vanishes in the limit of zero viscosity and resistivity.

A comment is due here regarding how the established truncated MHD closures fit into this picture. Given the indispensability of such simple closures, a higher-resolution understanding of their limitations can help guide and interpret their inevitable application. For example, we show that both conservative and dissipative corrections to ideal MHD tend to restore the validity of the quasilinear approximation. In this sense, ideal incompressible MHD is a ‘singular’ theory that is particularly sensitive to the accuracy of the closure within mean-field models. That said, it is an important conclusion of our work that, unless deviations from ideal incompressible MHD are substantial, changing the turbulence parameters even slightly can produce qualitative inaccuracies in an otherwise workable reduced model. Therefore, efforts to extrapolate closures or infer them from general considerations should be supplemented with first-principle calculations based on the underlying modulational dynamics. As a step in this direction, we propose a spectral closure that captures both the propagating spectral waves and MIs within our model. (But, of course, more work remains to be done for generic MHD turbulence.)

The paper is organised as follows. In section 2, we present the basic equations and derive the XQLT. In section 3, we examine the ability of truncated models of low-order to predict the MI rate. In section 4, we introduce spectral waves and describe their properties. In section 5, we propose a closure that can simultaneously capture MIs and spectral waves. In section 6, we demonstrate the impact of corrections to ideal MHD on the modulational dynamics. In section 7, we summarise our main results. Auxiliary calculations are presented in appendices.

Refer to caption
Figure 1: DNS of modulational dynamics seeded at τ=0\smash{\tau=0} with random noise. (This figure is discussed in detail in later sections. For the notation, see sections 2.1 and 2.2.) Figures (a), (c), and (e) correspond to a modulationally unstable primary mode (17) with θ±=±\upi/6\smash{\theta^{\pm}=\pm\upi/6}, and (b), (d), and (f) correspond to a modulationally stable primary mode with θ±=±\upi/3\smash{\theta^{\pm}=\pm\upi/3}. Figures (a) and (b) show zx+(t,y)\smash{z_{x}^{+}(t,y)}, (c) and (d) show zy+(t,x)\smash{z_{y}^{+}(t,x)}, and (e) and (f) show the corresponding energy breakdown. Specifically, v\smash{\mathcal{E}_{v}} is the normalised kinetic energy (21c), b=Eb/Etot\smash{\mathcal{E}_{b}=E_{b}/E_{\text{tot}}} is the normalised magnetic energy (21d), mod\smash{\mathcal{E}_{\text{mod}}} is the normalised modulational-mode energy (21b), and pri\smash{\mathcal{E}_{\text{pri}}} is the normalised primary-mode energy (21a). The colour bar shows the field amplitudes normalised to 𝒜/p\smash{\mathcal{A}/p}. The spatial coordinates are normalised to 1/p\smash{1/p}.

2 Extended quasilinear theory

2.1 Base model

2.1.1 Incompressible resistive MHD

As a base model, we assume incompressible resistive MHD with homogeneous mass density ρ=const\smash{\rho=\text{const}}. The corresponding governing equations are

t𝒗+(𝒗)𝒗=(𝒃)𝒃P+ν2𝒗,\displaystyle\partial_{t}\boldsymbol{v}+(\boldsymbol{v}\cdot\nabla)\boldsymbol{v}=(\boldsymbol{b}\cdot\nabla)\boldsymbol{b}-\nabla P+\nu\nabla^{2}\boldsymbol{v}, (1a)
t𝒃=×(𝒗×𝒃)+η2𝒃.\displaystyle\partial_{t}\boldsymbol{b}=\nabla\times(\boldsymbol{v}\times\boldsymbol{b})+\eta\nabla^{2}\boldsymbol{b}. (1b)
Here 𝒗\smash{\boldsymbol{v}} is the fluid velocity; 𝒃𝑩/4\upiρ\smash{\boldsymbol{b}\doteq\boldsymbol{B}/\sqrt{4\upi\rho}} is the local Alfvén velocity (the symbol \smash{\doteq} denotes definitions), i.e. rescaled magnetic field 𝑩\smash{\boldsymbol{B}}; and P(Pkin+B2/8\upi)/ρ\smash{P\doteq(P_{\text{kin}}+B^{2}/8\upi)/\rho} is the normalised total pressure, with Pkin\smash{P_{\text{kin}}} being the kinetic pressure. Also, ν\smash{\nu} is the viscosity and η\smash{\eta} is the resistivity, both of which are assumed either constant or negligible (except at small enough scales). Due to incompressibility and magnetic Gauss’s law, one also has
𝒗=0,𝒃=0.\displaystyle\nabla\cdot\boldsymbol{v}=0,\qquad\nabla\cdot\boldsymbol{b}=0. (1c)

In order to put (1) in a more symmetric form, let us rewrite them in terms of the two Elsässer fields 𝒛±\smash{\boldsymbol{z}^{\pm}} (Elsasser, 1950), which are also solenoidal:

𝒛±𝒗±𝒃,𝒛±=0.\displaystyle\boldsymbol{z}^{\pm}\doteq\boldsymbol{v}\pm\boldsymbol{b},\qquad\nabla\cdot\boldsymbol{z}^{\pm}=0. (2)

This leads to two coupled equations for 𝒛±\smash{\boldsymbol{z}^{\pm}}:

t𝒛±=(𝒛)𝒛±P+ν+2𝒛±+ν2𝒛,\displaystyle\partial_{t}\boldsymbol{z}^{\pm}=-(\boldsymbol{z}^{\mp}\cdot\nabla)\boldsymbol{z}^{\pm}-\nabla P+\nu_{+}\nabla^{2}\boldsymbol{z}^{\pm}+\nu_{-}\nabla^{2}\boldsymbol{z}^{\mp}, (3)

where ν+(ν+η)/2\smash{\nu_{+}\doteq(\nu+\eta)/2} and ν(νη)/2\smash{\nu_{-}\doteq(\nu-\eta)/2}. The normalised total pressure P\smash{P} can be found as follows. By taking the divergence of (1a) and using (2), one obtains

2P=[(𝒛)𝒛±].\displaystyle\nabla^{2}P=-\nabla\cdot[(\boldsymbol{z}^{\mp}\cdot\nabla)\boldsymbol{z}^{\pm}]. (4)

Let us introduce the wavevector operator 𝒌^i\smash{\hat{\boldsymbol{k}}\doteq-\mathrm{i}\nabla}, so k^2𝒌^2=2\smash{\hat{k}^{2}\doteq\smash{\hat{\boldsymbol{k}}}^{2}=-\nabla^{2}}. Let us also assume some appropriate (say, periodic) boundary conditions. Then, (4) yields

P=k^2𝒌^[(𝒛𝒌^)𝒛±]+const,\displaystyle P=-\hat{k}^{-2}\hat{\boldsymbol{k}}\cdot[(\boldsymbol{z}^{\mp}\cdot\hat{\boldsymbol{k}})\boldsymbol{z}^{\pm}]+\text{const}, (5)

whence P\smash{\nabla P} in (3) can be expressed through 𝒛±\smash{\boldsymbol{z}^{\pm}}. Alternatively, P\smash{\nabla P} can be eliminated from (3) by taking the curl of this equation and rewriting the result in terms of the Elsässer vorticities

𝒘±×𝒛±.\displaystyle\boldsymbol{w}^{\pm}\doteq\nabla\times\boldsymbol{z}^{\pm}. (6)

Indeed, due to (2), one can express 𝒛±\smash{\boldsymbol{z}^{\pm}} using a vector potential 𝒂±\smash{\boldsymbol{a}^{\pm}} such that 𝒛±=×𝒂±\smash{\boldsymbol{z}^{\pm}=\nabla\times\boldsymbol{a}^{\pm}}. Let us assume the gauge such that 𝒂±=0\smash{\nabla\cdot\boldsymbol{a}^{\pm}=0}. Then,

𝒘±=×(×𝒂±)=2𝒂±k^2𝒂±,\displaystyle\boldsymbol{w}^{\pm}=\nabla\times(\nabla\times\boldsymbol{a}^{\pm})=-\nabla^{2}\boldsymbol{a}^{\pm}\equiv\hat{k}^{2}\boldsymbol{a}^{\pm}, (7)

whence

𝒛±=ik^2(𝒌^×𝒘±).\displaystyle\boldsymbol{z}^{\pm}=\mathrm{i}\hat{k}^{-2}(\hat{\boldsymbol{k}}\times\boldsymbol{w}^{\pm}). (8)

(Here, we assume that 𝒛±\smash{\boldsymbol{z}^{\pm}} have zero spatial average, so k^2\smash{\hat{k}^{2}} is invertible.) Then, (3) can be expressed through 𝒘±\smash{\boldsymbol{w}^{\pm}} alone:

t𝒘±=𝒌^×{[(𝒌^×k^2𝒘)𝒌^](𝒌^×k^2𝒘±)}k^2(ν+𝒘±+ν𝒘).\displaystyle\partial_{t}\boldsymbol{w}^{\pm}=-\hat{\boldsymbol{k}}\times\{[(\hat{\boldsymbol{k}}\times\hat{k}^{-2}\boldsymbol{w}^{\mp})\cdot\hat{\boldsymbol{k}}](\hat{\boldsymbol{k}}\times\hat{k}^{-2}\boldsymbol{w}^{\pm})\}-\hat{k}^{2}(\nu_{+}\boldsymbol{w}^{\pm}+\nu_{-}\boldsymbol{w}^{\mp}). (9)

2.1.2 2-D collisionless model

For simplicity, below we adopt a 2-D model, meaning that 𝒛±\smash{\boldsymbol{z}^{\pm}} lie in the (x,y)\smash{(x,y)} plane and z=0\smash{\partial_{z}=0}. In this case, the only potentially nonzero component of 𝒘±\smash{\boldsymbol{w}^{\pm}} is the z\smash{z}-component, w±(×𝒛±)z\smash{w^{\pm}\doteq(\nabla\times\boldsymbol{z}^{\pm})_{z}}. We also forgo an explicit treatment of the viscosities ν±\smash{\nu_{\pm}} below. (However small, though, these viscosities remain nonnegligible in that they determine absorbing boundary conditions at infinity in the spectral space; see section 4.) Then, the vector equation (9) can be replaced with a scalar equation for w±\smash{w^{\pm}}:

tw±=ϵlmz[(k^mk^2w)(k^lw±)(k^nk^lk^2w)(k^nk^mk^2w±)],\displaystyle\partial_{t}w^{\pm}=\epsilon_{lmz}\Bigg{[}\left(\frac{\hat{k}_{m}}{\hat{k}^{2}}\,w^{\mp}\right)\left(\hat{k}_{l}w^{\pm}\right)-\left(\frac{\hat{k}_{n}\hat{k}_{l}}{\hat{k}^{2}}\,w^{\mp}\right)\left(\frac{\hat{k}_{n}\hat{k}_{m}}{\hat{k}^{2}}\,w^{\pm}\right)\Bigg{]}, (10)

where ϵlmz\smash{\epsilon_{lmz}} is the Levi–Civita symbol with the third index fixed to z\smash{z}. Note that the right-hand side of (10) vanishes whenever w+\smash{w^{+}} or w\smash{w^{-}} is zero. This means that any stationary w+\smash{w^{+}} is a solution as long as w\smash{w^{-}} is zero and vice versa.

2.1.3 Fourier representation

Assuming the Fourier representation

w±=𝒌w𝒌±(t)ei𝒌𝒙,\displaystyle w^{\pm}=\sum_{\boldsymbol{k}}w_{\boldsymbol{k}}^{\pm}(t)\mathrm{e}^{\mathrm{i}\boldsymbol{k}\cdot\boldsymbol{x}}, (11)

where 𝒙(x,y)\smash{\boldsymbol{x}\doteq(x,y)}, one arrives at the following equation for the Fourier coefficients w𝒌±\smash{w_{\boldsymbol{k}}^{\pm}}:

tw𝒌±=𝒌1,𝒌2T(𝒌1,𝒌2)w𝒌1w𝒌2±δ𝒌,𝒌2+𝒌1,\displaystyle\partial_{t}w^{\pm}_{\boldsymbol{k}}=\sum_{\boldsymbol{k}_{1},\boldsymbol{k}_{2}}T(\boldsymbol{k}_{1},\boldsymbol{k}_{2})w^{\mp}_{\boldsymbol{k}_{1}}w^{\pm}_{\boldsymbol{k}_{2}}\delta_{\boldsymbol{k},\boldsymbol{k}_{2}+\boldsymbol{k}_{1}}, (12)

where the dot is the time derivative, δ\smash{\delta} is the Kronecker delta, T(𝒌1,𝒌2)\smash{T(\boldsymbol{k}_{1},\boldsymbol{k}_{2})} are the coupling coefficients given by

T(𝒌1,𝒌2)=[𝒆z(𝒌2×𝒌1)](𝒌2+𝒌1)𝒌2k12k22,\displaystyle T(\boldsymbol{k}_{1},\boldsymbol{k}_{2})=[\boldsymbol{e}_{z}\cdot(\boldsymbol{k}_{2}\times\boldsymbol{k}_{1})]\,\frac{(\boldsymbol{k}_{2}+\boldsymbol{k}_{1})\cdot\boldsymbol{k}_{2}}{k_{1}^{2}k_{2}^{2}}, (13)

and 𝒆j\smash{\boldsymbol{e}_{j}} is the unit vector along the j\smash{j}th axis. Accordingly, the kinetic and magnetic energy are given by

Ev\displaystyle E_{v} =ρ8𝒌1k2|w𝒌++w𝒌|2𝒌Ev,𝒌,\displaystyle=\frac{\rho}{8}\,\sum_{\boldsymbol{k}}\frac{1}{k^{2}}\,|w_{\boldsymbol{k}}^{+}+w_{\boldsymbol{k}}^{-}|^{2}\equiv\sum_{\boldsymbol{k}}E_{{v},\boldsymbol{k}}, (14a)
Eb\displaystyle E_{b} =ρ8𝒌1k2|w𝒌+w𝒌|2𝒌Eb,𝒌,\displaystyle=\frac{\rho}{8}\,\sum_{\boldsymbol{k}}\frac{1}{k^{2}}\,|w_{\boldsymbol{k}}^{+}-w_{\boldsymbol{k}}^{-}|^{2}\equiv\sum_{\boldsymbol{k}}E_{{b},\boldsymbol{k}}, (14b)

where Ev,𝒌\smash{E_{{v},\boldsymbol{k}}}, and Eb,𝒌\smash{E_{{b},\boldsymbol{k}}} are the corresponding spectral energy densities. Then, the total energy density is

E=ρ4𝒌1k2(|w𝒌+|2+|w𝒌|2)𝒌E𝒌,\displaystyle E=\frac{\rho}{4}\,\sum_{\boldsymbol{k}}\frac{1}{k^{2}}\,(|w_{\boldsymbol{k}}^{+}|^{2}+|w_{\boldsymbol{k}}^{-}|^{2})\equiv\sum_{\boldsymbol{k}}E_{\boldsymbol{k}}, (15)

where E𝒌\smash{E_{\boldsymbol{k}}} is the spectral total-energy density.

2.2 Harmonic coupling mediated by a primary structure

2.2.1 Equation for the modulational spectrum

Let us explore modulational stability of a Fourier harmonic with a given wavevector 𝒌=𝒑\smash{\boldsymbol{k}=\boldsymbol{p}} and the corresponding order-one Fourier amplitudes w𝒑±\smash{w_{\boldsymbol{p}}^{\pm}}. We will call it the primary harmonic or, if multiple 𝒑\smash{\boldsymbol{p}}s are present, primary structure. Let us choose axes such that 𝒑=p𝒆y\smash{\boldsymbol{p}=p\boldsymbol{e}_{y}} and assume a perturbation with a wavevector 𝒒\smash{\boldsymbol{q}}. As can be seen from (13), nontrivial coupling requires a component of 𝒒\smash{\boldsymbol{q}} perpendicular to 𝒑\smash{\boldsymbol{p}}, and the component of 𝒒\smash{\boldsymbol{q}} parallel to 𝒑\smash{\boldsymbol{p}} does not qualitatively affect the modulational dynamics.333As shown below, the modulational dynamics is determined mostly by the asymptotic form of the coefficients αn±\smash{\alpha_{n}^{\pm}} and βn±\smash{\beta_{n}^{\pm}} (introduced in section 2.2.2) at large n\smash{n}, and this form is insensitive to the component of 𝒒\smash{\boldsymbol{q}} parallel to 𝒑\smash{\boldsymbol{p}}. For simplicity then, let us assume 𝒒=q𝒆x\smash{\boldsymbol{q}=q\boldsymbol{e}_{x}}. Let us also assume w𝒑±=𝒪(1)\smash{w_{\boldsymbol{p}}^{\pm}=\mathcal{O}(1)}, w𝒒+n𝒑±=𝒪(ϵ)\smash{w_{\boldsymbol{q}+n\boldsymbol{p}}^{\pm}=\mathcal{O}(\epsilon)}, and same for any other w𝒌𝒑±\smash{w_{\boldsymbol{k}\neq\boldsymbol{p}}^{\pm}}, where ϵ1\smash{\epsilon\ll 1} is a small parameter. (Remember that 𝒪(ϵ)\smash{\mathcal{O}(\epsilon)} means, loosely speaking, ‘scales as ϵ\smash{\epsilon} or smaller’ but not necessarily ‘of order ϵ\smash{\epsilon}’.) Then, by linearizing (12) in ϵ\smash{\epsilon}, one obtains the following chain of equations:

tw𝒌±\displaystyle\partial_{t}w^{\pm}_{\boldsymbol{k}}\approx\,\, T(𝒑,𝒌)w𝒑w𝒌±+T(𝒑,𝒌+)w𝒑w𝒌+±\displaystyle T(\boldsymbol{p},\boldsymbol{k}_{-})w^{\mp}_{\boldsymbol{p}}w^{\pm}_{\boldsymbol{k}_{-}}+T(-\boldsymbol{p},\boldsymbol{k}_{+})w^{\mp}_{-\boldsymbol{p}}w^{\pm}_{\boldsymbol{k}_{+}}
+\displaystyle+ T(𝒌,𝒑)w𝒌w𝒑±+T(𝒌+,𝒑)w𝒌+w𝒑±,\displaystyle T(\boldsymbol{k}_{-},\boldsymbol{p})w^{\mp}_{\boldsymbol{k}_{-}}w^{\pm}_{\boldsymbol{p}}+T(\boldsymbol{k}_{+},-\boldsymbol{p})w^{\mp}_{\boldsymbol{k}_{+}}w^{\pm}_{-\boldsymbol{p}}, (16)

where 𝒌±𝒌±𝒑\smash{\boldsymbol{k}_{\pm}\doteq\boldsymbol{k}\pm\boldsymbol{p}} and 𝒌=𝒒+n𝒑\smash{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}} with integer n\smash{n}. For all other 𝒌\smash{\boldsymbol{k}}, one obtains tw𝒌±0\smash{\partial_{t}w^{\pm}_{\boldsymbol{k}}\approx 0}. In particular, this means that w𝒑±\smash{w^{\pm}_{\boldsymbol{p}}} can be considered fixed; i.e. A±w𝒑±const\smash{A^{\pm}\doteq w^{\pm}_{\boldsymbol{p}}\approx\text{const}}. In terms of the Elsässer fields, this corresponds to a primary structure

𝒛±=2p𝒜±sin(py+θ±)𝒆x,\displaystyle\boldsymbol{z}^{\pm}=-\frac{2}{p}\,\mathcal{A}^{\pm}\sin(py+\theta^{\pm})\boldsymbol{e}_{x}, (17)

where 𝒜±|A±|\smash{\mathcal{A^{\pm}}\doteq|A^{\pm}|} and θ±argA±\smash{\theta^{\pm}\doteq\arg A^{\pm}}.

For the modulation energy density and the primary-wave energy density,

Emod𝒌=𝒒+n𝒑E𝒌=nρ4σ=±1|w𝒒+n𝒑σ|2q2+n2p2,\displaystyle E_{\text{mod}}\doteq\sum_{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}}E_{\boldsymbol{k}}=\sum_{n}\frac{\rho}{4}\sum_{\sigma=\pm 1}\frac{|w_{\boldsymbol{q}+n\boldsymbol{p}}^{\sigma}|^{2}}{q^{2}+n^{2}p^{2}}, (18a)
EpriE𝒑=ρ4σ=±1|w𝒑σ|2p2,\displaystyle E_{\text{pri}}\doteq E_{\boldsymbol{p}}=\frac{\rho}{4}\sum_{\sigma=\pm 1}\frac{|w_{\boldsymbol{p}}^{\sigma}|^{2}}{p^{2}}, (18b)

one has

dEmoddt=nρ4σ=±1w𝒒+n𝒑σtw𝒒+n𝒑σq2+n2p2+c.c.,\displaystyle\frac{\mathrm{d}E_{\text{mod}}}{\mathrm{d}t}=\sum_{n}\frac{\rho}{4}\sum_{\sigma=\pm 1}\frac{w_{\boldsymbol{q}+n\boldsymbol{p}}^{\sigma*}\,\partial_{t}w_{\boldsymbol{q}+n\boldsymbol{p}}^{\sigma}}{q^{2}+n^{2}p^{2}}+\text{c.c.}, (19a)
dEpridt=ρ4σ=±1w𝒑σw˙𝒑σp2+c.c.,\displaystyle\frac{\mathrm{d}E_{\text{pri}}}{\mathrm{d}t}=\frac{\rho}{4}\sum_{\sigma=\pm 1}\frac{w_{\boldsymbol{p}}^{\sigma*}\dot{w}_{\boldsymbol{p}}^{\sigma}}{p^{2}}+\text{c.c.}, (19b)

where ‘c.c.’ stands for ‘complex conjugate’. Using

tw𝒑σ=𝒌=𝒒+n𝒑T(𝒌,𝒑𝒌)w𝒌σw𝒑𝒌σ,\displaystyle\partial_{t}w_{\boldsymbol{p}}^{\sigma}=\sum_{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}}T(\boldsymbol{k},\boldsymbol{p-k})w_{\boldsymbol{k}}^{-\sigma}w_{\boldsymbol{p-k}}^{\sigma}, (20)

for 𝒌=𝒒+n𝒑\smash{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}}, one finds that our approximate equations conserve the total energy density Etot=Epri+Emod\smash{E_{\text{tot}}=E_{\text{pri}}+E_{\text{mod}}} exactly within this approximation. (For brevity, from now on we will refer to the energy densities simply as energies.) We will assume that the initial modulational energy is negligible, so Etot=ρ𝒜2/4p2\smash{E_{\text{tot}}=\rho\mathcal{A}^{2}/4p^{2}}, where 𝒜2(𝒜+)2+(𝒜)2\smash{\mathcal{A}^{2}\doteq(\mathcal{A}^{+})^{2}+(\mathcal{A}^{-})^{2}}, and we will also use the following dimensionless energies:

pri=Epri/Etot,\displaystyle\mathcal{E}_{\text{pri}}=E_{\text{pri}}/E_{\text{tot}}, (21a)
mod=Emod/Etot,\displaystyle\mathcal{E}_{\text{mod}}=E_{\text{mod}}/E_{\text{tot}}, (21b)
v=Ev/Etot,\displaystyle\mathcal{E}_{v}=E_{v}/E_{\text{tot}}, (21c)
v=Eb/Etot.\displaystyle\mathcal{E}_{v}=E_{b}/E_{\text{tot}}. (21d)

2.2.2 Dimensionless equations

In order to reduce the number of free parameters, let us perform a variable transformation w𝒒+n𝒑±ψn±\smash{w_{\boldsymbol{q}+n\boldsymbol{p}}^{\pm}\mapsto\psi_{n}^{\pm}}:

w𝒒+n𝒑±=𝒜3(q2+n2p2)𝒜p2exp(inθ++θ2)ψn±,\displaystyle w_{\boldsymbol{q}+n\boldsymbol{p}}^{\pm}=\sqrt{\frac{\mathcal{A}^{3}(q^{2}+n^{2}p^{2})}{\mathcal{A}^{\mp}p^{2}}}\exp\left(\mathrm{i}n\,\frac{\theta^{+}+\theta^{-}}{2}\right)\psi_{n}^{\pm}, (22)

where ψn±=𝒪(ϵ)\smash{\psi_{n}^{\pm}=\mathcal{O}(\epsilon)}. Then, (16) becomes

tψn±=\displaystyle\partial_{t}\psi_{n}^{\pm}=\quad 𝒜eiθαnψn1±+𝒜+𝒜e±iθβnψn1\displaystyle\mathcal{A}^{\mp}\mathrm{e}^{\mp\mathrm{i}\theta}\alpha_{n}^{-}\psi_{n-1}^{\pm}+\sqrt{\mathcal{A}^{+}\mathcal{A}^{-}}\mathrm{e}^{\pm\mathrm{i}\theta}\beta_{n}^{-}\psi_{n-1}^{\mp}
+\displaystyle+ 𝒜e±iθαn+ψn+1±+𝒜+𝒜eiθβn+ψn+1,\displaystyle\mathcal{A^{\mp}}\mathrm{e}^{\pm\mathrm{i}\theta}\alpha_{n}^{+}\psi_{n+1}^{\pm}+\sqrt{\mathcal{A}^{+}\mathcal{A}^{-}}\mathrm{e}^{\mp\mathrm{i}\theta}\beta_{n}^{+}\psi_{n+1}^{\mp}, (23)

where θ(θ+θ)/2\smash{\theta\doteq(\theta^{+}-\theta^{-})/2} and αn±\smash{\alpha_{n}^{\pm}} and βn±\smash{\beta_{n}^{\pm}} are given by

αn±rr2+n(n±1)(r2+n2)(r2+(n±1)2),\displaystyle\alpha_{n}^{\pm}\doteq\mp r\,\frac{r^{2}+n(n\pm 1)}{\sqrt{(r^{2}+n^{2})(r^{2}+(n\pm 1)^{2}})}, (24a)
βn±rn(r2+n2)(r2+(n±1)2).\displaystyle\beta_{n}^{\pm}\doteq-r\,\frac{n}{\sqrt{(r^{2}+n^{2})(r^{2}+(n\pm 1)^{2})}}. (24b)

Note that these coefficients depend only on the ratio rq/p\smash{r\doteq q/p} rather than on q\smash{q} and p\smash{p} separately. From (2.2.2), one can also see that the individual phases θ+\smash{\theta^{+}} and θ\smash{\theta^{-}} per se are, expectedly, unimportant and the dynamics is affected by these phases only in the combination θ+θ\smash{\theta^{+}-\theta^{-}}.

It is also convenient for our purposes to introduce

τ𝒜t\displaystyle\tau\doteq\mathcal{A}\,t (25)

as the new dimensionless time, as well as ϕarctan(𝒜/𝒜+)[0,\upi/2]\smash{\phi\doteq\arctan(\mathcal{A}^{-}/\mathcal{A}^{+})\in[0,\upi/2]}, so that

𝒜+=𝒜cosϕ,𝒜=𝒜sinϕ.\displaystyle\mathcal{A}^{+}=\mathcal{A}\cos\phi,\qquad\mathcal{A}^{-}=\mathcal{A}\sin\phi. (26)

In particular, ϕ=0\smash{\phi=0} and ϕ=\upi/2\smash{\phi=\upi/2} correspond to structures consisting only of w𝒑+\smash{w_{\boldsymbol{p}}^{+}} and w𝒑\smash{w_{\boldsymbol{p}}^{-}}, respectively, and ϕ=\upi/4\smash{\phi=\upi/4} corresponds to a ‘balanced’ background with 𝒜+=𝒜\smash{\mathcal{A}^{+}=\mathcal{A}^{-}}. For simplicity, ϕ=\upi/4\smash{\phi=\upi/4} will be assumed from now on throughout the paper (except where specified otherwise). In this case, one has

Epri,bEpri,v=|w𝒑+w𝒑|2|w𝒑++w𝒑|2=tan2θ,\displaystyle\frac{E_{\text{pri},b}}{E_{\text{pri},v}}=\frac{|w_{\boldsymbol{p}}^{+}-w_{\boldsymbol{p}}^{-}|^{2}}{|w_{\boldsymbol{p}}^{+}+w_{\boldsymbol{p}}^{-}|^{2}}=\tan^{2}\theta, (27)

where Epri,v\smash{E_{\text{pri},v}} and Epri,b\smash{E_{\text{pri},b}} are the kinetic and magnetic energy densities in the primary mode respectively. Hence, θ\smash{\theta} can be understood as the parameter that determines the relative weights of the kinetic and magnetic-field energy in the primary-mode energy. In particular, θ=0\smash{\theta=0} corresponds to a primary structure that involves only velocity perturbations, while θ=\upi/2\smash{\theta=\upi/2} corresponds to a primary structure that involves only magnetic-field perturbations.

Using this notation, one can express (2.2.2) in the following compact form:

𝝍˙n=σ=±1𝑮nσ𝝍n+σ\displaystyle\dot{\boldsymbol{\psi}}_{n}=\sum_{\sigma=\pm 1}\boldsymbol{G}_{n}^{\sigma}\boldsymbol{\psi}_{n+\sigma} (28)

(the dot denotes τ\smash{\partial_{\tau}}), or even more compactly as

𝝍˙=𝑴𝝍.\displaystyle\dot{\boldsymbol{\psi}}=\boldsymbol{M}\boldsymbol{\psi}. (29)

Here, 𝝍n(ψn,ψn+)\smash{\boldsymbol{\psi}_{n}\doteq(\psi_{n}^{-},\psi_{n}^{+})^{\intercal}} is a two-component column vector (the symbol denotes transposition), 𝝍(,𝝍1,𝝍0,𝝍1)\smash{\boldsymbol{\psi}\doteq(\ldots,\boldsymbol{\psi}_{-1},\boldsymbol{\psi}_{0},\boldsymbol{\psi}_{1}\ldots)^{\intercal}} is an infinite-dimensional block vector consisting of 𝝍n\smash{\boldsymbol{\psi}_{n}}, the matrices 𝑮nσ\smash{\boldsymbol{G}_{n}^{\sigma}} are given by

𝑮nσ(αnσcosϕeiσθβnσcosϕsinϕeiσθβnσcosϕsinϕeiσθαnσsinϕeiσθ),\displaystyle\boldsymbol{G}_{n}^{\sigma}\doteq\begin{pmatrix}\alpha_{n}^{\sigma}\cos\phi\,\mathrm{e}^{-\mathrm{i}\sigma\theta}&\beta_{n}^{\sigma}\sqrt{\cos\phi\sin\phi}\,\mathrm{e}^{\mathrm{i}\sigma\theta}\\ \beta_{n}^{\sigma}\sqrt{\cos\phi\sin\phi}\,\mathrm{e}^{-\mathrm{i}\sigma\theta}&\alpha_{n}^{\sigma}\sin\phi\,\mathrm{e}^{\mathrm{i}\sigma\theta}\end{pmatrix}, (30)

and 𝑴\smash{\boldsymbol{M}} is the following block matrix:

𝑴{medsize}(0𝑮2+000𝑮10𝑮1+000𝑮00𝑮0+000𝑮10𝑮1+000𝑮20).\boldsymbol{M}\doteq\medsize\begin{pmatrix}\ddots&&&\vdots&&&\iddots\\ &0&\boldsymbol{G}_{-2}^{+}&0&0&0&\\ &\boldsymbol{G}_{-1}^{-}&0&\boldsymbol{G}_{-1}^{+}&0&0&\\ \ldots&0&\boldsymbol{G}_{0}^{-}&0&\boldsymbol{G}_{0}^{+}&0&\ldots\\ &0&0&\boldsymbol{G}_{1}^{-}&0&\boldsymbol{G}_{1}^{+}\\ &0&0&0&\boldsymbol{G}_{2}^{-}&0&\\ \iddots&&&\vdots&&&\ddots\end{pmatrix}.

It is readily seen then that the system’s dynamics is controlled only by the dimensionless parameters r\smash{r}, θ\smash{\theta}, and ϕ\smash{\phi}, while 𝒜\smash{\mathcal{A}} and p\smash{p} determine only its characteristic temporal and spatial scales, respectively.

Equations (28) and (29) are similar to the equations that describe a one-dimensional chain of coupled linear oscillators. The n\smash{n}th elementary cell of the chain consists of two different oscillators characterised by ψn+\smash{\psi_{n}^{+}} and ψn\smash{\psi_{n}^{-}}, respectively. Similar equations emerge in studies of phase-space turbulence for the coupling between Hermite moments of the particle phase-space probability distribution (Hammett et al., 1993; Nastac et al., 2023; Adkins & Schekochihin, 2018; Parker, 2016; Kanekar et al., 2015). Throughout this work, we use (28) and (29) to study the nature of collective oscillations in the chain of modulational harmonics, which can be understood as Floquet modes of the linearised system. Unstable oscillations of this system (i.e. MIs) lead to structure formation on top of the primary structure.

One can also consider (29) as a Schrödinger equation with a Hamiltonian i𝑴\smash{\mathrm{i}\boldsymbol{M}}. This Hamiltonian is not Hermitian, because modulations are parametrically coupled with the primary mode (through which energy can be either gained or lost) and dissipate at n\smash{n\to\infty}. At the same time, this Hamiltonian is invariant under the time-reversal transformation (ii\smash{\mathrm{i}\to-\mathrm{i}}) and the parity transformation in the spectral space (nn\smash{n\rightarrow-n}, σσ\smash{\sigma\rightarrow-\sigma}). This makes the system (28) 𝒫𝒯\smash{\mathcal{PT}}-symmetric (Bender, 2005; Bender et al., 2019). Depending on the balance of sources and sinks, such systems can support modes with entirely real frequencies (unbroken 𝒫𝒯\smash{\mathcal{PT}} symmetry) and pairs of modes whose frequencies are mutually complex-conjugate (broken 𝒫𝒯\smash{\mathcal{PT}} symmetry). This will be discussed further in section 4.

Refer to caption
Figure 2: Comparison of the time evolution of various ψn+\smash{\psi_{n}^{+}} as predicted by DNS of the nonlinear equation (3) (red dots) and XQLT (28) (black curves) for two representative cases of the MI (θ=\upi/6\smash{\theta=\upi/6}) [(a), (c), (e)] and stable modulational dynamics (θ=\upi/3\smash{\theta=\upi/3}) [(b), (d), (f)]. Nonlinear DNS are seeded with random noise, resulting in a broadband modulational dynamics, but the results shown are for a modulation corresponding to r=0.5\smash{r=0.5}, and the XQLT simulations are initialised accordingly. For the θ=\upi/6\smash{\theta=\upi/6} case, the harmonics shown are: (a) n=0\smash{n=0}, (b) n=1\smash{n=1}, and (c) n=2\smash{n=2}. In this case, XQLT adequately approximates the nonlinear dynamics until about τ30\smash{\tau\sim 30}. For the θ=\upi/3\smash{\theta=\upi/3} case, the harmonics shown are: (d) n=0\smash{n=0}, (e) n=25\smash{n=25}, and (f) n=50\smash{n=50}. In this case, XQLT adequately approximates the nonlinear dynamics indefinitely.

As long as the underlying ordering assumption [|w𝒒+n𝒑±|/|w𝒑±|𝒪(ϵ)\smash{|w_{\boldsymbol{q}+n\boldsymbol{p}}^{\pm}|/|w_{\boldsymbol{p}}^{\pm}|\sim\mathcal{O}(\epsilon)}] holds, XQLT (28) provides an excellent approximation of the nonlinear modulational dynamics of (3) (figure 2). However, practical applications of XQLT require truncations of this system. In the remainder of this work, we discuss to what extent such truncations can be adequate and compare our analytical results to DNS of (28), in lieu of the full (3).

3 Truncated models

3.1 Basic equations

A common way to truncate a problem like (29) is to postulate that all ψn\smash{\psi_{n}} with |n|>N\smash{|n|>N} are negligible. One can also understand this as imposing reflective boundary conditions in the oscillator chain {𝝍nσ}\smash{\{\boldsymbol{\psi}_{n}^{\sigma}\}}:

𝝍N+1σ=𝝍(N+1)σ=0.\displaystyle\boldsymbol{\psi}_{N+1}^{\sigma}=\boldsymbol{\psi}_{-(N+1)}^{\sigma}=0. (31)

Retained in this case are harmonics with wavevectors 𝒑\smash{\boldsymbol{p}} and 𝒒+n𝒑\smash{\boldsymbol{q}+n\boldsymbol{p}}, with n=0,±1,,±Nn=0,\pm 1,\ldots,\pm N. This means that the resulting system has Q=1+(1+2N)\smash{Q=1+(1+2N)} degrees of freedom. We call this procedure Q\smash{Q}-mode truncation (Q\smash{Q}MT). The corresponding truncation of 𝝍\smash{\boldsymbol{\psi}} will be denoted as 𝝍QMT\smash{\boldsymbol{\psi}_{Q\text{MT}}}, and the corresponding truncation of 𝑴\smash{\boldsymbol{M}} will be denoted as 𝑴QMT\smash{\boldsymbol{M}_{Q\text{MT}}}, so (29) becomes

𝝍˙QMT=𝑴QMT𝝍QMT.\displaystyle\dot{\boldsymbol{\psi}}_{Q\text{MT}}=\boldsymbol{M}_{Q\text{MT}}\boldsymbol{\psi}_{Q\text{MT}}. (32)

Because of (31), eigenmodes of this system can be understood as standing waves in the oscillator chain {𝝍nσ}\smash{\{\boldsymbol{\psi}_{n}^{\sigma}\}} and have the form 𝝍QMT=𝒀exp(iωτ)\smash{\boldsymbol{\psi}_{Q\text{MT}}=\boldsymbol{Y}\exp(-\mathrm{i}\omega\tau)}, where ω\smash{\omega} are global frequencies and 𝒀\smash{\boldsymbol{Y}} are constant ‘polarization vectors’. According to (29), these vectors satisfy 𝑫𝒀=0\smash{\boldsymbol{D}\boldsymbol{Y}=0}, where 𝑫=iω𝟙+𝑴QMT\smash{\boldsymbol{D}=\mathrm{i}\omega\mathbb{1}+\boldsymbol{M}_{Q\text{MT}}} and 𝟙\smash{\mathbb{1}} is a unit Q×Q\smash{Q\times Q} matrix. Then, ω\smash{\omega} can be found from

det𝑫=0\displaystyle\det\boldsymbol{D}=0 (33)

and MIs correspond to modes with Imω>0\smash{\operatorname{Im}\omega>0}.

The success of this approach hinges on the assumption that higher harmonics truly remain negligible. In other words, a Q\smash{Q}MT should be able to adequately describe the evolution of a modulational mode if its eigenvector 𝝍QMT\smash{\boldsymbol{\psi}_{Q\text{MT}}} is heavily weighted towards the low-order harmonics. Below, we explore to what extent this assumption is satisfied in various regimes.

3.2 Growth rates

The lowest-order Q\smash{Q}MT is 4MT, which corresponds to N=1\smash{N=1}. Within this model, the primary harmonic with 𝒌=𝒑\smash{\boldsymbol{k}=\boldsymbol{p}} is assumed to interact with the modulational mode at 𝒌=𝒒\smash{\boldsymbol{k}=\boldsymbol{q}} only through two sidebands 𝒌=𝒒±𝒑\smash{\boldsymbol{k}=\boldsymbol{q}\pm\boldsymbol{p}}; then, 𝝍4MT=(𝝍1,𝝍0,𝝍1)\smash{\boldsymbol{\psi}_{\text{4MT}}=(\boldsymbol{\psi}_{-1},\boldsymbol{\psi}_{0},\boldsymbol{\psi}_{1})^{\intercal}} and

𝑴4MT={medsize}(0𝑮1+0𝑮00𝑮0+0𝑮10).\displaystyle\boldsymbol{M}_{\text{4MT}}=\medsize\begin{pmatrix}0&\boldsymbol{G}_{-1}^{+}&0\\ \boldsymbol{G}_{0}^{-}&0&\boldsymbol{G}_{0}^{+}\\ 0&\boldsymbol{G}_{1}^{-}&0\end{pmatrix}. (34)

(This model can be understood as quasilinear, because, although the modulational dynamics remains nonlinear, the second- and higher-order harmonics of the perturbation are neglected.) The 4MT equations yield

ω2=r41+r2(1±1(1r4cos22θ)sin22ϕ).\displaystyle\omega^{2}=\frac{r^{4}}{1+r^{2}}\left(1\pm\sqrt{1-(1-r^{-4}\cos^{2}2\theta)\sin^{2}2\phi}\right). (35)

[Note that these are normalised frequencies corresponding to the normalised time τ\smash{\tau}. To obtain the actual physical frequencies, one must multiply the right-hand side of (35) by 𝒜\smash{\mathcal{A}}.] Assuming ϕ=\upi/4\smash{\phi=\upi/4}, one finds that two out of four branches of this dispersion relation are unstable at |r|<1\smash{|r|<1}. The corresponding growth rate ΓImω\smash{\Gamma\doteq\operatorname{Im}\omega} is maximised at θ=0\smash{\theta=0} and θ=\upi/2\smash{\theta=\upi/2} at the value

Γ=r1r21+r2\displaystyle\Gamma=r\sqrt{\frac{1-r^{2}}{1+r^{2}}} (36)

(from now on, we assume r>0\smash{r>0} for clarity of notation), and the corresponding polarization vectors are as follows:

𝒀0={(1,+1),θ=0,(1,1),θ=\upi/2.\displaystyle\boldsymbol{Y}_{0}=\begin{cases}(1,+1)^{\intercal},&\theta=0,\\ (1,-1)^{\intercal},&\theta=\upi/2.\\ \end{cases} (37)

The case θ=0\smash{\theta=0} can be recognised as a purely hydrodynamic Kelvin–Helmholtz instability (KHI), i.e. one that does not involve magnetic field. The case 0<θ<\upi/4\smash{0<\theta<\upi/4} can be understood as the MHD generalization of the KHI.

Potentially more accurate is a model with N=2\smash{N=2}, or 6MT, which accounts for harmonics with 𝒌=𝒒±2𝒑\smash{\boldsymbol{k}=\boldsymbol{q}\pm 2\boldsymbol{p}}; then, 𝝍4MT=(𝝍2,𝝍1,𝝍0,𝝍1,𝝍2)\smash{\boldsymbol{\psi}_{\text{4MT}}=(\boldsymbol{\psi}_{-2},\boldsymbol{\psi}_{-1},\boldsymbol{\psi}_{0},\boldsymbol{\psi}_{1},\boldsymbol{\psi}_{2})^{\intercal}} and

𝑴6MT={medsize}(0𝑮2+000𝑮10𝑮1+000𝑮00𝑮0+000𝑮10𝑮1+000𝑮20).\displaystyle\boldsymbol{M}_{\text{6MT}}=\medsize\begin{pmatrix}0&\boldsymbol{G}_{-2}^{+}&0&0&0\\ \boldsymbol{G}_{-1}^{-}&0&\boldsymbol{G}_{-1}^{+}&0&0\\ 0&\boldsymbol{G}_{0}^{-}&0&\boldsymbol{G}_{0}^{+}&0\\ 0&0&\boldsymbol{G}_{1}^{-}&0&\boldsymbol{G}_{1}^{+}\\ 0&0&0&\boldsymbol{G}_{2}^{-}&0\end{pmatrix}. (38)

One can derive an analytical expression for ω\smash{\omega} from here just like we derived (35) from (34). However, this expression is cumbersome and not particularly instructive, so it is not presented explicitly. Truncations with larger Q\smash{Q}, albeit also explored by us, will not be presented either for the same reason.

Figure 3 shows a comparison of the 4MT and 6MT predictions and the MI growth rates inferred numerically from nonlinear DNS of (9). Note that the truncated models quantitatively agree with each other and with the DNS at some parameters but drastically disagree at other parameters. In particular, the 4MT predicts that Γ\smash{\Gamma} is an even function of \upi/2θ\smash{\upi/2-\theta}, while the 6MT predicts that this is not the case. Furthermore, the DNS predicts that, for example, at r=0.5\smash{r=0.5}, the system is stable at all θ>\upi/2\smash{\theta>\upi/2} [figure 3(b)]. Let us discuss this in detail.

Refer to caption
Figure 3: (a) The growth rate Γ\smash{\Gamma} (of the most unstable mode) versus the normalised wavenumber r\smash{r} at θ=0\smash{\theta=0}: 4MT (blue), 6MT (red), and nonlinear DNS of (9) (black markers). (b) Same for Γ\smash{\Gamma} versus ϕ\smash{\phi} at r=0.5\smash{r=0.5}. The 4MT predicts identical growth rates at ϕ=0\smash{\phi=0} and ϕ=\upi\smash{\phi=\upi}, while the 6MT and nonlinear DNS predict that the system is unstable at ϕ=0\smash{\phi=0} and stable at ϕ=\upi\smash{\phi=\upi}. The growth rates are calculated with nonlinear DNS by adding random noise with ky=0\smash{k_{y}=0} noise to the primary mode, allowing this modulational noise to evolve, and then picking out the growth rate of the most unstable mode. The noise amplitude is taken small enough such that it does not affect the results.
Refer to caption
Figure 4: The difference between the growth rate predicted by a Q\smash{Q}MT and that inferred from XQLT DNS, ΓQMTΓDNS\smash{\Gamma_{Q\text{MT}}-\Gamma_{\text{DNS}}}, versus (θ,r)\smash{(\theta,r)} for ϕ=\upi/4\smash{\phi=\upi/4}: (a) 4MT, (b) 6MT. The dashed curves mark the stability boundary as determined by a parameter scan with nonlinear DNS. The preponderance of the red colour, especially outside the instability domain, indicates that truncated models tend to produce false positives for instability.

As expected from the analytical formula (36) based on the 4MT, and as we have also found numerically, MI is typically suppressed at r1\smash{r\gtrsim 1}; hence, below we focus on the regime r(0,1)\smash{r\in(0,1)}. A comparison of the growth rate predicted by nonlinear DNS and truncated models in this range is presented figure 5. The corresponding primary modes with θ\upi/4\smash{\theta\lesssim\upi/4} are modulationally unstable, and the growth rates predicted by 4MT and 6MT are in good agreement with DNS. Such primary structures are dominated by the velocity field, pri,v>pri,b\smash{\mathcal{E}_{\text{pri},v}>\mathcal{E}_{\text{pri},b}} [see (27)], and thus will be called 𝒗\smash{\boldsymbol{v}}-dominated primary modes (VDPMs). At θ\upi/4\smash{\theta\gtrsim\upi/4}, primary structures are dominated by the magnetic field, pri,b>pri,v\smash{\mathcal{E}_{\text{pri},b}>\mathcal{E}_{\text{pri},v}}, and thus will be called 𝒃\smash{\boldsymbol{b}}-dominated primary modes (BDPMs). As seen in figure 5, BDPMs are modulationally stable, but truncated models predict otherwise. Figures 4(a) and (b) show the discrepancy between the predictions of nonlinear DNS, 4MT, and 6MT for representative parameters, specifically, θ(0,\upi/2)\smash{\theta\in(0,\upi/2)} and ϕ=\upi/4\smash{\phi=\upi/4}. (Stability is primarily determined by θ\smash{\theta} and r\smash{r}. Deviations of ϕ\smash{\phi} from \upi/4\smash{\upi/4} result only in quantitative adjustments, unless ϕ\smash{\phi} is close to 0\smash{0} or \upi/2\smash{\upi/2}.) In other words, truncated modes tend to overestimate the growth rate and systematically produce false positives for instability. Figure 5 shows that, although the magnitude of disagreement decreases with N\smash{N}, the systematic overprediction of instability persists.

The reason for this overprediction can be understood by analyzing the eigenmode structure. (See also appendix A for an alternative explanation.) As to be discussed in section 4, unstable modes are evanescent spectral waves, whose |Yn|\smash{|Y_{n}|} exponentially decrease with |n|\smash{|n|} [figure 6(a)]:

|Yn|exp(|n|/l).\displaystyle|Y_{n}|\propto\exp(-|n|/l). (39)

(We assume ϕ=\upi/4\smash{\phi=\upi/4} for simplicity, so |Yn+|=|Yn|\smash{|Y_{n}^{+}|=|Y_{n}^{-}|} and the upper index in |Ynσ|\smash{|Y_{n}^{\sigma}|} can be omitted.) That is why truncated models correctly predict MI for modes that are actually unstable. The spectral scale l\smash{l} depends on the system parameters, and the smaller l\smash{l} the more accurate Q\smash{Q}MT is. At some parameters, though, particularly when θ\upi/4\smash{\theta\rightarrow\upi/4}, l\smash{l} becomes large or even infinite, such that |Yn|\smash{|Y_{n}|} asymptotes to a nonzero constant at |n|\smash{|n|\to\infty} [figure 6(b)]. In this case, Q\smash{Q}MT is bound to fail. The corresponding modes are propagating spectral waves (PSWs). While they also receive energy from the primary structure at low n\smash{n}, PSWs transport this energy along the spectrum to |n|\smash{|n|\to\infty}. There, the energy is unavoidably dissipated by viscosity or resistivity, however small those are. This dissipation makes PSWs more stable than any Q\smash{Q}MT would predict, because Q\smash{Q}MTs assume that the mode energy forever resides at small n\smash{n}, where dissipation is small or zero. Below, we discuss these effects in more detail.

Refer to caption
Figure 5: The growth rates Γ\smash{\Gamma} obtained through a Q\smash{Q}MT for θ=0\smash{\theta=0} (blue markers) and θ=\upi/2\smash{\theta=\upi/2} (orange markers) versus the number of modes retained, N\smash{N}. The solid coloured lines indicate the corresponding DNS growth rates, to which the rates predicted by the truncated models converge at large N\smash{N}.

4 Spectral waves

4.1 Dynamics at large |n||n|

First, let us consider spectral waves at |n|1\smash{|n|\gg 1}. In this limit, one has αn±r\smash{\alpha_{n}^{\pm}\rightarrow\mp r} and βn±0\smash{\beta_{n}^{\pm}\rightarrow 0}, so (29) yields two decoupled wave equations for ψn+\smash{\psi_{n}^{+}} and ψn\smash{\psi_{n}^{-}}:

ψ˙n\displaystyle\dot{\psi}_{n}^{-} =rcosϕ(eiθψn1eiθψn+1),\displaystyle=r\cos\phi\big{(}\mathrm{e}^{\mathrm{i}\theta}\psi_{n-1}^{-}-\mathrm{e}^{-\mathrm{i}\theta}\psi_{n+1}^{-}\big{)}, (40a)
ψ˙n+\displaystyle\dot{\psi}_{n}^{+} =rsinϕ(eiθψn1+eiθψn+1+).\displaystyle=r\sin\phi\big{(}\mathrm{e}^{-\mathrm{i}\theta}\psi_{n-1}^{+}-\mathrm{e}^{\mathrm{i}\theta}\psi_{n+1}^{+}\big{)}. (40b)

These equations allow solutions in the form of monochromatic waves:

ψnσ=Ψσexp(iωτ+iKσn),\displaystyle\psi_{n}^{\sigma}=\Psi^{\sigma}\exp(-\mathrm{i}\omega\tau+\mathrm{i}K^{\sigma}n), (41)

with constant amplitude Ψσ\smash{\Psi^{\sigma}}, frequency ω\smash{\omega}, and ‘wavenumber’ Kσ\smash{K^{\sigma}}. The quotation marks are added as a reminder that the corresponding waves propagate in the spectral space rather than in the physical space. Also note that K±\smash{K^{\pm}} are defined unambiguously only within the first Brillouin zone, K±(\upi,\upi)\smash{K^{\pm}\in(-\upi,\upi)}.

From (40), one readily finds that spectral waves obey the following dispersion relations:

ω=ω0σsinκσ,κ±K±±θ,\displaystyle\omega=\omega_{0}^{\sigma}\sin\kappa^{\sigma},\qquad\kappa^{\pm}\doteq K^{\pm}\pm\theta, (42)

where ω0+=2rsinϕ\smash{\omega_{0}^{+}=2r\sin\phi} and ω0=2rcosϕ\smash{\omega_{0}^{-}=2r\cos\phi}. Accordingly,spectral waves in the n\smash{n} space along the ψσ\smash{\psi^{\sigma}} channel have the group velocity

vgσ=ω0σcosκσ\displaystyle v_{g}^{\sigma}=\omega_{0}^{\sigma}\cos\kappa^{\sigma} (43)

and can propagate either up or down the spectrum (figure 7). The dependence of ω\smash{\omega} and vg±\smash{v_{g}^{\pm}} on θ\smash{\theta} can be removed by a gauge transformation. Specifically, κ±\smash{\kappa^{\pm}} becomes the true wavenumber if one adopts the variables ψ¯n±ψn±einθ\smash{\bar{\psi}_{n}^{\pm}\doteq\psi_{n}^{\pm}\mathrm{e}^{\mp\mathrm{i}n\theta}} instead of ψn±\smash{\psi_{n}^{\pm}}. Also notably, to the extent that the (large) index n\smash{n} can be approximately considered a continuous variable, (40) can be written as usual nondispersive-wave equations for ζ±(τ,n)ψ¯n±(τ)\smash{\zeta^{\pm}(\tau,n)\doteq\bar{\psi}_{n}^{\pm}(\tau)}:

τζ±=ω0±nζ±.\displaystyle\partial_{\tau}\zeta^{\pm}=-\omega_{0}^{\pm}\,\partial_{n}\zeta^{\pm}. (44)

This model corresponds to the small-κσ\smash{\kappa^{\sigma}} limit of (42), that is, ωω0σκσ\smash{\omega\approx\omega_{0}^{\sigma}\kappa^{\sigma}}.

The above equations can be used to describe PSW packets localised at large |n|\smash{|n|} but not the global eigenmodes (because the latter involve dynamics at small |n|\smash{|n|}). In particular, (42) are not enough to find the global-mode frequencies. Yet they allow one to determine the mode structure at |n|1\smash{|n|\gg 1} if one knows the mode frequency from other considerations (or, vice versa, to find the mode frequency if κ±\smash{\kappa^{\pm}} are known). In particular, unstable modes have complex ω\smash{\omega}, so, according to (42), their asymptotic wavenumber cannot be real. This explains why unstable modes are evanescent. In contrast, for real ω\smash{\omega}, (42) allows for real wavenumbers (provided that |ω|ω0±\smash{|\omega|\leq\omega_{0}^{\pm}}), i.e. predicts PSWs. These results are corroborated by DNS of (29). For example, figure 6 shows the mode structures inferred from the asymptotic dynamics of ψnσ(τ)\smash{\psi_{n}^{\sigma}(\tau)} at large enough τ\smash{\tau} for the initial conditions

ψnσ(τ=0)={ϵexp(iξnσ),n=±1,0,n±1.\displaystyle\psi_{n}^{\sigma}(\tau=0)=\begin{cases}\epsilon\exp(\mathrm{i}\xi^{\sigma}_{n}),&n=\pm 1,\\ 0,&n\neq\pm 1.\end{cases} (45)

Here, ϵ\smash{\epsilon} is a constant small amplitude (within the linear approximation, one can always rescale ψnσ\smash{\psi_{n}^{\sigma}} such that ϵ=1\smash{\epsilon=1}), and ξn\smash{\xi_{n}} are parameters that change the polarization of the initial conditions while keeping the initial modulation energy fixed. (The specific values of ξnσ\smash{\xi_{n}^{\sigma}} are given in the captions of figures in which initial condition dependent results of DNS are presented.) This form of the initial conditions is chosen to emulate a simple modulation of the primary mode.

Refer to caption
Figure 6: Typical structures of eigenmodes, specifically, |Yn|\smash{|Y_{n}|} versus n\smash{n}, at r=0.5\smash{r=0.5} and ϕ=\upi/4\smash{\phi=\upi/4} for various θ\smash{\theta}: (a) unstable modes supported by VDPMs; (b) oscillatory solutions supported by BDPMs. (The upper index in |Ynσ|\smash{|Y_{n}^{\sigma}|} is omitted because |Yn+|=|Yn|\smash{|Y_{n}^{+}|=|Y_{n}^{-}|} for the assumed parameters.) |Yn|\smash{|Y_{n}|} decreases exponentially with n\smash{n} for the former (notice the logarithmic scale) but asymptote to nonzero constants for the latter. The asymptotes are indicated by the dotted lines. These results are obtained through DNS of (29) using the initial conditions of the form (45) and normalised such that |Y0|=1\smash{|Y_{0}|=1}.

The fact that the properties of spectral waves are determined only by the asymptotic properties of αn±\smash{\alpha_{n}^{\pm}} and βn±\smash{\beta_{n}^{\pm}} makes these waves particularly robust. In particular, note that the asymptotic form of αn±\smash{\alpha_{n}^{\pm}} and βn±\smash{\beta_{n}^{\pm}} at |n|\smash{|n|\to\infty} remains the same even when 𝒑\smash{\boldsymbol{p}} and 𝒒\smash{\boldsymbol{q}} are not orthogonal, so the above results readily extend to general 𝒒\smash{\boldsymbol{q}}. Likewise, a similar picture holds also for three-dimensional interactions, although the mode polarization can be more complicated in this case. Notably, PSWs provide ballistic rather than diffusive (or super-diffusive) energy transport along the spectrum. This distinguishes them from the cascades that are commonly discussed in turbulence theories and result from eddy–eddy interactions, or wave–wave collisions (Galtier et al., 2000).

Refer to caption
Figure 7: Numerical simulations showing PSW packets propagating: (a) down the spectrum (|n|\smash{|n|} of the packet center increases with time) and (b) up the spectrum (|n|\smash{|n|} of the packet center decreases with time). In both cases, r=0.5\smash{r=0.5} and θ=\upi/3\smash{\theta=\upi/3}. Both packets are initialised using ψn+(τ=0)=exp[(nn0)2/ς+iK+n]\smash{\psi_{n}^{+}(\tau=0)=\exp[-(n-n_{0})^{2}/\varsigma+\mathrm{i}K^{+}n]} with n0=200\smash{n_{0}=200}, ς=150\smash{\varsigma=150}, and: (a) K+=\upi/6\smash{K^{+}=-\upi/6}, (b) K+=\upi/2\smash{K^{+}=\upi/2}.

4.2 Global modes in the form of PSWs

In realistic settings, spectral waves are excited at finite |n|\smash{|n|} and propagate down the spectrum. Evanescent waves reach |n|l\smash{|n|\sim l} [see (39)] in finite time τ0\smash{\tau_{0}}, get reflected, and eventually (asymptotically) settle as stationary standing waves on time scales of the order of a few τ0\smash{\tau_{0}}. Such waves can be adequately described by truncated models. In contrast, PSWs continue to propagate toward infinity indefinitely (until they dissipate at scales where viscosity or resistivity is no longer negligible). Thus, they never develop components propagating up the spectrum at large |n|\smash{|n|} and so they never become quite like the standing-wave eigenmodes predicted by truncated models (section 3.1). This means that actual PSWs cannot be adequately described by analyzing Q\smash{Q}MT in principle. Instead, they should be considered in the context of the initial-value problem, much like Langmuir waves appear in plasma kinetic theory within Landau’s initial-value approach as opposed to the Case–van Kampen true-eigenmode approach (Stix, 1992).

Consequently, even though (29) has infinitely many degrees of freedom, the number of relevant global modes, i.e. those that propagate towards infinity, is finite. We find that there are only two of them, which can be attributed to the fact that dim𝝍n=2\smash{\dim\boldsymbol{\psi}_{n}=2}. (Likewise, at a given wavenumber, only one, Langmuir, branch of relevant electrostatic waves in electron plasma remains relevant after transients are gone.) For example, for ϕ=\upi/4\smash{\phi=\upi/4}, we find from DNS that

ω=±2rcosθ.\displaystyle\omega=\pm\sqrt{2}r\cos\theta. (46)

This dependence and the corresponding mode structures are illustrated in figure 8. Each of the two modes has two wavenumbers associated with it: one determines ψnσ\smash{\psi^{\sigma}_{n}} at n+\smash{n\to+\infty}, and the other one determines ψnσ\smash{\psi^{-\sigma}_{n}} at n\smash{n\to-\infty} (figure 9). Thus, there are four κdσ\smash{\kappa^{\sigma}_{d}} overall, where σ\smash{\sigma} denotes the corresponding component of 𝝍n\smash{\boldsymbol{\psi}_{n}}, and d\smash{d} denotes the direction of propagation (d=±1\smash{d=\pm 1} is the sign of the group velocity at |n|\smash{|n|\to\infty}). They can be found by combining (46) with (42) and applying sgnvg=d\smash{\operatorname{sgn}v_{g}=d}, i.e. ω0σdcosκdσ>0\smash{\omega_{0}^{\sigma}d\cos\kappa^{\sigma}_{d}>0}. This yields

ω>0:\displaystyle\omega>0:\quad κ++=\upi2θ,\displaystyle\kappa^{+}_{+}=\frac{\upi}{2}-\theta,\kern 10.0pt κ=3\upi2+θ,\displaystyle\kappa^{-}_{-}=-\frac{3\upi}{2}+\theta, (47a)
ω<0:\displaystyle\omega<0:\qquad κ+=3\upi2θ,\displaystyle\kappa^{+}_{-}=\frac{3\upi}{2}-\theta,\kern 10.0pt κ+=\upi2+θ,\displaystyle\kappa^{-}_{+}=-\frac{\upi}{2}+\theta, (47b)

where κdσ=Kdσ+σθ\smash{\kappa^{\sigma}_{d}=K^{\sigma}_{d}+\sigma\theta} and the values of Kdσ\smash{K^{\sigma}_{d}} are defined modulo the Brillouin zone (i.e. K=3\upi/2\smash{K^{-}_{-}=-3\upi/2} should be understood as \upi/2\smash{\upi/2} and so on). Remember, though, that this applies only to global eigenmodes. Transient waves propagating in the region |n|1\smash{|n|\gg 1} can have any κσ\smash{\kappa^{\sigma}} (figure 7).

Refer to caption
Figure 8: (a) The dependence of the global-mode frequency ω>0\smash{\omega>0} on θ\smash{\theta} for various r\smash{r}. The dashed lines are the inferred solutions (46). (b) |Yn+|\smash{|Y_{n}^{+}|} (blue) and |Yn|\smash{|Y_{n}^{-}|} (orange) for the mode with ω>0\smash{\omega>0}. The eigenmode amplitudes for the ω<0\smash{\omega<0} mode are identical with the roles of |Yn+|\smash{|Y_{n}^{+}|} and |Yn|\smash{|Y_{n}^{-}|} switched.
Refer to caption
Figure 9: DNS of (29) showing global-mode PSWs for r=0.5\smash{r=0.5}, θ=\upi/3\smash{\theta=\upi/3}, and ϕ=\upi/4\smash{\phi=\upi/4}: (a) Reψn+/ϵ\smash{\operatorname{Re}\psi_{n}^{+}/\epsilon} for a PSW seeded by the initial conditions (45) with ξdσ=dexp(σi\upi/4)\smash{\xi^{\sigma}_{d}=d\exp(\sigma\mathrm{i}\upi/4)} (colour bar). The dashed lines indicate the fronts propagating at: (i) the maximum spectral speed 2r\smash{\sqrt{2}r} and (ii) the actual group velocity of the mode. The field between these dashed lines consists of transients, which are negligible behind the second front. (b) The total normalised energy of the modulation, mod/ϵ2\smash{\mathcal{E}_{\text{mod}}/\epsilon^{2}}, versus τ\smash{\tau} (black), along with its kinetic (red) and magnetic (blue) components. (c) The profiles of the spectral energy density at τ=100, 150, 200\smash{\tau=100,\,150,\,200}, clearly exhibiting left- and right-propagating fronts. The horizontal dashed line indicates the average spectral energy density n¯\smash{\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu}, where the average is taken over the PSW period and over n\smash{n} between the energy fronts.
Refer to caption
Figure 10: The global-mode PSW mode with ω>0\smash{\omega>0}, r=0.5\smash{r=0.5}, and θ=\upi/3\smash{\theta=\upi/3}: same as figure 9, but in the real space as opposed to the spectral space. Specifically shown are: (a) zy+\smash{z_{y}^{+}} as a function of (τ,x)\smash{(\tau,x)} at y=0\smash{y=0}, (b) zx+\smash{z_{x}^{+}} as a function of (τ,y)\smash{(\tau,y)} at x=0\smash{x=0}.

The fact that the frequencies of these modes are real indicates that the modes are self-sustained notwithstanding the dissipation that they unavoidably experience at |n|\smash{|n|\to\infty}. The corresponding evolution of the wave spectrum is illustrated in figure 9(a) and the corresponding evolution in the physical space is shown in figure 10. This remarkable dynamics is understood from the fact that the energy drain via this dissipation is balanced by the energy injection from the primary mode at small |n|\smash{|n|}. As seen from (19) and (16), the dimensionless modulational energy (21b), or

modσ𝒜𝒜σ|ψnσ|2nn,\displaystyle\mathcal{E}_{\text{mod}}\doteq\sum_{\sigma}\frac{\mathcal{A}}{\mathcal{A}^{-\sigma}}\,|\psi_{n}^{\sigma}|^{2}\equiv\sum_{n}\mathcal{E}_{n}, (48)

is governed by

˙n=σ(Inσ+Fnσ)+c.c.,\displaystyle\dot{\mathcal{E}}_{n}=\sum_{\sigma}(I_{n}^{\sigma}+F_{n}^{\sigma})+\text{c.c.}, (49)

where In\smash{I_{n}} and Fn\smash{F_{n}} are given by

Inσ=𝒜𝒜σ(𝒜σ)3/2(βneiσθψn1σ+βn+eiσθψn+1σ)ψnσ,\displaystyle I_{n}^{\sigma}=\frac{\mathcal{A}\sqrt{\mathcal{A}^{\sigma}}}{(\mathcal{A}^{-\sigma})^{3/2}}\,(\beta_{n}^{-}\mathrm{e}^{\mathrm{i}\sigma\theta}\psi_{n-1}^{-\sigma}+\beta_{n}^{+}\mathrm{e}^{-\mathrm{i}\sigma\theta}\psi_{n+1}^{-\sigma})\psi_{n}^{\sigma*}, (50)
Fnσ=𝒜𝒜σ(αneiσθψn1σ+αn+eiσθψn+1σ)ψnσ.\displaystyle F_{n}^{\sigma}=\frac{\mathcal{A}}{\mathcal{A}^{-\sigma}}\,(\alpha_{n}^{-}\mathrm{e}^{-\mathrm{i}\sigma\theta}\psi_{n-1}^{\sigma}+\alpha_{n}^{+}\mathrm{e}^{\mathrm{i}\sigma\theta}\psi_{n+1}^{\sigma})\psi_{n}^{\sigma*}. (51)

Because nFnσ=0\smash{\sum_{n}F_{n}^{\sigma}=0}, the terms Fnσ\smash{F_{n}^{\sigma}} represent the energy flux that is carried along the modulation spectrum and is conserved within each sub-channel σ\smash{\sigma}. In contrast, In\smash{I_{n}} can be understood as injection terms in that they also appear, with the opposite signs, in the equation for the primary wave,

˙pri=nσ=±1Inσ+c.c.,\displaystyle\dot{\mathcal{E}}_{\text{pri}}=-\sum_{n}\sum_{\sigma=\pm 1}I_{n}^{\sigma}+\text{c.c.}, (52)

which follows from (20).

The linear spectral waves discussed so far correspond to the regime when the change of pri\smash{\mathcal{E}_{\text{pri}}} is negligible. Eventually, though, as higher harmonics are populated and thus absorb more energy, the primary wave will be depleted. This means, in particular, that no primary wave can be truly stationary. Even an arbitrarily small perturbation will generally launch a PSW, and this PSW will eventually deplete the primary wave.

4.3 Anomalous dissipation

Since a PSW mode carries energy towards |n|\smash{|n|\rightarrow\infty}, eventually, a large enough |n|\smash{|n|} is reached where the energy is dissipated regardless of how small the viscosity and resistivity are. Thus, a PSW exhibits an effective, or ‘anomalous’, dissipation rate γ\smash{\gamma} that is independent of ν\smash{\nu} and η\smash{\eta} in the limit ν,η0\smash{\nu,\eta\to 0}. This effect is different from the anomalous transport caused by eddy–eddy collisions in turbulence (for example, see Donzis et al. (2005)) in that the energy transport caused by a PSW is ballistic. As a result, γ\smash{\gamma} is straightforward to calculate, which is done as follows.

As discussed in section 4.2, a global PSW at large |n|\smash{|n|} has a flat mode structure, i.e. a structure with n\smash{\mathcal{E}_{n}} independent of n\smash{n}. This structure establishes itself as an expanding ‘shelf’ whose edges (wave fronts) propagate across the modulational spectrum to n±\smash{n\to\pm\infty} at the group velocity vg\smash{v_{g}} (figure 11). Since the height of the shelf, n¯\smash{\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu} (the overbar here denotes averaging over the PSW temporal period and over all n\smash{n} within the shelf), remains constant, this process drains energy from the primary mode linearly in time:

˙pri=2|vg|n¯=const,\displaystyle\dot{\mathcal{E}}_{\text{pri}}=-2|v_{g}|\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu=\text{const}, (53)

where we consider the dynamics average over the PSW temporal period. Hence,

γ˙pripri=2|vg|n¯priω0±ϵ2.\displaystyle\gamma\doteq-\frac{\dot{\mathcal{E}}_{\text{pri}}}{\mathcal{E}_{\text{pri}}}=2|v_{g}|\,\frac{\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu}{\mathcal{E}_{\text{pri}}}\sim\omega_{0}^{\pm}\epsilon^{2}. (54)

(The rate relative to the physical time t\smash{t}, as opposed to τ\smash{\tau}, is 𝒜\smash{\mathcal{A}} times this γ\smash{\gamma}.)

Let us again assume the initial conditions (45) and ϕ=\upi/4\smash{\phi=\upi/4}. Through DNS of (29), we find that the global PSW-mode amplitude is maximised when |arg(ξ±1σ/ξ±1σ)|=|arg(ξ±1σ/ξ1σ)|=\upi\smash{|\arg(\xi^{\sigma}_{\pm 1}/\xi^{-\sigma}_{\pm 1})|=|\arg(\xi^{\sigma}_{\pm 1}/\xi^{\sigma}_{\mp 1})|=\upi}, irrespective of θ\smash{\theta}. (Any particular choice of ξnσ\smash{\xi^{\sigma}_{n}} that satisfies this condition affects only phases of the modulational dynamics and does not impact averaged quantities that determine γ\smash{\gamma}.) This corresponds to the case when all modulational-seed energy is in the magnetic field; i.e. b=\smash{\mathcal{E}_{b}=\mathcal{E}} and v=0\smash{\mathcal{E}_{v}=0}. Figure 11 shows the corresponding anomalous dissipation rate normalised to the seed energy, along with its determining factors – the system-dependent PSW group velocity vg\smash{v_{g}} and the average spectral energy density n¯\smash{\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu}, which is determined by the initial conditions. It can be seen that, for the MI unstable VDPMs (θ<\upi/4\smash{\theta<\upi/4}), spectral waves are relatively slow and have a small amplitude. The transition to modulational stability occurs at |vg|Γ\smash{|v_{g}|\sim\Gamma}. This condition can be understood as a threshold beyond which (i.e. at |vg|Γ\smash{|v_{g}|\gtrsim\Gamma}) PSWs provide a sufficiently fast escape route for the energy injected at small |n|\smash{|n|}, such that the positive feedback loops [see (50)] supporting MIs can no longer be sustained. In the context of 𝒫𝒯\smash{\mathcal{PT}} symmetry (section 2.2.2), the spectral group velocity can be understood as the effective coupling between the links in the oscillator chain, connecting the energy injection from the primary mode at small n\smash{n} to the energy sink at n\smash{n\to\infty}. As |vg|\smash{|v_{g}|} increases with θ\smash{\theta}, the system transitions from broken to unbroken 𝒫𝒯\smash{\mathcal{PT}} symmetry.

Refer to caption
Figure 11: (a) The magnitude of the group velocity |vg|\smash{|v_{g}|} of the global PSW mode (i.e. the speed at which the energy front propagates along the spectrum) versus θ\smash{\theta} for various r\smash{r}. (b) Same for the average spectral energy density n¯/ϵ2\smash{\mkern 1.5mu\overline{\mkern-1.5mu\mathcal{E}_{n}\mkern-1.5mu}\mkern 1.5mu/\epsilon^{2}}. (c) Same for the resulting average drain rate γ/ϵ2\smash{\gamma/\epsilon^{2}}, where γ\smash{\gamma} is defined in (54). The initial conditions used throughout these figures are given by (45), with ξ1σ=ξ1σ=exp(σi\upi/2).\smash{\xi_{1}^{\sigma}=-\xi_{-1}^{\sigma}=\exp(\sigma\mathrm{i}\upi/2).}

Also, notice the following. If modulations at multiple 𝒒\smash{\boldsymbol{q}}s are present, they launch independent cascades along 𝒌=𝒒+n𝒑\smash{\boldsymbol{k}=\boldsymbol{q}+n\boldsymbol{p}} and thus the drain on the shared primary mode will be additive. This regime is illustrated by figures 1(d)-(f), which show the modulational dynamics for a BDPM (θ=\upi/3\smash{\theta=\upi/3}) seeded with noise. The rather nondescript modulational dynamics seen in figures 1(d) and (e) can be understood as the superposition of the many structures like that in figure 10 for various 𝒒\smash{\boldsymbol{q}}. Also, the linear-in-time depletion of the primary mode seen in figure 1(f) can be understood as the sum of the PSW-driven energy fluxes along the constituent modulational channels.

5 Unified closure

As discussed in section 3.1, naive Q\smash{Q}MTs assume the perfectly reflecting boundary conditions (31) for spectral waves, thus precluding PSWs and ignoring potential dissipation at large |n|\smash{|n|}. One can expect that, instead of (31), it would be more accurate to adopt the following closure in anticipation of spectral waves:

ψN+1σ=eiKσψNσ\displaystyle\psi_{N+1}^{\sigma}=\mathrm{e}^{\mathrm{i}K^{\sigma}}\psi_{N}^{\sigma} (55)

(and similarly for ψ(N+1)σ\smash{\psi_{-(N+1)}^{\sigma}}). The value of Kσ\smash{K^{\sigma}} is found, for given ω\smash{\omega}, from (42):

eiKσ=eiσθ[iΩσ+1(Ωσ)2],\displaystyle\mathrm{e}^{\mathrm{i}K^{\sigma}}=\mathrm{e}^{-\mathrm{i}\sigma\theta}\Big{[}\mathrm{i}\Omega^{\sigma}+\sqrt{1-(\Omega^{\sigma})^{2}}\Big{]}, (56)

where Ωσω/ω0σ\smash{\Omega^{\sigma}\doteq\omega/\omega_{0}^{\sigma}}, and we restrict our attention to the range |ReΩ|1\smash{|\operatorname{Re}\Omega\,|\leq 1} to avoid discontinuities, a choice which is consistent with the range of observed solutions. The sign of the square root is chosen to enforce outward propagation at Imω=0\smash{\operatorname{Im}\omega=0}, which also enforces that unstable solutions exponentially decay in the direction of propagation |exp(iK+σ)|<1\smash{|\exp(\mathrm{i}K^{\sigma}_{+})|<1}.444A different choice of signs would also yield formally valid solutions, but would not have the properties that correspond to PSWs of physical interest. (Conversely, damped solutions grow in the direction of propagation.) As seen from figure 12, the resulting closure

ψN+1σ=eiσθ[iΩσ+1(Ωσ)2]ψNσ\displaystyle\psi_{N+1}^{\sigma}=\mathrm{e}^{-\mathrm{i}\sigma\theta}\,\Big{[}\mathrm{i}\Omega^{\sigma}+\sqrt{1-(\Omega^{\sigma})^{2}}\Big{]}\psi_{N}^{\sigma} (57)

leads to reasonably accurate results already N=2\smash{N=2}, both for MIs and PSWs.

Because the closure becomes exact only in the limit N\smash{N\rightarrow\infty}, it also yields spurious solutions (gray curves in figure 12) at finite N\smash{N}. One can understand this from the analogy with the Case–van Kampen modes mentioned in section 4.2. These solutions become increasingly stable, and thus less of an issue, as N\smash{N} increases. Also note that the closure adequately describes true MIs. This is explained by the fact that ψnσ\smash{\psi_{n}^{\sigma}} at large |n|\smash{|n|} are exponentially small and do not matter for unstable modes anyway.

Refer to caption
Figure 12: The θ\smash{\theta}-dependence of the global-mode frequencies derived from truncated models with the closure (57) for various N\smash{N}: Reω\smash{\operatorname{Re}\omega} (left column) and Imω\smash{\operatorname{Im}\omega} (right column). (a) and (b): N=2\smash{N=2}. (c) and (d): N=3\smash{N=3}. (e) and (f): N=4\smash{N=4}. The red and blue curves indicate the branches that are the best match to the, respectively, unstable and stable solutions obtained through DNS (black dashed lines). The gray curves show the remaining spurious solutions due to finite N\smash{N}. Note that MIs exist only for θ(0,\upi/4)\smash{\theta\in(0,\upi/4)}, while PSWs exist in the entire range θ(0,\upi/2)\smash{\theta\in(0,\upi/2)}.

6 Quasilinear approximation beyond ideal MHD

As discussed in the previous sections, the existence of PSWs undermines the standard quasilinear approximation in application to ideal incompressible MHD. Given the ubiquity of quasilinear modeling in the literature, it may seem concerning that the quasilinear approximation can fail so spectacularly. Interestingly, though, the quasilinear approximation is somewhat more robust beyond the ideal-MHD limit, both due to conservative corrections and dissipation.

Let us discuss the former first. Without attempting to describe any particular physical system, let us consider the following modified version of (3):

t𝒛±=(𝒛)𝒛±P+λx2𝒛±,\displaystyle\partial_{t}\boldsymbol{z}^{\pm}=-(\boldsymbol{z}^{\mp}\cdot\nabla)\boldsymbol{z}^{\pm}-\nabla P+\lambda\partial_{x}\nabla^{2}\boldsymbol{z}^{\pm}, (58)

where λ\smash{\lambda} is a constant parameter. The last term is intended as a simple ad hoc correction that, while causing deviation from ideal MHD, leaves the primary-mode evolution unaffected and preserves MHD’s key invariants, specifically, the energy and cross helicity (appendix B). Perhaps notably, this term introduces rotational asymmetry in the (x,y)\smash{(x,y)} plane. Similar terms can appear due to a background magnetic field or differential rotation (Heinonen et al., 2023).

Taking ϕ=\upi/4\smash{\phi=\upi/4} for simplicity, one arrives at the following corresponding modification of the linear (2.2.2):

2(τ+iδn)ψn±=eiθαnψn1±+e±iθβnψn1+e±iθαn+ψn+1±+eiθβn+ψn+1,\displaystyle\sqrt{2}(\partial_{\tau}+\mathrm{i}\delta_{n})\psi_{n}^{\pm}=\quad\mathrm{e}^{\mp\mathrm{i}\theta}\alpha_{n}^{-}\psi_{n-1}^{\pm}+\mathrm{e}^{\pm\mathrm{i}\theta}\beta_{n}^{-}\psi_{n-1}^{\mp}+\mathrm{e}^{\pm\mathrm{i}\theta}\alpha_{n}^{+}\psi_{n+1}^{\pm}+\mathrm{e}^{\mp\mathrm{i}\theta}\beta_{n}^{+}\psi_{n+1}^{\mp}, (59)

where δn±Λr(r2+n2)\smash{\delta_{n}^{\pm}\doteq\Lambda r(r^{2}+n^{2})}, Λλ/𝒜p3\smash{\Lambda\doteq\lambda/\mathcal{A}p^{3}}, and αn±\smash{\alpha_{n}^{\pm}}, βn±\smash{\beta_{n}^{\pm}} are as in (24). In the large-|n|\smash{|n|} limit, one has αn±1\smash{\alpha_{n}^{\pm}\sim 1}, βn±0\smash{\beta_{n}^{\pm}\to 0}, δnΛn2\smash{\delta_{n}\sim\Lambda n^{2}}, so one obtains the following scaling:

|ψn+1±||ψn±|Λn2.\displaystyle|\psi_{n+1}^{\pm}|\sim\frac{|\psi_{n}^{\pm}|}{\Lambda n^{2}}. (60)

This shows that the harmonic magnitude |ψn|\smash{|\psi_{n}|} decreases rapidly (super-exponentially) with |n|\smash{|n|}, and thus low-order truncations may be justified. Note that although the opposite scaling, |ψn1±||ψn±|/Λn2\smash{|\psi_{n-1}^{\pm}|\sim|\psi_{n}^{\pm}|/\Lambda n^{2}} is also formally possible, such modes cannot be excited, as is the case with inward propagating PSWs. Figure 13(a) shows that, indeed, the agreement between analytical QL growth rates and nonlinear DNS improves as the parameter Λ\smash{\Lambda} increases. The agreement becomes nearly perfect for Λ0.5\smash{\Lambda\gtrsim 0.5}. Figure 13(b) shows that the same effect can be achieved if, instead of the λ\smash{\lambda} term in (58), one introduces sufficiently strong viscosity. In this case, one also has a modified (2.2.2) with the exact form of (59), but with δn±μ(r2+n2)\smash{\delta_{n}^{\pm}\doteq\mu(r^{2}+n^{2})}, where μν+/𝒜p2\smash{\mu\doteq\nu_{+}/\mathcal{A}p^{2}} (with ν=0\smash{\nu_{-}=0} for simplicity). Again, the agreement becomes nearly exact for μ0.5\smash{\mu\gtrsim 0.5}.

Refer to caption
Figure 13: The growth rate Γ\smash{\Gamma}, at ν=0\smash{\nu_{-}=0}, versus: (a) Λλ/𝒜p3\smash{\Lambda\doteq\lambda/\mathcal{A}p^{3}} and (b) μν+/𝒜p2\smash{\mu\doteq\nu_{+}/\mathcal{A}p^{2}}. The corresponding ‘center of energy" of the unstable eigenmode, n¯nnn2/\smash{\mkern 1.5mu\overline{\mkern-1.5mun\mkern-1.5mu}\mkern 1.5mu\doteq\sqrt{\sum_{n}\mathcal{E}_{n}n^{2}/\mathcal{E}}}, is shown in (c) and (d), respectively. In all figures, the colour markers indicate the results obtained through DNS of (29), while the black solid curves indicate solutions obtained from the 4MT truncation of (59). The results are presented for the representative cases θ=0\smash{\theta=0} and θ=\upi/2\smash{\theta=\upi/2}, both at r=0.5\smash{r=0.5} and ϕ=\upi/4\smash{\phi=\upi/4}.

7 Summary

In this paper, we explore structure formation in two-dimensional MHD turbulence as a modulational instability (MI) of turbulent fluctuations. We focus on the early stages of structure formation and consider simple backgrounds that allow for a tractable model of the MI while retaining the full chain of modulational harmonics. This approach allows for a systematic examination of the importance of high-order correlations that are typically ignored in mean-field theories.

We find that, when the primary structure truly experiences a MI, this MI can be described well with typical truncated models that neglect high-order correlations. However, already in adjacent regimes, such truncated models can fail spectacularly and produce false positives for instability. To study this process in detail, we propose an ‘extended’ quasilinear theory (XQLT) that treats the primary structure as fixed but includes the entire spectrum of modulational harmonics (as opposed to just the low-order harmonics, as usual). From XQLT, also corroborated by DNS, we find that the difference between said regimes is due to a fundamental difference in the structures of the modulational spectra. For unstable modes, the spectrum is exponentially localised at low harmonic numbers, so truncated models are justified. But this localization does not always occur. At other parameters, modulational modes turn into constant-amplitude waves propagating down the spectrum, unimpeded until dissipative scales. These spectral waves are self-maintained as global modes with real frequencies and cause ballistic energy transport along the spectrum, breaking the feedback loops that could otherwise sustain MI.

The ballistic transport by PSWs drains energy from the primary structure at a constant rate until the primary structure is depleted. Because global PSWs exist at almost all parameters, this means that almost any primary structure in ideal incompressible MHD will eventually be depleted. This means, in particular, that sustainability of MHD structures is not entirely limited to the issue of exponentially growing linear instabilities, as PSWs must also be taken into consideration. To describe them within a reduced model, we propose a closure that successfully captures both the propagating spectral waves and MIs for the assumed primary structure.

Finally, we find that departures from ideal MHD constrains the form of modulational eigenmodes, in turn suppressing the amplitude and impact of high harmonics. This allows us to end on an informed yet optimistic note regarding the applicability of the quasilinear approximation for structure formation in dispersive forms of MHD. That said, it is an important conclusion of our work that, unless deviations from ideal incompressible MHD are substantial, changing the turbulence parameters even slightly can utterly destroy the applicability of an otherwise workable reduced model. An understanding of the complex modulational dynamics supported by (nearly) incompressible MHD provides important context in the interpretation of existing simple closures and potentially opens the path to building more reliable alternatives.

This research was supported by the U.S. Department of Energy through contract No. DE-AC02-09CH11466.

Appendix A Quasilinear time scale

The quasilinear approximation holds when the quasilinear MI growth rate exceeds the rate at which energy escapes to higher harmonics. In the main text, we take the latter to be the group velocity of the PSW global mode. However, one might object that the properties of PSWs are well defined only at large n\smash{n}, while the departure from quasilinear occurs at the modest n=2\smash{n=2}. It may be instructive then to develop an alternative argument that would not assume large n\smash{n}. Here, we propose such an argument by considering the initial-value problem.

For simplicity, let us assume the initial conditions such that only 𝝍0(τ=0)\smash{\boldsymbol{\psi}_{0}(\tau=0)} is nonzero, with all other 𝝍n0(τ=0)\smash{\boldsymbol{\psi}_{n\neq 0}(\tau=0)} are zero. We also adopt (ϕ,θ)=(\upi/4,0)\smash{(\phi,\theta)=(\upi/4,0)} and (ϕ,θ)=(\upi/4,\upi/2)\smash{(\phi,\theta)=(\upi/4,\upi/2)} as representative cases for VDPMs and BDPMs respectively. For these parameters, (28) can be further reduced due to the parity of the initial conditions. Also, the coupling matrices 𝑮𝒏𝝈\smash{\boldsymbol{G_{n}^{\sigma}}} become particularly simple:

𝑮𝒏𝝈={12(αnσβnσβnσαnσ),θ=0,iσ2(αnσβnσβnσαnσ),θ=\upi2.\displaystyle\boldsymbol{G_{n}^{\sigma}}=\begin{cases}\frac{1}{\sqrt{2}}\begin{pmatrix}\alpha_{n}^{\sigma}&\beta_{n}^{\sigma}\\ \beta_{n}^{\sigma}&\alpha_{n}^{\sigma}\end{pmatrix},&\theta=0,\\ \frac{\mathrm{i}\sigma}{\sqrt{2}}\begin{pmatrix}-\alpha_{n}^{\sigma}&\beta_{n}^{\sigma}\\ -\beta_{n}^{\sigma}&\alpha_{n}^{\sigma}\end{pmatrix},&\theta=\frac{\upi}{2}.\end{cases} (61)

For the case θ=0\smash{\theta=0}, we assume assume ψ0+=ψ0\smash{\psi_{0}^{+}=\psi_{0}^{-}} at τ=0\smash{\tau=0}. Then, ψ˙n+(τ=0)=ψ˙n(τ=0)\smash{\dot{\psi}_{n}^{+}(\tau=0)=\dot{\psi}_{n}^{-}(\tau=0)} as well, and thus ψn+=ψn\smash{\psi_{n}^{+}=\psi_{n}^{-}} at all times, i.e. 𝒃0\smash{\boldsymbol{b}\equiv 0}. We will refer to this case as 𝒗\smash{\boldsymbol{v}}-only initial conditions (VIC). The corresponding dynamics can be described by a single function ψnψn+=ψn\smash{\psi_{n}\doteq\psi_{n}^{+}=\psi_{n}^{-}}, which satisfies the equation

ψ˙n=αn+βn2ψn1+αn++βn+2ψn+1.\displaystyle\dot{\psi}_{n}=\frac{\alpha_{n}^{-}+\beta_{n}^{-}}{\sqrt{2}}\,\psi_{n-1}+\frac{\alpha_{n}^{+}+\beta_{n}^{+}}{\sqrt{2}}\,\psi_{n+1}. (62)

For the case θ=\upi/2\smash{\theta=\upi/2}, we assume ψ0+=ψ0\smash{\psi_{0}^{+}=-\psi_{0}^{-}} at τ=0\smash{\tau=0}. Similarly to the VIC case, this implies that 𝒗0\smash{\boldsymbol{v}\equiv 0}. We will refer to this case as 𝒃\smash{\boldsymbol{b}}-only initial conditions (BIC). The corresponding dynamics can be described by a single function ψninψn+=(i)nψn\smash{\psi_{n}\doteq-\mathrm{i}^{n}\psi_{n}^{+}=(-\mathrm{i})^{n}\psi_{n}^{-}}, which satisfies the equation

ψ˙n=αn(1)nβn2ψn1+αn+(1)nβn+2ψn+1.\displaystyle\dot{\psi}_{n}=\frac{\alpha_{n}^{-}-(-1)^{n}\beta_{n}^{-}}{\sqrt{2}}\,\psi_{n-1}+\frac{\alpha_{n}^{+}-(-1)^{n}\beta_{n}^{+}}{\sqrt{2}}\,\psi_{n+1}. (63)

To estimate the nonlinear time scale τNL\smash{\tau_{\text{NL}}} on which ψ2\smash{\psi_{2}} might become comparable with ψ0\smash{\psi_{0}}, let us consider (62) and (63) for n=2\smash{n=2} at early times, when ψ3\smash{\psi_{3}} remains negligible:

ψ2¨={α2+β22(α1+β12ψ0+α1++β1+2ψ2)for VIC,α2β22(α1+β12ψ0+α1++β1+2ψ2)for BIC.\displaystyle\ddot{\psi_{2}}=\begin{cases}\frac{\alpha_{2}^{-}+\beta_{2}^{-}}{\sqrt{2}}\left(\frac{\alpha_{1}^{-}+\beta_{1}^{-}}{\sqrt{2}}\,\psi_{0}+\frac{\alpha_{1}^{+}+\beta_{1}^{+}}{\sqrt{2}}\,\psi_{2}\right)&\text{for VIC},\\ \frac{\alpha_{2}^{-}-\beta_{2}^{-}}{\sqrt{2}}\left(\frac{\alpha_{1}^{-}+\beta_{1}^{-}}{\sqrt{2}}\,\psi_{0}+\frac{\alpha_{1}^{+}+\beta_{1}^{+}}{\sqrt{2}}\,\psi_{2}\right)&\text{for BIC}.\end{cases} (64)

Initially, |ψ0||ψ2|\smash{|\psi_{0}|\gg|\psi_{2}|}, so the second term in the brackets can be neglected in both cases and the nonlinear time scale can be readily estimated:

τNL2=12×{(α2+β2)(α1+β1)for VIC,(α2β2)(α1+β1)for BIC.\displaystyle\tau_{\text{NL}}^{-2}=\frac{1}{2}\times\begin{cases}(\alpha_{2}^{-}+\beta_{2}^{-})(\alpha_{1}^{-}+\beta_{1}^{-})&\text{for VIC},\\ (\alpha_{2}^{-}-\beta_{2}^{-})(\alpha_{1}^{-}+\beta_{1}^{-})&\text{for BIC}.\end{cases} (65)

In contrast, the quasilinear time scale τQL\smash{\tau_{\text{QL}}}, which follows from the 4MT equations (34), is identical in both cases:

τQL2=α0(α1+β1).\displaystyle\tau_{\text{QL}}^{-2}=\alpha_{0}^{-}(\alpha_{1}^{-}+\beta_{1}^{-}). (66)

The ratio of time scales is then

(τQLτNL)2={2r2+4for VIC,2rr2+4r+4for BIC.\displaystyle\left(\frac{\tau_{\text{QL}}}{\tau_{\text{NL}}}\right)^{2}=\begin{cases}2\sqrt{r^{2}+4}&\text{for VIC},\\ \frac{2r\sqrt{r^{2}+4}}{r+4}&\text{for BIC}.\end{cases} (67)

For the range of interest (|r|<1\smash{|r|<1}), this always yields

τQLτNL,BIC<1<τQLτNL,VIC.\displaystyle\frac{\tau_{\text{QL}}}{\tau_{\text{NL,BIC}}}<1<\frac{\tau_{\text{QL}}}{\tau_{\text{NL,VIC}}}. (68)

(For example, at r=0.5\smash{r=0.5}, one has τQL/τNL,VIC0.5\smash{\tau_{\text{QL}}/\tau_{\text{NL,VIC}}\approx 0.5}, and τQL/τNL,BIC1.5\smash{\tau_{\text{QL}}/\tau_{\text{NL,BIC}}\approx 1.5}.) Thus, quasilinear is sufficient for VIC but not for BIC, in agreement with our numerical results.

Appendix B Properties of (58)

The operator introduced in (58) conserves energy (here it is normalised to the constant mass density):

E14d𝒙(|𝒛+|2+|𝒛|2),\displaystyle E\doteq\frac{1}{4}\int\mathrm{d}\boldsymbol{x}\,(|\boldsymbol{z}^{+}|^{2}+|\boldsymbol{z}^{-}|^{2}), (69)

and cross helicity:

HC14d𝒙(|𝒛+|2|𝒛|2).\displaystyle H_{C}\doteq\frac{1}{4}\int\mathrm{d}\boldsymbol{x}\,(|\boldsymbol{z}^{+}|^{2}-|\boldsymbol{z}^{-}|^{2}). (70)

These can be expressed in terms of the Elsasser energies

E±d𝒙|𝒛±|2/4,\displaystyle E^{\pm}\doteq\int\mathrm{d}\boldsymbol{x}\,|\boldsymbol{z}^{\pm}|^{2}/4, (71)

as

E=E++E,HC=E+E.\displaystyle E=E^{+}+E^{-},\qquad H_{C}=E^{+}-E^{-}. (72)

Equation (58) yields

dE±dt=14d𝒙t|𝒛±|2=12d𝒙𝒛±t𝒛±=λ2d𝒙𝒛±x2𝒛±=λ2d𝒙zl±xm2zl±=λ2d𝒙(mzl±)(xmzl±)=λ4d𝒙x(mzl±mzl±)=0.\displaystyle\begin{split}\frac{\mathrm{d}E^{\pm}}{\mathrm{d}t}&=\frac{1}{4}\int\mathrm{d}\boldsymbol{x}\,\partial_{t}|\boldsymbol{z}^{\pm}|^{2}\\ &=\frac{1}{2}\int\mathrm{d}\boldsymbol{x}\,\boldsymbol{z}^{\pm}\cdot\partial_{t}\boldsymbol{z}^{\pm}\\ &=\frac{\lambda}{2}\int\mathrm{d}\boldsymbol{x}\,\boldsymbol{z}^{\pm}\cdot\partial_{x}\nabla^{2}\boldsymbol{z}^{\pm}\\ &=\frac{\lambda}{2}\int\mathrm{d}\boldsymbol{x}\,z^{\pm}_{l}\partial_{x}\partial_{m}^{2}z^{\pm}_{l}\\ &=-\frac{\lambda}{2}\int\mathrm{d}\boldsymbol{x}\,(\partial_{m}z^{\pm}_{l})(\partial_{x}\partial_{m}z^{\pm}_{l})\\ &=-\frac{\lambda}{4}\int\mathrm{d}\boldsymbol{x}\partial_{x}\,(\partial_{m}z^{\pm}_{l}\partial_{m}z^{\pm}_{l})\\ &=0.\end{split} (73)

Thus,

dEdt=dE+dt+dEdt=0,\displaystyle\frac{\mathrm{d}E}{\mathrm{d}t}=\frac{\mathrm{d}E^{+}}{\mathrm{d}t}+\frac{\mathrm{d}E^{-}}{\mathrm{d}t}=0, (74)
dHCdt=dE+dtdEdt=0,\displaystyle\frac{\mathrm{d}H_{C}}{\mathrm{d}t}=\frac{\mathrm{d}E^{+}}{\mathrm{d}t}-\frac{\mathrm{d}E^{-}}{\mathrm{d}t}=0, (75)

meaning that (58) conserves both E\smash{E} and HC\smash{H_{C}}.

References

  • Adkins & Schekochihin (2018) Adkins, T. & Schekochihin, A. A. 2018 A solvable model of Vlasov-kinetic plasma turbulence in Fourier–Hermite phase space. J. Plasma Phys. 84 (1), 905840107.
  • Bender (2005) Bender, C. M. 2005 Introduction to 𝒫𝒯\mathcal{PT}-symmetric quantum theory. Contemp. Phys. 46, 277.
  • Bender et al. (2019) Bender, C. M., Dorey, P. E., Dunning, C., Fring, A., Hook, D. W., Jones, H. F., Kuzhel, S., Lévai, G. & Tateo, R. 2019 PT Symmetry. World Scientific Publishing Europe Ltd.
  • Beresnyak (2019) Beresnyak, A. 2019 MHD turbulence. Living Rev. Comput. Astrophys. 5 (1), 2.
  • Biskamp (2003) Biskamp, D. 2003 Magnetohydrodynamic Turbulence. Cambridge Univ. Press.
  • Biskamp & Schwarz (2001) Biskamp, D. & Schwarz, E. 2001 On two-dimensional magnetohydrodynamic turbulence. Phys. Plasmas 8 (7), 3282–3292.
  • Biskamp et al. (1996) Biskamp, D., Schwarz, E. & Drake, J. F. 1996 Two-dimensional electron magnetohydrodynamic turbulence. Phys. Rev. Lett. 76 (8), 1264.
  • Blackman & Field (2002) Blackman, E. G. & Field, G. B. 2002 New dynamical mean-field dynamo theory and closure approach. Phys. Rev. Lett. 89, 265007.
  • Brandenburg (2018) Brandenburg, A. 2018 Advances in mean-field dynamo theory and applications to astrophysical turbulence. J. Plasma Phys. 84 (4), 735840404.
  • Brandenburg et al. (2012) Brandenburg, A., Sokoloff, D. & Subramanian, K. 2012 Current status of turbulent dynamo theory: from large-scale to small-scale dynamos. Space Sci. Rev. 169, 123–157.
  • Brandenburg & Subramanian (2005a) Brandenburg, A. & Subramanian, K. 2005a Astrophysical magnetic fields and nonlinear dynamo theory. Phys. Rep. 417 (1), 1–209.
  • Brandenburg & Subramanian (2005b) Brandenburg, A. & Subramanian, K. 2005b Minimal tau approximation and simulations of the alpha effect. Astron. Astrophys. 439 (3), 835–843.
  • Dodin (2022) Dodin, I. Y. 2022 Quasilinear theory for inhomogeneous plasma. J. Plasma Phys. 88, 905880407.
  • Donzis et al. (2005) Donzis, D. A., Sreenivasan, K. R. & Yeung, P. K. 2005 Scalar dissipation rate and dissipative anomaly in isotropic turbulence. J. Fluid Mech. 532, 199–216.
  • Elsasser (1950) Elsasser, W. M. 1950 The hydromagnetic equations. Phys. Rev. 79 (1), 183.
  • Friedberg (1987) Friedberg, J. P. 1987 Ideal Magnetohydrodynamics. New York: Plenum Press.
  • Galtier et al. (2000) Galtier, S., Nazarenko, S. V., Newell, A. C. & Pouquet, A. 2000 A weak turbulence theory for incompressible magnetohydrodynamics. J. Plasma Phys. 63 (5), 447–488.
  • Giorgio et al. (2017) Giorgio, E. De, Servidio, S. & Veltri, P. 2017 Coherent structure formation through nonlinear interactions in 2D magnetohydrodynamic turbulence. Sci. Rep. 7, 13849.
  • Gopalakrishnan & Singh (2023) Gopalakrishnan, K. & Singh, N. K. 2023 Mean-field dynamo due to spatio-temporal fluctuations of the turbulent kinetic energy. J. Fluid Mech. 973, A29.
  • Gruzinov & Diamond (1994) Gruzinov, A. V. & Diamond, P. H. 1994 Self-consistent theory of mean-field electrodynamics. Phys. Rev. Lett. 72 (11), 1651.
  • Hammett et al. (1993) Hammett, G. W., Beer, M. A., Dorland, W., Cowley, S. C. & Smith, S. A. 1993 Developments in the gyrofluid approach to tokamak turbulence simulations. Plasma Phys. Control. Fusion 35, 973.
  • Heinonen et al. (2023) Heinonen, R. A., Diamond, P. H., Katz, M. F. D. & Ronimo, G. E. 2023 Generation of momentum transport in weakly turbulent β\beta-plane magnetohydrodynamics. Phys. Rev. E 107, 025202.
  • Hussain (1986) Hussain, A. K. M. Fazle 1986 Coherent structures and turbulence. J. Fluid Mech. 173, 303–356.
  • Kanekar et al. (2015) Kanekar, A., Schekochihin, A. A., Dorland, W. & Loureiro, N. F. 2015 Fluctuation-dissipation relations for a plasma-kinetic Langevin equation. J. Plasma Phys. 81 (1), 305810104.
  • Käpylä et al. (2006) Käpylä, P. J., Korpi, M. J., Ossendrijver, M. & Stix, M. 2006 Magnetoconvection and dynamo coefficients - III. α\alpha-effect and magnetic pumping in the rapid rotation regime. Astron. Astrophys. 455 (2), 401–412.
  • Karimabadi et al. (2013) Karimabadi, H., Roytershteyn, V., Wan, M., Matthaeus, W. H., Daughton, W., Wu, P., Shay, M., Loring, B., Borovsky, J., Leonardis, E., Chapman, S. C. & Nakamura, T. K. M. 2013 Coherent structures, intermittent turbulence, and dissipation in high-temperature plasmas. Phys. Plasmas 20 (1), 012303.
  • Kraichnan (1977) Kraichnan, R. H. 1977 Eulerian and Lagrangian renormalization in turbulence theory. J. Fluid Mech. 83 (2), 349–374.
  • Krashennikov et al. (2008) Krashennikov, S. I., D’Ippolito, D. A. & Myra, J. R. 2008 Recent theoretical progress in understanding coherent structures in edge and SOL turbulence. J. Plasma Phys. 74 (5), 679–717.
  • Krause & Rädler (1980) Krause, F. & Rädler, K. H. 1980 Mean-field Magnetohydrodynamics and Dynamo Theory. Oxford: Pergamon Press.
  • Lingam & Bhattacharjee (2016) Lingam, M. & Bhattacharjee, A. 2016 Hall current effects in mean-field dynamo theory. Astrophys. J. 829 (1), 51.
  • Masada & Sano (2014) Masada, Y. & Sano, T. 2014 Mean-field modeling of an α2\alpha^{2} dynamo coupled with direct numerical simulations of rigidly rotating convection. Astrophys. J. Lett. 794 (1), L6.
  • Moffatt (1978) Moffatt, H. K. 1978 Field Generation in Electrically Conducting Fluids. Cambridge Univ. Press.
  • Nastac et al. (2023) Nastac, M. L., Ewart, R. J., Sengupta, W., Schekochihin, A. A., Barnes, M. & Dorland, W. D. 2023 Phase-space entropy cascade and irreversibility of stochastic heating in nearly collisionless plasma turbulence, arXiv: 2310.18211.
  • Nazarenko (2007) Nazarenko, S. 2007 2D enslaving of MHD turbulence. New J. Phys. 9 (8), 307.
  • Nicklaus & Stix (1988) Nicklaus, B. & Stix, M. 1988 Corrections to first order smoothing in mean-field electrodynamics. Geophys. Astrophys. Fluid Dyn. 43 (2), 149–166.
  • Parker (2016) Parker, J. T. 2016 Gyrokinetic simulations of fusion plasmas using a spectral velocity space representation, arXiv: 1603.04727.
  • Pipin & Proctor (2008) Pipin, V. V. & Proctor, M. R. E. 2008 Closure tests for mean field magnetohydrodynamics using a self-consistent reduced model. Mon. Not. R. Astron. Soc. 388 (1), 367–374.
  • Pouquet (1978) Pouquet, A. 1978 On two-dimensional magnetohydrodynamic turbulence. J. Fluid Mech. 88 (1), 1–16.
  • Pouquet et al. (1976) Pouquet, A., Frisch, U. & Léorat, J. 1976 Strong MHD helical turbulence and the nonlinear dynamo effect. J. Fluid Mech. 77 (2), 321–354.
  • Rädler (1982) Rädler, K. H. 1982 On dynamo action in the high-conductivity limit. Geophys. Astrophys. Fluid Dyn. 20 (3-4), 191–211.
  • Rädler (2007) Rädler, K. H. 2007 Mean-Field Dynamo Theory: Early Ideas and Today’s Problems, pp. 55–72. Dordrecht: Springer Netherlands.
  • Rincon (2019) Rincon, F. 2019 Dynamo theories. J. Plasma Phys. 85 (4), 205850401.
  • Rincon et al. (2016) Rincon, F., Califano, F., Schekochihin, A. A. & Valentini, F. 2016 Turbulent dynamo in a collisionless plasma. Proc. Natl. Acad. Sci. U.S.A. 113 (15), 3950–3953.
  • Schekochihin (2022) Schekochihin, A. A. 2022 MHD turbulence: a biased review. J. Plasma Phys. 88 (5), 155880501.
  • Schrinner et al. (2005) Schrinner, M., K.-H. Rädler, D., Schmitt, Rheinhardt, M. & Christensen, U. 2005 Mean-field view on rotating magnetoconvection and a geodynamo model. Astron. Nachrichten 326 (3-4), 245–249.
  • Smolyakov et al. (2000) Smolyakov, A. I., Diamond, P. H. & Malkov, M. 2000 Coherent structure phenomena in drift wave–zonal flow turbulence. Phys. Rev. Lett. 84, 491–494.
  • Squire & Bhattacharjee (2015) Squire, J. & Bhattacharjee, A. 2015 Electromotive force due to magnetohydrodynamic fluctuations in sheared rotating turbulence. Phys. Rev. E 92, 053101.
  • Stix (1992) Stix, T. H. 1992 Waves in Plasmas. New York: AIP.
  • Tobias (2021) Tobias, S. M. 2021 The turbulent dynamo. J. Fluid Mech. 912, P1.
  • Tsiolis et al. (2020) Tsiolis, V., Zhou, Y. & Dodin, I. Y. 2020 Structure formation in turbulence as an instability of effective quantum plasma. Phys. Lett. A 384, 126377.
  • Zakharov & Ostrovsky (2009) Zakharov, V. E. & Ostrovsky, L. A. 2009 Modulation instability: The beginning. Physica D 238, 540.
  • Zeldovich (1957) Zeldovich, Y. B. 1957 The magnetic field in the two-dimensional motion of a conducting turbulent fluid. Sov. Phys. JETP 4, 460–462.
  • Zhou & Blackman (2021) Zhou, H. & Blackman, E. G. 2021 On the shear-current effect: toward understanding why theories and simulations have mutually and separately conflicted. Mon. Not. R. Astron. Soc. 507 (4), 5732–5746.
  • Zhu & Dodin (2021) Zhu, H. & Dodin, I. Y. 2021 Wave-kinetic approach to zonal-flow dynamics: recent advances. Phys. Plasmas 28, 032303.