This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\stackMath

Spectral stability of shock-fronted travelling waves under viscous relaxation

Ian Lizarraga   and Robert Marangell School of Mathematics and Statistics, The University of Sydney, [email protected] (corresponding author)School of Mathematics and Statistics, The University of Sydney, [email protected].
Abstract

Reaction-nonlinear diffusion partial differential equations can exhibit shock-fronted travelling wave solutions. Prior work by Li et al. (2021) has demonstrated the existence of such waves for two classes of regularisations, including viscous relaxation (see [22]). Their analysis uses geometric singular perturbation theory: for sufficiently small values of a parameter ε>0\varepsilon>0 characterising the ‘strength’ of the regularisation, the waves are constructed as perturbations of a singular heteroclinic orbit. Here we show rigorously that these waves are spectrally stable for the case of viscous relaxation. Our approach is to show that for sufficiently small ε>0\varepsilon>0, the ‘full’ eigenvalue problem of the regularised system is controlled by a reduced slow eigenvalue problem defined for ε=0\varepsilon=0. In the course of our proof, we examine the ways in which this geometric construction complements and differs from constructions of other reduced eigenvalue problems that are known in the wave stability literature.

1   Introduction

Regularised PDEs exhibiting travelling wave solutions can frequently be regarded as singularly perturbed dynamical systems, since the regularisation is modelled by the inclusion of strictly higher-order partial derivatives multiplied by a small constant. The singular perturbation in turn has a physical/dynamical interpretation reflecting an inherent hierarchy of spatiotemporal scales, which we refer to more succinctly as a fast-slow structure. We consider the following reaction-nonlinear diffusion PDE with a viscous relaxation term, exhibiting (regularised) shock-fronted travelling waves:111Throughout the paper we use bar notation to refer to phase space variables. Later on we will use unbarred variables to denote variables defined on the linearized subspaces along the curve, and carets to denote projectivisations of these variables.

U¯t\displaystyle\frac{\partial\bar{U}}{\partial t} =x(D(U¯)U¯x)+R(U¯)+ε3U¯x2t\displaystyle=\frac{\partial}{\partial x}\left(D(\bar{U})\frac{\partial\bar{U}}{\partial x}\right)+R(\bar{U})+\varepsilon\frac{\partial^{3}\bar{U}}{\partial x^{2}\partial t} (1)

for (x,t)×(x,t)\in\mathbb{R}\times\mathbb{R} and ε0\varepsilon\geq 0 the singular perturbation parameter characterising the strength of the the regularisation. The nonlinear diffusion and reaction terms given by the quadratic resp. cubic polynomials D(U¯)D(\bar{U}) and R(U¯)R(\bar{U}). We suppose that D(U¯)<0D(\bar{U})<0 within an interval (a,b)(0,1)(a,b)\subset(0,1); the potential function F(U¯):=D(U¯)𝑑U¯F(\bar{U}):=\int D(\bar{U})d\bar{U} is hence nonmonotone. Nonlinear diffusion processes with nonmonotone potential functions serve as prototypical models for the formation of coherent solutions with sharp fronts; see e.g. [37, 38]. Arguably the canonical system from a physical context is the Cahn-Hilliard model [27]. The system (1) for ε=0\varepsilon=0 is derived as a particular continuum limit of stochastic agent-based models of invasion processes [31, 32]. The choice of viscous regularisation in (1) appears in the shock regularisation literature [25, 26, 38], but there are other physically-motivated high-order regularising terms that give rise to (smoothed families of) shock-fronted travelling waves. We refer the reader to [22] for a comparison of the existence problems under viscous resp. nonlocal regularising terms relative to system (1).

In the case of existence, the effectiveness of geometric singular perturbation theory (GSPT) is now well established. When (1) is written in a coordinate frame that follows the travelling wave, it takes the structure of a closed fast-slow system of ODEs. A one-parameter family of travelling waves for 0<ε10<\varepsilon\ll 1 is then constructed rigorously as a perturbation of a singular heteroclinic orbit, composed of segments defined by the slow flow along a so-called critical manifold, concatenated with fast jumps along the fast fibres, according to the dynamics of the layer problem. Li et. al. follow this approach in [22], numerically demonstrating the existence of smooth travelling waves of (1) for small values of ε>0\varepsilon>0. These one-parameter families of waves limit to ‘genuine’ (piecewise continuous) shock-fronted waves as ε0\varepsilon\to 0; furthermore, such singular limits are nonunique, strongly depending on the regularisation chosen. Li and coauthors have also studied smooth travelling-wave solutions of the unperturbed problem arising as so-called ‘hole in the wall’ solutions; see [23] and the footnote on page 9. Generally speaking, one can attempt to construct shock-fronted limiting solutions directly as weak solutions, but there may be infinitely many; see eg. [13].

It is worth re-emphasizing the effect of the singular perturbation in (1), in terms of coordinate representations of the travelling wave. With respect to a natural Liénard-type representation, the unperturbed problem can be thought of as modelling the slow dynamics on the critical manifold itself. The introduction of a regularisation term is then tantamount to embedding this slow flow within a higher-dimensional space and introducing a rule (i.e. the fast layer flow) for connecting the wavefront smoothly. In [22], it is explicitly shown that viscous relaxation adds a one-dimensional layer flow, while a fourth-order nonlocal regularization term adds a two-dimensional layer flow.

The objective of this paper is to demonstrate the spectral stability of travelling waves arising in (1). To determine the spectral stability along a wave, we must find the spectrum σ(L)\sigma(L) of a corresponding linearized operator LL. The total spectrum is decomposed into its continuous and point components as σ(L)=σe(L)σp(L)\sigma(L)=\sigma_{e}(L)\cup\sigma_{p}(L). Typically, most of the work involves deducing the point spectrum σp(L)\sigma_{p}(L), by recasting the eigenvalue problem (LλI)v=0(L-\lambda I)v=0 as a bifurcation problem posed on the underlying linearized subspaces along the wave. The major point that we want to highlight is that although the existence problem remains amenable to the usual GSPT techniques, the analysis of the stability problem (in particular, the computation of the point spectrum) is quite distinct.

The literature on geometric and analytic techniques for linearized stability problems is substantial (see e.g. [18] and [28] for comprehensive surveys). In this paper we follow the geometric framework developed in a series of seminal papers by Alexander, Gardner, and Jones [1, 8, 14]. The key point is that LL also inherits a slow-fast structure, enabling a reduction via linearized subsystems defined along either the slow manifold or along the fast fibres. The typical approach is then to hope that the spectrum of the full system, for ε>0\varepsilon>0 sufficiently small, is close to the ‘fast’ and/or ‘slow’ spectra of these linearized subsystems, which are defined for ε=0\varepsilon=0.

Our principal result is Theorem 2.11. As a consequence of the abovementioned high order of the singular perturbation term representing the regularisation, the stability problem for small ε>0\varepsilon>0 is now close to a slow eigenvalue problem defined on the critical manifold. We give an outline of the proof in Section 6.2. To summarize, we show in Sections 6 and 7 that the eigenvalues for the perturbed problem are the same as the eigenvalues for the reduced problem, which are calculated in Section 8. Together with the fact that the essential spectrum lies in the left half complex plane as shown in Section 5, we can conclude that there is no spectrum of solutions to (1) in any fixed contour KK in the right half complex plane.

The principal estimate for constructing these reduced eigenvalue problems is the elephant trunk lemma, which characterises an attracting set over a fast unstable subbundle in the linearized space when λ\lambda is not a fast eigenvalue; see the construction in the paper of Gardner & Jones [8]. We highlight that a nondegenerate fast eigenvalue problem is a requirement to construct linked elephant trunks over the entire wave. Furthermore, the construction of a slow eigenvalue problem, corresponding to the singular limit of some slow subbundle, is essentially auxiliary to that of the fast problem, and many of the key properties of the slow subbundle are indirectly enforced by the elephant trunk over the fast subbundle.

In order for us to define the promised slow eigenvalue problem, it is essential that we determine how exactly slow linear data on one branch of the slow manifold is transported across fast fibres to another branch of the slow manifold. When an elephant trunk lemma is available, it enforces certain nice properties, such as continuity of the slow subbundle across the fast layer in the singular limit. In other words, slow linear data from one branch of the slow manifolds is transported across fast fibres to the other branch identically, allowing a straightforward definition of slow eigenvalue problems across disjoint subsets of the slow manifolds.

We will show that our problem does not allow for elephant trunk-type estimates over the fast layer. The main obstruction is that, relative to a natural set of coordinates for the linearized system so that setting λ=0\lambda=0 recovers the usual variational equations along the flow, the parameter λ\lambda enters the equations ‘weakly,’ i.e. only through 𝒪(ελ)\mathcal{O}(\varepsilon\lambda) terms. The singular limit of the fast linearized system therefore degenerates such that there is ‘always a fast eigenvalue,’ i.e. there is a connection made between unstable and stable directions across the wavefront for all λ\lambda\in\mathbb{C} as ε0\varepsilon\to 0. In the present case the eigenvalue problem is low-dimensional enough that we can partly avoid this problem by concentrating on the slow problem only, but we return to this issue in the conclusion when we consider fourth-order regularizations.

Consequently, it must be the case that the slow linearized dynamics is responsible for the generation of eigenvalues near the tails of the wave. In this context, we think of our work as complementing Jones’ early result on the stability of a travelling pulse in the FitzHugh-Nagumo system, in which the eigenvalue problem of the full system is strongly controlled by the fast eigenvalue problems along the wave front and back [14]; as well as the result of Gardner and Jones on the stability of travelling waves in predator-prey systems, where both fast and slow eigenvalue problems are constructed [8].

While we do not have access to a fast elephant trunk over the entire wave, we nevertheless retain control of the linearized dynamics for small values of ε\varepsilon since λ\lambda enters the equations only weakly. The key observation here is that the eigenvalue problem can be thought of as a ‘weak’ (𝒪(ε)\mathcal{O}(\varepsilon))-perturbation of the standard variational problem. We will show that the corresponding transport of slow data is not the identity map, but instead is replaced by a nontrivial jump map that we can write down explicitly in the limit as ε0\varepsilon\to 0. The key technical tools that we use are desingularized linearized slow flows; ε\varepsilon-dependent rescalings along the fast fibres; and a continuous differentiability criterion that holds across the fold. We remark that this analysis holds also for the λ=0\lambda=0 case (corresponding to the regular variational dynamics carried by the wave), but to our knowledge such a statement about jump compatibility is new in the GSPT literature.

The exchange lemma, another fundamental GSPT technique (see e.g. [16, 17]),222Strictly speaking, in this paper we consider the case of an inclination lemma (see e.g. [5, 30]). We also point out that the adapation of the standard (k+σ)(k+\sigma)-exchange lemma to the case of zero unstable fast directions is straightforward. can also be adapted to our eigenvalue problem, since it is 𝒪(ε)\mathcal{O}(\varepsilon)-close to the standard variational problem. Exchange lemma-type estimates allow us to track solutions of the eigenvalue problem as they leave the fast layer and enter small neighborhoods of the slow manifolds, where they enter partial elephant trunks and remain well-controlled along the tails of the wave. The key result here is that solutions entering neighborhoods of the slow manifolds in a generic way will be aligned closely to a slow subbundle defined on the slow manifold after sufficiently long times. This subbundle is defined from the eigenvalue problem, and is therefore not generally equivalent to the tangent bundle of the slow manifold (but it is nearby).

These new techniques cause no extra trouble for the topological arguments that allow us to calculate the point spectrum of the perturbed problem: the corresponding evaluation of first Chern numbers of certain augmented unstable bundles (defined in [1]) follows, as usual. Following the approach of Gardner & Jones in [8], we construct a homotopy between the augmented unstable bundle ε(K)\mathcal{E}_{\varepsilon}(K) of the ‘full’ problem with ε>0\varepsilon>0 and that of a reduced problem 0(K)\mathcal{E}_{0}(K), when ε\varepsilon is sufficiently small. The reduced vector bundle is defined over separate hemispheres using the slow eigenvalue problem. The jump map now plays an essential role in defining the appropriate clutching function, which glues these vector bundles together along the edges of the hemispheres. We also highlight that the translational eigenvalue at λ=0\lambda=0 is now counted by the slow problem.

In the course of proving stability in our problem, we also demonstrate explicitly the geometric consequence of eigenvalue crossings in the full system for 0<ε10<\varepsilon\ll 1: the projectivized solutions wind around according to the number of eigenvalues crossed, and these winds occur entirely within the slow dynamics of the critical manifold. The propagation of winding in the reduced problem, as well as the generation of new winds (eigenvalues) can be identified by locating the poles and zeroes, respectively, of a meromorphic Riccati-Evans function [10], defined with respect to a judiciously chosen section of the projective dynamics. This also gives us another opportunity to demonstrate the computational utility of the Riccati-Evans function to count eigenvalues via the argument principle.

We wish to highlight some results in the literature in the context of our present problem. Slow eigenvalues appear in a doubly-diffusive Fitzhugh-Nagumo system which is considered as an early application of the augmented unstable bundle theory in [1], but here the extra fast dynamics is relatively trivial; furthermore, the ‘slow’ problem is in fact fast-slow relative to another small parameter, with its eigenvalues arising from the corresponding fast front and back. The occurrence of such a nontrivial slow eigenvalue problem as described in our current analysis appears to be quite atypical; we highlight the work of Bose [3] in this vein as one of the only comparable examples we have found in the literature. Bose provides similar geometrical results for pulse (meta)stability in singularly-perturbed nonlocal RDEs. The singular perturbation occurs at the second order, so the resulting nonlocal eigenvalue problem can then be studied with traditional Sturm-Liouville techniques, and an oscillation theorem is proven. Relative to Bose’s result, we have two additional technical issues to settle: (i) we must first ‘reduce’ our singularly-perturbed third-order eigenvalue problem (see (24)) by showing closeness to a second-order eigenvalue problem defined on the critical manifold, and (ii) we must also find the corresponding compatibility condition to connect the eigenvalue problem across disjoint branches of the critical manifold.

We also wish to highlight de Rijk et. al.’s recent work on analytical slow-fast factorizations of Evans functions [6]; the relationship between the geometric obstructions discussed above and analytic conditions to produce reduced Evans functions in de Rijk et. al.’s paper bears further exploration, but we do not attempt this in our paper.

We find it sensible to closely follow the general structure in [8] in organizing our paper. In Sec. 2, we describe the construction of a one-parameter family of travelling waves limiting onto a singular shock-fronted travelling wave using geometric singular perturbation theory. In Sec. 3, we define the relevant geometric spaces in which our objects of interest lie and write down some key facts about them. In Sec. 4 we write down the spatial eigenvalue problem for the associated linearized operator of (1), and we write down some leading-order estimates for the eigenvalues and eigenvectors of the associated asymptotic systems. We also define the linearized slow and fast subsystems that we will use often to estimate the linear dynamics of the ‘full’ system. In Sec. 5, we describe some general stability theory for travelling waves and show that the essential spectrum is bounded away from the imaginary axis in the left-half complex plane. In Sec. 6, we construct the augmented unstable vector bundles over a complex contour enclosing the remaining point spectrum, and we also take the opportunity to outline the proof of the main stability theorem in the paper. In Sec. 7, we prove the main uniform estimates for the linearized dynamics in terms of the singular subsystems. We use these estimates to construct a homotopy between some augmented unstable bundles. In Sec. 8, we characterize the point spectrum of the reduced subsystem, finally allowing us to deduce the stability of the wave of the full system for ε>0\varepsilon>0 sufficiently small. We conclude in Sec. 9, and the most technical lemmas are relegated to the Appendices.

2   Existence of travelling waves

Our first task is to construct constant-speed travelling waves for the PDE (1). In this section we summarise the setup and analysis in [22]. We highlight that the techniques and terminology used here are entirely standard in the context of GSPT; we point to the usual references [15, 21] for general definitions.

2.1   The travelling wave equations

The PDE (1) is expressed in terms of the frame (ζ,t)=(xct,t)(\zeta,t^{\prime})=(x-ct,t) (where cc is a constant parameterizing the wavespeed) is given by

U¯t\displaystyle\bar{U}_{t^{\prime}} =cU¯ζ+ζ(D(U¯)U¯ζ)+R(U¯)+ε(3U¯ζ2tc3U¯ζ3).\displaystyle=c\bar{U}_{\zeta}+\frac{\partial}{\partial\zeta}\left(D(\bar{U})\frac{\partial\bar{U}}{\partial\zeta}\right)+R(\bar{U})+\varepsilon\left(\frac{\partial^{3}\bar{U}}{\partial\zeta^{2}\partial t^{\prime}}-c\frac{\partial^{3}\bar{U}}{\partial\zeta^{3}}\right). (2)

Li et al. (2021) provide the precise conditions necessary on D(U¯)D(\bar{U}) and R(U¯)R(\bar{U}) for the existence of a shock-fronted travelling wave solution. We refer the reader to [22] for an in-depth discussion about the modelling assumptions and potential generalisations underlying these definitions. Here we continue with the definitions used by [22] in their computations for the sake of consistency: we take a quadratic nonlinear diffusion term

D(U¯)\displaystyle D(\bar{U}) =6(U¯7/12)(U¯3/4)\displaystyle=6\left(\bar{U}-7/12\right)\left(\bar{U}-3/4\right) (3)

and cubic reaction term

R(U¯)\displaystyle R(\bar{U}) =5U¯(1U¯)(U¯1/5).\displaystyle=5\bar{U}(1-\bar{U})(\bar{U}-1/5). (4)

We also record the following potential function (i.e. integral) of D(U¯)D(\bar{U}), which will be used to define the vector field in the travelling wave frame:

F(U¯)\displaystyle F(\bar{U}) =2U¯34U¯2+218U¯.\displaystyle=2\bar{U}^{3}-4\bar{U}^{2}+\frac{21}{8}\bar{U}. (5)

Note that the diffusion term is negative within the range 7/12<U¯<3/47/12<\bar{U}<3/4, and the reaction term is pinned at U¯=0\bar{U}=0 and U¯=1\bar{U}=1.

A travelling wave solution is found as a steady-state to the above equation:

R(U¯)\displaystyle-R(\bar{U}) =(cU¯+F(U¯)ζεc2U¯ζ2)ζ.\displaystyle=\left(c\bar{U}+\frac{\partial F(\bar{U})}{\partial\zeta}-\varepsilon c\frac{\partial^{2}\bar{U}}{\partial\zeta^{2}}\right)_{\zeta}. (6)

Letting

P¯\displaystyle\bar{P} =cU¯+Fζεc2U¯ζ2\displaystyle=c\bar{U}+\frac{\partial F}{\partial\zeta}-\varepsilon c\frac{\partial^{2}\bar{U}}{\partial\zeta^{2}}
V¯\displaystyle\bar{V} =F(U¯)εcU¯ζ,\displaystyle=F(\bar{U})-\varepsilon c\bar{U}_{\zeta},

we arrive at the following slow travelling wave equations (where ˙:=d/dζ\dot{~{}}:=d/d\zeta)

εU¯˙\displaystyle\varepsilon\dot{\bar{U}} =1c(F(U¯)V¯)\displaystyle=\frac{1}{c}\left(F(\bar{U})-\bar{V}\right)
P¯˙\displaystyle\dot{\bar{P}} =R(U¯)\displaystyle=-R(\bar{U}) (7)
V¯˙\displaystyle\dot{\bar{V}} =P¯cU¯.\displaystyle=\bar{P}-c\bar{U}.

We also record the equivalent fast travelling wave equations in terms of the stretched variable ξ:=ζ/ε\xi:=\zeta/\varepsilon:333From now on we use lowercase letters throughout to distinguish the variables and functions defined with respect to the stretched scaling, following the convention in [8].

u¯\displaystyle\bar{u}^{\prime} =1c(F(u¯)v¯)\displaystyle=\frac{1}{c}\left(F(\bar{u})-\bar{v}\right)
p¯\displaystyle\bar{p}^{\prime} =εR(u¯)\displaystyle=-\varepsilon R(\bar{u}) (8)
v¯\displaystyle\bar{v}^{\prime} =ε(p¯cu¯),\displaystyle=\varepsilon(\bar{p}-c\bar{u}),

where :=d/dξ{~{}}^{\prime}:=d/d\xi. From the definitions of RR and FF, the systems (7)–(8) evidently admit three fixed points for ε>0\varepsilon>0. Two of them are

z¯\displaystyle\bar{z}^{-} =(u¯,p¯,v¯)=(0,0,0)\displaystyle=(\bar{u}^{-},\bar{p}^{-},\bar{v}^{-})=(0,0,0) (9)
z¯+\displaystyle\bar{z}^{+} =(u¯+,p¯+,v¯+)=(1,c,5/8),\displaystyle=(\bar{u}^{+},\bar{p}^{+},\bar{v}^{+})=(1,c,5/8),

lying on S0a,S^{a,-}_{0} and S0a,+S^{a,+}_{0}, respectively, and the third is the point z¯m=(0.2,0.2c,0.381)\bar{z}^{m}=(0.2,0.2c,0.381). Travelling waves of (1) linking the states U¯=0\bar{U}=0 and U¯=1\bar{U}=1 correspond to heteroclinic connections between the fixed points z¯\bar{z}^{-} and z¯+\bar{z}^{+}.

Since the systems (7)–(8) are odd-dimensional, it is convenient to choose an orientation that minimizes the dimension of the unstable manifolds Wu(z¯)W^{u}(\bar{z}^{-}) and Wu(z¯+)W^{u}(\bar{z}^{+}). We therefore apply the orientation reversal ζζ\zeta\mapsto-\zeta and ξξ\xi\mapsto-\xi in the subsequent analysis and work henceforth with the pair of systems

εU¯˙\displaystyle\varepsilon\dot{\bar{U}} =1c(V¯F(U¯))\displaystyle=\frac{1}{c}\left(\bar{V}-F(\bar{U})\right)
P¯˙\displaystyle\dot{\bar{P}} =R(U¯)\displaystyle=R(\bar{U}) (10)
V¯˙\displaystyle\dot{\bar{V}} =cU¯P¯\displaystyle=c\bar{U}-\bar{P}

and

u¯\displaystyle\bar{u}^{\prime} =1c(v¯F(u¯))\displaystyle=\frac{1}{c}\left(\bar{v}-F(\bar{u})\right)
p¯\displaystyle\bar{p}^{\prime} =εR(u¯)\displaystyle=\varepsilon R(\bar{u}) (11)
v¯\displaystyle\bar{v}^{\prime} =ε(cu¯p¯).\displaystyle=\varepsilon(c\bar{u}-\bar{p}).

Let us now assess the linear stability of the fixed points in (9).

Lemma 2.1.

Let c>0c>0 be fixed. For each sufficiently small ε>0\varepsilon>0, the fixed points z¯\bar{z}^{-} and z¯+\bar{z}^{+} of (11) are both saddle-type equilibria. Furthermore, the three eigenvalues of the linearisation of (11) evaluated at both z¯\bar{z}^{-} and z¯+\bar{z}^{+} are all real, having the hierarchy

μfμs,1<0<μs,2.\displaystyle\mu_{f}\ll\mu_{s,1}<0<\mu_{s,2}. (12)

Specifically, with respect to the scaling in (11) we have μf=𝒪(1)\mu_{f}=\mathcal{O}(1) and μs,i=𝒪(ε)\mu_{s,i}=\mathcal{O}(\varepsilon) where i=1,2i=1,2.

Proof: We verify the spectral hierarchy for z¯\bar{z}^{-}; the steps for z¯+\bar{z}^{+} are identical. Let f(u¯,p¯,v¯)f(\bar{u},\bar{p},\bar{v}) denote the vector field of (11). The associated Jacobian matrix is

Df(u¯,p¯,v¯)\displaystyle Df(\bar{u},\bar{p},\bar{v}) =(D(u¯)c01cεR(u¯)00εcε0).\displaystyle=\begin{pmatrix}-\frac{D(\bar{u})}{c}&0&\frac{1}{c}\\ \varepsilon R^{\prime}(\bar{u})&0&0\\ \varepsilon c&-\varepsilon&0\end{pmatrix}. (13)

At z¯\bar{z}^{-} we have D(0)=21/8D(0)=21/8 and R(0)=1R^{\prime}(0)=-1. The characteristic polynomial of Df(z¯)Df(\bar{z}^{-}) is

p(μ)\displaystyle p(\mu) =μ3218cμ2+εμ+1c2ε2.\displaystyle=-\mu^{3}-\frac{21}{8c}\mu^{2}+\varepsilon\mu+\frac{1}{c^{2}}\varepsilon^{2}.

Therefore, the eigenvalue of largest magnitude has the expansion

μf(ε)\displaystyle\mu_{f}(\varepsilon) =218c+𝒪(ε).\displaystyle=\frac{-21}{8c}+\mathcal{O}(\varepsilon). (14)

Thus μf(ε)<0\mu_{f}(\varepsilon)<0 and uniformly bounded away from 0 for ε\varepsilon sufficiently small when c>0c>0. To estimate the remaining two 𝒪(ε)\mathcal{O}(\varepsilon) eigenvalues μs,i\mu_{s,i}, i=1,2i=1,2, we consider the scaling μ=εν\mu=\varepsilon\nu. Then

p(ν)\displaystyle p(\nu) =ε3ν3218cε2ν2+ε2ν+1c2ε2.\displaystyle=-\varepsilon^{3}\nu^{3}-\frac{21}{8c}\varepsilon^{2}\nu^{2}+\varepsilon^{2}\nu+\frac{1}{c^{2}}\varepsilon^{2}.

Dividing a factor of ε2\varepsilon^{2}, regular perturbation theory again gives the solution of p(ν)=0p(\nu)=0 in orders of ε\varepsilon as

ν±\displaystyle\nu_{\pm} =221(2±4+42c2)+𝒪(ε).\displaystyle=\frac{2}{21}\left(2\pm\sqrt{4+\frac{42}{c^{2}}}\right)+\mathcal{O}(\varepsilon). (15)

Since the roots limit to distinct real values as ε0\varepsilon\to 0, the pair must perturb to distinct real values. Furthermore μs,1=εν<0<εν+=μs,2\mu_{s,1}=\varepsilon\nu_{-}<0<\varepsilon\nu_{+}=\mu_{s,2} for each ε>0\varepsilon>0 sufficiently small. \Box

Following standard GSPT, we now define two subproblems which characterize the slow and fast singular limits of (10) and (11). These are used to define a singular heteroclinic orbit by concatenating solutions of these subsystems.

2.2   Fast and slow singular limits

We first characterize the fast dynamics by considering the singular limit ε0\varepsilon\to 0 of (11).

Definition 2.2.

The layer problem of (11) is given by

u¯\displaystyle\bar{u}^{\prime} =1c(v¯F(u¯))\displaystyle=\frac{1}{c}\left(\bar{v}-F(\bar{u})\right)
p¯\displaystyle\bar{p}^{\prime} =0\displaystyle=0 (16)
v¯\displaystyle\bar{v}^{\prime} =0.\displaystyle=0.

\triangle

In this limit, the slow variables p¯,v¯\bar{p},\bar{v} are constant, parametrising the fast fiber bundle; away from the zero set of uu^{\prime}, the layer problem then specifies the one-dimensional fast dynamics fiberwise.

Definition 2.3.

The critical manifold is S0={(u¯,p¯,v¯):v¯=F(u¯)}S_{0}=\{(\bar{u},\bar{p},\bar{v}):\,\bar{v}=F(\bar{u})\}. We decompose S0S_{0} into the pieces

S0\displaystyle S_{0} =S0a,FS0rF+S0a,+,\displaystyle=S^{a,-}_{0}\cup F_{-}\cup S^{r}_{0}\cup F_{+}\cup S^{a,+}_{0},

with

S0a,\displaystyle S^{a,-}_{0} =S0{u¯<7/12}\displaystyle=S_{0}\cap\{\bar{u}<7/12\}
F\displaystyle F_{-} =S0{u¯=u¯F:=7/12}\displaystyle=S_{0}\cap\{\bar{u}=\bar{u}_{F}:=7/12\}
S0r\displaystyle S^{r}_{0} =S0{7/12<u¯<3/4}\displaystyle=S_{0}\cap\{7/12<\bar{u}<3/4\}
F+\displaystyle F_{+} =S0{u¯=3/4}\displaystyle=S_{0}\cap\{\bar{u}=3/4\}
S0a,+\displaystyle S^{a,+}_{0} =S0{u¯>3/4}.\displaystyle=S_{0}\cap\{\bar{u}>3/4\}.

Here, FF_{-} and F+F_{+} denote resp. the left and right folds of the cubic manifold S0S_{0}, where D(U¯)D(\bar{U}) vanishes. We also define the jump-on curve JaS0a,+J_{a}\subset S^{a,+}_{0}, which is the horizontal projection of the fold FF_{-} onto S0a,+S^{a,+}_{0}:

Ja\displaystyle J_{a} =S0{u¯=u¯J:=5/6}.\displaystyle=S_{0}\cap\{\bar{u}=\bar{u}_{J}:=5/6\}.

These sets are drawn in Fig. 1(a). \triangle

The Jacobian matrix of (16) has two trivial zero eigenvalues for each xS0x\in S_{0}, corresponding to the locally two-dimensional tangent space at each point. The sign of the remaining nontrivial eigenvalue determines the dynamics along the fast fibers nearby.

Definition 2.4.

We say that S0S_{0} is normally hyperbolic at xS0x\in S_{0} if its nontrivial eigenvalue λ\lambda does not lie on the imaginary axis. Furthermore, we say that S0S_{0} is normally hyperbolic attracting if λ<0\lambda<0 and normally hyperbolic repelling if λ>0\lambda>0. \triangle

The nontrivial eigenvalue of the linearization of (16) is

λ\displaystyle\lambda =D(u¯)c,\displaystyle=-\frac{D(\bar{u})}{c},

from which it immediately follows that S0a,S^{a,-}_{0} and S0a,+S^{a,+}_{0} are normally hyperbolic attracting and S0rS^{r}_{0} is normally hyperbolic repelling when c>0c>0; see Fig. 1(b). Well-known theorems of Fenichel [7] specify the existence of normally hyperbolic invariant slow manifolds Sεa,±S^{a,\pm}_{\varepsilon} and SεrS^{r}_{\varepsilon} and fast fiber bundles near to compact normally hyperbolic subsets of their singular counterparts S0a,±S^{a,\pm}_{0} resp. S0rS^{r}_{0}.

(a)Refer to caption
(b)Refer to caption

Figure 1: (a) Depiction of the critical manifold S0S_{0} from Def. 2.3 in (u¯,p¯,v¯)(\bar{u},\bar{p},\bar{v})-space. (b) Dynamics of the layer problem (11) away from S0S_{0}, projected onto the (u¯,p¯)(\bar{u},\bar{p}) plane.

The critical manifold loses normal hyperbolicity via a simple zero eigenvalue crossing at the folds F±F_{\pm}. Slow dynamics may be defined on S0S_{0} away from F±F_{\pm} by considering the singular limit of (10). Differentiating both sides of V¯=F(U¯)\bar{V}=F(\bar{U}) with respect to ζ\zeta we have

V¯˙\displaystyle\dot{\bar{V}} =D(U¯)U¯˙.\displaystyle=D(\bar{U})\dot{\bar{U}}. (17)

Solutions of the reduced problem blow up as they approach F±F_{\pm}, and so we rescale ζ\zeta by the function D(U¯)D(\bar{U}) to extend solutions of the reduced problem smoothly across the folds. Altogether, we define the following two systems.

Definition 2.5.

The reduced problem defined on S0S_{0} is given by

D(U¯)U¯˙\displaystyle D(\bar{U})\dot{\bar{U}} =cU¯P¯\displaystyle=c\bar{U}-\bar{P} (18)
P¯˙\displaystyle\dot{\bar{P}} =R(U¯).\displaystyle=R(\bar{U}).

Rescaling ζ\zeta by the diffusion D(U¯)D(\bar{U}), we obtain the (desingularized) slow flow

U¯˙\displaystyle\dot{\bar{U}} =cU¯P¯\displaystyle=c\bar{U}-\bar{P} (19)
P¯˙\displaystyle\dot{\bar{P}} =R(U¯)D(U¯).\displaystyle=R(\bar{U})D(\bar{U}).

\triangle

The slow flow extends the solutions of the reduced problem across the lines of singularities given by F±F_{\pm} in finite time. Note that although topological equivalence of solutions of (18) and (19) is assured on S0a,±S^{a,\pm}_{0} and S0rS^{r}_{0}, there is an orientation reversal on S0rS^{r}_{0} owing to the change in sign of D(U¯)D(\bar{U}).

The slow flow (19) admits five fixed points corresponding to the zeroes of R(U¯)R(\bar{U}) and D(U¯)D(\bar{U}). A calculation of the eigenvalues of the Jacobian of (19) shows that the fixed points

Z¯\displaystyle\bar{Z}^{-} =(0,0)\displaystyle=(0,0) (20)
Z¯+\displaystyle\bar{Z}^{+} =(1,c)\displaystyle=(1,c)

are saddle-type when c>0c>0. These correspond to the singular limits of the fixed points (9) of the full system as ε0\varepsilon\to 0.

Remark 2.6.

The set FF_{-} consists mostly of so-called regular (or generic) fold points; we refer the reader to the precise definition in [21].444There are also isolated canard points, corresponding to folded singularities where trajectories beginning near an attracting branch of a slow manifold are able to cross over to the repelling branch, tracking it for 𝒪(1)\mathcal{O}(1) time. In the travelling wave literature such points are referred to as ‘holes in the wall,’ relating to the observation that D(U¯)=0D(\bar{U})=0 defines ‘walls’ of singularities corresponding to the fold lines; see [9], [23], and [36]. One of the seminal achievements of GSPT is to rigorously characterise the flow of typical orbits across folds, where Fenichel theory breaks down. This requires extending the slow manifolds (which a priori exist only over compact normally hyperbolic subsets of the critical manifold) across such folded singularities using geometric blow-up theory; see e.g. [20, 34, 35]. \triangle

We are interested in the existence of a singular heteroclinic orbit that connects Z¯\bar{Z}^{-} to Z¯+\bar{Z}^{+} by jumping across a regular fold point on FF_{-}. This singular orbit is a hybrid object defined by concatenating solutions of (16) and (19).

Hypothesis 2.7.

There exists a singular heteroclinic orbit Γ0\Gamma_{0} for c=c0>0c=c_{0}>0, connecting U¯=0\bar{U}=0 to U¯=1\bar{U}=1. This orbit is formally defined as the concatenation of the following solution segments:

  • the portion of the unstable manifold Wu(Z¯)W^{u}(\bar{Z}^{-}) which connects Z¯\bar{Z}^{-} to a regular fold point (7/12,p¯f)(7/12,\bar{p}_{f}) on FF_{-}, corresponding to a unique trajectory XR(ζ)X_{R}(\zeta) of (19) defined on ζ(,0]\zeta\in(-\infty,0]

  • the horizontal fast fiber connecting Wu(Z¯)FW^{u}(\bar{Z}^{-})\cap F_{-} to a point (5/6,p¯f,v¯f)(5/6,\bar{p}_{f},\bar{v}_{f}) on the jump curve JaJ_{a}, corresponding to a unique trajectory xR(ξ)x_{R}(\xi) of (16) defined on ξ(,)\xi\in(-\infty,\infty), and

  • the portion of the stable manifold Ws(Z¯+)W^{s}(\bar{Z}^{+}) which connects (5/6,p¯f)(5/6,\bar{p}_{f}) to Z¯+\bar{Z}^{+}, corresponding to a unique trajectory XR(ζ)X_{R}(\zeta) of (19) defined on ζ[0,)\zeta\in[0,\infty). We also suppose that the slow flow is transverse to JaJ_{a} at Ws(Z¯+)JaW^{s}(\bar{Z}^{+})\cap J_{a}.

We furthermore suppose that the heteroclinic connection Γ0\Gamma_{0} is transversal with respect to variation in cc; i.e. extending the reduced systems with the equation c=0c^{\prime}=0, we have Γ0=Wu(Z¯,c)Ws(Z¯+,c)\Gamma_{0}=W^{u}(\bar{Z}^{-},c)\mathrel{\text{\vbox{ \halign{#\cr\smash{$-$}\crcr$\pitchfork$\crcr} }}}W^{s}(\bar{Z}^{+},c), where Wu(Z¯,c)=cWu(Z¯(c))W^{u}(\bar{Z}^{-},c)=\cup_{c}W^{u}(\bar{Z}^{-}(c)) and Ws(Z¯+,c)=cWs(Z¯+(c))W^{s}(\bar{Z}^{+},c)=\cup_{c}W^{s}(\bar{Z}^{+}(c)) denote, respectively, the extended two-dimensional center-unstable (resp. three-dimensional center-stable) manifolds, continued across the fast jump in the usual way.

Refer to caption
Figure 2: Segments of Wu(Z¯)W^{u}(\bar{Z}^{-}) and Ws(Z¯+)W^{s}(\bar{Z}^{+}) arising from the slow flow (19)–(20), as cc is varied within the range [0.19,0.23][0.19,0.23]. The stable manifolds are extended slightly past JaJ_{a} to illustrate the transversality of the slow flow at JaJ_{a}. The singular homoclinic orbit Γ0\Gamma_{0} is depicted by the red curve for c00.199362c_{0}\approx 0.199362. The blue arrows indicate how the (un)stable manifolds vary as cc is increased. The magenta dashed line denotes the U¯\bar{U}-nullcline for c=c0c=c_{0}.

Hypothesis 2.7 can be readily verified numerically; this was done already in [22]. Here we report a singular heteroclinic connection for c00.199362c_{0}\approx 0.199362 which is clearly transversal with respect to variation in cc; see Fig. 2. Based on the relative location of Γ0\Gamma_{0} with respect to the U¯\bar{U}-nullcline, we also record the following monotonicity hypothesis.

Hypothesis 2.8.

The portions of Γ0\Gamma_{0} (from Hypothesis 2.7) lying in S0a,±S^{a,\pm}_{0} are strictly monotone in U¯\bar{U}; i.e. U¯˙(ζ)>0\dot{\bar{U}}(\zeta)>0 (equivalently, V¯˙(ζ)>0\dot{\bar{V}}(\zeta)>0) for all ζ<0\zeta<0 on S0a,S_{0}^{a,-} and for all ζ>0\zeta>0 on S0a,+S_{0}^{a,+}.

This monotonicity condition turns out to be useful when analyzing the reduced eigenvalue problem introduced in Sec. 8: it is equivalent to the statement that the projectivized dynamics of the slow coordinates lives entirely on one chart of \mathbb{CP} specified by S=(P/V)S=(P/V), V0V\neq 0, when the temporal eigenvalue parameter λ=0\lambda=0. Here PP and VV denote respectively the linearisations of P¯\bar{P} and V¯\bar{V}.555Our notation is equivalent to the “δ\delta” convention to denote linearized variables, i.e. V=δV¯V=\delta\bar{V}.

2.3   Travelling waves for 0<ε10<\varepsilon\ll 1

We now use hypothesis 2.7, to show the existence of a one-parameter family of travelling waves {Γε}\{\Gamma_{\varepsilon}\} for sufficiently small values of ε>0\varepsilon>0; we write it in such a way as to make clear the uniform convergence onto the singular wave Γ0\Gamma_{0}.

Lemma 2.9.

For any δ>0\delta>0, there exists ε¯>0\bar{\varepsilon}>0 such that for each ε(0,ε¯)\varepsilon\in(0,\bar{\varepsilon}), there exists a wavespeed c(ε)c(\varepsilon) and a heteroclinic orbit Γε(ξ)\Gamma_{\varepsilon}(\xi) for (11) connecting z¯\bar{z}^{-} to z¯+\bar{z}^{+} with dH(Γε,Γ0)<δd_{H}(\Gamma_{\varepsilon},\Gamma_{0})<\delta and |c(ε)c0|<δ|c(\varepsilon)-c_{0}|<\delta.

Refer to caption
Figure 3: Numerical approximation of a heteroclinic orbit (black solid curve) connecting the saddle equilibria (0,0,0)(0,0,0) and (1,c,F(1))(1,c,F(1)) for (ε,c)=(0.001,0.20637)(\varepsilon,c)=(0.001,0.20637), together with the singular heteroclinic orbit Γ0\Gamma_{0} (red dashed curve) from Hypothesis 2.7.

Shock-fronted travelling waves for small fixed values of ε>0\varepsilon>0 are numerically approximated in [22]; see also Fig. 3. We sketch a geometrical proof showing that Hypothesis 2.7 implies that Lemma 2.9 holds. Our sketch follows standard arguments for constructing heteroclinic orbits by estimating the relative orientations of slow manifolds and their fast fiber bundles for ε>0\varepsilon>0 small, see eg. [33]. Here we also address the issue that the jump occurs across a regular fold, necessitating a slightly different estimate.

Proof: Fix α>0\alpha>0 small enough that Γ0\Gamma_{0} is the unique transverse singular heteroclinic orbit for cBα:=(c0α,c0+α)c\in B_{\alpha}:=(c_{0}-\alpha,c_{0}+\alpha). We also fix a section which intersects Γ0\Gamma_{0} in its interior in the middle of the jump, say Σ={u¯=0.7}\Sigma=\{\bar{u}=0.7\}. For ε>0\varepsilon>0 sufficiently small, Fenichel theory provides the existence of slow manifolds Sεa,S^{a,-}_{\varepsilon} and Sεa,+S^{a,+}_{\varepsilon}, which are 𝒪(ε)\mathcal{O}(\varepsilon)-close in Hausdorff distance to compact normally hyperbolic subsets of S0a,S^{a,-}_{0} and S0a,+S^{a,+}_{0}.

The slow manifold Sεa,S^{a,-}_{\varepsilon} can be continuously extended across a regular fold of a two-dimensional critical manifold in 3\mathbb{R}^{3} using blow-up theory; after crossing the fold, Sεa,S^{a,-}_{\varepsilon} lies 𝒪(ε2/3)\mathcal{O}(\varepsilon^{2/3}) close to the singular fast fiber subbundle 𝒥\mathscr{J} extending from the fold FF_{-} to the jump-on curve JaJ_{a}, at the section Σ\Sigma (see Theorem 1 in [34] for details). The slow manifold Sa,S^{a,-} then flows along the fast fiber bundle, reaching Σ\Sigma in 𝒪(1)\mathcal{O}(1) time with respect to the scaling (11) (this intersection must exist for ε>0\varepsilon>0 sufficiently small since Γ0\Gamma_{0} intersects Σ\Sigma by construction). Thus, Sεa,S^{a,-}_{\varepsilon} remains 𝒪(ε2/3)\mathcal{O}(\varepsilon^{2/3})-close to 𝒥\mathscr{J} at the point of intersection on Σ\Sigma .

On the other hand, consider a ball VV inside W=cBαWs(z¯(c))W=\cup_{c\in B_{\alpha}}W^{s}(\bar{z}^{-}(c)) which straddles JaJ_{a}. Fenichel theory implies that the (nonlinear) fast fiber bundle over the basepoint set VV perturbs to an O(ε)O(\varepsilon)-close fast fiber bundle over the corresponding basepoints now given on Sεa,+S^{a,+}_{\varepsilon} [7]. Extend this fiber bundle backwards to Σ\Sigma. Therefore at Σ\Sigma, the transverse intersection of the projection of FF with the stable fast fiber bundle over S0a,+S^{a,+}_{0} persists over a small range in ε\varepsilon, and for each such ε\varepsilon the intersection occurs at a corresponding wavespeed c(ε)c(\varepsilon). Evaluated at Σ\Sigma, the perturbed intersection is 𝒪(ε2/3)\mathcal{O}(\varepsilon^{2/3})-close to the original intersection in the extended (u¯,p¯,v¯,c)(\bar{u},\bar{p},\bar{v},c)-space, i.e. c(ε)c0c(\varepsilon)\to c_{0} as ε0\varepsilon\to 0. \Box

Remark 2.10.

We highlight a high-dimensional generalization of this construction, due to Lin and Wechselberger [24]. They prove a generalization of Theorem 1 in [34], and they also require exchange/lambda lemma-type estimates to pick out unique waves from two-parameter families. In our case, the transversal intersection Γε\Gamma_{\varepsilon} for each corresponding wavespeed c(ε)c(\varepsilon) must be unique, which can be verified directly with dimension counting. \triangle

We are now able to state the main theorem proved in this paper.

Theorem 2.11.

Suppose Hypotheses 2.7, and 2.8 hold. Fix a contour KK in the right half of the complex plane. Then there exists ε¯>0\bar{\varepsilon}>0 so that for each 0<εε¯0<\varepsilon\leq\bar{\varepsilon}, the wave Γε\Gamma_{\varepsilon} does not have spectrum inside of KK.

The proof of this theorem is developed mostly in the second half of the paper; see Sec. 6.2 for a general outline of the strategy. A characterization of the spectral stability of a travelling wave must first be developed. This is accomplished by the end of Sec. 6.

We finish this section with a preliminary estimate showing that we have strong control over the dynamics of the wave via the reduced subsystems away from the transition sets FF_{-} and JaJ_{a}; c.f. Corollary 2.2 in [8].

Corollary 2.12.

Given ε>0\varepsilon>0 sufficiently small, let X(ζ,ε)=x(ξ,ε)X(\zeta,\varepsilon)=x(\xi,\varepsilon) parametrize the travelling wave solution from Hypothesis 2.9 corresponding to the wavespeed c=c(ε)c=c(\varepsilon). The following are true for each fixed a>0a>0:

limε0x(ξ,ε)\displaystyle\lim_{\varepsilon\to 0}x(\xi,\varepsilon) =xR(ξ) uniformly for |ξ|a\displaystyle=x_{R}(\xi)\,\text{ uniformly for }|\xi|\leq a
limε0x(ζ,ε)\displaystyle\lim_{\varepsilon\to 0}x(\zeta,\varepsilon) =XR(ζ) uniformly for |ζ|a.\displaystyle=X_{R}(\zeta)\text{ uniformly for }|\zeta|\geq a.

In the second limit, we take XRX_{R} to mean the embedding of the corresponding parametrised solutions of Γ0|S0a,S0a,+\Gamma_{0}|_{S^{a,-}_{0}\cup S^{a,+}_{0}} into 3\mathbb{R}^{3} via the map (U¯,P¯)(U¯,P¯,F(U¯))S0(\bar{U},\bar{P})\mapsto(\bar{U},\bar{P},F(\bar{U}))\in S_{0}.

Proof: We sketch the argument for the first uniform limit. Fix an a>0a>0. Then |x(ξ,ε)xR(ξ)||x(\xi,\varepsilon)-x_{R}(\xi)| can be uniformly bounded through a combination of Gronwall’s inequality and the distance estimate in Hypothesis 2.9. The argument for the second uniform limit is similar. \Box

3   Geometric preliminaries

Ahead of our analysis in the remaining sections, we explicitly describe the geometric spaces in which our objects of interest live. We will track the dynamics of fibers of complex vector bundles, corresponding to points in 𝐆𝐫(k,n)\mathbf{Gr}(k,n), the Grassmannian of complex kk-planes in n\mathbb{C}^{n}. The general set-up is standard: we will study dynamics on the Grassmannian indirectly via embedded systems on complex projective space.

We first introduce some essential notation for complex projective space. The space n\mathbb{CP}^{n} is the quotient space (n+1{0})/(\mathbb{C}^{n+1}-\{0\})/\sim{} subject to the equivalence relation that identifies complex rays, i.e. y1,y2n+1{0}y_{1},y_{2}\in\mathbb{C}^{n+1}-\{0\} are identified if y1=αy2y_{1}=\alpha y_{2} for some α\alpha\in\mathbb{C}. We denote the projection map by π:n+1{0}n\pi:\mathbb{C}^{n+1}-\{0\}\to\mathbb{CP}^{n}, and for each yny\in\mathbb{C}^{n} we use the notation [y][y] or y^\hat{y} for the image π(y)n\pi(y)\in\mathbb{CP}^{n} (this notation also holds for sets Sn{0}S\subset\mathbb{C}^{n}-\{0\}).

Later, in Section 7.2 we will use the following elementary lemma to interpret metric neighborhoods on 2\mathbb{CP}^{2} in terms of those on 3\mathbb{C}^{3}.

Lemma 3.1.

Let ρ\rho be any metric on n\mathbb{CP}^{n}. Then there is a constant K>0K>0, depending only on ρ\rho, such that the following is true for each δ>0\delta>0:

If yn+1y\in\mathbb{C}^{n+1} is a complex vector with unit modulus and if Pn+1P\subset\mathbb{C}^{n+1} is a complex nn-plane passing through the origin with y^Nδ(P^)\hat{y}\notin N_{\delta}(\hat{P}), then

|w|Kδ,|w|\geq K\delta,

where ww is the component of yy lying orthogonal to PP, and Nδ(P^)N_{\delta}(\hat{P}) denotes the δ\delta-neighborhood of P^\hat{P} with respect to ρ\rho, i.e. Nδ(P^)={x^n:ρ(x^,p^)<δ for some p^P^}.N_{\delta}(\hat{P})=\{\hat{x}\in\mathbb{CP}^{n}:\rho(\hat{x},\hat{p})<\delta\text{ for some }\hat{p}\in\hat{P}\}.

Remark 3.2.

The norm |x|:=x¯Tx|x|:=\sqrt{\bar{x}^{T}x} in Lemma 3.1 is the usual Euclidean norm, and the orthogonal component is defined using the Hermitian inner product xy:=i=1nxiy¯ix\cdot_{\mathbb{C}}y:=\sum_{i=1}^{n}x_{i}\bar{y}_{i}. Essentially the lemma says that if a vector is inclined far away from a plane, then the component of the vector lying orthogonal to the plane cannot be too small, up to a scaling factor that depends on the details of how this inclination is measured. This lemma was implicitly used in Lemma 5.4 in [8] for the case n=3n=3, so we provide an explicit proof for arbitrary nn. \triangle

Proof of Lemma 3.1: Suppose that there is no such uniform constant for some fixed choice of δ>0\delta>0. Then we can find a sequence {yk}\{y_{k}\} lying on the unit sphere in n+1\mathbb{C}^{n+1} such that for each kk, we have y^kNδ(P^)\hat{y}_{k}\notin N_{\delta}(\hat{P}) and |wk|<δ/k|w_{k}|<\delta/k, where wkw_{k} denotes the orthogonal projection of yky_{k} with respect to PP. Choose a convergent subsequence ykyy_{k}\to y_{*}, relabeling kk if necessary. Then yy_{*} clearly has zero component in the direction orthogonal to PP, implying that yPy_{*}\in P and hence that y^Nδ(P^)\hat{y}_{*}\in N_{\delta}(\hat{P}). On the other hand, we have y^ky^Nδ(P^)\hat{y}_{k}\to\hat{y}_{*}\notin N_{\delta}(\hat{P}) since the quotient map is continuous and the complement of Nδ(P^)N_{\delta}(\hat{P}) is closed, giving a contradiction. \Box

Let v1,,vknv_{1},\cdots,v_{k}\in\mathbb{C}^{n} denote a spanning set of vectors for the kk-plane WW. The well-known Plücker embedding

𝐆𝐫(k,n)\displaystyle\mathbf{Gr}(k,n) 𝜓(kn)(nk)1\displaystyle\xhookrightarrow{\psi}\mathbb{P}(\bigwedge^{k}\mathbb{C}^{n})\cong\mathbb{CP}^{\binom{n}{k}-1}
ψ(W)\displaystyle\psi(W) :=[v1vk]\displaystyle:=[v_{1}\wedge\cdots\wedge v_{k}]

provides homogeneous coordinates for the Grassmannian, called Plücker coordinates. The image ψ(W)\psi(W) can be coordinatized by writing down the maximal minors/cofactors of the (n×k)(n\times k) matrix MW=[v1vk]M_{W}=[v_{1}~{}\cdots~{}v_{k}].

Example. Consider a complex 2-plane W𝐆𝐫(2,3)W\in\mathbf{Gr}(2,3) with spanning complex vectors v1=(a1,b1,c1)Tv_{1}=(a_{1},b_{1},c_{1})^{T} and v2=(a2,b2,c2)Tv_{2}=(a_{2},b_{2},c_{2})^{T}. A coordinate representation of [v1v2]2[v_{1}\wedge v_{2}]\in\mathbb{C}\mathbb{P}^{2} is given by the list of 2×22\times 2 cofactors of the 3×23\times 2 matrix [v1v2][v_{1}~{}~{}v_{2}], namely

NW:=(b1c2c1b2a2c1a1c2a1b2a2b1).\displaystyle N_{W}:=\begin{pmatrix}b_{1}c_{2}-c_{1}b_{2}\\ a_{2}c_{1}-a_{1}c_{2}\\ a_{1}b_{2}-a_{2}b_{1}\end{pmatrix}.

Observe that the Plücker coordinate representation NWN_{W} can be interpreted as a parametrisation of the plane WW by a (nontrivial) normal vector.

Remark 3.3.

The Plücker coordinates are generally redundant: we have (nk)\binom{n}{k} of them, whereas the dimension of 𝐆𝐫(k,n)\mathbf{Gr}(k,n) is k(nk)k(n-k). In this paper our analysis is restricted to the cases 𝐆𝐫(1,3)\mathbf{Gr}(1,3) and 𝐆𝐫(2,3)\mathbf{Gr}(2,3), since we will be occupied with tracking complex line (resp. plane) bundles. In the case 𝐆𝐫(1,3)\mathbf{Gr}(1,3), the redundancy arises from nonunique scalings of the complex ray. In the case 𝐆𝐫(2,3)\mathbf{Gr}(2,3), an analogous scaling nonuniqueness arises when choosing a normal vector to represent a complex plane (see the example above). \triangle

The Grassmannian spaces are metrizable. Here we construct an explicit metric, because we will need a way to measure ‘angles’ between lines and planes. Using a complex version of the construction given in [17], we take advantage of the Plücker embedding to equip 𝐆𝐫(k,n)\mathbf{Gr}(k,n) with the Fubini-Study metric. Complex projective space can be embedded into the space of Hermitian (n+1)×(n+1)(n+1)\times(n+1) matrices \mathcal{H} via the following isometric immersion:

n\displaystyle\mathbb{CP}^{n} 𝜑\displaystyle\xhookrightarrow{\varphi}\mathcal{H}
φ([x])\displaystyle\varphi([x]) :=xx¯T|x|2,\displaystyle:=\frac{x\bar{x}^{T}}{|x|^{2}},

where xn+1{0}x\in\mathbb{C}^{n+1}-\{0\} represents the ray x\mathbb{C}x. The space of Hermitian matrices is in turn endowed with an inner product given by

d(P,Q)\displaystyle d(P,Q) =12Tr(PQ),\displaystyle=\frac{1}{2}\text{Tr}(PQ),

which allows us to define a norm P=d(P,P)1/2\lVert P\rVert=d(P,P)^{1/2}, and hence a metric induced by the norm given by dist(P,Q):=PQ\text{dist}(P,Q):=\lVert P-Q\rVert. We can pull back this metric on \mathcal{H} to one on n\mathbb{CP}^{n}, and hence on the Grassmannian:

dist([x],[y])\displaystyle\text{dist}([x],[y]) :=xx¯T|x|2yy¯T|y|2\displaystyle:=\lVert\frac{x\bar{x}^{T}}{|x|^{2}}-\frac{y\bar{y}^{T}}{|y|^{2}}\rVert
=12Tr[(xx¯|x|2yy¯T|y|2)2]\displaystyle=\sqrt{\frac{1}{2}\text{Tr}\left[\left(\frac{x\bar{x}}{|x|^{2}}-\frac{y\bar{y}^{T}}{|y|^{2}}\right)^{2}\right]}
=1|(xy)|2|x|2|y|2.\displaystyle=\sqrt{1-\frac{|(x\cdot_{\mathbb{C}}y)|^{2}}{|x|^{2}|y|^{2}}}.

We observe that dist([x],[y])\text{dist}([x],[y]) recovers the sine of the so-called Hermitian angle [29] between a corresponding pair of nontrivial complex vectors representing [x],[y][x],\,[y]. For example, this metric on 𝐆𝐫(2,3)\mathbf{Gr}(2,3) measures the Hermitian angle between a pair of normal vectors parametrising a corresponding pair of complex 22-planes.

We will use the Plücker embedding to study (linear) dynamics which describe the evolution of kk-planes in the Grassmannian. Concretely, let y=b(ξ)yy^{\prime}=b(\xi)y denote a nonautonomous linear system on n+1\mathbb{C}^{n+1}. This linear system induces a nonlinear flow on n\mathbb{CP}^{n}, and hence on the corresponding Grassmannian, which we denote by

y^\displaystyle\hat{y}^{\prime} =b^(y^,ξ).\displaystyle=\hat{b}(\hat{y},\xi). (21)

We mention two elementary facts about about these induced flows. The first is that yy is an eigenvector of b(ξ)b(\xi) if and only if y^\hat{y} is a critical point of b^(y^,ξ)\hat{b}(\hat{y},\xi). The second is that if {λi}\{\lambda_{i}\} denotes the eigenvalues of the linearization of a (constant) matrix bb and if yiy_{i} is an eigenvector corresponding to the eigenvalue λi\lambda_{i}, then the eigenvalues of the linearization of b^\hat{b} near the corresponding critical point y^i\hat{y}_{i} are λjλi\lambda_{j}-\lambda_{i} for jij\neq i. As a consequence of these facts, if the largest eigenvalue of a linear system is simple, then the corresponding eigendirection in the projectivized system is a stable fixed point.

We will find it useful to study the system (21) in its own right as a nonlinear flow on appropriate charts on n\mathbb{CP}^{n}, but it is also convenient to treat the natural system y=b(ξ)yy^{\prime}=b(\xi)y as the Plücker coordinate representation of the system on projective space. To clarify this, we introduce the notion of induced variational fields. We repeat the general treatment of [17], Sec. 3.2. Let VV be a vector space of dimension nn and let 1σn1\leq\sigma\leq n. Given a linear map A:VVA:V\to V, an induced derivation of order σ\sigma is the multilinear map Aσ:σVσVA^{\sigma}:\bigwedge^{\sigma}V\to\bigwedge^{\sigma}V defined by

v1vσi=1σv1Avivσ.\displaystyle v_{1}\wedge\cdots\wedge v_{\sigma}\mapsto\sum_{i=1}^{\sigma}v_{1}\wedge\cdots\wedge Av_{i}\wedge\cdots\wedge v_{\sigma}. (22)

We have

Lemma 3.4.

Let {λi}i=1n\{\lambda_{i}\}_{i=1}^{n} denote the set of eigenvalues of A(1)=AA^{(1)}=A. Then the set of eigenvalues of A(2)A^{(2)} is given by {λi+λj}i<j,i=1i=n\{\lambda_{i}+\lambda_{j}\}_{i<j,i=1}^{i=n}.

Proof: Choose an ordered basis for VV consisting of eigenvectors of AA, and use it to construct an ordered basis for 2V\bigwedge^{2}V. The conclusion follows from a straightforward calculation using the definition (22). \Box

If x=Axx^{\prime}=Ax is a linear vector field, then P=A(σ)PP^{\prime}=A^{(\sigma)}P defines a linear vector field on the σ\sigmath exterior power space. By using the Plücker embedding, the latter induced vector field can be interpreted as evolving points in 𝐆𝐫(σ,n)\mathbf{Gr}(\sigma,n) (see Sec. 3.2 in [17] for a complete description of this construction in the real setting; the extension to complex kk-planes is direct).

4   Spatial eigenvalue problems

We now assume that Hypotheses 2.7, 2.8, and 2.9 hold, and we proceed to define the spatial eigenvalue problem associated with the one-parameter family of waves (Γε,c(ε))(\Gamma_{\varepsilon},c({\varepsilon})). Let us fix ε>0\varepsilon>0 sufficiently small and represent Γε\Gamma_{\varepsilon} by the solution X¯(ζ,ε)=(U¯(ζ,ε),P¯(ζ,ε),V¯(ζ,ε))\bar{X}(\zeta,\varepsilon)=(\bar{U}(\zeta,\varepsilon),\bar{P}(\zeta,\varepsilon),\bar{V}(\zeta,\varepsilon)). We consider perturbations of the form

U~(ζ,t)\displaystyle\tilde{U}(\zeta,t) =U¯(ζ,ε)+δeλtU(ζ)+𝒪(δ2).\displaystyle=\bar{U}(\zeta,\varepsilon)+\delta e^{\lambda t}U(\zeta)+\mathcal{O}(\delta^{2}). (23)

Assuming that U~(ζ,t)\tilde{U}(\zeta,t) solves (1) and applying the fact that U¯(ζ,ε)\bar{U}(\zeta,\varepsilon) is a stationary solution of (2), we arrive at the eigenvalue problem by collecting 𝒪(δ)\mathcal{O}(\delta) terms:

cUζ+(D(U¯)U)ζζ+R(U¯)U+ε(λUζζcUζζζ)\displaystyle cU_{\zeta}+(D(\bar{U})U)_{\zeta\zeta}+R^{\prime}(\bar{U})U+\varepsilon(\lambda U_{\zeta\zeta}-cU_{\zeta\zeta\zeta}) =λU.\displaystyle=\lambda U. (24)

Collecting derivatives and defining

V\displaystyle V =D(U¯)U+ε(λUcUζ)\displaystyle=D(\bar{U})U+\varepsilon(\lambda U-cU_{\zeta})
P\displaystyle P =cU+Vζ,\displaystyle=cU+V_{\zeta},

we obtain the following closed three-dimensional system:

εU˙\displaystyle\varepsilon\dot{U} =1c((ελ+D(U¯))UV)\displaystyle=\frac{1}{c}\left((\varepsilon\lambda+D(\bar{U}))U-V\right)
P˙\displaystyle\dot{P} =(λR(U¯))U\displaystyle=(\lambda-R^{\prime}(\bar{U}))U (25)
V˙\displaystyle\dot{V} =PcU.\displaystyle=P-cU.

In the stretched scaling ξ=ζ/ε\xi=\zeta/\varepsilon, we obtain

u\displaystyle u^{\prime} =1c(v+(ελ+D(u¯))u)\displaystyle=\frac{1}{c}\left(-v+(\varepsilon\lambda+D(\bar{u}))u\right)
p\displaystyle p^{\prime} =ε(λR(u¯))u\displaystyle=\varepsilon(\lambda-R^{\prime}(\bar{u}))u (26)
v\displaystyle v^{\prime} =ε(pcu).\displaystyle=\varepsilon(p-cu).

In view of the orientation reversal giving the pair of wave equations (10)–(11), we will instead work with the following pair of linearized systems throughout the paper:

εU˙\displaystyle\varepsilon\dot{U} =1c(V(ελ+D(U¯))U)\displaystyle=\frac{1}{c}(V-(\varepsilon\lambda+D(\bar{U}))U)
P˙\displaystyle\dot{P} =(R(U¯)λ)U\displaystyle=(R^{\prime}(\bar{U})-\lambda)U (27)
V˙\displaystyle\dot{V} =cUP.\displaystyle=cU-P.

and

u\displaystyle u^{\prime} =1c(v(ελ+D(u¯))u)\displaystyle=\frac{1}{c}(v-(\varepsilon\lambda+D(\bar{u}))u)
p\displaystyle p^{\prime} =ε(R(u¯)λ)u\displaystyle=\varepsilon(R^{\prime}(\bar{u})-\lambda)u (28)
v\displaystyle v^{\prime} =ε(cup).\displaystyle=\varepsilon(cu-p).
Remark 4.1.

Setting λ=0\lambda=0 in (27)–(28) gives the standard variational equations of (10)–(11) along the wave (Γε,c(ε))(\Gamma_{\varepsilon},c(\varepsilon)) for each ε>0\varepsilon>0. In particular, we can account for the existence of the translational eigenvalue; the variational equations are satisfied by a nontrivial uniformly bounded solution, namely the derivative of the wave (i.e. the vector field). \triangle

Collecting y=(u,p,v)y=(u,p,v) and Y=(U,P,V)Y=(U,P,V), we write the pair of systems (27)–(28) more compactly in terms of the matrices A,aA,a:

Y˙\displaystyle\dot{Y} =A(U¯,λ,ε)Y\displaystyle=A(\bar{U},\lambda,\varepsilon)Y (29)
y\displaystyle y^{\prime} =a(u¯,λ,ε)y.\displaystyle=a(\bar{u},\lambda,\varepsilon)y. (30)

We end this section by recording the fast projectivized eigenvalue problem y^=a^(u¯,λ,ε)\hat{y}=\hat{a}(\bar{u},\lambda,\varepsilon) corresponding to the linear eigenvalue problem (28). This system is defined on the space 2\mathbb{CP}^{2}; with respect to the chart (β1,β2)=(p/u,v/u)(\beta_{1},\beta_{2})=(p/u,v/u) with u0u\neq 0, we have

β1\displaystyle\beta_{1}^{\prime} =1c(D(u¯)β1β1β2)+ε(R(u¯)+λ(β1/c1))\displaystyle=\frac{1}{c}(D(\bar{u})\beta_{1}-\beta_{1}\beta_{2})+\varepsilon(R^{\prime}(\bar{u})+\lambda(\beta_{1}/c-1)) (31)
β2\displaystyle\beta_{2}^{\prime} =1c(D(u¯)β2β22)+ε(cβ1+λcβ2).\displaystyle=\frac{1}{c}(D(\bar{u})\beta_{2}-\beta_{2}^{2})+\varepsilon\left(c-\beta_{1}+\frac{\lambda}{c}\beta_{2}\right).

Slow projectivized eigenvalue problems can be similarly defined using (27). We record for future convenience the following system defined with respect to the chart (β1,β2)=(U/V,P/V)(\beta_{1},\beta_{2})=(U/V,P/V) with V0V\neq 0:

εβ1˙\displaystyle\varepsilon\dot{\beta_{1}} =1c(1D(U¯)β1)+ε(β1β2cβ12(λ/c)β1)\displaystyle=\frac{1}{c}\left(1-D(\bar{U})\beta_{1}\right)+\varepsilon(\beta_{1}\beta_{2}-c\beta_{1}^{2}-(\lambda/c)\beta_{1}) (32)
β2˙\displaystyle\dot{\beta_{2}} =β22cβ1β2+(R(U¯)λ)β1.\displaystyle=\beta_{2}^{2}-c\beta_{1}\beta_{2}+(R^{\prime}(\bar{U})-\lambda)\beta_{1}.

4.1   Asymptotic and far-field hyperbolicity

We turn to the asymptotic systems associated with (29)–(30). Broadly speaking, these systems encode ‘far-field’ information about the wave; for instance, they are useful in determining the boundaries of the essential spectrum (see [18]). The matrices a(ξ,λ,ε)a(\xi,\lambda,\varepsilon) and A(ζ(ξ),λ,ε)A(\zeta(\xi),\lambda,\varepsilon) tend to the limits a±(λ,ε)a^{\pm}(\lambda,\varepsilon) and A±(λ,ε)A^{\pm}(\lambda,\varepsilon) as ξ±\xi\to\pm\infty. Here the minus (resp. plus) superscript corresponds to an evaluation at z¯\bar{z}^{-} (resp. z¯+\bar{z}^{+}). We have

a±(λ,ε)\displaystyle a^{\pm}(\lambda,\varepsilon) =(1c(D(u±)+ελ)01cε(R(u±)λ)00εcε0),\displaystyle=\begin{pmatrix}-\frac{1}{c}(D(u^{\pm})+\varepsilon\lambda)&0&\frac{1}{c}\\ \varepsilon(R^{\prime}(u^{\pm})-\lambda)&0&0\\ \varepsilon c&-\varepsilon&0\end{pmatrix}, (33)
A±(λ,ε)\displaystyle A^{\pm}(\lambda,\varepsilon) =(1/ε)a±(λ,ε).\displaystyle=(1/\varepsilon)a^{\pm}(\lambda,\varepsilon).

Note from (4) that R(1)<R(0)<0R^{\prime}(1)<R^{\prime}(0)<0. Let β\beta be some fixed real number with R(0)<β<0R^{\prime}(0)<\beta<0 and define

Ω\displaystyle\Omega ={λ:Reλ>β}.\displaystyle=\{\lambda\in\mathbb{C}:\text{Re}\,\lambda>\beta\}. (34)

We will show in Sec. 5 that the essential spectrum lies entirely in the left-half complex plane, and bounded away from Ω\Omega. We now record some estimates for the eigenvalues and eigenvectors of the asymptotic matrices (33) for any λΩ\lambda\in\Omega, which will be important in determining the point spectrum.

Lemma 4.2.

Let λΩ\lambda\in\Omega. Then for ε>0\varepsilon>0 sufficiently small, the asymptotic matrices (33) have three distinct eigenvalues μj±(λ,ε)\mu^{\pm}_{j}(\lambda,\varepsilon) for a±(λ,ε)a^{\pm}(\lambda,\varepsilon) (resp. Δj±(λ,ε)\Delta^{\pm}_{j}(\lambda,\varepsilon) for A±(λ,ε)A^{\pm}(\lambda,\varepsilon)), which satisfy

Reμf±Reμs,1±<\displaystyle\text{Re}\,\mu^{\pm}_{f}\ll\text{Re}\,\mu^{\pm}_{s,1}<  0<Reμs,2±\displaystyle\,0<\text{Re}\,\mu^{\pm}_{s,2} (35)
Δj±\displaystyle\Delta^{\pm}_{j} =(1/ε)μj±.\displaystyle=(1/\varepsilon)\mu^{\pm}_{j}.

In particular, we have μf±=𝒪(1)\mu^{\pm}_{f}=\mathcal{O}(1) and μs,j±=𝒪(ε)\mu^{\pm}_{s,j}=\mathcal{O}(\varepsilon), j=1,2j=1,2. Furthermore, let ef±(λ,ε)e_{f}^{\pm}(\lambda,\varepsilon) and es,j±(λ,ε)e_{s,j}^{\pm}(\lambda,\varepsilon) denote the eigenvectors associated with μf±\mu^{\pm}_{f} and μs,j±\mu^{\pm}_{s,j}, respectively, where j=1,2j=1,2. Then each e±(λ,ε)e^{\pm}(\lambda,\varepsilon) limits to a ‘reduced’ eigenvector r±(λ)r^{\pm}(\lambda) as ε0\varepsilon\to 0. These reduced eigenvectors are given explicitly by

rf±\displaystyle r_{f}^{\pm} =(1,0,0)\displaystyle=(1,0,0)^{\top} (36)
rs,1±\displaystyle r_{s,1}^{\pm} =(1/D(u¯±),νp,±,1)\displaystyle=(1/D(\bar{u}^{\pm}),\nu_{p,\pm},1)^{\top}
rs,2±\displaystyle r_{s,2}^{\pm} =(1/D(u¯±),νm,±,1),\displaystyle=(1/D(\bar{u}^{\pm}),\nu_{m,\pm},1)^{\top},

where

νp,±\displaystyle\nu_{p,\pm} =c0+c02+4D(u¯±)(λR(u¯±))2D(u¯±)\displaystyle=\frac{c_{0}+\sqrt{c_{0}^{2}+4D(\bar{u}^{\pm})(\lambda-R^{\prime}(\bar{u}^{\pm}))}}{2D(\bar{u}^{\pm})} (37)
νm,±\displaystyle\nu_{m,\pm} =c0c02+4D(u¯±)(λR(u¯±))2D(u¯±).\displaystyle=\frac{c_{0}-\sqrt{c_{0}^{2}+4D(\bar{u}^{\pm})(\lambda-R^{\prime}(\bar{u}^{\pm}))}}{2D(\bar{u}^{\pm})}.
Remark 4.3.

The smallness of ε\varepsilon required for Lemma 4.2 is dependent on λ\lambda, but a uniform bound εε¯\varepsilon\leq\bar{\varepsilon} over closed, bounded contours in \mathbb{C} can be extracted using compactness. \triangle

Proof of Lemma 4.2: We outline the proof as it is similar to the proof of Lemma 2.1. The characteristic polynomial of a±(λ,ε)a^{\pm}(\lambda,\varepsilon) is

p±(μ)\displaystyle p^{\pm}(\mu) =μ3μ2(D(u±)+ελc)+με+ε2λR(u±)c,\displaystyle=-\mu^{3}-\mu^{2}\left(\frac{D(u^{\pm})+\varepsilon\lambda}{c}\right)+\mu\varepsilon+\varepsilon^{2}\frac{\lambda-R^{\prime}(u^{\pm})}{c}, (38)

from which we obtain the expansion μf±=D(u¯±)/c0+𝒪(ε)\mu^{\pm}_{f}=-D(\bar{u}^{\pm})/c_{0}+\mathcal{O}(\varepsilon). Leading-order expansions of the 𝒪(ε)\mathcal{O}(\varepsilon) eigenvalues are obtained by using the scaling μ=εν\mu=\varepsilon\nu as before. Their leading order terms are given by νp,±\nu_{p,\pm} and νm,±\nu_{m,\pm}; this can be checked by direct calculation. The remaining small eigenvalues therefore fall into the hierarchy stated by the lemma at both uu^{-} and u+u^{+} provided that Reλ>R(0)\text{Re}\,\lambda>R^{\prime}(0). The expressions for the singular limits of the eigenvectors can also be checked with algebra. \Box

For fixed a>0a>0, there exists ε¯>0\bar{\varepsilon}>0 sufficiently small so that the wave x(ξ,ε)x(\xi,\varepsilon) lies near S0a,S0a,+S^{a,-}_{0}\cup S^{a,+}_{0} for |ξ|a/ε|\xi|\geq a/\varepsilon for all 0<εε¯0<\varepsilon\leq\bar{\varepsilon}. The corresponding coefficient matrix a(ξ,λ,ε)a(\xi,\lambda,\varepsilon) has three eigenvalues μi\mu_{i} satisfying the hierarchy μfμs,1<0<μs,2\mu_{f}\ll\mu_{s,1}<0<\mu_{s,2}, inherited from the asymptotic hyperbolicity described in Lemma 4.2. Let f1(ξ,λ,ε),f2(ξ,λ,ε),ff(ξ,λ,ε)f_{1}(\xi,\lambda,\varepsilon),\,f_{2}(\xi,\lambda,\varepsilon),\,f_{f}(\xi,\lambda,\varepsilon) denote a choice of eigenvectors corresponding to the eigenvalues μs,1,μs,2,μf\mu_{s,1},\,\mu_{s,2},\,\mu_{f} (note: if μs,1\mu_{s,1} and μs,2\mu_{s,2} coalesce, we let f1,f2f_{1},\,f_{2} denote the generalized eigenvectors instead). We furthermore suppose that the fif_{i} are normalized so that |fi|=1|f_{i}|_{\infty}=1 for all (ξ,λ,ε)(\xi,\lambda,\varepsilon).

Definition 4.4.

The slow subbundle σs\sigma_{s} of the linearized system is

σs(ξ,λ,ε)\displaystyle\sigma_{s}(\xi,\lambda,\varepsilon) =|ξ|a/εspan{f1(ξ,λ,ε),f2(ξ,λ,ε)}\displaystyle=\bigcup_{|\xi|\geq a/\varepsilon}\text{span}\{f_{1}(\xi,\lambda,\varepsilon),\,f_{2}(\xi,\lambda,\varepsilon)\} (39)

where the base space is |ξ|a/ε|\xi|\geq a/\varepsilon and λΩ\lambda\in\Omega. We denote the slow subbundle with respect to the timescale ζ=ξ/ε\zeta=\xi/\varepsilon by Σs\Sigma_{s}, i.e.

Σs(ζ,λ,ε)\displaystyle\Sigma_{s}(\zeta,\lambda,\varepsilon) =σs(ξ/ε,λ,ε).\displaystyle=\sigma_{s}(\xi/\varepsilon,\lambda,\varepsilon). (40)

\triangle

4.2   Complex 2-plane dynamics induced by the eigenvalue problem

We now write down a concrete representation of the evolution equations of complex 22-planes that are induced by the eigenvalue problem (28), and recording some essential properties about this induced system using the formulation described in Sec. 3. We have

(uppvvu)\displaystyle\begin{pmatrix}u\wedge p\\ p\wedge v\\ v\wedge u\end{pmatrix}^{\prime} =(1c(ελ+D(u¯))1c0εc0ε(R(u¯)λ)ε01c(ελ+D(u¯)))(uppvvu).\displaystyle=\begin{pmatrix}-\frac{1}{c}\left(\varepsilon\lambda+D(\bar{u})\right)&-\frac{1}{c}&0\\ -\varepsilon c&0&-\varepsilon(R^{\prime}(\bar{u})-\lambda)\\ \varepsilon&0&-\frac{1}{c}(\varepsilon\lambda+D(\bar{u}))\end{pmatrix}\begin{pmatrix}u\wedge p\\ p\wedge v\\ v\wedge u\end{pmatrix}. (41)

The coupled system (11)–(41) acts on the complexified tangent bundle TUpUTp(U)T^{\mathbb{C}}U\cong\cup_{p\in U}T_{p}(U)\otimes_{\mathbb{R}}\mathbb{C}, where U3U\subset\mathbb{R}^{3}. In view of Corollary (2.12) and Lemmas 4.2 and 3.4, there exists ε¯>0\bar{\varepsilon}>0 and a>0a>0 so that for each ε(0,ε¯)\varepsilon\in(0,\bar{\varepsilon}) and |ξ|>a/ε|\xi|>a/\varepsilon, the eigenvalues μ1(2)(ξ),μ2(2)(ξ),μ3(2)(ξ)\mu^{(2)}_{1}(\xi),\,\mu^{(2)}_{2}(\xi),\mu^{(2)}_{3}(\xi) of the linearization of (41) can be ordered so that

Re(μ1(2))\displaystyle\text{Re}(\mu^{(2)}_{1}) =𝒪(1)<0\displaystyle=\mathcal{O}(1)<0
Re(μ2(2))\displaystyle\text{Re}(\mu^{(2)}_{2}) =𝒪(1)<0\displaystyle=\mathcal{O}(1)<0
Re(μ3(2))\displaystyle\text{Re}(\mu^{(2)}_{3}) =𝒪(ε).\displaystyle=\mathcal{O}(\varepsilon).

Let us provide some geometric intuition to clarify these estimates. When the wave lies near the slow manifolds, the eigenvalue problem (28) provides an eigenvector lying near the direction of strong contraction onto the slow manifolds, and a pair of eigenvectors lying near to the tangent space of the slow manifold. These eigenvectors can be chosen pairwise to construct an eigenbasis for the linearisation of system (41), consisting of three complex 22-planes. Two of the constructed 2-planes have an axis lying near the fast fibers of the slow manifolds, so they strongly contract in forward time under the flow; these are the eigenvectors corresponding to the two eigenvalues of 𝒪(1)\mathcal{O}(1) negative real part. The remaining 2-plane lies near the tangent space of the slow manifold. Depending on the relative magnitudes of the singular eigenvalues along the wave, this eigenvector provides a weakly (un)stable direction.

We can further clarify the local dynamics near the tails of the wave, in view of the elementary facts about projectivized systems discussed just after (21). The projectivized system induced by (41) on 2\mathbb{CP}^{2}, written in terms of the coordinate representation (β1,β2)=(pvup,vuup)(\beta_{1},\beta_{2})=\left(\frac{p\wedge v}{u\wedge p},\frac{v\wedge u}{u\wedge p}\right), up0u\wedge p\neq 0, is given by

β1\displaystyle\beta_{1}^{\prime} =εc+(ελ+D(u¯))cβ1ε(R(u¯)λ)β2+1cβ12\displaystyle=-\varepsilon c+\frac{\left(\varepsilon\lambda+D(\bar{u})\right)}{c}\beta_{1}-\varepsilon(R^{\prime}(\bar{u})-\lambda)\beta_{2}+\frac{1}{c}\beta_{1}^{2} (42)
β2\displaystyle\beta_{2}^{\prime} =ε+1cβ1β2,\displaystyle=\varepsilon+\frac{1}{c}\beta_{1}\beta_{2},

where we remind the reader of the nonautonomous nature of the flow due to the inclusion of the phase space variable u¯=u¯(ξ)\bar{u}=\bar{u}(\xi). associated with the system (42) is a family of frozen systems, defined for each fixed ε\varepsilon by replacing the nonautonomous system by a one-parameter family of autonomous ODEs where the variable ξ\xi is formally replaced by an independent parameter γ\gamma on the right hand side (see Sec. 7.1.1 for the explicit construction). A straightforward perturbation argument shows that for ε>0\varepsilon>0 sufficiently small with a>0a>0 defined as above and for each fixed γ\gamma with |γ|>a/ε|\gamma|>a/\varepsilon, there is an (isolated) attracting fixed point of the frozen family for each such γ\gamma, denoted β0(γ,ε)\beta_{0}(\gamma,\varepsilon), which is 𝒪(ε)\mathcal{O}(\varepsilon)-close to (β1,β2)=(D(u¯(γ)),0)(\beta_{1},\beta_{2})=(-D(\bar{u}(\gamma)),0). For fixed ε>0\varepsilon>0, the function β0(γ,ε)\beta_{0}(\gamma,\varepsilon) defines curves of attracting critical points on the left and right branches of the slow manifold as γ\gamma is varied, corresponding precisely to the family of eigenvector 22-planes near the slow manifolds whose construction was just described. This is exactly the slow subbundle in Def. 4.4.

These curves of attracting critical points serve as the organizing centers for so-called relatively invariant sets (see [8]), which we construct over the wave tails in Sec. 7.1.1.

4.3   Reduced eigenvalue problems

The dynamics of the linearized systems (25) and (26) are equivalent when ε>0\varepsilon>0, but they limit to distinguished problems as ε0\varepsilon\to 0. In analogy to the definitions of the layer and reduced problems for the wave, we now define two linearized subsystems using (27)–(28). The desingularized slow eigenvalue problem will be used in Sec. 6.3 to define a slow eigenvalue. This will require the derivation of a jump condition for two disjoint solution segments, defined separately on S0a,S^{a,-}_{0} and S0a,+S^{a,+}_{0}.

Definition 4.5.

The fast eigenvalue problem is the linear subsystem defined by the singular limit of (28), along the fast fiber bundle:

u\displaystyle u^{\prime} =1cD(u¯)u\displaystyle=-\frac{1}{c}D(\bar{u})u (43)

with p=v=0p=v=0. \triangle

Remark 4.6.

The limiting system (43) is the variational equation of (11), and thus is degenerate as an eigenvalue problem. There is always a nontrivial, uniformly bounded solution to (43). \triangle

Definition 4.7.

The slow eigenvalue problem is the linear subsystem on Sεa,Sεa,+S^{a,-}_{\varepsilon}\cup S^{a,+}_{\varepsilon} defined by the singular limit of (27), subject to the constraint V=D(U¯)UV=D(\bar{U})U:

P˙\displaystyle\dot{P} =(R(U¯)λ)VD(U¯)\displaystyle=(R^{\prime}(\bar{U})-\lambda)\frac{V}{D(\bar{U})} (44)
V˙\displaystyle\dot{V} =cVD(U¯)P,\displaystyle=\frac{cV}{D(\bar{U})}-P,

or more compactly as

W˙\displaystyle\dot{W} =A0(U¯,λ,c)W.\displaystyle=A_{0}(\bar{U},\lambda,c)W. (45)

It is convenient to record here the projectivized slow eigenvalue problem on the chart S=P/VS=P/V with V0V\neq 0:

S˙\displaystyle\dot{S} =1D(U¯)(R(U¯)λcS+D(U¯)S2).\displaystyle=\frac{1}{D(\bar{U})}\left(R^{\prime}(\bar{U})-\lambda-cS+D(\bar{U})S^{2}\right). (46)

We also record here a desingularized slow eigenvalue problem, which is defined by first appending (44) to (18) and then rescaling the frame variable of the resulting autonomous system by D(U¯)D(\bar{U}):

P˙\displaystyle\dot{P} =(R(U¯)λ)V\displaystyle=(R^{\prime}(\bar{U})-\lambda)V (47)
V˙\displaystyle\dot{V} =cVD(U¯)P.\displaystyle=cV-D(\bar{U})P.

\triangle

Projectivizations of the desingularised problem (47) will be written down and used in Sec. 8 to identify the slow eigenvalues.

5   Essential & Absolute Spectrum

Our goal is to determine spectral stability of each member of the one-parameter family of waves {Γε}ε(0,ε¯]\{\Gamma_{\varepsilon}\}_{\varepsilon\in(0,\bar{\varepsilon}]} for some sufficiently small value of ε¯>0\bar{\varepsilon}>0. Following the standard approach, we will compute the essential and point spectra, defined as in [18], and compute them separately. It is more convenient to write the eigenvalue problem (24) as a first order system. Following [28], we consider the (equivalent) family \mathcal{L} of linear operators from (27) and (28)

(λ)\displaystyle\mathcal{L}(\lambda) :=ddζA(U¯,λ,ε)=ddζ(1cε(D(U¯)+ελ)01cε(R(U¯)λ)00c10)\displaystyle:=\frac{d}{d\zeta}-A(\bar{U},\lambda,\varepsilon)=\frac{d}{d\zeta}-\begin{pmatrix}-\frac{1}{c\varepsilon}(D(\bar{U})+\varepsilon\lambda)&0&\frac{1}{c\varepsilon}\\ (R^{\prime}(\bar{U})-\lambda)&0&0\\ c&-1&0\end{pmatrix} (48)

where (λ):H1(,3)L2(,3)\mathcal{L}(\lambda):H^{1}(\mathbb{R},\mathbb{C}^{3})\to L^{2}(\mathbb{R},\mathbb{C}^{3}).

The essential spectrum σe()\sigma_{e}(\mathcal{L}) of the family of linear operators (λ)\mathcal{L}(\lambda) defined in (48) is the set of λ\lambda\in\mathbb{C} such that either (λ)\mathcal{L}(\lambda) is not Fredholm, or (λ)\mathcal{L}(\lambda) is Fredholm but the (Fredholm) index is not 0. The point spectrum σp()\sigma_{p}(\mathcal{L}) is the set of values λ\lambda\in\mathbb{C} where (λ)\mathcal{L}(\lambda) is not invertible, but does have index 0. We will denote by σ(Γ)\sigma(\Gamma) or σ()\sigma(\mathcal{L}), the union σpσe\sigma_{p}\cup\sigma_{e} of the spectrum of the family \mathcal{L} associated with the travelling wave Γ\Gamma.

Definition 5.1.

(Def. 4.1.7 in [18]) A wave Γ\Gamma corresponding to a stationary solution of equation (2) is called spectrally stable if σ(Γ){λ:Re(λ)0}=\sigma(\Gamma)\cap\{\lambda\in\mathbb{C}:\text{Re}(\lambda)\geq 0\}=\emptyset, except possibly at λ=0\lambda=0. \triangle

For nonzero ε\varepsilon, because U¯\bar{U} approaches its end states exponentially in ζ\zeta (equivalently ξ\xi), determining the essential spectrum amounts to determining when the matrices A±(λ,ε)A^{\pm}(\lambda,\varepsilon) from (33) have different signatures (this is the content of Weyl’s essential spectrum theorem, Theorem 2.2.6 from [18] as well as and Lemma 3.1.10, also from [18]). The edges of the essential spectrum are called dispersion relations or the Fredholm borders as well as the continuous spectrum. To find the dispersion relations we look for values where A±A^{\pm} has a purely imaginary (spatial) eigenvalue ikik, and solve for the temporal eigenvalue parameter λ\lambda. This gives a pair of curves λ±(k;ε)\lambda_{\pm}(k;\varepsilon) in the complex plane, parametrised by the spatial modes, where the asymptotic matrices fail to be hyperbolic. Working with (27), and denoting limζ±U¯\lim_{\zeta\to\pm\infty}\bar{U} by U¯±\bar{U}^{\pm}, we have that the dispersion relations are:

λ±(k;ε)\displaystyle\lambda_{\pm}(k;\varepsilon) =ick+R(U¯±)D(U¯±)k21+k2ε\displaystyle=-ick+\frac{R^{\prime}(\bar{U}^{\pm})-D(\bar{U}^{\pm})k^{2}}{1+k^{2}\varepsilon} (49)

We have a pair of curves partitioning \mathbb{C} into five disjoint sections: the set to the right of both curves (where we’re going to look for point spectrum) which we denote Ω\Omega, together with four more regions which we denote by 𝒜j\mathcal{A}_{j} for j=1,2,3,4j=1,2,3,4. (See Fig. 4).

Since the characteristic polynomials of A±A^{\pm} are both cubics:

PA±(μ)=μ31c(λ+D(U¯±)ε)μ2+με+1cε(λR(U¯±)),\displaystyle P_{A^{\pm}}(\mu)=-\mu^{3}-\frac{1}{c}\left(\lambda+\frac{D(\bar{U}^{\pm})}{\varepsilon}\right)\mu^{2}+\frac{\mu}{\varepsilon}+\frac{1}{c\varepsilon}\left(\lambda-R^{\prime}(\bar{U}^{\pm})\right), (50)

everything can be checked explicitly. We are working in the slow variables, but everything that follows in this section can be computed for the (equivalent) fast systems, since

PA±(μ)=1ε3p±(εμ)P_{A^{\pm}}(\mu)=\frac{1}{\varepsilon^{3}}p^{\pm}(\varepsilon\mu)

from (38). Defining the transformation

η:=μ13(λ+D(U¯±)cε),\eta:=\mu-\frac{1}{3}\left(\lambda+\frac{D(\bar{U}^{\pm})}{c\varepsilon}\right),

we have

PA±(η)=η3+H±η+K±,P_{A^{\pm}}(\eta)=-\eta^{3}+H_{\pm}\eta+K_{\pm},

where

H±:=λ23c2+λ2D(U¯±)3c2ε+D(U¯±)2+3c2ε3c2ε2,H_{\pm}:=\frac{\lambda^{2}}{3c^{2}}+\lambda\frac{2D(\bar{U}^{\pm})}{3c^{2}\varepsilon}+\frac{D(\bar{U}^{\pm})^{2}+3c^{2}\varepsilon}{3c^{2}\varepsilon^{2}},

and

K±\displaystyle K_{\pm} :=227c3λ32D(U¯±)9c3ελ2+18c2ε26εD(U¯±)227c3ε3λ\displaystyle:=-\frac{2}{27c^{3}}\lambda^{3}-\frac{2D(\bar{U}^{\pm})}{9c^{3}\varepsilon}\lambda^{2}+\frac{18c^{2}\varepsilon^{2}-6\varepsilon D(\bar{U}^{\pm})^{2}}{27c^{3}\varepsilon^{3}}\lambda
9c2εD(U¯±)+D(U¯±)3+27c2ε2R(U¯±)27c3ε3.\displaystyle\qquad-\frac{9c^{2}\varepsilon D(\bar{U}^{\pm})+D(\bar{U}^{\pm})^{3}+27c^{2}\varepsilon^{2}R^{\prime}(\bar{U}^{\pm})}{27c^{3}\varepsilon^{3}}.

We note that the discriminants ΔH±\Delta_{H_{\pm}} of H±H_{\pm} are polynomials in λ\lambda that are independent of U¯±\bar{U}^{\pm}:

ΔH±=43c2ε<0\Delta_{H_{\pm}}=-\frac{4}{3c^{2}\varepsilon}<0

which means that for λ\lambda\in\mathbb{R}, we have H±>0H_{\pm}>0. Hence for each λ\lambda\in\mathbb{R},

H±327+K±240.\frac{H_{\pm}^{3}}{27}+\frac{K_{\pm}^{2}}{4}\geq 0.

We thus infer that for λ\lambda\in\mathbb{R}, PA±(μ)P_{A^{\pm}}(\mu) has three real roots (counting multiplicity in edge cases). We can then use Descartes’ rule of signs to see that for λ=0\lambda=0 (and subsequently for all λΩ\lambda\in\Omega), we have that the signature of both A+A^{+} and AA^{-} is (,,+)(-,-,+). That is A±A^{\pm} has two eigenvalues with negative real part, and one with positive real part. Checking the signatures in the rest of the regions 𝒜j\mathcal{A}_{j} similarly produces the following table:

Region sgn(A)\textrm{sgn}(A^{-}) sgn(A+)\textrm{sgn}(A^{+})
Ω\Omega (,,+)(-,-,+) (,,+)(-,-,+)
𝒜1\mathcal{A}_{1} (,+,+)(-,+,+) (,,+)(-,-,+)
𝒜2\mathcal{A}_{2} (,,+)(-,-,+) (,+,+)(-,+,+)
𝒜3\mathcal{A}_{3} (,,+)(-,-,+) (,+,+)(-,+,+)
𝒜4\mathcal{A}_{4} (,+,+)(-,+,+) (,+,+)(-,+,+)

Thus we find that the essential spectrum is the union of regions 𝒜2,3,4\mathcal{A}_{2,3,4} together with their boundaries. This set in the complex plane with a bounded real part and an unbounded imaginary component (see Fig. 4).

The dispersion relations are symmetric about kk, and they cross the real axis at R(U¯±)R^{\prime}(\bar{U}^{\pm}) when k=0k=0. The dispersion relations are vertical lines when ε=ε±:=D(U¯±)R(U¯±)\varepsilon=\varepsilon^{\pm}_{*}:=-\frac{D(\bar{U}^{\pm})}{R^{\prime}(\bar{U}^{\pm})}, and are parabolic in kk near k=0k=0, opening to the left when ε<ε±\varepsilon<\varepsilon_{*}^{\pm} and to the right when ε>ε±\varepsilon>\varepsilon_{*}^{\pm} before ‘flaring’ up to asymptote towards vertical lines.

It is worthwhile to investigate whether the family \mathcal{L} is sectorial when ε>0\varepsilon>0, since the property of sectoriality allows us to strengthen spectral stability to linear stability. (see eg. Henry [12]). When ε=0\varepsilon=0 the dispersion relations define a pair of parabolas opening leftward, and as such the linearized operator associated with the unperturbed problem is sectorial. On the other hand, for any fixed ε>0\varepsilon>0, the following asymptotics characterize the behavior of the Fredholm bolders as k±k\to\pm\infty:

λ±D(U¯±)ε+ick.\displaystyle\lambda_{\pm}\sim-\frac{D(\bar{U}^{\pm})}{\varepsilon}+-ick.\quad (51)

Hence, the essential spectrum is contained in the left half plane, but the family \mathcal{L} is not sectorial since the borders are vertically asymptotic; see Fig. 4.

We note that as ε0\varepsilon\to 0, the continuous spectrum (Fredholm borders) of the perturbed problem will converge to the continuous spectrum (Fredholm borders) of the linearised unperturbed problem, linearised about the steady-state solutions U¯±\bar{U}^{\pm}. That is we set ε=0\varepsilon=0 in (2), and consider the dispersion relations of the steady state solutions U¯±\bar{U}^{\pm}. We see that these are given by

λ±(k;0)=k2D(U¯±)+R(U¯±)+ick,\lambda_{\pm}(k;0)=-{k^{2}}{D(\bar{U}^{\pm})}+R^{\prime}(\bar{U}^{\pm})+ick,

which corresponds to setting ε=0\varepsilon=0.

We highlight the subtlety underlying the asymptotic convergence of the essential spectrum of the linearized operator as ε0\varepsilon\to 0: for any large bounded set intersecting the essential spectrum of the operator for ε>0\varepsilon>0 and for any fixed δ>0\delta>0, there exists a sufficiently small uniform bound ε¯>0\bar{\varepsilon}>0 so that for each ε(0,ε¯)\varepsilon\in(0,\bar{\varepsilon}), the Fredholm borders for the operator when ε>0\varepsilon>0 can be made δ\delta-close to the (parabolic) borders of the reduced operator within this bounded set; on the other hand, the uniform bound ε¯\bar{\varepsilon} must be made smaller and smaller as we take larger and larger bounded sets and demand δ\delta-closeness. See Fig. 4.

Remark 5.2.

In view of the nonsectoriality of the family \mathcal{L}, spectral stability does not immediately imply linear exponential stability, i.e. it is unknown whether nearby translates of the spectrally stable wave decay exponentially onto the wave in forward time. \triangle

Remark 5.3.

Lastly, we note that as the essential spectrum is entirely contained in the left half plane, any absolute spectrum will also be contained in the left half plane, and so will play no role in destabilising the underlying wave. \triangle

Refer to caption
Refer to caption
Figure 4: Left: a plot of the essential spectrum (shaded regions) of the family \mathcal{L} illustrating its qualitative features, and noting the regions Ω\Omega and the 𝒜i\mathcal{A}_{i} partitioning the complex plane via the Fredholm borders (continuous spectrum). The asymptotics of the essential spectrum for large spatial modes means that the family of operators \mathcal{L}, for non-zero ε\varepsilon is not sectorial. Right: a plot comparing the Fredholm border of A(λ,ε)A^{-}(\lambda,\varepsilon) (solid line - blue online) to the Fredholm border of the reduced problem linearised about the constant solution U¯=0\bar{U}^{-}=0 with ε=0\varepsilon=0 (dashed line - red online). As ε0\varepsilon\to 0, the solid line follows the dashed line for a longer region in the left half plane, before eventually ‘flaring’ up. For the figures, ε=0.1\varepsilon=0.1 (where it is present) and c=1c=1, with DD and RR as in (3) and (4).

6   Augmented unstable bundles

Having shown that the essential spectrum lies to the left of Ω\Omega in the left-half complex plane, we now focus on finding the point spectrum σp()\sigma_{p}(\mathcal{L}) consisting of isolated eigenvalues of finite multiplicity. To that end, we fix a a simple, closed curve KΩK\subset\Omega, such that σpK\sigma_{p}\cap K is empty. In view of the hyperbolicity of the asymptotic systems a±(λ,ε)a^{\pm}(\lambda,\varepsilon), we henceforth refer to the values in the point spectrum as eigenvalues. Specifically, we rely on the following geometric characterization of the eigenvalues of (24).

Definition 6.1.

The number λΩ\lambda\in\Omega is an eigenvalue of (λ)\mathcal{L}(\lambda) if there exists a nontrivial uniformly bounded solution y(ξ)y(\xi) of (27) (equivalently, if the bounded solution y(ξ)y(\xi) decays to zero as ξ±\xi\to\pm\infty; see [1]). \triangle

In this section we construct two augmented unstable bundles ε(K)\mathcal{E}_{\varepsilon}(K) and 0(K)\mathcal{E}_{0}(K), over a cylindrical base space formed by extending KK along a compactified time variable. As we will outline in Sec. 6.2, our goal is to use a particular correspondence theorem that relates a certain topological invariant of these complex vector bundles to the eigenvalue count of (λ)\mathcal{L}(\lambda) inside KK.

6.1   Construction of the augmented unstable bundle ε(K)\mathcal{E}_{\varepsilon}(K) for ε>0\varepsilon>0

Following [8], we now use general facts in [1] together with Lemma 4.2 to construct (un)stable complex vector bundles of solutions along the wave for each sufficiently small ε>0\varepsilon>0.

Definition 6.2.

Let λΩ\lambda\in\Omega. For ε>0\varepsilon>0 sufficiently small, the unstable bundle and stable bundle φ(ξ,λ,ε)\varphi^{-}(\xi,\lambda,\varepsilon) resp. φ+(ξ,λ,ε)\varphi^{+}(\xi,\lambda,\varepsilon) are the linear subspaces of solutions along the wave Γε\Gamma_{\varepsilon} defined by the conditions

φ(ξ,λ,ε)\displaystyle\varphi^{-}(\xi,\lambda,\varepsilon) span{es,2(λ,ε)}\displaystyle\to\text{span}\{e_{s,2}^{-}(\lambda,\varepsilon)\} as ξ\displaystyle\text{as~{}~{}}\xi\to-\infty\hskip 28.45274pt (52)
φ+(ξ,λ,ε)\displaystyle\varphi^{+}(\xi,\lambda,\varepsilon) span{es,1+(λ,ε),ef+(λ,ε)}\displaystyle\to\text{span}\{e_{s,1}^{+}(\lambda,\varepsilon),e_{f}^{+}(\lambda,\varepsilon)\} as ξ+.\displaystyle\text{as~{}~{}}\xi\to+\infty.\hskip 28.45274pt

The asymptotic convergence to the corresponding (un)stable subspaces of a±(λ,ε)a^{\pm}(\lambda,\varepsilon) is defined with respect to the topologies of the appropriate Grassmannians G1,3G_{1,3} and G2,3G_{2,3}. \triangle

The linear subspace φ\varphi^{-} (resp. φ+\varphi^{+}) can be viewed as a complex line bundle (resp. 2-plane bundle) over the base space (ξ,λ)×Ω(\xi,\lambda)\in\mathbb{R}\times\Omega. In view of Definitions 6.1 and 6.2, λΩ\lambda\in\Omega is an eigenvalue if and only if φ(ξ,λ,ε)\varphi^{-}(\xi,\lambda,\varepsilon) and φ+(ξ,λ,ε)\varphi^{+}(\xi,\lambda,\varepsilon) intersect nontrivially for some (and hence all) ξ\xi\in\mathbb{R}.

We now describe a general construction for obtaining an augmented unstable bundle ε(K)\mathcal{E}_{\varepsilon}(K) over the real 2-sphere S2S^{2}, using φ\varphi^{-}; we refer to [1] for details. The approach is to first compactify ξ\xi using the change of variable ξτ\xi\to\tau, where

τ\displaystyle\tau^{\prime} =εκ(1τ2),\displaystyle=\varepsilon\kappa(1-\tau^{2}),
τ(0)\displaystyle\tau(0) =0,\displaystyle=0,

for κ>0\kappa>0 chosen such that τ(ξ)±1\tau(\xi)\to\pm 1 as ξ±\xi\to\pm\infty. With respect to this compactified ‘timescale’, φ(τ,λ,ε)\varphi^{-}(\tau,\lambda,\varepsilon) then specifies a line bundle over (1,1)×Ω(-1,1)\times\Omega; a bundle can then be defined over the restricted base space (1,1)×K(-1,1)\times K. This base space is homeomorphic to an infinite cylinder (equivalently, a finite cylinder with its caps removed). Our first goal is to continuously extend the line bundle over the closure of the cylinder (i.e. over the base space [1,1]×K[-1,1]\times K). After this we fill in the caps, altogether producing a complex vector bundle over the closed cylinder, i.e. over S2S^{2}.

Let us explain this continuous extension. First recall that every complex vector bundle can be realized as a pullback of the universal bundle over a Grassmannian, as follows. Given a (complex) vector bundle :EB×nB\mathcal{E}:E\subset B\times\mathbb{C}^{n}\to B with projection map π\pi, there is a natural map e^:BGk,n\hat{e}:B\to G_{k,n}, which assigns to each bBb\in B the complex kk-dimensional linear fiber π1(b)\pi^{-1}(b), considered as a kk-plane in Gk,nG_{k,n}. On the other hand, the universal bundle Γk(n)Gk,n\Gamma_{k}(\mathbb{C}^{n})\to G_{k,n} is a trivial (tautological) construction: the fiber above each element in Gk,nG_{k,n} is defined to be the corresponding kk-plane. Then \mathcal{E} can be realized as e^Γk(n)\hat{e}^{*}\Gamma_{k}(\mathbb{C}^{n}):

E{E}Γk(n){\Gamma_{k}(\mathbb{C}^{n})}B{B}Gn,k{G_{n,k}}π\scriptstyle{\pi}e^\scriptstyle{\hat{e}}

Thus, we say that φ|(1,1)×K\varphi^{-}|_{(-1,1)\times K} continuously extends to a bundle over [1,1]×K[-1,1]\times K if the natural map e^:(1,1)×KG1,3\hat{e}:(-1,1)\times K\to G_{1,3} extends continuously to e~:[1,1]×KG1,3\tilde{e}:[-1,1]\times K\to G_{1,3}. The extension to τ=1\tau=-1 is clear: for each λK\lambda\in K, φ\varphi^{-} continuously extends to es,2(λ,ε)e^{-}_{s,2}(\lambda,\varepsilon) as τ1\tau\to-1 by definition. At the other end, we need to specify that λK\lambda\in K is not an eigenvalue of (27). General considerations about ω\omega-limit sets relative to hyperbolic fixed points (see [1]) then imply that φ(τ,λ,ε)\varphi^{-}(\tau,\lambda,\varepsilon) continuously extends to the corresponding unstable eigenvector es,2+(λ,ε)e^{+}_{s,2}(\lambda,\varepsilon) as τ+1\tau\to+1. We note that this limit can be made uniform in λ\lambda over the closed contour KK.

It remains to fill in the interiors of the caps C={τ=1}×K0C_{-}=\{\tau=-1\}\times K^{0} and C+={τ=+1}×K0C_{+}=\{\tau=+1\}\times K^{0} at either end of the cylinder. For each λK0\lambda\in K^{0}, this is achieved for both caps by gluing the corresponding unstable eigenvector es,2±(λ,ε)e^{\pm}_{s,2}(\lambda,\varepsilon).

Definition 6.3.

The augmented unstable bundle ε(K)\mathcal{E}_{\varepsilon}(K) is the complex line bundle over the cylindrical base space C[1,1]×KC+C_{-}\cup[-1,1]\times K\cup C_{+}, constructed from φ\varphi^{-} as described above. \triangle

This base space is homeomorphic to S2S^{2}. Our goal will be to relate the eigenvalue count to a particular characteristic class defined for complex vector bundles.

6.2   An outline of the proof of the stability theorem

We make use of the main theorem proved in [1]: the total number of eigenvalues (counted with algebraic multiplicity) inside KK is equal to the first Chern number, c1(ε(K))c_{1}(\mathcal{E}_{\varepsilon}(K)), of the complex vector bundle constructed above. The main advantage of this topological characterisation lies in the homotopy invariance of c1c_{1}.

We can now outline the steps of the proof of Theorem 2.11:

  1. 1.

    Construct a reduced augmented unstable vector bundle 0(K)\mathcal{E}_{0}(K) using a complex two-dimensional reduced (or slow) eigenvalue problem defined on the critical manifold S0S_{0}. This is accomplished in Sec. 6.4.

  2. 2.

    Show that for sufficiently small ε>0\varepsilon>0, we can continue ε(K)\mathcal{E}_{\varepsilon}(K) to 0(K)\mathcal{E}_{0}(K); i.e. we can construct a homotopy between them. By homotopy invariance, c1(ε(K))=c1(0(K))c_{1}(\mathcal{E}_{\varepsilon}(K))=c_{1}(\mathcal{E}_{0}(K)). This homotopy is constructed at the end of Sec. 7 only after several key estimates are proven; see Theorem 7.1.

  3. 3.

    Use the correspondence theorem in [1] in the ‘other direction,’ i.e. compute c1(0(K))c_{1}(\mathcal{E}_{0}(K)) by finding the slow eigenvalues using Sturm-Liouville techniques. We will verify that the reduced problem admits only a simple translational eigenvalue λ0=0\lambda_{0}=0, and another simple eigenvalue λ1\lambda_{1} with R(0)<Reλ1<0R^{\prime}(0)<\text{Re}\,\lambda_{1}<0. The slow eigenvalues are computed in Sec. 8.

Steps 1 and 3 turn out to be relatively straightforward; most of the technical issues lie in proving the uniform estimates required to construct the homotopy in step 2. To summarize, we will show that for any fixed contour KK\subset\mathbb{C} to the right of the essential spectrum, there exists ε¯>0\bar{\varepsilon}>0 so that for each ε(0,ε¯]\varepsilon\in(0,\bar{\varepsilon}], the following statements hold:

{# of eigenvalues of  inside K}\displaystyle\{\#\text{ of eigenvalues of }\mathcal{L}\text{ inside }K\} =c1(ε(K))=c1(0(K))2,\displaystyle=c_{1}(\mathcal{E}_{\varepsilon}(K))=c_{1}(\mathcal{E}_{0}(K))\leq 2,

where the first equality follows from the general theory in [1]; the second follows from the Step 2 above; and the third follows from Steps 1 & 3. By choosing smaller contours K0K_{0} and K1K_{1} surrounding only λ0\lambda_{0} resp. λ1\lambda_{1} (and shrinking ε¯\bar{\varepsilon} finitely many times if necessary so that the corresponding homotopies can be constructed for all three contours K,K0K,\,K_{0}, and K1K_{1} simultaneously), we obtain linearized stability of the one-parameter family of waves Γε\Gamma_{\varepsilon} for 0<ε<ε¯0<\varepsilon<\bar{\varepsilon}.

6.3   Slow eigenvalues and the jump map

We now proceed with step 1 of the stability proof. In order to construct the reduced bundle, we must first define the appropriate geometric criteria for a slow eigenvalue λ\lambda\in\mathbb{C} for (25). Let (U¯(ζ),P¯(ζ))(\bar{U}(\zeta),\bar{P}(\zeta)) parametrize the segments of the singular heteroclinic orbit Γ0\Gamma_{0} from Hypothesis 2.7 on S0a,S0a,+S^{a,-}_{0}\cup S^{a,+}_{0}, so that the fast jump occurs at ζ=0\zeta=0.

Definition 6.4.

The complex number λΩ\lambda\in\Omega is called a slow eigenvalue of (25) if there exists a pair of nontrivial, uniformly bounded solutions on S0a,FS^{a,-}_{0}\cup F_{-} resp. S0a,+S^{a,+}_{0}, denoted

YL(λ,ζ)\displaystyle Y_{L}(\lambda,\zeta) for ζ0 and\displaystyle\text{ for }\zeta\leq 0\text{ and} (53)
YR(λ,ζ)\displaystyle Y_{R}(\lambda,\zeta) for ζ0\displaystyle\text{ for }\zeta\geq 0

of the desingularized slow eigenvalue problem (47), such that

YR(λ,0)\displaystyle Y_{R}(\lambda,0) =Jλ(YL(λ,0)),\displaystyle=J_{\lambda}\left(Y_{L}(\lambda,0)\right), (54)

where Jλ:22J_{\lambda}:\mathbb{C}^{2}\to\mathbb{C}^{2} is the (linear) jump map defined by

Jλ(P,V)\displaystyle J_{\lambda}(P,V) =(1R(U¯J)R(U¯F)λ(U¯JU¯F)cU¯FP¯F0cU¯JP¯FcU¯FP¯F)(PV).\displaystyle=\begin{pmatrix}1&\frac{R(\bar{U}_{J})-R(\bar{U}_{F})-\lambda(\bar{U}_{J}-\bar{U}_{F})}{c\bar{U}_{F}-\bar{P}_{F}}\\ 0&\frac{c\bar{U}_{J}-\bar{P}_{F}}{c\bar{U}_{F}-\bar{P}_{F}}\end{pmatrix}\begin{pmatrix}P\\ V\end{pmatrix}. (55)

\triangle

The jump map will be derived rigorously as a singular limit in Sec. 7.3.

Remark 6.5.

Suppose that λ\lambda is a slow eigenvalue. Owing to the asymptotic hyperbolicity of (47) as ζ±\zeta\to\pm\infty, the pair of nontrivial, uniformly bounded solutions (53) must decay to zero in forward (resp. reverse) time. Given two nontrivial, uniformly bounded ‘half-solutions’ YL(λ,ζ)Y_{L}(\lambda,\zeta) and YR(λ,ζ)Y_{R}(\lambda,\zeta) as given in (53), any nontrivial complex scalar multiple of these solutions Y~L(λ,ζ)=KYL(λ,ζ)\tilde{Y}_{L}(\lambda,\zeta)=KY_{L}(\lambda,\zeta) and Y~R(λ,ζ)=KYR(λ,ζ)\tilde{Y}_{R}(\lambda,\zeta)=KY_{R}(\lambda,\zeta) also satisfy the jump condition (54) owing to the linearity of the jump map (55).

It is instructive to consider the case λ=0\lambda=0, corresponding to a reduction of the translational eigenvalue. The reduced vector field provides (complex scalar multiples of) nontrivial, uniformly bounded linear solutions on S0a,FS^{a,-}_{0}\cup F_{-} and S0a,+S^{a,+}_{0} as required in (53). Indeed, any such solution pair YL,YRY_{L},\,Y_{R}, separately defined along the segments of the wave lying in S0a,FS^{a,-}_{0}\cup F_{-} and S0a,+S^{a,+}_{0}, must satisfy

YL(λ,0)\displaystyle Y_{L}(\lambda,0) =(PF,VF)=K1(R(U¯F),cU¯FP¯F)\displaystyle=(P_{F},V_{F})=K_{1}(R(\bar{U}_{F}),c\bar{U}_{F}-\bar{P}_{F})
YR(λ,0)\displaystyle Y_{R}(\lambda,0) =(PJ,VJ)=K2(R(U¯J),cU¯JP¯F),\displaystyle=(P_{J},V_{J})=\,K_{2}(R(\bar{U}_{J}),c\bar{U}_{J}-\bar{P}_{F}),

where K1,K20K_{1},\,K_{2}\neq 0 denote complex scalars. On the other hand,

J0(PF,VF)\displaystyle J_{0}(P_{F},V_{F}) =K1(R(U¯F)+(R(U¯J)R(U¯F))(cU¯JP¯F))\displaystyle=K_{1}\begin{pmatrix}R(\bar{U}_{F})+(R(\bar{U}_{J})-R(\bar{U}_{F}))\\ (c\bar{U}_{J}-\bar{P}_{F})\end{pmatrix}
=K1(R(U¯J)cU¯JP¯F)\displaystyle=K_{1}\begin{pmatrix}R(\bar{U}_{J})\\ c\bar{U}_{J}-\bar{P}_{F}\end{pmatrix}
=(K1/K2)(PJ,VJ).\displaystyle=(K_{1}/K_{2})(P_{J},V_{J}).

The jump condition 54 is therefore satisfied by rescaling one of the solutions if necessary, and hence 0 is a slow eigenvalue. Observe that the reduced vector field provides a natural scaling K1=K2=1K_{1}=K_{2}=1 such that the jump condition is automatically satisfied, but this is a special constraint that arises from the relationship between the reduced vector field along the singular limit of the wave and the variational equations, i.e. the linearised problem when λ=0\lambda=0. We do not generally expect (or need) to find such a natural scaling when λ0\lambda\neq 0.

We end this remark by comparing our definition of a slow eigenvalue to that in [8]. Motivated by the behavior of their slow line subbundle in the fast inner layer as ε0\varepsilon\to 0, their slow eigenvalue problem is defined by asking for continuous solutions across the jump. Hence, 0 is necessarily not a slow eigenvalue in their construction, since the tangent vector of a continuous solution at the end of the jump is transverse to the reduced vector field at the jump-on point on S0a,+S^{a,+}_{0}. The translational eigenvalue is instead counted by their fast eigenvalue problem. \triangle

6.4   The construction of 0(K)\mathcal{E}_{0}(K)

We work with the projectivization of (47) on \mathbb{CP}:

Z^˙\displaystyle\dot{\hat{Z}} =W^0(Z^,ζ,λ).\displaystyle=\hat{W}_{0}(\hat{Z},\zeta,\lambda). (56)

Let Z^L(ζ,λ)\hat{Z}_{L}(\zeta,\lambda) denote the (unique) solution of (56) on S0a,S^{a,-}_{0} which tends to the image of the unstable eigenvector of the (projective) asymptotic reduced system W^0(Z^,λ)\hat{W}^{-}_{0}(\hat{Z},\lambda) as ζ\zeta\to-\infty. We write ZL(ζ,λ)=(PL(ζ,λ),VL(ζ,λ))π1(Z^L)Z_{L}(\zeta,\lambda)=(P_{L}(\zeta,\lambda),V_{L}(\zeta,\lambda))\in\pi^{-1}(\hat{Z}_{L}). Let us denote by ZR(ζ,λ)=(PR(ζ,λ),VR(ζ,λ))Z_{R}(\zeta,\lambda)=(P_{R}(\zeta,\lambda),V_{R}(\zeta,\lambda)) the corresponding solution on S0a,+S^{a,+}_{0} for ζ0\zeta\geq 0 which satisfies the jump condition (54) (note that this gives a concrete initial condition for the solution on S0a,+S^{a,+}_{0}). We find the corresponding embedded solutions in 3\mathbb{R}^{3} using

EL(ζ,λ)\displaystyle E_{L}(\zeta,\lambda) =ι0(ζ,λ)ZL(ζ,λ)\displaystyle=\iota_{0}(\zeta,\lambda)Z_{L}(\zeta,\lambda)
ER(ζ,λ)\displaystyle E_{R}(\zeta,\lambda) =ι0(ζ,λ)ZR(ζ,λ),\displaystyle=\iota_{0}(\zeta,\lambda)Z_{R}(\zeta,\lambda),

where ι0\iota_{0} denotes the inclusion map into TS0TS_{0}.

Finally, we consider the projectivizations E^L,R(ζ,λ)2\hat{E}_{L,R}(\zeta,\lambda)\in\mathbb{CP}^{2}. In analogy to the construction of ε(K)\mathcal{E}_{\varepsilon}(K), we compactify the time variable and work instead with T(ζ)T(\zeta), where

T\displaystyle T^{\prime} =κ(1τ2)\displaystyle=\kappa(1-\tau^{2}) (57)
T(0)\displaystyle T(0) =0,\displaystyle=0, (58)

with 1T1-1\leq T\leq 1. When there are no slow eigenvalues within λK\lambda\in K, we can then continuously extend each of E^L,R\hat{E}_{L,R} to the appropriate cap of the compactified cylinder by taking

E^L(1,λ)\displaystyle\hat{E}_{L}(-1,\lambda) =e^2,0(λ)\displaystyle=\hat{e}_{2,0}^{-}(\lambda)
E^R(+1,λ)\displaystyle\hat{E}_{R}(+1,\lambda) =e^2,0+(λ)\displaystyle=\hat{e}_{2,0}^{+}(\lambda)

for each λKK0\lambda\in K\cup K^{0}.

We construct 0(K)\mathcal{E}_{0}(K) by gluing together vector bundles over two hemispheres, defined separately over the singular wave on S0a,±S^{a,\pm}_{0}. In particular, let

i0±(λ)\displaystyle i^{\pm}_{0}(\lambda) =limζ0±i0(ζ,λ)\displaystyle=\lim_{\zeta\to 0^{\pm}}i_{0}(\zeta,\lambda) (59)

and take

B\displaystyle B_{-} =B{τ0}\displaystyle=B\cap\{\tau\leq 0\}
B+\displaystyle B_{+} =B{τ0},\displaystyle=B\cap\{\tau\geq 0\},

denoting the two sets splitting the base space over left and right hemispheres. By pulling back E^L\hat{E}_{L} and E^R\hat{E}_{R} with respect to these restricted base sets, we can obtain two bundles R±\mathcal{E}^{\pm}_{R}. It remains to glue these bundles together over the hemisphere BB+B_{-}\cap B_{+}.

The fibers over BB+B_{-}\cap B_{+} of each bundle 0±\mathcal{E}^{\pm}_{0} is specified by

0|(BB+)\displaystyle\mathcal{E}^{-}_{0}|(B_{-}\cap B_{+}) =span{i0(λ)ZL,0(λ)}\displaystyle=\text{span}\{i_{0}^{-}(\lambda)Z_{L,0}(\lambda)\}
0+|(BB+)\displaystyle\mathcal{E}^{+}_{0}|(B_{-}\cap B_{+}) =span{i0+(λ)ZR,0}=span{i0+(λ)Jλ(ZL,0)(λ)},\displaystyle=\text{span}\{i_{0}^{+}(\lambda)Z_{R,0}\}=\text{span}\{i_{0}^{+}(\lambda)J_{\lambda}(Z_{L,0})(\lambda)\},

where ZL,0π1Z^L(0,λ)Z_{L,0}\in\pi^{-1}\hat{Z}_{L}(0,\lambda) and ZR,0π1Z^R(0,λ)Z_{R,0}\in\pi^{-1}\hat{Z}_{R}(0,\lambda). Then there is a bundle isomorphism

φH:0|{0}×K0+|{0}×K\displaystyle\varphi^{H}:\mathcal{E}^{-}_{0}|\{0\}\times K\to\mathcal{E}^{+}_{0}|\{0\}\times K (60)

defined by (the linear extension of)

φH(λ)i0(λ)ZL,0\displaystyle\varphi^{H}(\lambda)i_{0}^{-}(\lambda)Z_{L,0} =iR+(λ)ZR,0=iR+(λ)JλZL,0.\displaystyle=i_{R}^{+}(\lambda)Z_{R,0}=i_{R}^{+}(\lambda)J_{\lambda}Z_{L,0}. (61)

The basic properties of bundle isomorphism can be checked by using the linearity of the embeddings and the jump map.

Definition 6.6.

The reduced augmented unstable bundle 0(K)\mathcal{E}_{0}(K) is defined by the clutching operation

0\displaystyle\mathcal{E}_{0} =0φH0+,\displaystyle=\mathcal{E}^{-}_{0}\cup_{\varphi^{H}}\mathcal{E}^{+}_{0}, (62)

i.e. it is the bundle over the closed cylinder (S2\cong S^{2}) obtained by gluing together the hemispheric bundles along their common boundary via the equivalence relation

y\displaystyle y_{-} φH(λ)y.\displaystyle\sim\varphi^{H}(\lambda)y_{-}.

using the clutching map (60). \triangle

Remark 6.7.

We refer the reader to the standard treatments in differential topology for a general overview of clutching operations (see e.g. [2, 11]). We will find it useful to work instead with an equivalent bundle ~0(K)0(K)\tilde{\mathcal{E}}_{0}(K)\cong\mathcal{E}_{0}(K) obtained by projecting the fiber at each point over the base set onto the corresponding two components. It is straightforward to show that these two bundles are indeed homotopy equivalent by constructing the explicit homotopy that continuously bends each fiber as a homotopy parameter goes from 0 to 11; see the remark below Lemma 6.2 of [8] for details. \triangle

7   Comparison of the bundles ε(K)\mathcal{E}_{\varepsilon}(K) and 0(K)\mathcal{E}_{0}(K)

With the reduced objects from Sec. 6.4 in hand, let E^(ζ,λ,ε)\hat{E}(\zeta,\lambda,\varepsilon) denote the (unique) projective solution of the ε\varepsilon-dependent family of solutions tending to the unstable subspace as ζ\zeta\to-\infty. By general arguments in [1], we have that λσp()\lambda\notin\sigma_{p}(\mathcal{L}) if and only if E^(ζ,λ,ε)e^f+(λ,ε)\hat{E}(\zeta,\lambda,\varepsilon)\to\hat{e}_{f}^{+}(\lambda,\varepsilon) (defined in Lemma 4.2) as ζ+\zeta\to+\infty. We want to show that particular solutions of the projectivization of the slow eigenvalue problem (47) uniformly approximate E^(ζ,λ,ε)\hat{E}(\zeta,\lambda,\varepsilon) when ε>0\varepsilon>0 is sufficiently small .

Let Π:32\Pi:\mathbb{C}^{3}\to\mathbb{C}^{2} denote projection onto the slow coordinates, i.e.

Π(u,p,v)=(p,v).\displaystyle\Pi(u,p,v)=(p,v). (63)

We can also define the inclusion maps ι,ιR\iota,\iota_{R} in the obvious way. If Y,YRY,Y_{R} are solutions to the full system (in the slow timescale, with ε0\varepsilon\neq 0) and the linearized slow system (i.e. with ε=0\varepsilon=0), respectively, then we use the variables Z,Z02Z,Z_{0}\in\mathbb{C}^{2} to denote the projections:

Z\displaystyle Z =Π(Y)\displaystyle=\Pi(Y) (64)
Z0\displaystyle Z_{0} =Π(Y0).\displaystyle=\Pi(Y_{0}). (65)

Owing to the linearity of the eigenvalue problem, we can also induce a dynamical system for Z^\hat{Z} on projective space:

Z^˙\displaystyle\dot{\hat{Z}} =B^0(Z^,ζ,λ).\displaystyle=\hat{B}_{0}(\hat{Z},\zeta,\lambda). (66)

The goal now is to compare two solutions, denoted Z^\hat{Z}_{*} and Z^0\hat{Z}_{0}, defined via a singular limit of a solution of the full system and a solution of the reduced linear system, respectively (strictly speaking, we compare their embeddings in 3\mathbb{CP}^{3}). The solution Z^0\hat{Z}_{0} is the unique solution of the projectivized linearized slow system (56). On the other hand, the solution Z^\hat{Z}_{*} is defined as the projection of the full solution E(ζ,λ,ε)E(\zeta,\lambda,\varepsilon) onto the slow manifolds in the singular limit ε0\varepsilon\to 0; i.e. take Z=ΠEZ=\Pi E and choose a subsequence εn0\varepsilon_{n}\to 0 so that the limits

Z^(±1,λ,εn)\displaystyle\hat{Z}(\pm 1,\lambda,\varepsilon_{n}) =Z^±\displaystyle=\hat{Z}^{\pm}

both exist, and then define two solutions Z^±(ζ,λ)\hat{Z}^{\pm}(\zeta,\lambda) of the reduced problem (56) on ζ<0\zeta<0 and ζ>0\zeta>0 with initial conditions Z^±\hat{Z}^{\pm}, setting

ζ^(ζ,λ)\displaystyle\hat{\zeta}_{*}(\zeta,\lambda) ={ζ^(ζ,λ)ifζ<0ζ^+(ζ,λ)ifζ>0.\displaystyle=\begin{cases}\hat{\zeta}^{-}(\zeta,\lambda)&\text{if}\zeta<0\\ \hat{\zeta}^{+}(\zeta,\lambda)&\text{if}\zeta>0.\end{cases}

Finally, define the following two inclusions:

E0(ζ,λ)\displaystyle E_{0}(\zeta,\lambda) =ι0(ζ,λ)Z0(ζ,λ)\displaystyle=\iota_{0}(\zeta,\lambda)Z_{0}(\zeta,\lambda) (67)
E(ζ,λ)\displaystyle E_{*}(\zeta,\lambda) =ι0(ζ,λ)Z(ζ,λ).\displaystyle=\iota_{0}(\zeta,\lambda)Z_{*}(\zeta,\lambda).

We re-emphasize that the starred data is obtained from approximation of the full solution, whereas data denoted by ‘0’ is obtained only from solutions of the reduced problem on S0a,FS^{a,-}_{0}\cup F and S0a,+S^{a,+}_{0}, concatenated by an algebraic jump condition.

An approximation theorem can now be stated.

Theorem 7.1.

There exists ε¯>0\bar{\varepsilon}>0 such that for 0<ε<ε¯0<\varepsilon<\bar{\varepsilon} the following are true:

  • (a)

    limε0E^(ζ,λ,ε)=E^(ζ,λ)\lim_{\varepsilon\to 0}\hat{E}(\zeta,\lambda,\varepsilon)=\hat{E}_{*}(\zeta,\lambda) uniformly in a|ζ|Aa\leq|\zeta|\leq A, for each 0<a<1<A0<a<1<A.

  • (b)

    Let Z0(λ):=limζ0Z(ζ,λ)Z^{0-}_{*}(\lambda):=\lim_{\zeta\to 0^{-}}Z_{*}(\zeta,\lambda). Then

    Jλ(Z0(λ))=Z(0,λ)=limζ0+Z(ζ,λ).\displaystyle J_{\lambda}(Z^{0-}_{*}(\lambda))=Z_{*}(0,\lambda)=\lim_{\zeta\to 0^{+}}Z_{*}(\zeta,\lambda). (68)
  • (c)

    Suppose that λ\lambda is not a slow eigenvalue. Then

    Z^(ζ,λ)\displaystyle\hat{Z}_{*}(\zeta,\lambda) =Z^0(ζ,λ),\displaystyle=\hat{Z}_{0}(\zeta,\lambda),

    where

    Z^0(ζ,λ)\displaystyle\hat{Z}_{0}(\zeta,\lambda) ={Z^L(ζ,λ) for ζ<0Z^R(ζ,λ) for ζ0\displaystyle=\begin{cases}\hat{Z}_{L}(\zeta,\lambda)\text{ for }\zeta<0\\ \hat{Z}_{R}(\zeta,\lambda)\text{ for }\zeta\geq 0\end{cases} (69)

    is defined by projectivizations of a pair of solutions defined on S0a,FS^{a,-}_{0}\cup F resp. S0a,+S^{a,+}_{0} that tends to the projectivization of the unstable eigenvector in either direction. The pair also satisfies the projectivized jump condition from (90):

    Z^R(0,λ)\displaystyle\hat{Z}_{R}(0,\lambda) =s(u¯J,λ).\displaystyle=s(\bar{u}_{J},\lambda).

    Furthermore, E^(ζ,λ,ε)e^f+(ζ,λ,ε)\hat{E}(\zeta,\lambda,\varepsilon)\to\hat{e}_{f}^{+}(\zeta,\lambda,\varepsilon) as ζ+\zeta\to+\infty.

Remark 7.2.

This theorem should be directly compared to the analogous convergence theorem 5.3 in [8]. Observe that the continuity condition in part (b) is replaced by a jump condition. In general we no longer expect the projected limiting solution ZZ_{*} to be continuous at ζ=0\zeta=0 (however, it is still right-continuous). \triangle

The subsections 7.2, 7.3, and 7.4 are devoted to proving Theorem 7.1 (a),(b), and (c), respectively. In the final subsection 7.5, we use this theorem to construct a homotopy between the bundles ε(K)\mathcal{E}_{\varepsilon}(K) and 0(K)\mathcal{E}_{0}(K).

7.1   Preliminary estimates

Before proving Theorem 7.1, we need some preliminary estimates that will allow us to control the dynamics of the eigenvalue problem within the slow manifolds, and as they connect from the fast layers to the slow manifolds. The key points are that although we do not have access to an elephant trunk lemma, (i) we may still define partial relatively invariant sets over the attracting branches Sεa,±S^{a,\pm}_{\varepsilon} of the critical manifold SεS_{\varepsilon} for sufficiently small values of ε>0\varepsilon>0, and (ii) the way in which the eigenvalue parameter λ\lambda enters the equations suggests that we can retain control of the dynamics using exchange lemma-type estimates. Let us highlight that the exchange lemma estimates are apparently new relative to the techniques introduced in [8]. Furthermore, the use of relatively invariant set theory applied to the case of two-plane dynamics (i.e. the construction of a so-called slow elephant trunk over a slow subbundle) also appears to be new.

7.1.1 Fast and slow elephant trunks over Sεa,±S^{a,\pm}_{\varepsilon}

In this section, we construct relatively invariant attracting sets for (28) defined within compact neighborhoods of Sεa,±S^{a,\pm}_{\varepsilon}. Here we closely follow the treatment of Sec. IV in [8]. Let

Ω=ξIΩ(ξ)×{ξ}\Omega=\cup_{\xi\in I}\Omega(\xi)\times\{\xi\}

denote a subset of 2×\mathbb{C}^{2}\times\mathbb{R} such that II is an open interval in \mathbb{R} and Ω(ξ)\Omega(\xi) is a neighborhood in 2\mathbb{C}^{2} for each ξ\xi, such that Ω2×I\partial\Omega\cap\mathbb{C}^{2}\times I is a smooth manifold (with Ω(ξ)\Omega(\xi) varying smoothly in ξ\xi). Furthermore, consider a sufficiently smooth nonautonomous system on 2\mathbb{C}^{2} of the form

β\displaystyle\beta^{\prime} =G(β,ξ,ε).\displaystyle=G(\beta,\xi,\varepsilon). (70)
Definition 7.3.

The set Ω\Omega is (positively) invariant relative to II if for any solution β(ξ)\beta(\xi) of (70) with β(ξ0)Ω(ξ0)\beta(\xi_{0})\in\Omega(\xi_{0}) for some ξ0I\xi_{0}\in I, we have β(ξ)Ω(ζ)\beta(\xi)\in\Omega(\zeta) for all ζζ0\zeta\geq\zeta_{0} for which ζI\zeta\in I. \triangle

We will define two distinct collections of relatively invariant sets over the disjoint branches Sεa,±S^{a,\pm}_{\varepsilon}, referring to one type as fast elephant trunks, denoted Ω±f\Omega_{\pm}^{f}, and to the other as slow elephant trunks, denoted Ω±s\Omega_{\pm}^{s}.

The starting point is to consider the auxiliary autonomous family of frozen systems associated with (70):

dβdξ\displaystyle\frac{d\beta}{d\xi} =G(β,γ,ε)\displaystyle=G(\beta,\gamma,\varepsilon) (71)

and to assume that (71) admits a smooth curve of critical points β0(γ,ε)\beta_{0}(\gamma,\varepsilon) (i.e. for each ε>0\varepsilon>0, β0\beta_{0} depends smoothly on γ\gamma). Thinking of (71) as the frozen family corresponding to a projectivization of a linearized system over a reference chart, such curves arise naturally by continuing the eigenvectors of the linear system along the travelling wave, which is parametrized by γ\gamma. We will consider subsets of the following form as candidates for elephant trunks:

Ω(d,η,ε)\displaystyle\Omega(d,\eta,\varepsilon) ={(β,η):|ββ0(γ,ε)|γ<η,γI(d)},\displaystyle=\{(\beta,\eta):|\beta-\beta_{0}(\gamma,\varepsilon)|_{\gamma}<\eta,\gamma\in I(d)\}, (72)

where the real parameter dd parametrizes a nested family of subintervals I(d)I(d) (i.e. I(d1)I(d2)I(d_{1})\subset I(d_{2}) whenever d1<d2d_{1}<d_{2}), and the metric ||γ|\cdot|_{\gamma} is defined according to the construction in Sec. IV. B. in [8]. As noted there, the parameter dd can be thought of as characterizing the slowly varying character of (70) (with respect to ξ\xi when ε>0\varepsilon>0 is sufficiently small), while η\eta characterizes the width of the tube. See Fig. 5 for a sketch of the elephant trunks we construct.

(a)Refer to caption (b)Refer to caption

Figure 5: Sketches of the (a) fast resp. (b) slow elephant trunks constructed in Lemmas 7.47.5.

We begin with the construction of the fast elephant trunks which are defined for (26), i.e. (28) with the frame variable having reversed orientation ξξ\xi\mapsto-\xi. Strictly speaking, we work with the (orientation-reversed) projectivized system (31).

Lemma 7.4.

For each λΩ\lambda\in\Omega (see (34)), there exist ε0>0\varepsilon_{0}>0 and d0>0d_{0}>0 such that for each ε(0,ε0]\varepsilon\in(0,\varepsilon_{0}] and d(0,d0]d\in(0,d_{0}], there is a pair of fast elephant trunks Ωf:=Ωf(d,η,λ,ε)\Omega_{-}^{f}:=\Omega_{-}^{f}(d,\eta,\lambda,\varepsilon) (over Sεa,S^{a,-}_{\varepsilon}) and Ω+f:=Ω+f(d,η,λ,ε)\Omega_{+}^{f}:=\Omega_{+}^{f}(d,\eta,\lambda,\varepsilon) (over Sεa,+S^{a,+}_{\varepsilon}); namely, Ωf\Omega_{-}^{f} (resp. Ω+f\Omega_{+}^{f}) is positively invariant relative to the nested families of sub-intervals I(d)I_{-}(d) (resp. I+(d)I_{+}(d)), where

I(d)={ξ:ξξ(δ(d))<0}I_{-}(d)=\{\xi:\xi\leq\xi_{-}(\delta(d))<0\}

and

I+(d)={ξ:ξξ+(δ(d))>0},I_{+}(d)=\{\xi:\xi\geq\xi_{+}(\delta(d))>0\},

where δ:[0,d0)[0,)\delta:[0,d_{0})\to[0,\infty) varies smoothly in dd with δ(0)=0\delta(0)=0, and ξ±(δ(d))\xi_{\pm}(\delta(d)) are defined so that:

  • the travelling wave x¯(ξ,ε)\bar{x}(\xi,\varepsilon) lies δ\delta-close to the singular limit of the travelling wave for all ξξ(δ)ξ¯\xi\leq\xi_{-}(\delta)\leq-\bar{\xi} and for all ξξ+(δ)ξ¯\xi\geq\xi_{+}(\delta)\geq\bar{\xi}, where ξ¯\bar{\xi} is defined so that:

  • the matrix A(ξ,λ,ε):=Gβ(β0(ξ,λ,ε),ξ,λ,ε)A(\xi,\lambda,\varepsilon):=G_{\beta}(\beta_{0}(\xi,\lambda,\varepsilon),\xi,\lambda,\varepsilon) has eigenvalues μ2μ1\mu_{2}-\mu_{1} and μ3μ1\mu_{3}-\mu_{1} uniformly negative and bounded away from 0 for all |ξ|ξ¯|\xi|\geq\bar{\xi}, where GG denotes the orientation reversal of the projectivized system (31).

Proof: See Appendix A.

Lemma 7.4 provides elephant trunks about the fast unstable directions near the slow manifolds in reverse time. In fact we will only require the fast elephant trunk near the fixed point at u¯=0\bar{u}=0 (i.e. over the branch Sεa,S^{a,-}_{\varepsilon}), in order to verify one part of a uniform closeness estimate of the unique solution E(ξ,λ,ε)E(\xi,\lambda,\varepsilon) of (28) with |E(0,λ,ε)|=1|E(0,\lambda,\varepsilon)|=1 and E(ξ,λ,ε)uE(\xi,\lambda,\varepsilon)\to u^{-} as ξ\xi\to-\infty. As we track E(ξ,λ,ε)E(\xi,\lambda,\varepsilon) for ξ\xi sufficiently large, we will also require a slow elephant trunk over Sεa,+S^{a,+}_{\varepsilon} which guarantees uniform closeness of EE near the slow subbundle. We work with the projectivized system (42).

Lemma 7.5.

For each λΩ\lambda\in\Omega (see (34)), there exist ε0>0\varepsilon_{0}>0 and d0>0d_{0}>0 such that for each ε(0,ε0]\varepsilon\in(0,\varepsilon_{0}] and d(0,d0]d\in(0,d_{0}], there is a pair of slow elephant trunks Ωs:=Ωs(d,η,λ,ε)\Omega_{-}^{s}:=\Omega_{-}^{s}(d,\eta,\lambda,\varepsilon) (over Sεa,S^{a,-}_{\varepsilon}) and Ω+s:=Ω+s(d,η,λ,ε)\Omega_{+}^{s}:=\Omega_{+}^{s}(d,\eta,\lambda,\varepsilon) (over Sεa,+S^{a,+}_{\varepsilon}) for system (42); namely, Ωs\Omega_{-}^{s} (resp. Ω+s\Omega_{+}^{s}) is positively invariant relative to the nested families of sub-intervals I(d)I_{-}(d) (resp. I+(d)I_{+}(d)), where

I(d)={ξ:ξξ(δ(d))<0}I_{-}(d)=\{\xi:\xi\leq\xi_{-}(\delta(d))<0\}

and

I+(d)={ξ:ξξ+(δ(d))>0},I_{+}(d)=\{\xi:\xi\geq\xi_{+}(\delta(d))>0\},

where δ:[0,d0)[0,)\delta:[0,d_{0})\to[0,\infty) varies smoothly in dd with δ(0)=0\delta(0)=0, and ξ±(δ(d))\xi_{\pm}(\delta(d)) are defined so that:

  • the travelling wave x¯(ξ,ε)\bar{x}(\xi,\varepsilon) lies δ\delta-close to the singular limit of the travelling wave for all ξξ(δ)ξ¯\xi\leq\xi_{-}(\delta)\leq-\bar{\xi} and for all ξξ+(δ)ξ¯\xi\geq\xi_{+}(\delta)\geq\bar{\xi}, where ξ¯\bar{\xi} is defined so that:

  • the matrix A(ξ,λ,ε):=Gβ(β0(ξ,λ,ε),ξ,λ,ε)A(\xi,\lambda,\varepsilon):=G_{\beta}(\beta_{0}(\xi,\lambda,\varepsilon),\xi,\lambda,\varepsilon) has eigenvalues uniformly negative and bounded away from 0 for all |ξ|ξ¯|\xi|\geq\bar{\xi}.

Proof: The steps are essentially identical to those shown in Appendix A for Lemma (7.4). We point out that the corresponding family of frozen systems for (42) admits a curve of attracting critical points, as described in the paragraph below (42). The remaining conditions are checked with direct calculation. \Box

Remark 7.6.

The slow elephant trunks in Lemma 7.5 define attracting invariant neighborhoods relative to a metric defined in 𝐆𝐫(2,3)\mathbf{Gr}(2,3). Switching instead to the Fubini-Study metric (which gives equivalent estimates up to a constant factor depending only on the metrics), it is a rather lengthy calculation to show that these planar neighborhoods can be in turn expressed as neighborhoods of complex lines relative to the slow subbundle. \triangle

7.1.2 Exchange lemma-type estimates

Per Fenichel’s original setup [7], we consider a singularly perturbed system of differential equations in n\mathbb{R}^{n} which admits a family of kk-dimensional normally hyperbolic invariant manifolds SεS_{\varepsilon}, with 0<k<n0<k<n, for ε(0,ε¯]\varepsilon\in(0,\bar{\varepsilon}], arising from a critical manifold S0S_{0} when ε=0\varepsilon=0. We remind the reader that n=3n=3 and k=2k=2 in our model, and we restrict our interest to the case of attracting slow invariant manifolds. We use freely a Fenichel normal form defined over a common neighborhood UU of SεS_{\varepsilon}, appended with its variational equations:

b\displaystyle b^{\prime} =Γ(b,y,ε)b\displaystyle=\Gamma(b,y,\varepsilon)b (73)
y\displaystyle y^{\prime} =εf(y,ε)\displaystyle=\varepsilon f(y,\varepsilon)
db\displaystyle db^{\prime} =Γdb+DzΓ(dzb)\displaystyle=\Gamma db+D_{z}\Gamma(dz\otimes b)
dy\displaystyle dy^{\prime} =εDyfdy,\displaystyle=\varepsilon D_{y}f\,dy,

where z:=(b,y)z:=(b,y) and dz:=(db,dy)dz:=(db,dy). In terms of the geometric objects introduced in Sec. 3, the system (73) may be regarded as an induced derivation on the tangent bundle, with the dynamics on the tangent vectors coordinatized according to the Plücker embedding. We highlight a few properties of Fenichel theory and the Fenichel normal form. The slow manifolds are given by Sε={b=0}S_{\varepsilon}=\{b=0\}, so that yky\in\mathbb{R}^{k} may be viewed as the slow variables. The stable manifold WεsW^{s}_{\varepsilon} is foliated by invariant fibers, with associated projection map π:WεsSε\pi^{-}:W^{s}_{\varepsilon}\to S_{\varepsilon}.

Refer to caption
Figure 6: Setup of the exchange lemma-type estimate using system (76). Here the projection π×πλ\pi^{-}\times\pi^{-}_{\lambda} refers to the projection onto the invariant manifold {b=0}\{b=0\}, extended by the corresponding invariant manifold given by the eigenvalue problem (76).

With respect to the straightened dynamics of the normal form, this stable fibration is given by (b,y)y(b,y)\mapsto y. Each function in (73) is regarded as ‘sufficiently’ smooth for every succeeding statement to hold, and we further assume that the spectra of the matrix functions Γ\Gamma and εDyf\varepsilon D_{y}f satisfy the necessary conditions for Fenichel’s Lyapunov type numbers. Specifically, for each yUy\in U we list the spectrum

specΓ(0,y,0)\displaystyle\text{spec}\,\Gamma(0,y,0) :={γ1(y),,γnk(y)}\displaystyle:=\{\gamma_{1}(y),\cdots,\gamma_{n-k}(y)\}

in ascending order of their real parts, i.e.

Re(γ1(y))Re(γnk(y)),\displaystyle\text{Re}\,(\gamma_{1}(y))\leq\cdots\leq\text{Re}\,(\gamma_{n-k}(y)),

and we assume a spectral gap Reγnk(y)<0\text{Re}\,\gamma_{n-k}(y)<0 uniformly for yU¯y\in\bar{U}. Define

γ0\displaystyle\gamma_{0} =supyUγnk(y).\displaystyle=\text{sup}_{y\in U}\gamma_{n-k}(y). (74)

Let us summarise the geometric ideas underlying the exchange lemma. We consider families of manifolds that enter neighborhoods of normally hyperbolic slow invariant manifolds in a generic way. The primary goal is to track both the position and orientation (i.e. the corresponding evolution of tangent spaces according to the variational equations) as trajectories enter near the stable foliation and spend sufficiently long times near the slow manifold. If the entry manifolds intersect the stable foliation transversally, this generic entry is ‘exchanged’ for exponential closeness, in both position and orientation, to the unstable foliation (or in our case, to the tangent space of the slow manifold) at exit.

This geometric result can be expressed explicitly in terms of the rectified variables in (73). We follow the treatment by Jones and Tin in [17]. Fix a parameter Δ>0\Delta>0 and consider a one-parameter family of entry manifolds {Mε}\{M_{\varepsilon}\} intersecting the stable manifold transversally at some section {|b|=Δ}\{|b|=\Delta\}. We focus on initial conditions within the entry manifolds that remain within the box specified by

BΔ\displaystyle B_{\Delta} :={|y|Δ,|b|Δ},\displaystyle:=\{|y|\leq\Delta,|b|\leq\Delta\}, (75)

leaving BΔB_{\Delta} through {|y|=Δ}\{|y|=\Delta\} only after a sufficiently long time, say 𝒯ε=1/εs\mathcal{T}_{\varepsilon}=1/\varepsilon^{s} for some 0<s<10<s<1. Let z(t)z(t) parametrize an incoming trajectory γ\gamma corresponding to such an initial condition lying on MεM_{\varepsilon}. The exchange lemma assures us that for κ(0,|γ0|)\kappa\in(0,|\gamma_{0}|) (see (74)), z(t)z(t) is C1C^{1}-𝒪(eκ𝒯\mathcal{O}(e^{-\kappa\mathcal{T}})-close to SεS_{\varepsilon} at t=𝒯εt=\mathcal{T}_{\varepsilon}, i.e.

dist(z(𝒯ε),Sε)\displaystyle\text{dist}(z(\mathcal{T}_{\varepsilon}),S_{\varepsilon}) =𝒪(eκ𝒯ε)\displaystyle=\mathcal{O}(e^{-\kappa\mathcal{T}_{\varepsilon}})
dist(Tz(𝒯ε)γ,Tπ(z(𝒯ε))Sε)\displaystyle\text{dist}(T_{z(\mathcal{T}_{\varepsilon})}\gamma,T_{\pi^{-}(z(\mathcal{T}_{\varepsilon}))}S_{\varepsilon}) =𝒪(eκ𝒯ε),\displaystyle=\mathcal{O}(e^{-\kappa\mathcal{T}_{\varepsilon}}),

where the dist(,)(\cdot,\cdot) map is defined in the usual way, after specifying some appropriate metrics. In view of the rectified coordinate system for (73), TxSε={db=0}T_{x}S_{\varepsilon}=\{db=0\} (the tangent space of SεS_{\varepsilon} at xx) for each xSεx\in S_{\varepsilon}. Complete details about the formulation of the problem and a statement of the Exchange Lemma are given in [17]. This general formulation is referred to as the (k+σ)(k+\sigma)-Exchange Lemma, where kk denotes the dimension of the local unstable manifolds of the critical manifold. In the present case k=0k=0, the Exchange Lemma is referred to as an inclination lemma [5].

The first step is to write down the ‘straightened’ eigenvalue problem. Although the coordinate change—i.e. the composition of straightening diffeomorphisms—which defines the Fenichel normal form is usually highly nonlinear and difficult to write down, we highlight that the linearisation of this composition acts linearly on the dynamical system defined on the tangent spaces.666Indeed, if the eigenvalue problem in the original coordinates is written as dx=A(x,ε)dx+ελMdxdx^{\prime}=A(x,\varepsilon)dx+\varepsilon\lambda M\cdot dx, where A(x,ε,λ)dxA(x,\varepsilon,\lambda)dx denotes the original variational equations and MM is a constant square matrix, and z=φ(x,ε)z=\varphi(x,\varepsilon) is the diffeomorphism giving rise to the Fenichel normal form, then dz~=B(z,ε)dz~+ελM~dz~\widetilde{dz}^{\prime}=B(z,\varepsilon)\widetilde{dz}+\varepsilon\lambda\widetilde{M}\cdot\widetilde{dz}, where Bdz~B\,\widetilde{dz} is the transformed variational equation and M~=DφMDφ1\widetilde{M}=D\varphi\circ M\circ D\varphi^{-1}. Hence, we obtain

b\displaystyle b^{\prime} =Γ(b,y,ε)b\displaystyle=\Gamma(b,y,\varepsilon)b (76)
y\displaystyle y^{\prime} =ε(f(y,ε)+H(b,y,ε)b)\displaystyle=\varepsilon(f(y,\varepsilon)+H(b,y,\varepsilon)b)
db~\displaystyle\widetilde{db}^{\prime} =Γdb~+DzΓ(dz~b)+ελG1(b,y,ε)dz~\displaystyle=\Gamma\widetilde{db}+D_{z}\Gamma(\widetilde{dz}\otimes b)+\varepsilon\lambda G_{1}(b,y,\varepsilon)\cdot\widetilde{dz}
dy~\displaystyle\widetilde{dy}^{\prime} =ε(Dyfdy~+DzH(dz~b)+Hdb~)+ελG2(b,y,ε)dz~.\displaystyle=\varepsilon(D_{y}f\,\widetilde{dy}+D_{z}H(\widetilde{dz}\otimes b)+H\widetilde{db})+\varepsilon\lambda G_{2}(b,y,\varepsilon)\cdot\widetilde{dz}.

Here, dz~:=(db~,dy~)\widetilde{dz}:=(\widetilde{db},\widetilde{dy}) and G1,G2G_{1},\,G_{2} are smooth functions that are defined from the ε\varepsilon-family of diffeomorphisms used to derive the Fenichel coordinates, and so are independent of λ\lambda. Let us write the transformed eigenvalue problem in (76) in the more compact form

dz~\displaystyle\widetilde{dz}^{\prime} =A(z,ε)dz~+ελM~(z,ε)dz~,\displaystyle=A(z,\varepsilon)\widetilde{dz}+\varepsilon\lambda\widetilde{M}(z,\varepsilon)\widetilde{dz}, (77)

with M~Δ,ε0:=maxzBΔ,0εε0M~(z,ε)\widetilde{M}_{\Delta,\varepsilon_{0}}:=\max_{z\in B_{\Delta},0\leq\varepsilon\leq\varepsilon_{0}}\widetilde{M}(z,\varepsilon). We drop the tilde notation for the remainder of the section for ease of reading. For T>0T>0, let QTQ_{T} denote the set of initial conditions within the box of width of Δ>0\Delta>0 so that for each q:=z(0)=(b0,y0)QTq:=z(0)=(b_{0},y_{0})\in Q_{T}, we have that z(t)z(t) remains within the Δ\Delta-box for each t[0,T]t\in[0,T].

Lemma 7.7.

Assume the hypotheses of the (k+σ)(k+\sigma) Exchange Lemma in the singularly-perturbed case (Theorem 6.7 in [17]) with k=0k=0 (i.e. the normally hyperbolic critical manifold does not admit fast unstable directions). Furthermore, let (γ(t,ε),dz(t,ε))(\gamma(t,\varepsilon),dz(t,\varepsilon)) be a one-parameter family of trajectories of (76) (where the components of the tangent vector are given by dz(t,ε)=(db(t,ε),dy(t,ε)dz(t,\varepsilon)=(db(t,\varepsilon),dy(t,\varepsilon)), so that γ(t,ε)\gamma(t,\varepsilon) satisfies the hypotheses for the invariant manifolds MεM_{\varepsilon} in [17]. Furthermore, let dz(0,ε)=:(db0(ε),dy0(ε))dz(0,\varepsilon)=:(db_{0}(\varepsilon),dy_{0}(\varepsilon)) satisfy the estimate

|dy0|Mε\displaystyle|dy_{0}|\geq M\varepsilon

for some M>0M>0 independent of ε\varepsilon, when ε>0\varepsilon>0 is sufficiently small, and fix β\beta with 0<β<10<\beta<1. Then there exists ε¯>0\bar{\varepsilon}>0 so that for 0<εε¯0<\varepsilon\leq\bar{\varepsilon},

dist(dz(𝒯ε),Tγ(𝒯ε)Sε)K2εβ\displaystyle\text{dist}(dz(\mathcal{T}_{\varepsilon}),T_{\gamma(\mathcal{T}_{\varepsilon})}S_{\varepsilon})\leq K_{2}\varepsilon^{\beta}

K2K_{2} depending only on ε¯\bar{\varepsilon} and Δ\Delta.

Proof: See Appendix C.

Remark 7.8.

Let us highlight a few points about Lemma 7.7 relative to the (k+σ)(k+\sigma) Exchange Lemma in the singularly-perturbed case. The first obvious difference is that the estimate is weaker, being only linear in order of ε\varepsilon. This is because we compare a generic incoming trajectory to the tangent space of the slow manifold (i.e. the set {b=0,db=0}\{b=0,\,db=0\}) instead of the slow subbundle, where the computation becomes more difficult. For our present purposes this weaker estimate is sufficient, since it is immediate that the slow subbundle is 𝒪(ε)\mathcal{O}(\varepsilon)-close to the tangent space of the slow manifold relative to the Fubini-Study metric (note: it is crucial here that the eigenvalue problem depends ‘weakly’ on λ\lambda, i.e. through terms of 𝒪(ελ)\mathcal{O}(\varepsilon\lambda) only).

However, we highlight that the system (76) is singularly-perturbed, with the identical normally hyperbolic critical manifold {b=0,db=0}\{b=0,\,db=0\} as that of the variational equations. By standard Fenichel theory, there exists a one-parameter family of (real) six-dimensional locally invariant attracting slow manifolds ε\mathcal{M}_{\varepsilon} for sufficiently small values of ε>0\varepsilon>0, which lie 𝒪(ε)\mathcal{O}(\varepsilon)-close in Hausdorff distance to the critical manifold; two of the dimensions come from the slow directions in the phase space and the remaining four come from the slow complex directions of the eigenvalue problem. We therefore expect that the tangent spaces of incoming trajectories under the flow of the eigenvalue problem align exponentially closely to that of ε\mathcal{M}_{\varepsilon}, i.e. there is an analogue to the standard Exchange Lemma estimate for the eigenvalue problem. Furthermore, the slow subbundle should provide the 𝒪(ε)\mathcal{O}(\varepsilon) correction of the tangent bundle of the slow manifold (i.e. it is 𝒪(ε2)\mathcal{O}(\varepsilon^{2})-close with respect to the Fubini-Study metric), in analogy to the standard computations using Fenichel theory. We illustrate this in Appendix D with a toy problem. \triangle

7.2   Estimates near the slow subbundle

In this subsection we prove Theorem 7.1 (a). Our aim is to prove a similar result to Corollary 5.6 in [8], namely that the projectivization E^(ζ,λ,ε)\hat{E}(\zeta,\lambda,\varepsilon) of any nontrivial solution E(ζ,λ,ε)E(\zeta,\lambda,\varepsilon) lies uniformly close to the slow subbundle when the wave is near to the slow manifolds. A new technical issue here is that we do not have an elephant trunk estimate over the fast layer. We will instead combine our existing ‘partial’ fast and slow elephant trunks with our exchange lemma-type estimate to achieve this uniform closeness.

In the coming analysis we will work with projectivisations of the slow subbundle, denoted σ^s\hat{\sigma}_{s} and Σ^s\hat{\Sigma}_{s} as usual. Now fix any metric ρ\rho on 3\mathbb{CP}^{3}.

Definition 7.9.

For any set S^3\hat{S}\subset\mathbb{CP}^{3} and δ>0\delta>0, a δ\delta-neighborhood of S^\hat{S} is the set

Nδ(S^)\displaystyle N_{\delta}(\hat{S}) ={y^3:ρ(s^,y^)<δ for some s^S^}.\displaystyle=\{\hat{y}\in\mathbb{CP}^{3}:\rho(\hat{s},\hat{y})<\delta\text{ for some }\hat{s}\in\hat{S}\}. (78)

\triangle

We begin by estimating the closeness of E(ζ,λ,ε)E(\zeta,\lambda,\varepsilon) to Σs(ζ,λ,ε)\Sigma_{s}(\zeta,\lambda,\varepsilon) near S0a,±S^{a,\pm}_{0}. To do so, we consider the following family of autonomous frozen systems (and their projectivizations) corresponding to the linearized problem:

y\displaystyle y^{\prime} =a(γ,λ,ε)y\displaystyle=a(\gamma,\lambda,\varepsilon)y (79)
y^\displaystyle\hat{y}^{\prime} =a^(y^,γ,λ,ε).\displaystyle=\hat{a}(\hat{y},\gamma,\lambda,\varepsilon).

We refer the reader to a series of technical lemmas in Appendix B, culminating in the uniform estimate Corollary B.4 which is used in the following proof.

Proof of Theorem 7.1(a). Fix two constants aa and AA with 0<a<1<A0<a<1<A and suppose ζ[a,A]\zeta\in[a,A]. We select a representative solution Eπ1E^E\in\pi^{-1}\hat{E} scaled so that slow components |Z|1|Z|\leq 1 on aζAa\leq\zeta\leq A for each ε(0,ε¯)\varepsilon\in(0,\bar{\varepsilon}). It quickly follows that for ζ=1\zeta=1 fixed, we have ZZ+Z\to Z^{+} as ε0\varepsilon\to 0. Our primary task is to compare a solution ZZ representing Z^\hat{Z} to a solution ZZ_{*} representing the corresponding projectivisation of the reduced problem, i.e. we show that |ZZ|0|Z-Z_{*}|\to 0 uniformly on aζAa\leq\zeta\leq A as ε0\varepsilon\to 0.

Away from the jump, the projection onto the slow components Z=(P,V)Z=(P,V) and ZR=(PR,VR)Z_{R}=(P_{R},V_{R}) of the full and reduced systems, respectively, may be written as

Z˙\displaystyle\dot{Z} =B(ζ,λ,ε)Z+GΓ\displaystyle=B(\zeta,\lambda,\varepsilon)Z+G\Gamma
Z˙R\displaystyle\dot{Z}_{R} =BR(ζ,λ)ZR,\displaystyle=B_{R}(\zeta,\lambda)Z_{R},

where (suppressing the dependence of the phase space coordinate U¯\bar{U} on ζ\zeta and ε0\varepsilon\geq 0):

B(ζ,λ,ε)\displaystyle B(\zeta,\lambda,\varepsilon) =(0R(U¯)λD(U¯)1cD(U¯))\displaystyle=\begin{pmatrix}0&\frac{R^{\prime}(\bar{U})-\lambda}{D(\bar{U})}\\ -1&\frac{c}{D(\bar{U})}\end{pmatrix}
G(ζ,λ,ε)\displaystyle G(\zeta,\lambda,\varepsilon) =R(U¯)λ\displaystyle=R^{\prime}(\bar{U})-\lambda
Γ(ζ,λ,ε)\displaystyle\Gamma(\zeta,\lambda,\varepsilon) =UVD(U¯)\displaystyle=U-\frac{V}{D(\bar{U})}

and

BR(ζ,λ)\displaystyle B_{R}(\zeta,\lambda) =(0R(U¯)λD(U¯)1cD(U¯)).\displaystyle=\begin{pmatrix}0&\frac{R^{\prime}(\bar{U})-\lambda}{D(\bar{U})}\\ -1&\frac{c}{D(\bar{U})}\end{pmatrix}.

Here the term GΓG\Gamma can be thought of as a forcing term that is ‘turned on’ when ε>0\varepsilon>0; specifically, Γ\Gamma measures how far (the slow projection of) the solution YY is from solving the reduced eigenvalue problem. Evidently B(ζ,λ,ε)BR(ζ,λ)B(\zeta,\lambda,\varepsilon)\to B_{R}(\zeta,\lambda) as ε0\varepsilon\to 0, uniformly in the interval aζAa\leq\zeta\leq A. It remains to estimate the forcing term GΓG\Gamma uniformly within this ζ\zeta-interval. The term GG has a uniform singular limit for ε0\varepsilon\to 0, so we concern ourselves with the singular limit for Γ\Gamma. For δ>0\delta>0 arbitrarily chosen, Corollary B.4 provides a sufficiently small ε¯>0\bar{\varepsilon}>0 such that

E^(ζ,λ,ε)Nδ(Σs(ζ,λ,ε))\displaystyle\hat{E}(\zeta,\lambda,\varepsilon)\in N_{\delta}(\Sigma_{s}(\zeta,\lambda,\varepsilon))

for each ε(0,ε¯]\varepsilon\in(0,\bar{\varepsilon}] and ζa\zeta\geq a. A fixed representative E(ζ,λ,ε)E(\zeta,\lambda,\varepsilon) of the projectivised solution of the full linearized system can thus be made arbitrarily close to a linear combination of the basis vectors F1,F2F_{1},\,F_{2} by varying δ\delta (see the definitions of f1,f2f_{1},\,f_{2} in (40); we use capital letters to denote the appropriate timescale). Each of these basis vectors in turn has a singular limit. Indeed, we have

E(ζ,λ,ε)α1(ζ,λ)Rs,1(ζ,λ)+α2(ζ,λ)Rs,2(ζ,λ)\displaystyle E(\zeta,\lambda,\varepsilon)\to\alpha_{1}(\zeta,\lambda)R_{s,1}(\zeta,\lambda)+\alpha_{2}(\zeta,\lambda)R_{s,2}(\zeta,\lambda) (80)

as ε0\varepsilon\to 0, where Rs,1(ζ,λ)R_{s,1}(\zeta,\lambda) and Rs,2(ζ,λ)R_{s,2}(\zeta,\lambda) are the eigenvectors spanning the reduced slow subbundle. These are the required singular limits of the basis vectors F1,F2F_{1},\,F_{2}, and they admit explicit formulas:

F1Rs,1\displaystyle F_{1}\to R_{s,1} =(1/D(U¯),Vp,1)\displaystyle=(1/D(\bar{U}),V_{p},1)^{\top} (81)
F2Rs,2\displaystyle F_{2}\to R_{s,2} =(1/D(U¯),Vm,1),\displaystyle=(1/D(\bar{U}),V_{m},1)^{\top},

and α1(ζ,λ)\alpha_{1}(\zeta,\lambda) and α2(ζ,λ)\alpha_{2}(\zeta,\lambda) are uniformly bounded coefficients. Note that the basis vectors Rs,iR_{s,i}, i=1,2i=1,2 are derived similarly to the asymptotic case in (36), and the auxiliary quantities Vp,mV_{p,m} are defined analogously to νj,±\nu_{j,\pm} in (37). Finally, we write Γ\Gamma in a convenient form by including and rearranging terms:

Γ(ζ,λ,ε)\displaystyle\Gamma(\zeta,\lambda,\varepsilon) =U(ζ,ε)VD(U¯(ζ,0))+(1D(U¯(ζ,ε))1D(U¯(ζ,0)))V.\displaystyle=U(\zeta,\varepsilon)-\frac{V}{D(\bar{U}(\zeta,0))}+\left(\frac{1}{D(\bar{U}(\zeta,\varepsilon))}-\frac{1}{D(\bar{U}(\zeta,0))}\right)V. (82)

The term in parentheses converges uniformly to 0 and |V|1|V|\leq 1 by construction, and the first to terms also converge uniformly to zero by 81; hence |Γ|0|\Gamma|\to 0 uniformly on the required ζ\zeta-interval, and thus ZZZ\to Z_{*} uniformly on this interval by Gronwall’s inequality. Finally, the UU-components of EE also converge uniformly to those of EE_{*} by (80), (82), and Corollary B.4. Altogether, the uniform convergence of EE to EE_{*} on aζAa\leq\zeta\leq A follows from Gronwall’s inequality. The case ζ<0\zeta<0 proceeds identically, so we omit the proof. \Box

7.3   The jump map as a singular limit

We now construct the jump map used to define the slow eigenvalue problem in Def. 6.4. Our goal is to determine the fate of the slow components ys(ξ,λ,ε):=(p(ξ,λ,ε),v(ξ,λ,ε))y_{s}(\xi,\lambda,\varepsilon):=(p(\xi,\lambda,\varepsilon),v(\xi,\lambda,\varepsilon)) of y(ξ,λ,ε)y(\xi,\lambda,\varepsilon) across the fast layer. The difficulty in tracking the slow data in this inner layer is that ys(ξ,λ,ε)y_{s}(\xi,\lambda,\varepsilon) remains 𝒪(ε)\mathcal{O}(\varepsilon) while u(ξ,λ,ε)u(\xi,\lambda,\varepsilon) grows to 𝒪(1)\mathcal{O}(1) for a time interval that is 𝒪(1/ε)\mathcal{O}(1/\varepsilon); in other words, the linearised solution can be made to align arbitrarily closely to the fast fibres after crossing the fold, and they remain close throughout most of the jump. The directional information carried by the slow variables is not annihilated as ε0\varepsilon\to 0, however. The natural approach from the point of view of GSPT is to perform an ε\varepsilon-dependent rescaling of the fast linearized equations: for ε>0\varepsilon>0, let

εβ1\displaystyle\varepsilon\beta_{1} =p\displaystyle=p (83)
εβ2\displaystyle\varepsilon\beta_{2} =v.\displaystyle=v.

Then we have

u\displaystyle u^{\prime} =1c(εβ2(ελ+D(u¯))u)\displaystyle=\frac{1}{c}(\varepsilon\beta_{2}-(\varepsilon\lambda+D(\bar{u}))u) (84)
β1\displaystyle\beta_{1}^{\prime} =(R(u¯)λ)u\displaystyle=(R^{\prime}(\bar{u})-\lambda)u
β2\displaystyle\beta_{2}^{\prime} =cuεβ1,\displaystyle=cu-\varepsilon\beta_{1},

where a factor of ε\varepsilon has been cancelled from the latter two equations. The resulting equations limit to the following linear system as ε0\varepsilon\to 0:

u\displaystyle u^{\prime} =1cD(u¯)u\displaystyle=-\frac{1}{c}D(\bar{u})u (85)
β1\displaystyle\beta_{1}^{\prime} =(R(u¯)λ)u\displaystyle=(R^{\prime}(\bar{u})-\lambda)u
β2\displaystyle\beta_{2}^{\prime} =cu.\displaystyle=cu.

The first equation in (85) is the variational equation of the linearized layer problem (43), and thus has a family of nontrivial bounded solutions u(ξ)=K(v¯0F(u¯(ξ)))=K(du¯/dξ)u(\xi)=K(\bar{v}_{0}-F(\bar{u}(\xi)))=K(d\bar{u}/d\xi), where v¯0\bar{v}_{0} is the v¯\bar{v}-component of FF_{-}. It is then possible to calculate β1(ξ)\beta_{1}(\xi) and β2(ξ)\beta_{2}(\xi) explicitly. We find it more convenient to perform the intermediate calculations in the projective space 2\mathbb{CP}^{2}. This allows us to fix a free parameter by applying a smoothness condition across the fold.

We consider the projection of the linear system (85) on a copy of 2\mathbb{CP}^{2}, choosing the chart (s,g)=(β1/β2,u/β2)(s,g)=(\beta_{1}/\beta_{2},u/\beta_{2}), β20\beta_{2}\neq 0:

s\displaystyle s^{\prime} =(R(u¯)λ)gcsg\displaystyle=(R^{\prime}(\bar{u})-\lambda)g-csg (86)
g\displaystyle g^{\prime} =D(u¯)cg(R(u¯λ))g2.\displaystyle=-\frac{D(\bar{u})}{c}g-(R^{\prime}(\bar{u}-\lambda))g^{2}.

The gg-equation decouples, so we solve it directly to find

g(ξ)\displaystyle g(\xi) =v¯FF(u¯(ξ))cu¯(ξ)+C,\displaystyle=\frac{\bar{v}_{F}-F(\bar{u}(\xi))}{c\bar{u}(\xi)+C},

where CC\in\mathbb{C} is a constant. The ss equation is then given by

s\displaystyle s^{\prime} =((R(u¯)λ)cs)v¯FF(u¯(ξ))cu¯(ξ)+C.\displaystyle=((R^{\prime}(\bar{u})-\lambda)-cs)\frac{\bar{v}_{F}-F(\bar{u}(\xi))}{c\bar{u}(\xi)+C}. (87)
Refer to caption
Figure 7: Demonstration of the convergence of the projectivised dynamics (blueish curves) onto the reduced dynamics (red curve) as ε0\varepsilon\to 0 for λ=15\lambda=15. Solutions of reduced dynamical system are defined by concatenating the solution of the projectivized flow of dS/dζdS/d\zeta (see (89)), defined for U¯(0,U¯F)(U¯J,1)\bar{U}\in(0,\bar{U}_{F})\cup(\bar{U}_{J},1), with the graph of the map (90) defined for u¯[u¯F,u¯J]\bar{u}\in[\bar{u}_{F},\bar{u}_{J}]. The left dotted line denotes the fold u¯=u¯F\bar{u}=\bar{u}_{F} and the right dotted line denotes the jump curve u¯=u¯J\bar{u}=\bar{u}_{J} (and hence the fast dynamics takes place in between these dotted lines). The right dotted line is the jump curve, where the jump condition is used to concatenate the appropriate solutions. The wave speed was held at c=c00.199362c=c_{0}\approx 0.199362 for each value of ε\varepsilon.

Equation (87) is now a one-dimensional nonautonomous problem and can be solved explicitly, but we must first fix the constant CC to define an unambiguous jump condition. Such an ambiguity arises even at the projective level. For each ε>0\varepsilon>0, the scale of the linearized solution along the wave is set by one free parameter, whereas in the singular limit, we split the linearized dynamics into subsystems defined along the singular heteroclinic orbit. We are able to choose freely the scale of each corresponding linearized solution segment. It follows that these free parameters of the reduced systems may be constrained so that they are compatible with the scalings of the ‘full’ linearized solutions as ε0\varepsilon\to 0.

A suitable compatibility condition is the C1C^{1}-differentiability of the desingularized linearized flow across the fold. We compare (87) with the projectivization of the desingularized slow eigenvalue problem (47), which we write over the chart S=P/VS=P/V, with V0V\neq 0, as

dSdζ\displaystyle\frac{dS}{d\zeta} =(R(U¯)λcS+D(U¯)S2).\displaystyle=(R^{\prime}(\bar{U})-\lambda-cS+D(\bar{U})S^{2}). (88)

Using the chain rule, we write

SP¯dP¯dζ+SU¯dSdζ=dSdζ\displaystyle\frac{\partial S}{\partial\bar{P}}\frac{d\bar{P}}{d\zeta}+\frac{\partial S}{\partial\bar{U}}\frac{dS}{d\zeta}=\frac{dS}{d\zeta} =(R(U¯)λcS+D(U¯)S2)\displaystyle=(R^{\prime}(\bar{U})-\lambda-cS+D(\bar{U})S^{2}) (89)
dsdu¯\displaystyle\frac{ds}{d\bar{u}} =1cu¯C(R(U¯)λcS).\displaystyle=\frac{1}{c\bar{u}-C}(R^{\prime}(\bar{U})-\lambda-cS).

At the point Γ0F={X¯F=(u¯F,p¯F,v¯F)}\Gamma_{0}\cap F_{-}=\{\bar{X}_{F}=(\bar{u}_{F},\bar{p}_{F},\bar{v}_{F})\} where the singular orbit intersects the fold, we have D(U¯F)=0D(\bar{U}_{F})=0 and thus (dS/dζ)|X¯F=(S/U¯)(cu¯Fp¯F)(dS/d\zeta)|_{\bar{X}_{F}}=(\partial S/\partial\bar{U})(c\bar{u}_{F}-\bar{p}_{F}). By matching (S/U¯)(\partial S/\partial\bar{U}) and ds/du¯ds/d\bar{u} at X¯F\bar{X}_{F} we find

C\displaystyle C =p¯F.\displaystyle=\bar{p}_{F}.

At the projective level, the jump map uJs(uJ¯)u_{J}\mapsto s(\bar{u_{J}}) is defined from the solution s(u¯)s(\bar{u}) of the (complex) one-dimensional initial-value problem

dsdu¯\displaystyle\frac{ds}{d\bar{u}} =1cu¯p¯f(R(u¯)λcs)\displaystyle=\frac{1}{c\bar{u}-\bar{p}_{f}}(R^{\prime}(\bar{u})-\lambda-cs)
s(u¯F)\displaystyle s(\bar{u}_{F}) =s0.\displaystyle=s_{0}.

Note that cu¯p¯F>0c\bar{u}-\bar{p}_{F}>0 is bounded away from 0 for u¯[u¯F,u¯J]\bar{u}\in[\bar{u}_{F},\bar{u}_{J}] since cu¯Fp¯F>0c\bar{u}_{F}-\bar{p}_{F}>0 by Hypothesis 2.8 and u¯u¯f\bar{u}\geq\bar{u}_{f} across the jump. Explicitly, we have

s(u¯,λ)\displaystyle s(\bar{u},\lambda) =R(u¯)R(u¯F)λ(u¯u¯F)+s0(cu¯Fp¯F)cu¯p¯F\displaystyle=\frac{R(\bar{u})-R(\bar{u}_{F})-\lambda(\bar{u}-\bar{u}_{F})+s_{0}(c\bar{u}_{F}-\bar{p}_{F})}{c\bar{u}-\bar{p}_{F}} (90)

for u¯Fu¯u¯J\bar{u}_{F}\leq\bar{u}\leq\bar{u}_{J}. See Fig. 7 for a demonstration of the approximation of the ‘full’ linearized flow to that of the hybrid reduced problem for a nonzero value of λ\lambda.

The linear jump map (55) can be extracted from (90) by using the chart map s=p/vs=p/v. For a,bna,b\in\mathbb{C}^{n}, we define the equivalence relation aba\sim b if a=γba=\gamma b for some complex number γ0\gamma\neq 0. We then have

(p0,v0)(p,v)\displaystyle(p_{0},v_{0})\mapsto(p,v) =((p/v)v,v)=(sv,v)\displaystyle=((p/v)v,v)=(sv,v) (91)
(s,1)=(R(u¯)R(u¯F)λ(u¯u¯F)+s0(cu¯Fp¯F)cu¯p¯F,1)\displaystyle\sim(s,1)=\left(\frac{R(\bar{u})-R(\bar{u}_{F})-\lambda(\bar{u}-\bar{u}_{F})+s_{0}(c\bar{u}_{F}-\bar{p}_{F})}{c\bar{u}-\bar{p}_{F}},1\right)
(v0(R(u¯)R(u¯F)λ(u¯u¯F))+p0(cu¯Fp¯F)cu¯p¯F,v0)\displaystyle\sim\left(\frac{v_{0}(R(\bar{u})-R(\bar{u}_{F})-\lambda(\bar{u}-\bar{u}_{F}))+p_{0}(c\bar{u}_{F}-\bar{p}_{F})}{c\bar{u}-\bar{p}_{F}},v_{0}\right)
:=J(p0,v0,λ,u¯).\displaystyle:=J(p_{0},v_{0},\lambda,\bar{u}).

We can now define Jλ(P,V):=J(P,V,λ,U¯J)J_{\lambda}(P,V):=J(P,V,\lambda,\bar{U}_{J}). Our jump map is defined up to linear equivalence and is therefore clearly nonunique; however, any two linear choices induce the same projective jump map J^λ:=s(u¯J,λ)\hat{J}_{\lambda}:=s(\bar{u}_{J},\lambda) on \mathbb{CP}. This essentially completes the proof of Theorem 7.1(b).

7.4   Control from the reduced dynamics

Proof of Theorem 7.1(c). By Corollary B.4, the unique solution E^(ζ,λ,ε)\hat{E}(\zeta,\lambda,\varepsilon) which tends to the unstable eigenvector at u^=0\hat{u}=0 as ζ\zeta\to-\infty remains uniformly close to the (projectivized) slow subbundle Σ^s\hat{\Sigma}_{s}.

Fix an interval a|ζ|Aa\leq|\zeta|\leq A, where 0<a<A0<a<A with AA\leq\infty. For fixed λ,ε\lambda,\varepsilon, let (β1(ξ),β2(ξ))(\beta_{1}(\xi),\beta_{2}(\xi)) denote the local coordinates of the solution E^(ζ,λ,ε)\hat{E}(\zeta,\lambda,\varepsilon) which coincide with the coordinates used to define the slow projectivized system (32). Then by Lemma 4.2, for any δ>0\delta>0 we can find some sufficiently small ε¯=ε¯(δ)\bar{\varepsilon}=\bar{\varepsilon}(\delta) so that |β11/D(U¯)|δ|\beta_{1}-1/D(\bar{U})|\leq\delta for all a|ζ|Aa\leq|\zeta|\leq A and for all 0<εε¯0<\varepsilon\leq\bar{\varepsilon}. This estimate can be read off from the expressions for the reduced eigenvectors in (36).

Now consider the solution Z^(ζ,λ)\hat{Z}_{*}(\zeta,\lambda) written with the coordinate S(ζ)=P(ζ)/V(ζ)S_{*}(\zeta)=P(\zeta)/V(\zeta) with V0V\neq 0. The dynamics of S(ζ)S(\zeta) is determined by the system (46). At either limit ζ\zeta\to-\infty and ζ+\zeta\to+\infty, the corresponding asymptotic systems admit a pair of hyperbolic (saddle) fixed points, interpreted in the projectivized system as an attractor and a repeller. Let us consider the case ζ\zeta\to-\infty and denote the projectivized attractor (corresponding to the unstable eigenvector of the saddle) by u^(λ)\hat{u}^{-}(\lambda) and the projectivized repeller (stable eigenvector) by s^(λ)\hat{s}^{-}(\lambda). We seek to verify that the only possibility is that Z^(ζ,λ)u^(λ)\hat{Z}_{*}(\zeta,\lambda)\to\hat{u}^{-}(\lambda) as ζ\zeta\to-\infty (i.e. that E^(ζ,λ)r^s,1\hat{E}_{*}(\zeta,\lambda)\to\hat{r}_{s,1}^{-}, with rs,1r_{s,1}^{-} as defined in (36)).

So suppose that E^(ζ,λ)r^s,2\hat{E}_{*}(\zeta,\lambda)\to\hat{r}_{s,2}^{-} (the weak stable eigenvector) instead. Since the corresponding critical point s^\hat{s}^{-} is a repeller, let NN denote a repelling neighborhood of radius η>0\eta>0. Then by hypothesis, there exists some ζ1=ζ1(η)\zeta_{1}=\zeta_{1}(\eta) so that S(ζ,λ,ε)NS_{*}(\zeta,\lambda,\varepsilon)\in N for all ζζ1\zeta\leq\zeta_{1}. As long as V0V\neq 0 remains bounded away from zero, for each sufficiently small ε>0\varepsilon>0 we may choose a section Eπ1E^E\in\pi^{-1}\hat{E} so that

E(ζ,λ,ε)\displaystyle E(\zeta,\lambda,\varepsilon) =(1/D(u^),ac0/D(u^)+(ba)νm,,1)+𝒪(δ)\displaystyle=(1/D(\hat{u}),ac_{0}/D(\hat{u})+(b-a)\nu_{m,-},1)+\mathcal{O}(\delta)

where a+b=1a+b=1. Thus, |β1||\beta_{1}| and |β2||\beta_{2}| (the norms of the coordinates of E^\hat{E} with respect to the chart specified by (32)) remain uniformly bounded while SNS\in N. Now, observe that the β2\beta_{2} equation in (32) may be written as follows:

β2˙\displaystyle\dot{\beta_{2}} =(R(U¯)λ)cβ+D(U¯)S2+d(ζ,λ,ε),\displaystyle=(R^{\prime}(\bar{U}^{-})-\lambda)-c\beta+D(\bar{U}^{-})S^{2}+d(\zeta,\lambda,\varepsilon),

where d(ζ,λ,ε)d(\zeta,\lambda,\varepsilon) consists of terms characterizing the perturbation from the asymptotic system together with terms of the form (β11/D(U¯))(\beta_{1}-1/D(\bar{U})), which can be uniformly bounded as stated in the beginning of the proof. Hence, NN remains a repelling neighborhood of s^\hat{s}^{-} for the system above. But this implies that E^(ζ,λ,ε)\hat{E}(\zeta,\lambda,\varepsilon) can be chosen to remain in any small neighborhood of r^s,1(λ)\hat{r}^{-}_{s,1}(\lambda) as ζ\zeta\to-\infty, which contradicts the fact that E^(ζ,λ,ε)r^s,2\hat{E}(\zeta,\lambda,\varepsilon)\to\hat{r}^{-}_{s,2} as ζ\zeta\to-\infty. Hence, E^\hat{E}_{*} and E^0=ι0Z^0\hat{E}_{0}=\iota_{0}\hat{Z}_{0} coincide for ζ<0\zeta<0.

For the matching on the right hand-side, we have by Theorem 7.1(b) that the jump condition is identical and uniquely determined, so in fact E^\hat{E}_{*} and E^0\hat{E}_{0} coincide for ζ>0\zeta>0 as well. It remains to check that E^\hat{E} tends uniformly to r^s,1+\hat{r}_{s,1}^{+}. This step proceeds identically to the preceding argument, so we omit it. \Box

7.5   Continuing ε(K)\mathcal{E}_{\varepsilon}(K) to 0(K)\mathcal{E}_{0}(K)

We now prove the following corollary of Theorem 7.1:

Corollary 7.10.

There exists ε0>0\varepsilon_{0}>0 such that for each ε(0,ε0]\varepsilon\in(0,\varepsilon_{0}], ε(K)0(K)\mathcal{E}_{\varepsilon}(K)\cong\mathcal{E}_{0}(K). In particular, c1(ε(K))=c1(0(K))c_{1}(\mathcal{E}_{\varepsilon}(K))=c_{1}(\mathcal{E}_{0}(K)).

We follow the approach of Sec. VI-D in [8]. The method is quite natural. With respect to the compactified time TT-scaling (see (57)), the fast dynamics over the jump occurs over an 𝒪(ε)\mathcal{O}(\varepsilon)-interval straddling the midpoint of the interval [1,1][-1,1]. The approach is to first define two hemispheric bundles over the complement of this small interval in [1,1][-1,1], using the flow itself to construct the gluing map. We then construct a homotopy which closes this gap continuously as the homotopy parameter is varied. Theorem 7.1 is then used to show that the gluing map and bundles have nice singular limits: they are precisely the ones used to construct the reduced augmented unstable bundle 0(K)\mathcal{E}_{0}(K).

Proof of Corollary 7.10: Fix the pair of real parameters Sεa,,Sεa,+S^{a,-}_{\varepsilon},\,S^{a,+}_{\varepsilon} with 1<Sεa,<0<Sεa,+<1-1<S^{a,-}_{\varepsilon}<0<S^{a,+}_{\varepsilon}<1. We work with the parameter pair S=(Sεa,,σ)S=(S^{a,-}_{\varepsilon},\sigma), where σ[Sεa,,Sεa,+]\sigma\in[S^{a,-}_{\varepsilon},S^{a,+}_{\varepsilon}] is the homotopy parameter that we will continuously vary.

For ε>0\varepsilon>0 fixed, define the bundle (S,ε)\mathcal{E}(S,\varepsilon) as follows. The base space BSB_{S} of (S,ε)\mathcal{E}(S,\varepsilon) is a sphere obtained by gluing the following two hemispheres along their boundaries:

b(Sεa,)\displaystyle b_{-}(S^{a,-}_{\varepsilon}) =B{TSεa,}\displaystyle=B\cap\{T\leq S^{a,-}_{\varepsilon}\}
b+(σ)\displaystyle b_{+}(\sigma) =B{Tσ}.\displaystyle=B\cap\{T\geq\sigma\}.

We define two hemispheric bundles over these base spaces as follows:

(Sεa,,ε)\displaystyle\mathcal{E}^{-}(S^{a,-}_{\varepsilon},\varepsilon) =ε(K)|b(Sεa,)\displaystyle=\mathcal{E}_{\varepsilon}(K)|_{b_{-}(S^{a,-}_{\varepsilon})}
+(σ,ε)\displaystyle\mathcal{E}^{+}(\sigma,\varepsilon) =ε(K)|b+(σ).\displaystyle=\mathcal{E}_{\varepsilon}(K)|_{b_{+}(\sigma)}.

We complete the construction by defining a gluing map φS,ε:(Sεa,,ε)|bb++(σ,ε)\varphi_{S,\varepsilon}:\mathcal{E}^{-}(S^{a,-}_{\varepsilon},\varepsilon)|_{b_{-}\cap\,b_{+}}\to\mathcal{E}^{+}(\sigma,\varepsilon). Here “bb+b_{-}\cap b_{+}” refers to the intersection curve of the hemisphere boundaries, corresponding to a copy of KK in BSB_{S}. Let ζL\zeta_{L} and ζσ\zeta_{\sigma} be the values of ζ\zeta corresponding to T=Sεa,T=S^{a,-}_{\varepsilon} and T=σT=\sigma, respectively, defined using the initial value problem (57). Let E(ζ,λ,ε)E(\zeta,\lambda,\varepsilon) be the solution, which has data ELE_{L} at ζ=ζL\zeta=\zeta_{L} (by a time translation, if necessary), where ELE_{L} lies in the fiber of (Sεa,,ε)\mathcal{E}^{-}(S^{a,-}_{\varepsilon},\varepsilon) over (Sεa,,λ)(S^{a,-}_{\varepsilon},\lambda). A suitable gluing map is then the obvious one which is induced by the flow:

φS,εEL\displaystyle\varphi_{S,\varepsilon}E_{L} =E(ζσ,λ,ε).\displaystyle=E(\zeta_{\sigma},\lambda,\varepsilon). (92)

The bundle (S,ε)\mathcal{E}(S,\varepsilon) is then defined as

(S,ε)\displaystyle\mathcal{E}(S,\varepsilon) :=(Sεa,,ε)φS,ε+(σ,ε).\displaystyle:=\mathcal{E}^{-}(S^{a,-}_{\varepsilon},\varepsilon)\cup_{\varphi_{S,\varepsilon}}\mathcal{E}^{+}(\sigma,\varepsilon). (93)

Note that at σ=Sεa,\sigma=S^{a,-}_{\varepsilon}, the map φS,ε\varphi_{S,\varepsilon} is the identity (since ζL=ζσ\zeta_{L}=\zeta_{\sigma}). Furthermore, the flow map is continuous for each ε>0\varepsilon>0, implying that φS,ε\varphi_{S,\varepsilon} forms a homotopy of isomorphisms. These facts imply (see eg. Atiyah [2]) that

ε(K)\displaystyle\mathcal{E}_{\varepsilon}(K) (S,ε)).\displaystyle\cong\mathcal{E}(S,\varepsilon)). (94)

Note that we have fixed ε>0\varepsilon>0 up to now. Let us now fix Sεa,S^{a,-}_{\varepsilon} and Sεa,+S^{a,+}_{\varepsilon} and take ε>0\varepsilon>0 sufficiently small that Theorem 7.1 holds. By Theorem 7.1 (a) and (c),

limε0E^(T,λ,ε)\displaystyle\lim_{\varepsilon\to 0}\hat{E}(T,\lambda,\varepsilon) =E^0(T,λ)\displaystyle=\hat{E}_{0}(T,\lambda)

uniformly for (T,λ)b(Sεa,)b+(Sεa,+)(T,\lambda)\in b_{-}(S^{a,-}_{\varepsilon})\cup b_{+}(S^{a,+}_{\varepsilon}). Observe here that the information about the jump map is already encoded in the solution E^0(T,λ)\hat{E}_{0}(T,\lambda), as described in Theorem 7.1(c). Let us now determine the singular limit of the gluing map φS,ε\varphi_{S,\varepsilon}.

Let ζL\zeta_{L} and ζR\zeta_{R} correspond (as described earlier) to T=Sεa,T=S^{a,-}_{\varepsilon} and T=Sεa,+T=S^{a,+}_{\varepsilon}, respectively, and choose a solution Z0(ζ,λ)Z_{0}(\zeta,\lambda) of the reduced problem so that

E^0\displaystyle\hat{E}_{0} =\savestack\tmpbox\stretchto\scaleto\scalerel[width("ι0(ζ,λ)Z0(ζ,λ)")] 0.5ex\stackon[1pt]ι0(ζ,λ)Z0(ζ,λ)\tmpbox\displaystyle=\savestack{\tmpbox}{\stretchto{\scaleto{\scalerel*[width("\iota_{0}(\zeta,\lambda)Z_{0}(\zeta,\lambda)")]{\kern-0.6pt\bigwedge\kern-0.6pt}{\rule[-505.89pt]{4.30554pt}{505.89pt}}}{}}{0.5ex}}\stackon[1pt]{\iota_{0}(\zeta,\lambda)Z_{0}(\zeta,\lambda)}{\tmpbox}

It follows from Theorem 7.1(a) that

φS,0[ι0(Sεa,,λ)Z0(Sεa,,λ)]\displaystyle\varphi_{S,0}[\iota_{0}(S^{a,-}_{\varepsilon},\lambda)Z_{0}(S^{a,-}_{\varepsilon},\lambda)] =ι0(ζR,λ)Z0(ζR,λ).\displaystyle=\iota_{0}(\zeta_{R},\lambda)Z_{0}(\zeta_{R},\lambda). (95)

We now define the ‘singular’ limit of the glued hemispheric bundles (S,ε)\mathcal{E}(S,\varepsilon) in the obvious way:

(S,0)\displaystyle\mathcal{E}(S,0) :=0|b(Sεa,)φS,00|b+(Sεa,+).\displaystyle:=\mathcal{E}_{0}|b_{-}(S^{a,-}_{\varepsilon})\cup_{\varphi_{S,0}}\mathcal{E}_{0}|b_{+}(S^{a,+}_{\varepsilon}). (96)

We emphasize here that this is a topological limit, and so (S,ε)(S,0)\mathcal{E}(S,\varepsilon)\cong\mathcal{E}(S,0) directly—this is the advantage of using the augmented unstable bundle. It thus follows from (94) that ε(K)(S,0)\mathcal{E}_{\varepsilon}(K)\cong\mathcal{E}(S,0) for each ε>0\varepsilon>0 sufficiently small.

We have so far held Sεa,S^{a,-}_{\varepsilon} and Sεa,+S^{a,+}_{\varepsilon} fixed. Noting that the required smallness of ε\varepsilon from the above argument is independent of these parameters, we are free to send Sεa,S^{a,-}_{\varepsilon} to 0 from below and Sεa,+S^{a,+}_{\varepsilon} to 0 from above. By Theorem 7.1(b), φS,0\varphi_{S,0} approaches φH\varphi^{H} as defined in (60). \Box

8   Counting the slow eigenvalues

We now compute the slow eigenvalues λΩ\lambda\in\Omega; see Def. 6.4. We will adapt the technique presented in Section E of [8]; we split the projectivized equation (88) for (47) into its real and imaginary parts, with the spatial eigenvalue parameter written as λ=μ+iω\lambda=\mu+i\omega, and we consider separately the cases ω0\omega\neq 0 and ω=0\omega=0. Afterwards we may restrict our analysis to the real line; using comparison of solutions together with a Gronwall estimate, we demonstrate that there are no positive real eigenvalues. Here we need the monotonicity hypothesis 2.8 to ensure that we can work with the chart defined by V0V\neq 0 for λ0\lambda\geq 0. Generally speaking, the jump condition does not introduce any new complications with respect to these arguments.

Theorem 8.1.

There are exactly two slow eigenvalues in Ω\Omega, given by λ0=0\lambda_{0}=0 and λ1\lambda_{1}, with Im(λ1)=0\text{Im}(\lambda_{1})=0 and R(0)<λ1<0R^{\prime}(0)<\lambda_{1}<0. Both λ0\lambda_{0} and λ1\lambda_{1} are simple.

Remark 8.2.

As far as spectral stability of the wave for ε>0\varepsilon>0 is concerned, it suffices to verify that the eigenvalue with the largest real part in the point spectrum is the simple translational eigenvalue λ0\lambda_{0}. We resort to a numerical calculation using a Riccati-Evans function to determine the existence of the secondary eigenvalue λ1\lambda_{1}, but the simplicity of both eigenvalues are verified rigorously. \triangle

Proof that λ0=0\lambda_{0}=0 is the eigenvalue of largest real part:

We will prove that all slow eigenvalues are real, and furthermore there is no slow eigenvalue λ\lambda with Re(λ)>0\text{Re}(\lambda)>0. The asymptotic systems associated with the projectivized linearized slow flow (56) admit the fixed points u^,s^\hat{u}^{-},\,\hat{s}^{-} at U¯=0\bar{U}=0 and u^+,s^+\hat{u}^{+},\hat{s}^{+} at U¯=1\bar{U}=1, defined as follows:

u^(λ)\displaystyle\hat{u}^{-}(\lambda) =cc2+D(0)(R(0)λ)2D(0)\displaystyle=\frac{c-\sqrt{c^{2}+D(0)(R^{\prime}(0)-\lambda)}}{2D(0)} (97)
s^(λ)\displaystyle\hat{s}^{-}(\lambda) =c+c2+D(0)(R(0)λ)2D(0)\displaystyle=\frac{c+\sqrt{c^{2}+D(0)(R^{\prime}(0)-\lambda)}}{2D(0)}
u^+(λ)\displaystyle\hat{u}^{+}(\lambda) =cc2+D(1)(R(1)λ)2D(1)\displaystyle=\frac{c-\sqrt{c^{2}+D(1)(R^{\prime}(1)-\lambda)}}{2D(1)}
s^+(λ)\displaystyle\hat{s}^{+}(\lambda) =c+c2+D(1)(R(1)λ)2D(1).\displaystyle=\frac{c+\sqrt{c^{2}+D(1)(R^{\prime}(1)-\lambda)}}{2D(1)}.

Linear analysis verifies that u^±\hat{u}^{\pm} are attractors and s^±\hat{s}^{\pm} are repellers with respect to the projectivized asymptotic dynamics, corresponding to the asymptotic unstable resp. stable eigendirections. We remind the reader of the geometric characterisation of eigenvalues of (47) in terms of their asymptotic behavior; if λ\lambda is not a slow eigenvalue, it suffices to show that ‘the’ unique solution S0(ζ,λ)S_{0}(\zeta,\lambda) which tends to u^(λ)\hat{u}^{-}(\lambda) as ζ\zeta\to-\infty does not tend to s^+(λ)\hat{s}^{+}(\lambda) as ζ+\zeta\to+\infty.777Strictly speaking, S0S_{0} refers to a pair of solutions Sεa,S^{a,-}_{\varepsilon} and Sεa,+S^{a,+}_{\varepsilon}, defined for ζ0\zeta\leq 0 and ζ0\zeta\geq 0, which are uniquely defined by two constraints: the aforementioned asymptotic constraint at ζ=\zeta=-\infty, and the jump condition defined using (90).

We now write (56) in terms of its real and imaginary parts. Specifically, writing S=X+iYS=X+iY and λ=μ+iω\lambda=\mu+i\omega, we have

X˙\displaystyle\dot{X} =R(U¯)μcX+(X2Y2)D(U¯)\displaystyle=R^{\prime}(\bar{U})-\mu-cX+(X^{2}-Y^{2})D(\bar{U}) (98)
Y˙\displaystyle\dot{Y} =ωcY+2XYD(U¯).\displaystyle=-\omega-cY+2XYD(\bar{U}).

We consider the two subcases ω0\omega\neq 0 and ω=0\omega=0.

The subcase ω0\omega\neq 0. Let us focus on the case ω>0\omega>0; the case ω<0\omega<0 is similar. We have Im(u^±(λ))<0\text{Im}\,(\hat{u}^{\pm}(\lambda))<0 and Im(s^±(λ))>0\text{Im}\,(\hat{s}^{\pm}(\lambda))>0 for each λΩ\lambda\in\Omega, by applying the inequality R(1)<R(0)<Re(λ)R^{\prime}(1)<R^{\prime}(0)<\text{Re}\,(\lambda) on this set to the expressions (97) for the asymptotic fixed points. On the other hand, the half-plane {Y0}\{Y\leq 0\} is forward invariant since Y˙=ω<0\dot{Y}=-\omega<0 along {Y=0}\{Y=0\}. Furthermore, the jump condition is monotone decreasing in YY, i.e. Sεa,+(0,λ)Y=(Jλ(Sεa,(0,λ)))YSεa,(0,λ)YS^{a,+}_{\varepsilon}(0,\lambda)_{Y}=(J_{\lambda}(S^{a,-}_{\varepsilon}(0,\lambda)))_{Y}\leq S^{a,-}_{\varepsilon}(0,\lambda)_{Y}. Thus, if S(ζ,λ)=X(ζ,λ)+iY(ζ,λ)S(\zeta,\lambda)=X(\zeta,\lambda)+iY(\zeta,\lambda) remains on the chart specified by V0V\neq 0, then it is impossible that S(ζ,λ)s+(λ)S(\zeta,\lambda)\to s^{+}(\lambda) as ζ+\zeta\to+\infty.

It could happen that the solution S0(ζ,λ)S_{0}(\zeta,\lambda) leaves the chart by blowing up in the YY\to-\infty direction, emerging ‘on the other side’ of the half-plane {Y>0}\{Y>0\} from Y=+Y=+\infty to make a connection to s+(λ)s^{+}(\lambda). Let us show that any such blow-up leads to a contradiction. Following the style of Lemma 6.6 in [8], we observe that if the blow-up happens at ζ0=+\zeta_{0}=+\infty, then S0(ζ,λ)S_{0}(\zeta,\lambda) remains inside the closure of the image of {Y0}\{Y\leq 0\} on the Riemann sphere, and s+(λ)s^{+}(\lambda) is bounded away from this closed set; hence, we suppose that ζ0\zeta_{0}\in\mathbb{R}, and that ζ0\zeta_{0} is the smallest real number so that S0(ζ,λ)S_{0}(\zeta,\lambda) becomes unbounded as ζζ0\zeta\to\zeta_{0} but for which S0(ζ,λ)S_{0}(\zeta,\lambda) remains finite within the lower half-plane for all ζ<ζ0\zeta<\zeta_{0}. Then T0(ζ,λ):=S0(ζ,λ)1T_{0}(\zeta,\lambda):=S_{0}(\zeta,\lambda)^{-1} remains well-defined for all ζ\zeta sufficiently close to ζ0\zeta_{0}. This motivates the corresponding change in chart

s=XX2+Y2,\displaystyle s=\frac{X}{X^{2}+Y^{2}}, t=YX2+Y2.\displaystyle\hskip 28.45274ptt=\frac{-Y}{X^{2}+Y^{2}}. (99)

The corresponding dynamical system (98) expressed in the new chart is

s˙\displaystyle\dot{s} =D(U¯)+cs2ωst(R(U¯)μ)(s2t2)\displaystyle=-D(\bar{U})+cs-2\omega st-(R^{\prime}(\bar{U})-\mu)(s^{2}-t^{2}) (100)
t˙\displaystyle\dot{t} =ct2st(R(U¯)μ)+ω(s2t2),\displaystyle=ct-2st(R^{\prime}(\bar{U})-\mu)+\omega(s^{2}-t^{2}),

with the solution written as T0(ζ)=s(ζ)+it(ζ)T_{0}(\zeta)=s(\zeta)+it(\zeta). If YY\to-\infty, then both ss and tt tend to 0 as ζζ0\zeta\to\zeta_{0}; furthermore, t(ζ)>0t(\zeta)>0 for values of ζ<ζ0\zeta<\zeta_{0} sufficiently close to the blow-up value according to (99). The contradiction will arise by considering the behavior of t(ζ)t(\zeta) for ζ\zeta near ζ0\zeta_{0} using Taylor expansions. In the following argument, we suppose that ζ00\zeta_{0}\neq 0, i.e. we are not exactly at the jump and all two-sided limits of the relevant functions exist; we discuss the case ζ0=0\zeta_{0}=0 later. By Taylor expanding s(ζ)s(\zeta) around ζ=ζ0\zeta=\zeta_{0} and using the first equation in (100), we find that

s(ζ)\displaystyle s(\zeta) =D(U¯(ζ0))(ζζ0)+𝒪(ζζ0)2.\displaystyle=-D(\bar{U}(\zeta_{0}))(\zeta-\zeta_{0})+\mathcal{O}(\zeta-\zeta_{0})^{2}. (101)

Using the second equation in (100), we note that t˙=t¨=0\dot{t}=\ddot{t}=0 at ζ=ζ0\zeta=\zeta_{0}; however, we have t˙˙˙(ζ0)=2ωD(U¯(ζ0))2\dddot{t}(\zeta_{0})=2\omega D(\bar{U}(\zeta_{0}))^{2}, and so the Taylor expansion of t(ζ)t(\zeta) gives

t(ζ)\displaystyle t(\zeta) =ωD(U¯(ζ0))23(ζζ0)3+𝒪(ζζ0)4.\displaystyle=\frac{\omega D(\bar{U}(\zeta_{0}))^{2}}{3}(\zeta-\zeta_{0})^{3}+\mathcal{O}(\zeta-\zeta_{0})^{4}. (102)

But this implies that t(ζ)<0t(\zeta)<0 for ζ<ζ0\zeta<\zeta_{0} and ζ\zeta sufficiently close to ζ0\zeta_{0}, which produces a contradiction.

If the blow-up occurs exactly at the point of discontinuity ζ0=0\zeta_{0}=0, the argument above survives by constructing continuous extensions of D(U¯(ζ))D(\bar{U}(\zeta)) and R(U¯(ζ))R^{\prime}(\bar{U}(\zeta)), since their right limits still exist. Altogether, we have shown that there are no slow eigenvalues with nonzero imaginary part.

The subcase ω=0\omega=0. The (un)stable eigenvectors associated with the asymptotic subspaces are real, and by (98) the subset {Y=0}\{Y=0\} is invariant when ω=0\omega=0; hence, we are able to focus on the real one-dimensional problem given by the first equation in (98)

X˙\displaystyle\dot{X} =f(X,U¯;μ):=R(U¯)μcX+X2D(U¯).\displaystyle=f(X,\bar{U};\mu):=R^{\prime}(\bar{U})-\mu-cX+X^{2}D(\bar{U}). (103)

By Remark 6.5, μ=0\mu=0 is a slow eigenvalue. Now suppose μ>0\mu>0. Noting that f/μ<0\partial f/\partial\mu<0 and that u^/μ<0\partial\hat{u}^{-}/\partial\mu<0 from (97), any solution X(ζ,λ)X(\zeta,\lambda) which tends to u^(λ)\hat{u}^{-}(\lambda) as ζ\zeta\to-\infty satisfies the comparison

X(ζ,μ)X(ζ,μ)\displaystyle X(\zeta,\mu^{\prime})\leq X(\zeta,\mu) (104)

for all ζ\zeta\in\mathbb{R} whenever 0<μμ0<\mu\leq\mu^{\prime} (note that this comparison is preserved across the jump). Each such solution for any given μ>0\mu>0 has a common upper bound, given by the solution for λ=μ=0\lambda=\mu=0, which does not blow up by Hypothesis 2.8. We also have s^+/μ>0\partial\hat{s}^{+}/\partial\mu>0 for μ>0\mu>0 from (97); thus, s^+(μ)\hat{s}^{+}(\mu) lies on the other side of the connection for μ=0\mu=0 (relative to the solution X(ζ,μ)X(\zeta,\mu)) . As long as we can show that the solution for μ>0\mu>0 does not blow up (potentially allowing X(ζ,μ)X(\zeta,\mu) to connect to s+s^{+} by ‘looping around’ the Riemann sphere), it is impossible for the solution X(ζ,λ)X(\zeta,\lambda) to approach s^+(μ)\hat{s}^{+}(\mu) as ζ\zeta\to\infty, and we will be done.

According to the comparison to the common upper bound stated above, the solution X(ζ,λ)X(\zeta,\lambda) can only possibly blow up in the direction XX\to-\infty. This does not happen since for each μ>0\mu>0, a specific lower bound is given by Gronwall’s inequality and a comparison to the system X˙=min0U¯1R(U¯)μcX\dot{X}=\min_{0\leq\bar{U}\leq 1}R^{\prime}(\bar{U})-\mu-cX, whose solutions do not blow up. \Box

The existence of λ1\lambda_{1}. We now verify the remaining statements in Theorem 8.1. It is a straightforward task to numerically integrate the one-dimensional projection of the system (98) along the invariant subspace of real solutions {Y=0}\{Y=0\}. The solution can blow up by leaving the chart given by S=P/VS=P/V, V0V\neq 0. If this happens, the system (100) can be numerically integrated without issue on the chart T=V/PT=V/P, P0P\neq 0, around a small interval surrounding the pole, and then we can return to the original chart.

(a)Refer to caption (b)

Figure 8: Numerical integration of (98) along {Y=0}\{Y=0\} for (a) λ=0.1\lambda=0.1, (b) λ=λ0=0\lambda=\lambda_{0}=0, (c) λ=0.1\lambda=-0.1, (d) λ=0.3\lambda=-0.3, (e) λ=0.8\lambda=-0.8, (f) λ=λ10.80925\lambda=\lambda_{1}\approx-0.80925, (g) λ=0.85\lambda=-0.85. Red resp. black dots: (projectivisation of) the unstable resp. stable eigenvectors of the saddle-point at U¯=1\bar{U}=1. Magenta segments: concatenated fast jumps defined by the map (90).

(c)[Uncaptioned image] (d)[Uncaptioned image] (e)[Uncaptioned image] (f)[Uncaptioned image] (g)[Uncaptioned image]

These numerics are depicted in Fig. 8 for seven values of λ\lambda within the range [0.85,0.1][-0.85,0.1]. As shown in Fig. 8(a)–(c) and (e)–(g), a connection is formed between the unstable eigendirection of the saddle-point at U¯=0\bar{U}=0 and the stable eigendirection of the saddle-point at U¯=1\bar{U}=1 for the distinguished values λ=λ0=0\lambda=\lambda_{0}=0 and λ10.80925\lambda_{1}\approx-0.80925. Simultaneously, we observe the winds that are necessarily generated upon crossing these eigenvalues. These winds can be continued in the parameter λ\lambda—such coordinate singularities can of course be characterised as zeroes of the flow with respect to a suitably chosen chart. They propagate to the left and are preserved across the jump, as is shown in Fig. 8(c)–(d).

The simplicity of the eigenvalues in the reduced problem. We have now determined the existence of eigenfunctions as solutions to the Riccati formulation (103) of the linearised problem (47). Recall that the jump conditions for XX are given in Def. 6.4, and boundary conditions for XX given in (97), i.e.

limζ±X=u^(μ),s^+(μ).\lim_{\zeta\to\pm\infty}X=\hat{u}^{-}(\mu),\hat{s}^{+}(\mu).

In terms of the original reduced problem (47),

this means that we have a value λ=μ\lambda=\mu, and a solution to (47) satisfying the boundary conditions

limζ±P,V=0,\lim_{\zeta\to\pm\infty}P,V=0,

and appropriate jump conditions.

Differentiating the equation for VV with respect to ζ\zeta, and substituting in for P˙\dot{P}, and multiplying through by

g(ζ):=exp(ζ(log(D(U¯)))s+cds)=ecζD(U¯)g(\zeta):=\exp\left(-\int^{\zeta}(\log(D(\bar{U})))_{s}+cds\right)=\frac{e^{-c\zeta}}{D(\bar{U})}

we are led to the ‘Sturm-Liouville’ form of the reduced problem on the slow manifold:

ecζddζ(ecζD(U¯)ddζV)+Q~(ζ)V=μV.\displaystyle e^{c\zeta}\frac{d}{d\zeta}\left(\frac{e^{-c\zeta}}{D(\bar{U})}\frac{d}{d\zeta}V\right)+\tilde{Q}(\zeta)V=\mu V. (105)

where Q~(ζ)\tilde{Q}(\zeta) is defined as:

Q~(ζ):=R(U¯)+cD(U¯)U¯ζD(U¯)2\tilde{Q}(\zeta):=R^{\prime}(\bar{U})+\frac{cD^{\prime}(\bar{U})\bar{U}_{\zeta}}{D(\bar{U})^{2}}

Letting LL be the symmetric linear operator :

L[y]:=ecζddζ(ecζD(U¯)dydζ)+Q~(ζ)yL[y]:=e^{c\zeta}\frac{d}{d\zeta}\left(\frac{e^{-c\zeta}}{D(\bar{U})}\frac{dy}{d\zeta}\right)+\tilde{Q}(\zeta)y

we are interested in establishing the simplicity of the eigenvalues of such an operator where U(ζ)U(\zeta) is given as the solution on the slow manifold with the appropriate boundary conditions, and jump at ζ=0\zeta=0.

Remark 8.3.

As D(U¯)D(\bar{U}) is discontinuous at ζ=0\zeta=0, the domain of the linear operator LL is a function space that will necessarily incorporate a jump condition which is compatible with the one from Def 6.4. Further, the weighting function ecζe^{c\zeta} will define the inner product space on which LL is symmetric and well-defined. Here we only aim to establish simplicity of any eigenvalues/eigenfunctions, which a priori exist. \triangle

First we note that because LL is symmetric on eigenfunctions, the Fredholm alternative means that the geometric multiplicity of any eigenvalues must be the same as the algebraic multiplicity, and in particular if (Lμ)[y1]=0(L-\mu)[y_{1}]=0, there can be no non-zero solutions to (Lμ)[y]=y1(L-\mu)[y]=y_{1}.

Now, suppose that v1v_{1} and v2v_{2} are eigenfunctions of LL with the same eigenvalue μ\mu, so L[vj]=μvjL[v_{j}]=\mu v_{j}. On the one hand, we have

v1L[v2]v2L[v1]=μ(v1v2v2v1)=0,\displaystyle v_{1}L[v_{2}]-v_{2}L[v_{1}]=\mu(v_{1}v_{2}-v_{2}v_{1})=0, (106)

while on the other we have

v1L[v2]v2L[v1]\displaystyle v_{1}L[v_{2}]-v_{2}L[v_{1}] =ecζddζ(g(ζ)(v1dv2dζdv1dζv2)).\displaystyle=e^{c\zeta}\frac{d}{d\zeta}\left(g(\zeta)(v_{1}\frac{dv_{2}}{d\zeta}-\frac{dv_{1}}{d\zeta}v_{2})\right). (107)

Thus, the quantity

g(ζ)(v1dv2dζdv1dζv2)=:ecζD(U¯)W(v1,v2)\displaystyle g(\zeta)(v_{1}\frac{dv_{2}}{d\zeta}-\frac{dv_{1}}{d\zeta}v_{2})=:\frac{e^{-c\zeta}}{D(\bar{U})}W(v_{1},v_{2}) (108)

is a constant. Evaluating at ζ=+\zeta=+\infty we have

limζ+ecζD(U¯)W(v1,v2)=0.\lim_{\zeta\to+\infty}\frac{e^{-c\zeta}}{D(\bar{U})}W(v_{1},v_{2})=0.

We observe that

v1,2(ζ)e12(c+c2+4D(0)(μR(0)))ζv_{1,2}(\zeta)\sim e^{\frac{1}{2}\left(c+\sqrt{c^{2}+4D(0)(\mu-R^{\prime}(0))}\right)\zeta}

as ζ<0\zeta<0 grows very large in magnitude, and so in particular

limζecζD(U¯)W(v1,v2)=0\lim_{\zeta\to-\infty}\frac{e^{-c\zeta}}{D(\bar{U})}W(v_{1},v_{2})=0

when μ>R(0)\mu>R^{\prime}(0). Thus ecζD(U¯)W(v1,v2)0\frac{e^{-c\zeta}}{D(\bar{U})}W(v_{1},v_{2})\equiv 0 on both sides of the jump and hence we have that W(v1,v2)0W(v_{1},v_{2})\equiv 0. Since the Wronskian of the two eigenfunctions is identically zero, they must be linearly dependent. We conclude that the slow eigenvalues λ0\lambda_{0} and λ1\lambda_{1} are both simple. \Box

Refer to caption
Figure 9: Evaluation of the Riccati-Evans function (109) along circles CzC_{z} of radius 0.03 centered at the point zz\in\mathbb{C}. The winding numbers along each contour are given in the legend and are also depicted by the arrowheads. Note that λ0\lambda_{0} and λ1\lambda_{1} denote the two eigenvalues and λp=0.08\lambda_{p}=-0.08 is the approximate location of the pole of EΣ(λ)E^{\Sigma}(\lambda).

We supplement our proofs with a numerical demonstration of the existence of the simple eigenvalues λ0,λ1\lambda_{0},\,\lambda_{1}, by means of a Riccati-Evans function. Define the cross section Σ={U¯=0.95}.\Sigma=\{\bar{U}=0.95\}. The Riccati-Evans function associated with the Riccati equation (98) on the coordinate chart {V0}\{V\neq 0\} is

EΣ(λ)\displaystyle E^{\Sigma}(\lambda) :=s1Σ(λ)u0Σ(λ),\displaystyle:=s^{\Sigma}_{1}(\lambda)-u^{\Sigma}_{0}(\lambda), (109)

where s1Σ(λ)s^{\Sigma}_{1}(\lambda) denotes the (first) intersection of the unique nontrivial solution s1(ζ,λ)s_{1}(\zeta,\lambda) of (98) which converges asymptotically to the stable eigenvector of the saddle point at U¯=1\bar{U}=1 as ζ+\zeta\to+\infty, and similarly for u0Σ(λ)u^{\Sigma}_{0}(\lambda), which connects to the unstable eigendirection of the saddle point as ζ\zeta\to-\infty. Compare our definition to the general construction in [10].

We highlight a few key points about our function (109). The function is meromorphic (hence satisfying the argument principle), and it vanishes on eigenvalues λ\lambda. Its zeroes are intrinsic, depending neither on the choice of section nor on the choice of chart. On the other hand, as we can anticipate from the dynamics depicted in Fig. 8, the poles arise for values of λ\lambda where the solution leaves the chart by winding. In other words, the poles are an artifact of the choice of coordinate chart as well as the choice of section, and can usually be moved (or even removed entirely) by a judicious selection of the chart and the section (with our choice of Σ\Sigma, we locate a pole near λp=0.08\lambda_{p}=-0.08). See [10] for a general exposition.

In this analysis we find it instructive to work with the ‘naive’ chart S=P/VS=P/V, V0V\neq 0, for all values of λ\lambda—the dimensionality of the slow eigenvalue problem is low enough that we can easily demonstrate the utility of the argument principle. As shown in Fig. 9, the multiplicity of each eigenvalue can be readily computed by evaluating the corresponding winding number W(EΣ,C)W(E^{\Sigma},C) along simple closed contours CC. Contours which surround poles, corresponding to blow-up of solutions off the chosen coordinate chart, will map under EE to contours with a negative winding number (corresponding to a clockwise orientation). Here we demonstrate that the poles are also simple. This completes our numerical verification of Theorem 8.1.

9   Concluding remarks

We have given a complete characterisation of the spectral stability problem for shock-fronted travelling waves of the regularized system (1). But other types of high-order regularization terms can be applied to the underlying system that exhibits shock solutions; it can further be shown that these nonequivalent regularizations pick out distinct one-parameter families of shock-fronted travelling waves, which limit to singular solutions satisfying different rules.

Consider the following system which applies to both a viscous relaxation and a fourth-order nonlocal regularization, where the new parameter a0a\geq 0 characterises the relative weighting of the two regularizations.

U¯t\displaystyle\frac{\partial\bar{U}}{\partial t} =x(D(U¯)U¯x)+R(U¯)+εa3U¯x2tε24U¯x4.\displaystyle=\frac{\partial}{\partial x}\left(D(\bar{U})\frac{\partial\bar{U}}{\partial x}\right)+R(\bar{U})+\varepsilon a\frac{\partial^{3}\bar{U}}{\partial x^{2}\partial t}-\varepsilon^{2}\frac{\partial^{4}\bar{U}}{\partial x^{4}}. (110)

Setting a=0a=0 recovers a regularized system with ‘purely’ nonlocal regularization. In this case, it can be shown that the corresponding one-parameter family of travelling waves solutions has a singular limit as a shock-fronted travelling wave which satisfies a so-called ‘equal area rule’– the shock connection across disjoint branches of the potential function F(U¯)F(\bar{U}) is selected so that the area bounded above and below the shock height are exactly balanced. Recent work [4] has shown that families of shock-fronted waves persist robustly for a>0a>0, with each such family satisfying a generalized equal area rule in the singular limit. Furthermore, there is a finite value a=aV>0a=a_{V}>0 for which the shock connection is formed at the fold, again resulting in viscous-type shocks. The problem in this paper can be thought of as the regularization in the (scaled) limit as aa\to\infty.

Refer to caption
Figure 10: Plot of the essential spectrum (shaded regions) of the of the linearised operator L in (112), which shows Ω\Omega and the 𝒜i\mathcal{A}_{i} partitioning the complex plane via the Fredholm borders (continuous spectrum). The shape of the essential spectrum means that the operator LL, for non-zero ε\varepsilon is sectorial. The dashed line (red online) is one of the Fredholm borders coming from the linearisation of the reduced problem about the (constant) steady-state U¯\bar{U}^{-} c.f. the right figure in Fig 4. For the figure, ε=0.1\varepsilon=0.1 and c=a=1c=a=1, with DD and RR as in (3) and (4).

Thus, let a>0a>0 be given and assume the existence of such a family of travelling waves for (110). As before, we can find such waves as standing wave solutions U¯(z,t)U¯(z)\bar{U}(z,t)\equiv\bar{U}(z) to:

(U¯εaU¯zz)t=ε2U¯zzzzεacU¯zzz+(F(U¯))zz+cU¯z+R(U¯).\displaystyle(\bar{U}-\varepsilon a\bar{U}_{zz})_{t}=-\varepsilon^{2}\bar{U}_{zzzz}-\varepsilon ac\bar{U}_{zzz}+(F(\bar{U}))_{zz}+c\bar{U}_{z}+R(\bar{U}). (111)

We now discuss how the stability problem changes in this more general case. Linearising about a standing wave solution U¯(z)\bar{U}(z) to (111) leads to the eigenvalue problem

λpεaλpzz=ε2pzzzzεacpzzz+(D(U¯)p)zz+cpz+R(U¯)p.\displaystyle\lambda p-\varepsilon a\lambda p_{zz}=-\varepsilon^{2}p_{zzzz}-\varepsilon acp_{zzz}+(D(\bar{U})p)_{zz}+cp_{z}+R^{\prime}(\bar{U})p. (112)

Defining the variables

s\displaystyle s :=ε2pzzz+εacpzz((D(U)+εaλ)p)zcp\displaystyle:=\varepsilon^{2}p_{zzz}+\varepsilon acp_{zz}-((D(U)+\varepsilon a\lambda)p)_{z}-cp (113)
r\displaystyle r :=ε2pzz+εacpz(D(U)+εaλ)p\displaystyle:=\varepsilon^{2}p_{zz}+\varepsilon acp_{z}-(D(U)+\varepsilon a\lambda)p (114)
q\displaystyle q =εpz+acp,\displaystyle=\varepsilon p_{z}+acp, (115)

we can write the closed system

(εpεqrs)z\displaystyle\begin{pmatrix}\varepsilon p\\ \varepsilon q\\ r\\ s\end{pmatrix}_{z} =(ac100D(U¯)+εaλ010c001R(U¯)λ000)(pqrs),\displaystyle=\begin{pmatrix}-ac&1&0&0\\ D(\bar{U})+\varepsilon a\lambda&0&1&0\\ c&0&0&1\\ R^{\prime}(\bar{U})-\lambda&0&0&0\end{pmatrix}\begin{pmatrix}p\\ q\\ r\\ s\end{pmatrix}, (116)

with two fast and two slow variables. The calculations for the essential (and absolute) spectrum follow the same ideas as in Section 5. Presuming the existence of a wave U¯\bar{U} which exponentially approaches its end states U¯±\bar{U}^{\pm}, we have the dispersion relations are the pair of parametrised equations given by kk\in\mathbb{R}:

λ±=ε2k4D(U¯±)k2+R(U¯±)1+aεk2+ick.\lambda_{\pm}=\frac{-\varepsilon^{2}k^{4}-D(\bar{U}^{\pm})k^{2}+R^{\prime}(\bar{U}^{\pm})}{1+a\varepsilon k^{2}}+ick.

These are a pair of curves in the complex plane which are always opening leftward and intersecting the real axis at the points (R(U¯±),0)(R^{\prime}(\bar{U}^{\pm}),0) in the left half plane. The dispersion relations again form the Fredholm borders and make up the continuous spectrum. As before, \mathbb{C} is partitioned into five regions: the region Ω\Omega which contains the right half plane, together with 𝒜j\mathcal{A}_{j} for j=1,2,3,4j=1,2,3,4. The essential spectrum in this case can again be determined by considering the signatures of the asymptotic matrices, and can be seen to consist of the regions 𝒜1,2,3\mathcal{A}_{1,2,3}, which is the region “between” the Fredholm borders. We have

Region sgn(A)\textrm{sgn}(A^{-}) sgn(A+)\textrm{sgn}(A^{+})
Ω\Omega (,,++)(-,-,++) (,,+,+)(-,-,+,+)
𝒜1\mathcal{A}_{1} (,,,+)(-,-,-,+) (,,+,+)(-,-,+,+)
𝒜2\mathcal{A}_{2} (,,++)(-,-,++) (,,,+)(-,-,-,+)
𝒜3\mathcal{A}_{3} (,,+,+)(-,-,+,+) (,,,+)(-,-,-,+)
𝒜4\mathcal{A}_{4} (,,,+)(-,-,-,+) (,,,+)(-,-,-,+)

Again, the Fredholm borders will be close to the Fredholm borders of the linearised reduced problem, linearised about the steady-states corresponding to the end states of the unperturbed problem. Eventually, as with the third order perturbation problem, the higher order modes in the dispersion relations will dominate, and the essential spectrum will diverge from the appropriate continuous spectrum of the unperturbed problem (see Fig: 10). In contrast to the case of ‘pure’ viscous relaxation, however, the Fredholm borders do not asymptote to vertical lines in the complex plane and so the operator is sectorial, and spectral stability in this case would indeed imply linearised stability of the perturbation problem.

Having shown that the essential spectrum remains remarkably well-behaved in this more general context, let us now highlight key differences (and difficulties) in calculating the point spectrum. Here, the problem is now four-dimensional, and it can be verified with direct calculation that the two-dimensional slow manifolds on which lie saddle-type equilibria at U¯=0\bar{U}=0 and U¯=1\bar{U}=1 are now saddle-type (i.e. there is now both a fast stable and fast unstable direction). The augmented unstable bundle construction for ε>0\varepsilon>0 now defines a complex 2-plane bundle over the sphere.

It is natural to ask whether the present problem is amenable to the splitting techniques introduced in [8]: namely, is it possible to decompose this 2-plane bundle into ‘fast unstable’ and ‘slow unstable’ line bundles which are controlled by separated reduced eigenvalue problems?

We conjecture that such a separation is not possible: as in the ‘pure’ viscous case, it can be directly calculated that the eigenvalue problem again introduces the eigenvalue parameter λ\lambda only ‘weakly,’ i.e. through 𝒪(ελ)\mathcal{O}(\varepsilon\lambda) terms. Thus, the reduced fast eigenvalue problem again degenerates. As mentioned in the introduction, this poses issues for the construction of a fast elephant trunk over the entire wave; consequently, it does not appear to be feasible with the present techniques to uniformly separate a fast unstable line bundle from a slow unstable one. Furthermore, any such reduced line bundle over the fast layer is governed by the layer flow, which connects an unstable eigendirection to a stable one in the singular limit (i.e. we are not able to construct an augmented unstable line bundle for the reduced problem).

Nonetheless, we assert that estimates similar to Lemma 7.7 can be brought to bear to regain control of the unstable 2-plane bundle as we track it across the fast shock layer. It would be of interest to write down sharper (i.e. exponential closeness) estimates to the invariant manifold near the slow subbundle, as discussed in Remark 7.8. The calculation in Appendix D gives a concrete example of such an exponential closeness result in an elementary problem. Such calculations appear to be very cumbersome in the general case. It would also be interesting to situate the construction of such invariant manifolds within the context of general invariant manifold results in nonautonomous dynamical systems (see e.g. [19]). These are topics of ongoing work.

Acknowledgement

Both authors acknowledge support from the Australian Research Council Discovery Project grant DP200102130. The authors thank Martin Wechselberger at the University of Sydney for several insightful discussions. The first author thanks Martin Rasmussen at Imperial College London for a clarifying discussion about invariant manifolds of nonautonomous dynamical systems.

Appendix A Construction of the fast and slow elephant trunks

Proof of Lemma 7.4: We follow the strategy to the proof of Lemma 4.2 in [8]. The goal is to verify the following four conditions for the projectivization of the linearized system (26)\eqref{eq:visclinfast}:

  • ((4.6)GJ(4.6)_{GJ} in Sec. IV of [8]): There exists α>0\alpha>0 independent of ε\varepsilon and ξI\xi\in I so that the curve of critical points β0(γ,ε)\beta_{0}(\gamma,\varepsilon) associated with the frozen family (71) satisfies

    Reσ[Gβ(β0(γ,ε),γ,ε)]<α;\text{Re}\,\sigma[G_{\beta}(\beta_{0}(\gamma,\varepsilon),\gamma,\varepsilon)]<-\alpha;
  • ((4.7)GJ(4.7)_{GJ} in Sec. IV of [8]): for any d>0d>0 there exists ε0=ε0(d)\varepsilon_{0}=\varepsilon_{0}(d) and a nested family of subintervals I(d)II(d)\subset I (i.e. I(d1)I(d2)I(d_{1})\subset I(d_{2}) for d1<d2d_{1}<d_{2}) with

    sup(β,ξ,ε)C{|G|ξ,Gβ,ε,|β0ξ|}<d,\text{sup}_{(\beta,\xi,\varepsilon)\in C}\{|G|_{\xi},||G_{\beta,\varepsilon}||,|\beta_{0\xi}|\}<d,

    where the set CC can be taken as

    C={(β,ξ,ε):|ββ0(ξ,ε)|<c0,ξI(d),0<εε0}C=\{(\beta,\xi,\varepsilon):|\beta-\beta_{0}(\xi,\varepsilon)|<c_{0},\xi\in I(d),0<\varepsilon\leq\varepsilon_{0}\}

    for some c0>0c_{0}>0;

  • ((4.8)GJ(4.8)_{GJ} in Sec. IV of [8]): we have

    K=supCD2G(β,ξ,ε)<; andK=\text{sup}_{C}||D^{2}G(\beta,\xi,\varepsilon)||<\infty;\text{~{}and}
  • ((4.9)GJ(4.9)_{GJ} in Sec. IV of [8]): let A(γ):=Gβ(β0(γ,ε),γ,ε)A(\gamma):=G_{\beta}(\beta_{0}(\gamma,\varepsilon),\gamma,\varepsilon). Then there exists some invertible matrix Y(γ)Y(\gamma) depending smoothly on γ\gamma, such that the following holds for some a>0a>0 depending only on α\alpha (from condition (4.6)GJ(4.6)_{GJ}):

    Re(A(γ)β,β)γ\displaystyle\text{Re}(A(\gamma)\beta,\beta)_{\gamma} <a|β|γ2,\displaystyle<-a|\beta|_{\gamma}^{2},
    Y(γ)γ\displaystyle||Y(\gamma)||_{\gamma} =1,\displaystyle=1,
    Yγ(γ)γ\displaystyle||Y_{\gamma}(\gamma)||_{\gamma} d,\displaystyle\leq d,

    where we define the inner product (β1,β2)γ:=Y(γ)β1Y(γ)β2¯(\beta_{1},\beta_{2})_{\gamma}:=Y(\gamma)\beta_{1}\cdot\overline{Y(\gamma)\beta_{2}} and the norm |β|γ=(β,β)γ1/2|\beta|_{\gamma}=(\beta,\beta)^{1/2}_{\gamma}.

Lemma 4.1 in [8] then gives the result. By Lemma 4.2, there exists ξ¯>0\bar{\xi}>0 such that the second bullet point in the Lemma above holds for all |ξ|ξ¯|\xi|\geq\bar{\xi} and all sufficiently small ε>0\varepsilon>0. This verifies the condition (4.6)GJ(4.6)_{GJ}. Furthermore, the branch of equilibria β0(ξ,λ,ε)\beta_{0}(\xi,\lambda,\varepsilon) for the corresponding frozen family is uniformly bounded for all |ξ|>ξ¯|\xi|>\bar{\xi} and all λK\lambda\in K, with KΩK\subset\Omega a fixed contour. Hence condition (4.8)GJ(4.8)_{GJ} holds for some constant KK, with c0c_{0} in the definition equal to 1.

Let us check the remaining conditions (4.7)GJ(4.7)_{GJ} and (4.9)GJ(4.9)_{GJ}. Take the interval II to be either I={ξ:<ξ<ξ¯}I_{-}=\{\xi\in\mathbb{R}:-\infty<\xi<-\bar{\xi}\} or I+={ξ:ξ¯<ξ<+}I_{+}=\{\xi\in\mathbb{R}:\bar{\xi}<\xi<+\infty\}. On either interval, the projectivized vector field G(β,ξ,λ,ε)G(\beta,\xi,\lambda,\varepsilon) depends on ξ\xi through components of the (nonautonomous) linearisation matrix, in particular through the diffusion and reaction terms (i.e. through u¯\bar{u}). Hence for each constant L>0L>0 there exists ε0:=ε0(L)\varepsilon_{0}:=\varepsilon_{0}(L), so that for 0<εε00<\varepsilon\leq\varepsilon_{0} we have

max{|Gξ|,Gξ,β,|β0,ξ|}<Lmax{|u(ξ,ε)|}.\max\{|G_{\xi}|,||G_{\xi,\beta}||,|\beta_{0,\xi}|\}<L\max\{|u^{\prime}(\xi,\varepsilon)|\}.

Now let n(δ)n(\delta) denote a δ\delta-tube around the singular limit of the travelling wave and assume that I=II=I_{-}. Resetting ε0\varepsilon_{0} again if necessary, we can assume that the travelling wave x(ξ,ε)x(\xi,\varepsilon) lies within n(δ)n(\delta) for each ξξL(δ)\xi\leq\xi_{L}(\delta) . Then there exists a constant M>0M>0 so that |u¯(ξ,ε)|Mδ|\bar{u}^{\prime}(\xi,\varepsilon)|\leq M\delta for all ξξL(δ)\xi\leq\xi_{L}(\delta), since u¯=v¯Φ(u¯)\bar{u}^{\prime}=\bar{v}-\Phi(\bar{u}) is uniformly bounded along the entire wave. Hence we may take δ(d)=d/M\delta(d)=d/M and define

I(d)={ξ:ξL(δ(d))}.I_{-}(d)=\{\xi:\xi_{L}(\delta(d))\}.

Note that δ(d)\delta(d) depends smoothly on dd and that δ(0)=0\delta(0)=0, and furthermore that the definition for I(d)I_{-}(d) provides nested intervals. This verifies condition (4.7)GJ(4.7)_{GJ}.

To verify the remaining condition (4.9)GJ(4.9)_{GJ}, it is enough to note that the linearization A(ξ,λ,ε)=A0(ξ,λ,ε)+𝒪(δ)A(\xi,\lambda,\varepsilon)=A_{0}(\xi,\lambda,\varepsilon)+\mathcal{O}(\delta) of GG (the projectivization of the linear system), evaluated along the strong unstable eigendirection, is a 2×22\times 2 matrix which has two 𝒪(1)\mathcal{O}(1) negative eigenvalues. This matrix A(ξ,λ,ε)A(\xi,\lambda,\varepsilon), is necessarily negative definite for IL(d)I_{L}(d), and hence we can take Y=A01Y=A_{0}^{-1} such that Y1AYY^{-1}AY produces a matrix with diagonal entries of uniformly negative real part and remaining entries of 𝒪(δ)\mathcal{O}(\delta). This immediately implies the needed inequalities. Applying Lemma 4.1 in [8], we have constructed the necessary elephant trunk Ωf\Omega_{-}^{f} over Sεa,S^{a,-}_{\varepsilon}.

The preceding discussion applies identically for the case I=I+I=I_{+} to produce a fast elephant trunk along Sεa,+S^{a,+}_{\varepsilon}. \Box

Appendix B Lemmas for estimates near the slow subbundle

Lemma B.1.

Let δ,a>0\delta,\,a>0 be fixed. There exists ε¯>0\bar{\varepsilon}>0 sufficiently small and T>0T>0, both depending only on δ\delta and aa, so that for each 0<εε¯0<\varepsilon\leq\bar{\varepsilon} and |γ|a/ε|\gamma|\geq a/\varepsilon, each solution y^(s)\hat{y}(s) of (79) must satisfy at least one of the following:

  • (ii)

    y^(T)Nδ(f^1(γ,λ,ε))\hat{y}(-T)\in N_{\delta}(\hat{f}_{1}(\gamma,\lambda,\varepsilon)).

    Remark B.2.

    Fix some γ\gamma as above and let y^(s)\hat{y}(s) denote a solution to the corresponding member of the family of projectivized frozen systems in (79), and let y(s)π1(y^(s))y(s)\in\pi^{-1}(\hat{y}(s)) be chosen so that |y(0)|=1|y(0)|_{\infty}=1. The corresponding frozen linearized system in (79) is autonomous and linear, and thus its solution can be written explicitly as a linear combination of eigenvectors

    y(s)\displaystyle y(s) =g1f1esμ1,s+g2f2esμ2,s+gfffesμf.\displaystyle=g_{1}f_{1}e^{s\mu_{1,s}}+g_{2}f_{2}e^{s\mu_{2,s}}+g_{f}f_{f}e^{s\mu_{f}}. (123)

    It thus follows from invariance of (generalized) eigenspaces that if (i) in Lemma B.1 above does not hold for some fixed γ\gamma, then gf0g_{f}\neq 0, and so there is some T>0T>0 so that (ii) holds. The point of this lemma is to show that for all sufficiently small ε\varepsilon, such a T>0T>0 exists which can be chosen depending only on δ\delta and aa. \triangle

    Proof of Lemma B.1: Assume that item (i) does not hold for some γ\gamma as given in the Lemma, so that there exists TT so that item (ii) holds for that value of γ\gamma. Using the normalization of y(s)y(s) in Remark B.2, we note that |gi|1|g_{i}|\leq 1. By Lemma 3.1, there exists some K>0K>0, depending only on the metric ρ\rho, so that

    |gf|Kδ.\displaystyle|g_{f}|\geq K\delta. (124)

    From (123) we may write

    y(T)\displaystyle y(-T) =eμfT(gfef+R(T)),\displaystyle=e^{-\mu_{f}T}(g_{f}e_{f}+R(T)), (125)

    where

    R(T)\displaystyle R(T) =g1f1eT(μ1,sμf)+g2f2eT(μ2,sμf).\displaystyle=g_{1}f_{1}e^{-T(\mu_{1,s}-\mu_{f})}+g_{2}f_{2}e^{-T(\mu_{2,s}-\mu_{f})}.

    At this stage we remind the reader that the μi\mu_{i} are still γ\gamma- and ε\varepsilon-dependent. We now apply the asymptotic estimate in Lemma 4.2 to find ε¯0\bar{\varepsilon}\geq 0, and α>0\alpha>0 depending only on ε¯\bar{\varepsilon} and aa, so that for each ε(0,ε¯]\varepsilon\in(0,\bar{\varepsilon}] and |γ|a/ε|\gamma|\geq a/\varepsilon, we extract the uniform bound min{Re(μ1,sμf),Re(μ2,sμf)}>α\min\{\text{Re}(\mu_{1,s}-\mu_{f}),\,\text{Re}(\mu_{2,s}-\mu_{f})\}>\alpha. It thus follows from the triangle inequality and the estimates |gi|1,|fi|=1|g_{i}|\leq 1,\,|f_{i}|=1 that

    |R(T)|\displaystyle|R(T)| <2eαT.\displaystyle<2e^{-\alpha T}.

    For each T>TT^{\prime}>T, we have |R(T)|<|R(T)||R(T^{\prime})|<|R(T)|. This estimate and (124) applied to the factored form (125) then implies that

    y^(T)Nδ(f^f(γ,λ,ε))\displaystyle\hat{y}(-T)\in N_{\delta}(\hat{f}_{f}(\gamma,\lambda,\varepsilon))

    for TT depending only on α,δ,a,and ρ\alpha,\,\delta,\,a,\,\text{and }\rho. \Box

    We now turn to the dynamics on the slow timescale. Our objective here is to strengthen the estimate in Lemma B.1 slightly, so that a(ε)aa(\varepsilon)\equiv a can be chosen independently of ε>0\varepsilon>0 such that the solution E^\hat{E} remains uniformly near the slow subbundle for |ζ|a|\zeta|\geq a. This will be crucial in making the comparison to the linearized reduced flow defined on Sεa,,Sεa,+S^{a,-}_{\varepsilon},S^{a,+}_{\varepsilon}.

    From the construction in Lemmas 7.4 and 7.5, there exists δ1>0\delta_{1}>0 such that for each sufficiently small ε\varepsilon,

    Nδ1(f^f(ξ,λ,ε))Ω^f(ξ)\displaystyle N_{\delta_{1}}(\hat{f}_{f}(\xi,\lambda,\varepsilon))\subset\hat{\Omega}_{f}(\xi)  for ξa/ε and\displaystyle\text{~{}for~{}}\xi\leq-a/\varepsilon\text{~{}and}
    Nδ1(σ^s(ξ,λ,ε))Ω^s(ξ)\displaystyle N_{\delta_{1}}(\hat{\sigma}_{s}(\xi,\lambda,\varepsilon))\subset\hat{\Omega}_{s}(\xi)  for ξa/ε,\displaystyle\text{~{}for~{}}\xi\geq a/\varepsilon,

    where Ω^f(ξ)\hat{\Omega}_{f}(\xi) and Ω^s(ξ)\hat{\Omega}_{s}(\xi) denote the projectivizations of the slices of the corresponding elephant trunks within 2×{ξ}\mathbb{CP}^{2}\times\{\xi\}.

    Lemma B.3.

    Set δ>0\delta>0 with δ<δ1\delta<\delta_{1}, with δ1\delta_{1} as above, and fix a>0a>0. Suppose that Y^\hat{Y} is a solution to the projectivization of the slow linearized equations

    Y^˙\displaystyle\dot{\hat{Y}} =A^(Y^,ζ,λ,ε)\displaystyle=\hat{A}(\hat{Y},\zeta,\lambda,\varepsilon) (126)

    so that for each ε\varepsilon sufficiently small, there exists A(ε)>aA(\varepsilon)>a with the property that

    Y^(ζ,λ,ε)Nδ(Σs(ζ,λ,ε)).\displaystyle\hat{Y}(\zeta,\lambda,\varepsilon)\in N_{\delta}(\Sigma_{s}(\zeta,\lambda,\varepsilon)). (127)

    Then there exists ε¯>0\bar{\varepsilon}>0 such that for each 0<ε<ε¯0<\varepsilon<\bar{\varepsilon} and |ζ|a|\zeta|\geq a we have

    Y^(ζ,λ,ε)Nδ(Σs(ζ,λ,ε)).\displaystyle\hat{Y}(\zeta,\lambda,\varepsilon)\in N_{\delta}(\Sigma_{s}(\zeta,\lambda,\varepsilon)). (128)

    Proof of Lemma B.3:

    Our strategy will be to verify the uniform closeness estimate separately over the slow branches Sεa,S^{a,-}_{\varepsilon} and Sεa,+S^{a,+}_{\varepsilon}. The argument for Sεa,S^{a,-}_{\varepsilon} follows the indirect proof in [8] closely. With our modified exchange lemma in hand, the remaining closeness estimate over Sεa,+S^{a,+}_{\varepsilon} is direct.

    Suppose the lemma were false over Sεa,S^{a,-}_{\varepsilon}; then there would exist sequences ζn<a\zeta_{n}<-a and εn0\varepsilon_{n}\to 0 so that (128) fails to hold for all nn. By passing to subsequences, we can assume that the following sequences converge simultaneously:

    ζn\displaystyle\zeta_{n} ζ¯ where aζ¯<,\displaystyle\to\bar{\zeta}\text{ ~{}~{}~{}where~{}~{} }a\leq-\bar{\zeta}<\infty,
    X(ζn,εn)\displaystyle X(\zeta_{n},\varepsilon_{n}) x¯Sεa,, and\displaystyle\to\bar{x}~{}~{}~{}\in S^{a,-}_{\varepsilon},\text{ and}
    Y^(ζn,λ,εn)=Y^n\displaystyle\hat{Y}(\zeta_{n},\lambda,\varepsilon_{n})=\hat{Y}_{n} Y^2.\displaystyle\to\hat{Y}_{*}~{}~{}\in\mathbb{CP}^{2}.

    Let us first suppose that x¯Sεa,\bar{x}\in S^{a,-}_{\varepsilon}. After centering the linearized equations near ζ¯\bar{\zeta} by making the change of variables s=(ζ¯ζn)/εs=(\bar{\zeta}-\zeta_{n})/\varepsilon and γn=ζn/εn\gamma_{n}=\zeta_{n}/\varepsilon_{n}, we arrive at the following recentered equations on the fast timescale:

    dz^ds\displaystyle\frac{d\hat{z}}{ds} =a^(z^,γn+s,λ,εn)\displaystyle=\hat{a}(\hat{z},\gamma_{n}+s,\lambda,\varepsilon_{n})
    z^(0)\displaystyle\hat{z}(0) =Y^n.\displaystyle=\hat{Y}_{n}.

    Proceeding as before, we define an associated family of frozen systems. Let z^(s,λ,n)\hat{z}_{*}(s,\lambda,n) denote the solution of the corresponding frozen system

    dz^ds\displaystyle\frac{d\hat{z}}{ds} =a^(z^,γn,λ,εn)\displaystyle=\hat{a}(\hat{z}_{*},\gamma_{n},\lambda,\varepsilon_{n})
    z^(0,λ,n)\displaystyle\hat{z}_{*}(0,\lambda,n) =Y^n.\displaystyle=\hat{Y}_{n}.

    Now fix δ2>0\delta_{2}>0 such that δ<δ2<δ1\delta<\delta_{2}<\delta_{1}. By assumption, for each nn we have Y^nNδ(σ^s(γn,λ,εn))\hat{Y}_{n}\notin N_{\delta}(\hat{\sigma}_{s}(\gamma_{n},\lambda,\varepsilon_{n})), and so it must be true by Lemma B.1 that

    z^(γnT,λ,n)\displaystyle\hat{z}(\gamma_{n}-T,\lambda,n) Nδ2(f^f(γnT,λ,εn)).\displaystyle\in N_{\delta_{2}}(\hat{f}_{f}(\gamma_{n}-T,\lambda,\varepsilon_{n})).

    Since δ2<δ1\delta_{2}<\delta_{1}, y^\hat{y} must enter the fast elephant trunk about f^f\hat{f}_{f} over Sεa,S^{a,-}_{\varepsilon}, which contradicts the assumed behavior of y^\hat{y} as ζ\zeta\to-\infty.

    We now know that Y^(ζ,λ,ε)Nδ(Σ^(ζ,λ,ε))\hat{Y}(\zeta,\lambda,\varepsilon)\in N_{\delta}(\hat{\Sigma}(\zeta,\lambda,\varepsilon)) for each ζa\zeta\leq-a. Then set δ>0\delta>0 small enough that for each sufficiently small ε>0\varepsilon>0, Lemma 7.7 applies to Y^(ζ,λ,ε)\hat{Y}(\zeta,\lambda,\varepsilon) as the wave enters the vicinity of x¯:=X(a,ε)Sεa,+\bar{x}:=X(a,\varepsilon)\in S^{a,+}_{\varepsilon} from the fast layer. In particular, we have the asymptotics that Y^\hat{Y} lies 𝒪(ε)\mathcal{O}(\varepsilon)-close to Σ^\hat{\Sigma} at the time ζ=a\zeta=a, with respect to the Fubini-Study metric, and so for each sufficiently small ε>0\varepsilon>0, we have Y^Nδ(Σ^s(a,λ,ε))\hat{Y}\in N_{\delta}(\hat{\Sigma}_{s}(a,\lambda,\varepsilon)). Simultaneously, Y^\hat{Y} enters the slow elephant trunk over Sεa,+S^{a,+}_{\varepsilon} since δ<δ1\delta<\delta_{1}, and thus remains δ\delta-close to the slow subbundle for each ζ>a\zeta>a. This completes the proof. \Box

    Corollary B.4.

    Fix λ\lambda\in\mathbb{C} and a,δ>0a,\delta>0. Then there exists ε¯>0\bar{\varepsilon}>0 sufficiently small so that for each ε(0,ε¯]\varepsilon\in(0,\bar{\varepsilon}], the unique solution E^(ζ,λ)\hat{E}(\zeta,\lambda) of the projectivized linearized equations for which E^(ζ,λ)e^s,2\hat{E}(\zeta,\lambda)\to\hat{e}_{s,2}^{-} (the unstable eigenvector) as ζ\zeta\to-\infty, also satisfies

    E^(ζ,λ,ε)Nδ(Σ^s(ζ,λ,ε)).\displaystyle\hat{E}(\zeta,\lambda,\varepsilon)\in N_{\delta}(\hat{\Sigma}_{s}(\zeta,\lambda,\varepsilon)). (129)

    for |ζ|a|\zeta|\geq a.

  • Appendix C Proof of Lemma 7.7

    In this proof we focus on clarifying the essential steps in the estimate, omitting unwieldy calculations while noting that they can be traced from [17]. The key step is to write down a slightly modified version of the estimate given in Proposition 8 in [17]; namely, there exist positive constants B1,B2,B3B_{1},\,B_{2},\,B_{3}, and T0T_{0} depending only on the width Δ\Delta of the defining box (75), so that for all TT0T\geq T_{0} and for each qQTq\in Q_{T}, we have the upper bounds

    |dy(q(t))|\displaystyle|dy(q(t))| B1eαt\displaystyle\leq B_{1}e^{\alpha t} (130)
    |db(q(t))|\displaystyle|db(q(t))| B2e(ακ)t+B3ελeαt\displaystyle\leq B_{2}e^{(\alpha-\kappa)t}+B_{3}\varepsilon\lambda e^{\alpha t}

    for all t[0,T]t\in[0,T], where α>0\alpha>0 and 0<κ<|γ0|0<\kappa<|\gamma_{0}| are growth rate constants characterizing respectively slow growth versus fast exponential contraction (see Proposition 8 and Corollary 2 in [17]). We highlight the essential modification: whereas in the standard exchange lemma, the component db(t)db(t) of the tangent vector that is aligned along the fast fibers can be arranged to shrink exponentially quickly over long times, i.e. with growth rate (ακ)<0(\alpha-\kappa)<0 (in the case of a slow manifold with only attracting directions), there is now an extra, slowly-growing error term, arising from the new ελ\varepsilon\lambda term in the eigenvalue problem.

    The strategy to prove (130) proceeds essentially as in the proof of Proposition 8 in [17]. By smoothness of solutions to the ODEs, there exists some T>0T^{\prime}>0 such that the upper bound for |db(t)||db(t)| holds for t[0,T]t\in[0,T^{\prime}] if we choose B2B_{2} and B3B_{3} sufficiently large. We seek to show that we can take T=TT^{\prime}=T for choices B2,B3B_{2},\,B_{3} which are independent of the initial condition. Assume that there is a maximal T>0T^{\prime}>0 such that the error estimate is attained (otherwise there is nothing to prove). We show that this leads to a contradiction if TT is large enough.

    By a similar calculation to that in the proof of Proposition 8 in [17], we find that

    |dy(t)|\displaystyle|dy(t)| B¯1eαt,\displaystyle\leq\bar{B}_{1}e^{\alpha t},

    where the new constant B¯1>0\bar{B}_{1}>0 depends upon the width of the defining box Δ>0\Delta>0, as well as the growth rate constants. There is also an extra term of the form e(ακ)te^{(\alpha-\kappa)t} arising from applying the Duhamel principle (see Lemma 5.1 in [17]) to the extra term of the form ελM2db\varepsilon\lambda M_{2}db, where M2M_{2} is a smooth bounded function defined by compositions of the Fenichel straightening diffeomorphism and its inverse. We subsume this extra error term into the constant B¯1\bar{B}_{1}.

    By the Duhamel principle and the bounding estimates on points in QTQ_{T} together with the basic flow estimates for slowly-varying nonautonomous linear systems (see Proposition 2, Corollary 1, Proposition 6, and Proposition 7 in [17]), we then have

    |db(t)|\displaystyle|db(t)| K¯Xe(ακ)tM¯2+ελ0teμ(tζ)|M2(b(ζ),y(ζ),ε)||dy(ζ)|𝑑ζ\displaystyle\leq\bar{K}_{X}e^{(\alpha-\kappa)t}\bar{M}_{2}+\varepsilon\lambda\int_{0}^{t}e^{-\mu(t-\zeta)}|M_{2}(b(\zeta),y(\zeta),\varepsilon)||dy(\zeta)|d\zeta
    K¯Xe(ακ)tM¯2+ελMΔ,ε¯eμtB¯1e(μ+α)tμ+α\displaystyle\leq\bar{K}_{X}e^{(\alpha-\kappa)t}\bar{M}_{2}+\varepsilon\lambda M_{\Delta,\bar{\varepsilon}}e^{-\mu t}\bar{B}_{1}\frac{e^{(\mu+\alpha)t}}{\mu+\alpha}
    <B2e(ακ)t+B3eαt\displaystyle<B_{2}e^{(\alpha-\kappa)t}+B_{3}e^{\alpha t}

    if TT is chosen large enough and for B2,B3B_{2},\,B_{3} defined from the coefficients as in the above calculation. This contradicts the assumed maximality of TT^{\prime}, and so we conclude that db(t)db(t) satisfies the required inequality for all sufficiently large tt.

    The remainder of the proof follows the outlines of the main Theorem 6.5 in [17]. By assumption we have that |dy(0)|Mε|dy(0)|\geq M\varepsilon for some M>0M>0, and let β<0\beta<0 be fixed. It follows from general considerations about slowly varying nonautonomous linear systems (see Proposition 2 in [17]) together with the Duhamel principle that

    |dy(𝒯ε)|Kyεeβ𝒯q.\displaystyle|dy(\mathcal{T}_{\varepsilon})|\geq K_{y}\varepsilon e^{\beta\mathcal{T}_{q}}.

    The result follows from applying the estimate for |dy(t)||dy(t)| above and calculating the distance between (db(𝒯ε),dy(𝒯ε))(db(\mathcal{T}_{\varepsilon}),dy(\mathcal{T}_{\varepsilon})) and (0,dy(𝒯ε))(0,dy(\mathcal{T}_{\varepsilon})) in the Fubini-Study metric. \Box

    Remark C.1.

    An elephant trunk lemma is used in Gardner and Jones to prove an analogous result. We point out that only partial elephant trunk-type estimates over Sεa,S^{a,-}_{\varepsilon} and Sεa,+S^{a,+}_{\varepsilon} are required to provide an estimate of the type proven above. \triangle

    Appendix D Example for modified exchange lemma estimates

    We illustrate the assertions in Remark 7.8 with the following toy problem defined in a suitable box Δ\Delta in 2×2\mathbb{R}^{2}\times\mathbb{C}^{2}, where λ\lambda\in\mathbb{C} is taken to be a fixed constant:

    b\displaystyle b^{\prime} =b\displaystyle=-b (131)
    y\displaystyle y^{\prime} =εy\displaystyle=\varepsilon y
    db\displaystyle db^{\prime} =db+ελydy\displaystyle=-db+\varepsilon\lambda y\,dy
    dy\displaystyle dy^{\prime} =εdy.\displaystyle=\varepsilon\,dy.

    The critical manifold 0\mathcal{M}_{0} of this system is given by {b=0,db=0}\{b=0,\,db=0\}. This manifold is normally hyperbolic and attracting since the real parts of the nontrivial eigenvalues of the corresponding layer problem of (131) along 0\mathcal{M}_{0} are both negative in Δ\Delta, and hence there exists a one-parameter family ε\mathcal{M}_{\varepsilon} of attracting slow manifolds for sufficiently small values of ε>0\varepsilon>0. Without loss of generality we think of this family as parametrized by the slow variables (y,dy)(y,\,dy), and evidently b=0b=0 specifies one of the defining equations for ε\mathcal{M}_{\varepsilon}. The slow subbundle along ε\mathcal{M}_{\varepsilon} in terms of this coordinate representation can be computed by calculating the eigenvector corresponding to the 𝒪(ε)\mathcal{O}(\varepsilon) eigenvalue of the 2×22\times 2 Jacobian of the latter two equations in (131). It is given by the span of the vector

    (ελy1+ε1)\displaystyle\begin{pmatrix}\frac{\varepsilon\lambda y}{1+\varepsilon}\\ 1\end{pmatrix}

    as yy varies. On the other hand, the system (131) can be explicitly solved for (db(t),dy(t))(db(t),dy(t)) given initial values (b(0),y(0),db(0),dy(0))=(b0,y0,db0,dy0)(b(0),y(0),db(0),dy(0))=(b_{0},y_{0},db_{0},dy_{0}) to find

    db(t)\displaystyle db(t) =(11+2ε)(et[db0(1+2ε)ελy0dy0]+e2εtελy0dy0)\displaystyle=\left(\frac{1}{1+2\varepsilon}\right)\left(e^{-t}[db_{0}(1+2\varepsilon)-\varepsilon\lambda y_{0}dy_{0}]+e^{2\varepsilon t}\varepsilon\lambda y_{0}dy_{0}\right)
    dy(t)\displaystyle dy(t) =dy0eεt.\displaystyle=dy_{0}e^{\varepsilon t}.

    From this calculation, it follows that ε\mathcal{M}_{\varepsilon} is given by {b=0,db=ελydy/(1+2ε)}\{b=0,\,db=\varepsilon\lambda y\,dy/(1+2\varepsilon)\}.

    For times t=𝒯=𝒪(1/ε)t=\mathcal{T}=\mathcal{O}(1/\varepsilon), We find that the tangent vector (db(t),dy(t))(db(t),dy(t)) lies exponentially close to

    (ελy0dy01+2εe2ε𝒯dy0eε𝒯).\displaystyle\begin{pmatrix}\frac{\varepsilon\lambda y_{0}dy_{0}}{1+2\varepsilon}e^{2\varepsilon\mathcal{T}}\\ dy_{0}e^{\varepsilon\mathcal{T}}\end{pmatrix}.

    Evidently, this vector is 𝒪(ε)\mathcal{O}(\varepsilon) close to the tangent bundle of the slow manifold, which has the local representation

    (01).\begin{pmatrix}0\\ 1\end{pmatrix}.

    Further comparing the angle of this vector to the slow subbundle at the point y=y0eε𝒯y=y_{0}e^{\varepsilon\mathcal{T}} by using the distance estimate Lemma 3.1 in [17], we find that the angle is no larger than

    2ε2λy0(1+2ε)(1+ε),\displaystyle\frac{2\varepsilon^{2}\lambda y_{0}}{(1+2\varepsilon)(1+\varepsilon)},

    i.e. the angle scales as 𝒪(ε2)\mathcal{O}(\varepsilon^{2}). This error estimate is sharper than that given by the tangent bundle of the slow manifold.

    References

    • [1] J. Alexander, R. Gardner, and C. Jones, A topological invariant arising in the stability analysis of travelling waves, J. reine angew. Math. 410 (1990), 167–212.
    • [2] M.F. Atiyah, K-Theory, W.A. Benjamin, Inc. (1967).
    • [3] A. Bose, A Geometric Approach to Singularly Perturbed Nonlocal Reaction-Diffusion Equations, SIAM J. Math. Anal. 31 (2000), 431–454.
    • [4] Bronwyn Bradshaw-Hajek, Ian Lizarraga, Robert Marangell, and Martin Wechselberger. A geometric singular perturbation analysis of regularised reaction-nonlinear diffusion models including shocks, Proceedings of 47th Sapporo Symposium on Partial Differential Equations, (2022), pp. 53–64
    • [5] P. Brunovsky, Tracking invariant manifolds without differential forms, Acta Math. Univ. Comenion. 65 (1996), 23–32.
    • [6] B. de Rijk, A. Doelman, J.D.M. Rademacher, Spectra and stability of spatially periodic pulse patterns: Evans function factorization via Riccati transformation, SIAM J. Math. Anal. 48 (2016), 61–121.
    • [7] N. Fenichel, Geometric singular perturbation theory for ordinary differential equations, J. Differential Equations 31 (1979), pp. 53–98.
    • [8] JR. Gardner and C. Jones, Stability of Travelling Wave Solutions of Diffusive Predator-Prey Systems, Trans. AMS 327 (1991), pp. 465–524.
    • [9] K.E. Harley, P. van Heijster, R. Marangell, G.J. Pettet, and M. Wechselberger, Novel Solutions for a model of wound healing angiogenesis, Nonlinearity 27 (2014), pp. 2975-3003.
    • [10] K.E. Harley, P. van Heijster, R. Marangell, G.J. Pettet, T.V. Roberts, and M. Wechselberger, (In)stability of Travelling Waves in a Model of Haptotaxis, SIAM J. Appl. Math. 80 (2020), pp. 1629–1653.
    • [11] A. Hatcher, Vector Bundles and K-Theory, unpublished manuscript version 2.2 available online at https://pi.math.cornell.edu/~hatcher/VBKT/VB.pdf (2017).
    • [12] D. Henry, Geometric Theory of Semilinear Parabolic Equations, Lecture Notes in Mathematics, Springer-Verlag, Berlin (1981).
    • [13] K. Holling, Existence of Infinitely Many Solutions for a Forward Backward Heat Equation, Trans. AMS 278 (1983), pp. 299–316.
    • [14] C.K.R.T. Jones, Stability of the Travelling Wave Solution of the Fitzhugh-Nagumo System, Trans. AMS 286 (1984), pp. 431–469.
    • [15] C.K.R.T Jones, Geometric Singular Perturbation Theory, Lecture Notes in Mathematics, Dynamical Systems (Montecatini Terme), Springer, Berlin (2015).
    • [16] C.K.R.T. Jones and N. Kopell, Tracking Invariant Manifolds With Differential Forms in Singularly Perturbed Systems, J. Differential Equations (1994), pp. 64–88.
    • [17] C.K.R.T. Jones and S.K. Tin, Generalized Exchange Lemmas and Orbits Heteroclinic to Invariant Manifolds, DCDS-S (2009), pp. 967–1023.
    • [18] T. Kapitula and K. Promislow, Spectral and Dynamical Stability of Nonlinear Waves, Springer. Applied Mathematical Sciences Vol 185 (2013).
    • [19] P.E. Kloeden and M. Rasmussen, Nonautonomous Dynamical Systems, Mathematical Surveys and Monographs, Volume 176 (2011).
    • [20] M. Krupa and P. Szmolyan, Extending Geometric Singular Perturbation Theory to Nonhyperbolic Points–Fold and Canard Points in Two Dimensions, SIAM. J. Math. Anal. 33 (2001), pp. 286–314.
    • [21] C. Kuehn, Multiple Time Scale Dynamics, Springer International Publishing Switzerland (2015).
    • [22] Y. Li, P. van Heijster, M.J. Simpson, and M. Wechselberger, Shock-fronted travelling waves in a reaction-diffusion model with nonlinear forward-backward-forward diffusion, Physica D 423 (2021), 132916.
    • [23] Y. Li, P. van Heijster, R. Marangell, and M.J. Simpson, Travelling wave solutions in a negative nonlinear diffusion–reaction model, J. Math. Biol. 81 (2020), pp. 1495–1522.
    • [24] X.B. Lin and M. Wechselberger, Transonic evaporation waves in a spherically symmetric nozzle, SIAM J. Math. Anal. 46 (2014), pp. 1472–1504.
    • [25] A. Novick-Cohen, R.L. Pego, Stable patterns in a viscous diffusion equation, Trans. AMS 324 (1991), pp. 331–351.
    • [26] V. Padrón, Effect of aggregation on population recovery modeled by a forward-backward pseudoparabolic equation, Trans. AMS 356 (2004), pp. 2739–2756.
    • [27] R.L. Pego, O. Penrose, Front migration in the nonlinear Cahn-Hilliard equation, P. Roy. Soc. A-Math. Phys. 422 (1989), pp. 261–278.
    • [28] B. Sandstede, Stability of travelling waves, Handbook of Dynamical Systems II, North-Holland (2002), 983–1055.
    • [29] K. Scharnhorst, Angles in Complex Vector Spaces, Acta Appl. Math. 69 (2001), 95–103.
    • [30] S. Schecter, Exchange lemmas 2: General Exchange Lemma, J. Diff. Equations 245 (2008), pp. 411–441.
    • [31] M.J. Simpson, K.A. Landman, and B.D. Hughes, Cell invasion with proliferation mechanisms motivated by time-lapse data, Physica A 389 (2010), pp. 3779–3790.
    • [32] M.J. Simpson, K.A. Landman, B.D. Hughes, and A.E. Fernando, A model for mesoscale patterns in motile populations, Physica A 389 (2010), pp. 1412–1424.
    • [33] P. Szmolyan, Transversal heteroclinic and homoclinic orbits in singular perturbation problems, J. Differential Equations 92 (1991), pp. 252–281.
    • [34] P. Szmolyan and M. Wechselberger, Relaxation Oscillations in 3\mathbb{R}^{3}, J. Differential Equations 200 (2004), pp. 69–104.
    • [35] M. Wechselberger, À Propos De Canards, Trans. AMS 364 (2012), pp. 3289–3309.
    • [36] M. Wechselberger and G. J. Pettet, Folds, canards and shocks in advection-reaction-diffusion models, Nonlinearity 23 (2010) pp. 1949–1969.
    • [37] T.P. Witelski, Shocks in nonlinear diffusion, Appl. Math. Lett. 8 (1995), pp. 27–32.
    • [38] T.P. Witelski, The structure of internal layers for unstable nonlinear diffusion equations, Stud. Appl. Math. 97 (1996), pp. 277–300.