This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Melnikov method for non-conservative perturbations of the three-body problem

Marian Gidea Yeshiva University, Department of Mathematical Sciences, New York, NY 10016, USA [email protected] Rafael de la Llave School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30332, USA [email protected]  and  Maxwell Musser Yeshiva University, Department of Mathematical Sciences, New York, NY 10016, USA [email protected]
Abstract.

We consider the planar circular restricted three-body problem (PCRTBP), as a model for the motion of a spacecraft relative to the Earth-Moon system. We focus on the Lagrange equilibrium points L1L_{1} and L2L_{2}. There are families of Lyapunov periodic orbits around either L1L_{1} or L2L_{2}, forming Lyapunov manifolds. There also exist homoclinic orbits to the Lyapunov manifolds around either L1L_{1} or L2L_{2}, as well as heteroclinic orbits between the Lyapunov manifold around L1L_{1} and the one around L2L_{2}. The motion along the homoclinic/heteroclinic orbits can be described via the scattering map, which gives the future asymptotic of a homoclinic orbit as a function of the past asymptotic. In contrast with the more customary Melnikov theory, we do not need to assume that the asymptotic orbits have a special nature (periodic, quasi-periodic, etc.).

We add a non-conservative, time-dependent perturbation, as a model for a thrust applied to the spacecraft for some duration of time, or for some other effect, such as solar radiation pressure. We compute the first order approximation of the perturbed scattering map, in terms of fast convergent integrals of the perturbation along homoclinic/heteroclinic orbits of the unperturbed system. As a possible application, this result can be used to determine the trajectory of the spacecraft upon using the thrust.

Key words and phrases:
Melnikov method; homoclinic and heteroclinic orbits; three-body problem; astrodynamics.
2010 Mathematics Subject Classification:
Primary, 37J40; 37D05 37C29; 34C37; 70F07 Secondary, 70H08.
Research of M.G. was partially supported by NSF grant DMS-1515851, and the Alfred P. Sloan Foundation grant G-2016-7320.
Research of R.L. was partially supported by NSF grant DMS-1800241, and H2020-MCA-RISE #734577
Research of M.M. was partially supported by NSF grant DMS-1515851, and the Alfred P. Sloan Foundation grant G-2016-7320.

1. Introduction

1.1. Motivation

A motivation for this work is the following situation from astrodynamics. Consider a spacecraft traveling between the Earth and the Moon. Assume that the spacecraft is coasting along the stable/unstable hyperbolic invariant manifolds associated to the periodic orbits near one of the center-saddle equilibrium points, at some fixed energy level. Such an orbit is driven by the gravitational fields of the Earth and the Moon, and does not require using the thrusters. The total energy is preserved along the orbit. One can describe the motion of the spacecraft in terms of some geometrically defined coordinates: an ‘action’ coordinate describing the size of the periodic orbit associated to the stable/unstable invariant manifold, an ‘angle’ coordinate describing the asymptotic phase of the motion, and a pair of ‘hyperbolic’ coordinates describing the position of the spacecraft relative to the corresponding stable/unstable manifold.

Suppose now that we want to make a maneuver in order to jump from the hyperbolic invariant manifold on the given energy level to another hyperbolic invariant manifold on a different energy level. Mathematically, turning on the thrusters amounts to adding a small, non-conservative, time-dependent perturbation to the original system. If such a perturbation is given, we would like to estimate its effect on the orbit of the spacecraft. More precisely, we would like to compute the change in the action and angle coordinates associated to the orbit as a result of applying the perturbation. We want to obtain such an estimate in terms of the original trajectory of the unperturbed system, and of the particular perturbation.

Other non-Hamiltonian perturbations, for instance, due to solar radiation pressure and solar wind, can be considered.

In this paper, we will investigate the general problem of adding a non-Hamiltonian perturbation to a homoclinic/heteroclinic trajectory and computing the effect on the homoclinic/heteroclinic orbits.

Note that adding a non-Hamiltonian perturbation (e.g., a small dissipation) may destroy all the periodic orbits. Nevertheless, the normally hyperbolic manifold, formed by the collection of periodic orbits, persists (see details later). In contrast with the most customary versions of the Melnikov theory, which assume that the asymptotic orbits are periodic or quasiperiodic, and that they preserve their nature under perturbation, we consider homoclinic excursions to a normally hyperbolic manifold. The asymptotic orbits could change their nature under the perturbations. For example, a family of Lyapunov periodic orbits subject to a small dissipation could get transformed into a family of orbits that converge to a critical point.

1.2. Brief description of the main results and methodology

In the model that we consider, the family of periodic orbits about either L1L_{1} or L2L_{2} forms a normally hyperbolic invariant manifold (NHIM). Each NHIM has hyperbolic stable and unstable manifolds. We will assume that the stable and unstable manifolds of the NHIM corresponding to either L1L_{1} or L2L_{2} intersect transversally, and also that the unstable (stable) manifold of the NHIM corresponding to L1L_{1} intersects transversally the stable (unstable) manifold of the NHIM corresponding to L2L_{2}. This assumption can be verified numerically for a wide range of energy levels and mass parameters in the PCRTBP (see, e.g., [KLMR00]). Thus, there exist homoclinic orbits to either one of the NHIM’s, as well as heteroclinic orbits between the two NHIM’s. There exist scattering maps associated to the transverse homoclinic intersections, as well as to the transverse heteroclinic intersections.

There exist some neighborhoods of L1L_{1} and of L2L_{2} where the Hamiltonian of the PCRTBP can be written in a normal form, via some suitable symplectic action-angle and hyperbolic coordinates (I,θ,y,x)(I,\theta,y,x). In particular, each NHIM can be parametrized in terms of the action-angle coordinates (I,θ)(I,\theta). Therefore, the scattering map can also be described in terms of these coordinates. In the unperturbed case, the scattering map is particularly simple: it is a shift in the angle coordinate (a phase shift).

The fact that we use normal form coordinates to express the Hamiltonian, and we subsequently estimate the scattering map in terms of the action-angle coordinates, is a matter of practical convenience. Normal forms are often used to compute numerically, with high precision, the periodic orbits and the NHIM’s around the equilibrium points, as well as the corresponding stable and unstable manifolds, e.g., [Jor99, GJMS01].

For applications, it is important to note that the scattering map for the PCRTBP can be computed numerically with high precision; see [CDMR06, DGR16, CGdlL16].

In this paper we study the effect of a small, non-Hamiltonian, time-dependent perturbation that is added to the system. Provided that the perturbation is small enough, the NHIM’s will persist [Fen72], although periodic orbits inside the NHIM’s may disappear. Also, the transverse homoclinic/heteroclinic orbits, hence the scattering map, will survive in the perturbed system.

The main contribution is that we compute the effect of the perturbation on the action and angle components of the scattering map. More precisely, we use Melnikov theory to provide explicit estimates – up to first order with respect to the size of the perturbation – for the difference between the perturbed scattering map and the unperturbed one, relative to the action and angle coordinates. The resulting expressions are given in terms of convergent improper integrals of the perturbation evaluated along segments of homoclinic/heteroclinic orbits of the unperturbed system. One important aspect in the computation is that, in the perturbed system, the action is a slow variable, while the angle is a fast variable.

We stress that, unlike the usual treatments of Melnikov theory, when one studies orbits homoclinic to hyperbolic fixed points, periodic or quasi-periodic orbits, here we study orbits homoclinic to NHIM’s. The asymptotic dynamics inside the NHIM’s may change under the perturbation.

The effect of the perturbation on the action-angle components of the scattering map can be interpreted, in the context of astrodynamics, as follows. The difference in the action coordinates between the perturbed and the unperturbed scattering map can be interpreted as the change in energy due to the maneuver, or equivalently, the change in the ‘size’ of the periodic orbit which the homoclinic/heteroclinic orbit is asymptotic to. The difference in the angle coordinates between the perturbed and the unperturbed scattering map can be interpreted as the change in asymptotic phase due to the maneuver.

We mention here that there are numerous works on using hyperbolic invariant manifolds to design low-energy space mission, see, for example [KLMR00, GLMS01, Bel04, BGT10, PA14], and the references listed there. We hope that our results can be used to optimize the thrust that needs to be applied in order to maneuver between hyperbolic invariant manifolds on different energy levels.

1.3. Related works

The study of Hamiltonian systems subject to non-conservative perturbations is of practical interest in physical models, such as in celestial mechanics, where dissipation leads to migration of satellites and spacecrafts [MNF87, GEPC17, dlLK19, CCdlL20].

Computation of the scattering map, similar to the one in this paper, have been done in the case of the pendulum-rotator model subject to Hamiltonian perturbations, e.g., [DdlLS08, GdlL18]. The rotator-pendulum model is a product system and is naturally endowed with action-angle and hyperbolic coordinates. It has two conserved quantities: the action of the rotator and the total energy of the pendulum. The effect of the perturbation on the action component of the scattering map is relatively easy to compute directly. On the other hand, the effect on the angle component of the scattering map is more complicated to compute, since this is a fast variable. To circumvent this difficulty, the papers [DdlLS08, GdlL18] use the symplecticity of the scattering map to estimate indirectly the effect of the perturbation on the angle component of the scattering map.

The perturbed scattering map has been computed in the case of the pendulum-rotator model subject to non-conservative perturbations in[GdlLM21]. Since the perturbations are not Hamiltonian, the symplectic argument used in [DdlLS08] can no longer be applied. Therefore, to determine the effect of the perturbation on the angle component of the scattering map, a direct computation is performed in [GdlLM21].

The PCRTBP model considered in this paper presents some significant differences from the pendulum-rotator model. First, it has only one conserved quantity, namely the total energy. Second, the PCRTBP is not a product system, and does not carry a globally defined system of action-angle and hyperbolic coordinates. Third, in the unperturbed case the stable and unstable manifolds associated to a NHIM do not coincide. For these reasons, we construct locally defined systems of action-angle and hyperbolic coordinates along the unstable manifold as well as along the stable manifold, respectively. At the intersection of the unstable and stable manifolds, the two coordinate systems do not agree in general. So the computation of the perturbed scattering map has to take into account the ‘mismatch’ between these coordinate systems. The dynamics in these coordinate systems fails to be of product type, as there is a coupling between the action-angle and the hyperbolic coordinates, which also needs to be taken into account in the computation. All of these features make the computation of the perturbed scattering map for the planar circular restricted three-body problem more intricate than for the rotator-pendulum system. Some of the calculations are simplified taking advantage that some of the variables in the unperturbed system are slow variables, but we can deal with perturbation theory for fast variables by observing that, near the NHIM’s, the difference between the variables and their asymptotic values is slow (a technique already used in [GdlLM21].

1.4. Structure of the paper

The main result, Theorem 2.1, is given in Section 2. It provides an expansion of the perturbed scattering map in terms of the unperturbed scattering map, where the first order term in the expansion is given explicitly in Section 5, in Proposition 5.2 and Proposition 5.7. Section 3 describes how to verify the hypotheses of Theorem 2.1 in the context of the PCRTBP. Section 4 defines some suitable coordinate systems, which we use to describe the geometric objects of interest. The proof of the main result is given in Section 4.

2. Set-up and main result

Consider a (real analytic) 𝒞ω\mathcal{C}^{\omega}, symplectic manifold (M,Ω)(M,\Omega) of dimension (2m)(2m). Each point zMz\in M is described via a system of local coordinates z=z(p,q)2mz=z(p,q)\in\mathbb{R}^{2m} such that Ω\Omega relative to these coordinates is the standard symplectic form

(2.1) Ω=dpdq=i=1mdpidqi.\Omega=dp\wedge dq=\sum_{i=1}^{m}dp_{i}\wedge dq_{i}.

On MM we consider a non-autonomous system of differential equations

(2.2) ddtz=𝒳0(z)+ε𝒳1(z,t;ε),\frac{d}{dt}z=\mathcal{X}^{0}(z)+\varepsilon\mathcal{X}^{1}(z,t;\varepsilon),

where 𝒳0:MTM\mathcal{X}^{0}:M\to TM is a 𝒞ω\mathcal{C}^{\omega}-smooth vector field on MM, 𝒳1:M××(ε0,ε0)TM\mathcal{X}^{1}:M\times\mathbb{R}\times(-\varepsilon_{0},\varepsilon_{0})\to TM is a time-dependent, parameter dependent 𝒞r\mathcal{C}^{r}-smooth vector field on MM, with rr sufficiently large, and ϵ\epsilon\in\mathbb{R} is a ‘smallness’ parameter, taking values in the interval (ε0,ε0)(-\varepsilon_{0},\varepsilon_{0}) around 0. The dependence of 𝒳1(z,t;ε)\mathcal{X}^{1}(z,t;\varepsilon) on the time tt is assumed to be of a general type – not necessarily periodic or quasi-periodic.

The flow of (2.2) will be denoted by Φεt\Phi^{t}_{\varepsilon}.

The assumption that 𝒳0\mathcal{X}^{0} is analytic is motivated by applications to celestial mechanics. We will consider the case when 𝒳0\mathcal{X}^{0} is Hamiltonian vector field for the PCRTBP. We will use this assumption only to be able to quote several normal form theorems. We believe it could be weakened to finite differentiability at the price of providing some new normal form theorems.

The assumption that 𝒳1\mathcal{X}^{1} is only 𝒞r\mathcal{C}^{r} and possibly not Hamiltonian is also motivated by applications, as 𝒳1\mathcal{X}^{1} can model the thrust applied to a spacecraft for some time. In particular 𝒳1\mathcal{X}^{1} can have compact support in space and in time. Note that even if the perturbation were analytic, the NHIM’s which play a role in our treatment can only be assumed to be finite differentiable. The regularity is limited by ratios between the tangential and normal contraction rates, as well as by the regularity of the perturbation. Since the motion on the unperturbed manifold is integrable, we have that for ε\varepsilon small enough, the tangential rates are close to zero, so that the limitations of regularity due to the rates become irrelevant.

Below, we will require that the vector fields 𝒳0\mathcal{X}^{0}, 𝒳1\mathcal{X}^{1} satisfy additional assumptions.

2.1. The unperturbed system

We assume that the vector field 𝒳0\mathcal{X}^{0} represents an autonomous Hamiltonian vector field, that is, 𝒳0=JH0\mathcal{X}^{0}=J\nabla H_{0} for some 𝒞ω\mathcal{C}^{\omega} Hamiltonian function H0:MH_{0}:M\to\mathbb{R}, where JJ is an almost complex structure compatible with the standard symplectic form given by (2.1), and the gradient :=z\nabla:=\nabla_{z} is with respect to the associated Riemannian metric. The Hamilton equation for the unperturbed system is:

(2.3) ddtz=JH0(z).\frac{d}{dt}z=J\nabla H_{0}(z).

2.1.1. Homoclinic connections

We assume that the Hamiltonian flow associated to H0H_{0} satisfies the conditions (A-i), (A-ii) and (A-iii), below. In Section 3, we will see that these conditions can be verified in the planar circular restricted three-body problem.

  • (A-i)

    There exists an equilibrium point LL of saddle-center type, that is, the linearized system DJH0DJ\nabla H_{0} at LL has eigenvalues of the type ±λ,±iω\pm\lambda,\pm i\omega, with λ,ω0\lambda,\omega\neq 0.

Consequently, by the Lyapunov Center Theorem [Mos58], there exists a 11-parameter family of periodic orbits λ0(h)\lambda_{0}(h), parametrized by the energy level H=hH=h, for h(H(L),h1)h\in(H(L),h_{1}) with h1h_{1} sufficiently small, such that λ0(h)\lambda_{0}(h) shrinks to LL as hH(L)h\to H(L). This family of periodic orbits determines a 22-dimensional manifold Λ0D×𝕋M\Lambda_{0}\simeq D\times\mathbb{T}\subseteq M which is a normally hyperbolic invariant manifold (NHIM) with boundary for the Hamiltonian flow Φ0t\Phi^{t}_{0} of H0H_{0}, where DD is closed interval with non-empty interior contained in (H(L),h1)(H(L),h_{1}). The notion of a NHIM is recalled in Definition A.1, Appendix A.

The NHIM Λ0\Lambda_{0} is symplectic when endowed with the form ΩΛ0\Omega_{\mid\Lambda_{0}}, where Ω\Omega is given by (2.1). Moreover, Λ0\Lambda_{0} is foliated by the periodic orbits λ0(h)\lambda_{0}(h), i.e., Λ0=hDλ0(h)\Lambda_{0}=\bigcup_{h\in D}\lambda_{0}(h). The flow Φ0t\Phi^{t}_{0} on each λ0(h)\lambda_{0}(h) is a constant speed flow. In particular, the dynamics restricted to the NHIM is integrable. Therefore Λ0\Lambda_{0} can be parametrized in terms of symplectic action-angle variables (I,θ)(I,\theta), so that each periodic orbit λ0(h)\lambda_{0}(h) represents a level set IhI_{h} of the action variable. The action IhI_{h} is uniquely determined by the energy level H0=hH_{0}=h. In fact, as we will see in Section 4.1, there exist a system of action-angle and hyperbolic variables (I,θ,y,x)(I,\theta,y,x) in a neighborhood of LL such the Hamiltonian H0H_{0} can be written in a normal form.

  • (A-ii)

    There exists a relatively compact open set 𝒦\mathscr{K} in MM such that the unstable manifold W𝒦u(Λ0)W^{\rm u}_{\mathscr{K}}(\Lambda_{0}) and the stable manifold W𝒦s(Λ0)W^{\rm s}_{\mathscr{K}}(\Lambda_{0}) inside 𝒦\mathscr{K} intersect transversally along a homoclinic channel Γ0𝒦\Gamma_{0}\subset\mathscr{K}.

The definition of a homoclinic channel is given in Appendix B, Definition B.1.

As a consequence of (A-ii), there exist transverse homoclinic orbits to Λ0\Lambda_{0}. The unstable and stable manifolds of λ0(h)\lambda_{0}(h) are contained in the same energy level as λ0(h)\lambda_{0}(h), i.e., Wu(λ0(h)),Ws(λ0(h))MhW^{\rm u}(\lambda_{0}(h)),W^{\rm s}(\lambda_{0}(h))\subseteq M_{h}, where Mh={zM|H=h}M_{h}=\{z\in M\,|\,H=h\}. Hence, each homoclinic orbit to Λ0\Lambda_{0} is asymptotic, in both forward and backwards time, to the periodic orbit λ0(h)\lambda_{0}(h) corresponding to its energy level hh.

In Section 4.1, we will see that the normal form coordinates (I,θ,y,x)(I,\theta,y,x) can be extended via the flow Φ0t\Phi^{t}_{0} along neighborhoods of Wu(Λ0)W^{\rm u}(\Lambda_{0}) and Ws(Λ0)W^{\rm s}(\Lambda_{0}), yielding two systems of action-angle and hyperbolic variables (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}), (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}), respectively. Relative to these two systems of coordinates Wu(Λ0)W^{\rm u}(\Lambda_{0}) can be described locally by yu=0y^{\rm u}=0, and Ws(Λ0)W^{\rm s}(\Lambda_{0}) can be described locally by xs=0x^{\rm s}=0. The two coordinate systems (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) and (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}) do not agree with one another in a neighborhood of the homoclinic channel Γ0\Gamma_{0}.

In addition, we require that H0H_{0} satisfies a non-degeneracy condition (A-iii), which is formulated in terms of the normal form from Section 4.1. Condition (A-iii) consists of two parts: (A-iii-a), and (A-iii-b). Condition (A-iii-a) will be given in Section 4.1, and condition (A-iii-b) will be given in Section 4.2. These are explicit and verifiable conditions that the derivatives of certain functions are non-zero.

2.1.2. Unperturbed scattering map associated to a homoclinic channel

Let Ω:Wu(Λ0)Λ0\Omega^{-}:W^{\rm u}(\Lambda_{0})\to\Lambda_{0} be the projection mapping defined by Ω(z0)=z0\Omega^{-}(z_{0})=z^{-}_{0}, where z0Λ0z_{0}^{-}\in\Lambda_{0} is the footpoint of the unstable fiber through z0Wu(Λ0)z_{0}\in W^{\rm u}(\Lambda_{0}). Similarly, let Ω+:Ws(Λ0)Λ0\Omega^{+}:W^{\rm s}(\Lambda_{0})\to\Lambda_{0} be the projection mapping defined by Ω+(z0)=z0+\Omega^{+}(z_{0})=z^{+}_{0}, where z0+Λ0z_{0}^{+}\in\Lambda_{0} is the footpoint of the stable fiber through z0Ws(Λ0)z_{0}\in W^{\rm s}(\Lambda_{0}). The stable and unstable fibers are defined in Appendix A, equation (A.4).

Consider the homoclinic channel Γ0\Gamma_{0} from condition (A-ii). By the definition of a homoclinic channel, Ω±\Omega^{\pm} restricted to Γ0\Gamma_{0} is a diffeomorphism onto its image. To any homoclinic channel we can associate a scattering map, which is defined in Appendix B, Definition B.2. Specifically, the scattering map σ0\sigma_{0} associated to Γ0\Gamma_{0} is given by:

(2.4) σ0:Ω(Γ0)Λ0Ω+(Γ0)Λ0,σ0(z0)=z0+,\begin{split}&\sigma_{0}:\Omega^{-}(\Gamma_{0})\subseteq\Lambda_{0}\to\Omega^{+}(\Gamma_{0})\subseteq\Lambda_{0},\\ &\sigma_{0}(z_{0}^{-})=z_{0}^{+},\end{split}

provided that there exists a homoclinic point z0Γ0z_{0}\in\Gamma_{0} such that Ω(z0)=z0\Omega^{-}(z_{0})=z^{-}_{0} and Ω+(z0)=z0+\Omega^{+}(z_{0})=z^{+}_{0}. For more details on the scattering map, see Appendix B.

The energy preservations along the stable and unstable manifolds of periodic orbits implies that σ0\sigma_{0} leaves each periodic orbit λ0(h)\lambda_{0}(h) invariant, that is, σ0(λ0(h))λ0(h)\sigma_{0}(\lambda_{0}(h))\subseteq\lambda_{0}(h).

Fixing a point z0Γz_{0}\in\Gamma, we have that σ0(z0)=z0+=z0+Δ\sigma_{0}(z_{0}^{-})=z_{0}^{+}=z_{0}^{-}+\Delta, for some Δ\Delta depending on z0z_{0}. The invariance property of the scattering map (B.2), and the fact that Φ0t\Phi_{0}^{t} restricted to λ0(h)\lambda_{0}(h) is linear implies that

σ0(Φ0t(z0))=Φt(z0+)=Φt(z0+Δ)=Φt(z0)+Δ,\sigma_{0}(\Phi^{t}_{0}(z_{0}^{-}))=\Phi^{t}(z_{0}^{+})=\Phi^{t}(z_{0}^{-}+\Delta)=\Phi^{t}(z_{0}^{-})+\Delta,

for all tt for which Φ0t(z0)\Phi^{t}_{0}(z_{0}) remains in Γ0\Gamma_{0}. This implies that, in terms of the action-angle coordinates (I,θ)(I,\theta), σ0\sigma_{0} is given by a shift in the angle

σ0(I,θ)=(I,θ+Δ(I)),\sigma_{0}(I,\theta)=(I,\theta+\Delta(I)),

for some function Δ\Delta that depends differentiably on II. We stress that Δ\Delta also depends on the choice of the homoclinic channel Γ0\Gamma_{0}.

2.2. The perturbation.

The vector field 𝒳1\mathcal{X}^{1} is a time-dependent, parameter-dependent vector field on MM.

  • (A-iv)

    We assume that 𝒳1=𝒳1(z,t;ε)\mathcal{X}^{1}=\mathcal{X}^{1}(z,t;\varepsilon) is uniformly 𝒞r\mathcal{C}^{r}-differentiable in all variables on 𝒦××(ε0,ε0)\mathscr{K}\times\mathbb{R}\times(-\varepsilon_{0},\varepsilon_{0}), where the set 𝒦\mathscr{K} is as in the condition (A-ii).

Above, we assume that rr is suitably large. We will not assume that 𝒳1\mathcal{X}^{1} is Hamiltonian. Thus, our setting can be used to model dissipation or forcing applied to a Hamiltonian system. Note that non-Hamiltonian perturbations are very singular, in the sense that periodic and homoclinic orbits may disappear. On the other hand, the NHIM’s and their stable and unstable manifolds persist and can be used as the basis for perturbative calculations.

As a particular case, we will also write our results for the case when the perturbation 𝒳1\mathcal{X}^{1} in (2.2) is Hamiltonian, that is, it is given by

(2.5) 𝒳1(z,t;ε)=JzH1(z,t;ε),\mathcal{X}^{1}(z,t;\varepsilon)=J\nabla_{z}H_{1}(z,t;\varepsilon),

where H1H_{1} is a time-dependent, parameter-dependent 𝒞r\mathcal{C}^{r}-smooth Hamiltonian function on MM.

When the perturbation is added to the system, as we will see in Section 5.1, the NHIM for the unperturbed system can be continued to a NHIM for the perturbed system, and the transverse homoclinic/heteroclinic orbits for the unperturbed system can be continued to transverse homoclinic/heteroclinic orbits for the perturbed system, provided that the perturbation is sufficiently small. Hence there exists an associated scattering map for the perturbed system.

The goal is to quantify the effect of the perturbation on the corresponding scattering map.

2.3. Extended system

We consider the system (2.2) under the conditions (A-i), (A-ii), and (A-iii). We associate to it the extended system

(2.6) ddτz=𝒳0(z)+ε𝒳1(z,t;ε),ddτt=1,\begin{split}&\frac{d}{d\tau}z=\mathcal{X}^{0}(z)+\varepsilon\mathcal{X}^{1}(z,t;\varepsilon),\\ &\frac{d}{d\tau}t=1,\\ \end{split}

which is defined on the extended phase space M~=M×\widetilde{M}=M\times\mathbb{R}. We denote z~=(z,t)M~\tilde{z}=(z,t)\in\widetilde{M}. The independent variable will be denoted by τ\tau from now on. We will denote by Φ~ετ\tilde{\Phi}^{\tau}_{\varepsilon} the extended flow of (2.6). We have

Φ~ετ(z,t)=(Φετ(z),t+τ).\tilde{\Phi}^{\tau}_{\varepsilon}(z,t)=({\Phi}^{\tau}_{\varepsilon}(z),t+\tau).

In the extended phase space we have the following:

Λ~0=Λ0×{\tilde{\Lambda}}_{0}=\Lambda_{0}\times\mathbb{R}

is a NHIM with boundary for the extended unperturbed flow Φ~0τ\tilde{\Phi}^{\tau}_{0}.

Γ~0=Γ0×\tilde{\Gamma}_{0}=\Gamma_{0}\times\mathbb{R}

is a homoclinic channel for Φ~0τ\tilde{\Phi}^{\tau}_{0}.

The scattering map associated to Γ~0\tilde{\Gamma}_{0} is given by

σ~0(I,θ,t)=(I,θ+Δ(I),t).\tilde{\sigma}_{0}(I,\theta,t)=(I,\theta+\Delta(I),t).

2.4. Main result

Theorem 2.1.

Consider the system (2.2).

Assume that the unperturbed system 𝒳0\mathcal{X}^{0} satisfies the conditions (A-i), (A-ii) and (A-iii), and that the perturbation 𝒳1\mathcal{X}^{1} satisfies the condition (A-iv).

Then, there exists ε1\varepsilon_{1}, with 0<ε1<ε00<\varepsilon_{1}<\varepsilon_{0}, such that the following hold true:

  • (i)

    For all ε(ε1,ε1)\varepsilon\in(-\varepsilon_{1},\varepsilon_{1}), there is a 𝒞\mathcal{C}^{\ell}-family of NHIMs Λ~ε\tilde{\Lambda}_{\varepsilon} for the extended flow Φ~εt\tilde{\Phi}^{t}_{\varepsilon}, for some 1\ell\geq 1, which form a continuation of the NHIM Λ~0\tilde{\Lambda}_{0} of Φ~0t\tilde{\Phi}^{t}_{0};

  • (ii)

    For all ε(ε1,ε1)\varepsilon\in(-\varepsilon_{1},\varepsilon_{1}), the unstable and stable manifolds of Λ~ε\tilde{\Lambda}_{\varepsilon}, Wu(Λ~ε)W^{\rm u}(\tilde{\Lambda}_{\varepsilon}) and Ws(Λ~ε)W^{\rm s}(\tilde{\Lambda}_{\varepsilon}), respectively, intersect transversally, in the extended phase space M~\widetilde{M}, along a homoclinic channel Γ~ε\tilde{\Gamma}_{\varepsilon};

  • (iii)

    The perturbed scattering map σ~ε\tilde{\sigma}_{\varepsilon} associated to Γ~ε\tilde{\Gamma}_{\varepsilon} can be written as

    (2.7) σ~ε(I,θ,t)=σ~0(I,θ,t)+ε𝒮~(I,θ,t)+O𝒞(ε2),\tilde{\sigma}_{\varepsilon}(I,\theta,t)=\tilde{\sigma}_{0}(I,\theta,t)+\varepsilon\tilde{\mathcal{S}}(I,\theta,t)+O_{\mathcal{C}^{\ell}}(\varepsilon^{2}),

    where 𝒮~=(𝒮~I,𝒮~θ,𝒮~t)\tilde{\mathcal{S}}=(\tilde{\mathcal{S}}^{I},\tilde{\mathcal{S}}^{\theta},\tilde{\mathcal{S}}^{t}) is a mapping from some domain in D×𝕋×D\times\mathbb{T}\times\mathbb{R} to its image in ×𝕋×\mathbb{R}\times\mathbb{T}\times\mathbb{R} as follows:

    • (iii-a)

      the components 𝒮~I\tilde{\mathcal{S}}^{I} and 𝒮~θ\tilde{\mathcal{S}}^{\theta} are given by (5.17) and (5.40), respectively, and

    • (iii-b)

      the component 𝒮~t\tilde{\mathcal{S}}^{t} is given by 𝒮~t(I,θ,t)=t\tilde{\mathcal{S}}^{t}(I,\theta,t)=t.

We recall the notation O𝒞k()O_{\mathcal{C}^{k}}(\cdot) used above: f=O𝒞k(g)f=O_{\mathcal{C}^{k}}(g) means that f𝒞kMg𝒞k\|f\|_{\mathcal{C}^{k}}\leq M\|g\|_{\mathcal{C}^{k}} for some M>0M>0, where k0k\geq 0, and 𝒞k\|\cdot\|_{\mathcal{C}^{k}} is the 𝒞k{\mathcal{C}^{k}}-norm. In the sequel, to simplify the notation we will write O()O(\cdot) without the subscript indicating the function space topology, whenever this can be inferred from the context.

2.5. Heteroclinic connections

Instead of the conditions (A-i), (A-ii), (A-iii) we assume that the Hamiltonian flow associated to H0H_{0} satisfies the conditions (A’-i), (A’-ii), (A’-iii) from below.

Condition (A’-i) has two parts (A’-i-a) and (A’-i-b).

  • (A’-i-a)

    There exists two equilibrium points L1L^{1}, L2L^{2} of saddle-center type.

We do not assume that the two equilibrium points are on the same energy level, that is H(L1)H(L2)H(L^{1})\neq H(L^{2}) in general. Consequently, for each equilibrium point L1L^{1}, L2L^{2} there exists a 11-parameter family of closed orbits λ01(h)\lambda^{1}_{0}(h), for hD1h\in D^{1}, and λ02(h)\lambda^{2}_{0}(h) for hD2h\in D^{2}, where D1D^{1}, D2D^{2} are some closed intervals contained in some neighborhoods of H(L1)H(L^{1}), H(L2)H(L^{2}), respectively.

  • (A’-i-b)

    There exists an interval of energies DD1D2D\subseteq D^{1}\cap D^{2}, with non-empty interior, such that there are periodic orbits λ01(h)\lambda^{1}_{0}(h), λ02(h)\lambda^{2}_{0}(h) for all hDh\in D. Moreover, there exist normal form coordinates (I1,θ1,y1,x1)(I^{1},\theta^{1},y^{1},x^{1}) and (I2,θ2,y2,x2)(I^{2},\theta^{2},y^{2},x^{2}) defined around λ01(h)\lambda^{1}_{0}(h), λ02(h)\lambda^{2}_{0}(h), respectively, for all hDh\in D. These normal form coordinates are as in Section 4.1.

This condition implies that each family of periodic orbits determines a 22-dimensional NHIM Λ01=hDλ0(h)\Lambda^{1}_{0}=\bigcup_{h\in D}\lambda_{0}(h), Λ02=hDλ02(h)\Lambda^{2}_{0}=\bigcup_{h\in D}\lambda^{2}_{0}(h) in MM. Moreover, there are neighborhood of Λ01\Lambda^{1}_{0} and Λ02\Lambda^{2}_{0} where H0H_{0} can be written in a normal form as in Section 4.1.

  • (A’-ii)

    There exists a relatively compact open set 𝒦\mathscr{K} in MM such that the unstable manifold W𝒦u(Λ01)W^{\rm u}_{\mathscr{K}}(\Lambda^{1}_{0}) and the stable manifold W𝒦s(Λ02)W^{\rm s}_{\mathscr{K}}(\Lambda^{2}_{0}) inside 𝒦\mathscr{K} intersect transversally along a heteroclinic channel Γ0𝒦\Gamma_{0}\subset\mathscr{K}.

The definition of a heteroclinic channel is given in Definition B.3, Appendix B. As a consequence, there exist transverse heteroclinic orbits from Λ01\Lambda^{1}_{0} to Λ02\Lambda^{2}_{0}. Each such heteroclinic orbit is asymptotic in backwards time to a periodic orbit λ01(h)\lambda^{1}_{0}(h), and is asymptotic in forward time to a periodic orbit λ02(h)\lambda^{2}_{0}(h).

As in the case of homoclinic connections, we require some non-degeneracy condition (A’-iii), formulated in terms of normal forms, which is the analogue of condition (A-iii) in Section 2. For the sake of brevity, we will not formulate this condition explicitly.

2.5.1. Unperturbed scattering map associated to a heteroclinic channel

As before, we define Ω,1:Wu(Λ01)Λ01\Omega^{-,1}:W^{\rm u}(\Lambda^{1}_{0})\to\Lambda^{1}_{0} by Ω,1(z0)=z0\Omega^{-,1}(z_{0})=z^{-}_{0}, where z0Λ01z_{0}^{-}\in\Lambda^{1}_{0} is the footpoint of the unstable fiber through z0Wu(Λ01)z_{0}\in W^{\rm u}(\Lambda^{1}_{0}), and Ω+,2:Ws(Λ02)Λ02\Omega^{+,2}:W^{\rm s}(\Lambda^{2}_{0})\to\Lambda^{2}_{0} by Ω+,2(z0)=z0+\Omega^{+,2}(z_{0})=z^{+}_{0}, where z0+Λ02z_{0}^{+}\in\Lambda^{2}_{0} is the footpoint of the stable fiber through z0Ws(Λ02)z_{0}\in W^{\rm s}(\Lambda^{2}_{0}).

Associated to the heteroclinic channel Γ0\Gamma_{0} we can define the scattering map as in Definition B.4 in Appendix B. Specifically,

σ0:Ω,1(Γ0)Λ01Ω+,2(Γ0)Λ02,\sigma_{0}:\Omega^{-,1}(\Gamma_{0})\subseteq\Lambda^{1}_{0}\to\Omega^{+,2}(\Gamma_{0})\subseteq\Lambda^{2}_{0},

is given by

σ0(z0)=z0+,\sigma_{0}(z_{0}^{-})=z_{0}^{+},

provided that there exists a z0Γ0z_{0}\in\Gamma_{0} such that Ω,1(z0)=z0\Omega^{-,1}(z_{0})=z^{-}_{0} and Ω+,2(z0)=z0+\Omega^{+,2}(z_{0})=z^{+}_{0}.

Since H0H_{0} is constant along heteroclinic orbits, we have that I(z0)=I(z0+)I(z^{-}_{0})=I(z^{+}_{0}). Then the scattering map, expressed in terms of the action-angle coordinates, is given by

σ0(I,θ)=(I,θ+Δ(I)),\sigma_{0}(I,\theta)=(I,\theta+\Delta(I)),

for some function Δ\Delta that depends smoothly on II.

In this case, we can obtain a result similar to Theorem 2.1. For brevity, we will not provide the precise formulas for the components of the corresponding expansion of the perturbed scattering map, which is analogous to (2.7). Such formulas are analogues of (5.17) and (5.40).

3. Geometric structures in the planar circular restricted three-body problem.

In this section we survey the status of the verification of the conditions (A-i), (A-ii), (A-iii) from Section 2.1, or the conditions (A’-i), (A’-ii), (A’-iii) from Section 2.5, in the concrete model of the PCRTBP. Some of the verifications in the literature are rigorous and some of them are numerical.

Of course, the verification of the hypothesis of Theorem 2.1 in a concrete model does not affect the validity of the rigorous arguments establishing Theorem 2.1, and the reader interested only in rigorous results may safely skip this section.

We note that, since our hypothesis are mainly transversality conditions, they can be verified with finite precision calculations, which seem to be safe calculations for today’s standard and could well be accessible to “computer assisted proofs”. We hope that this work could stimulate more extensive verifications.

The PCRTBP is a model describing the motion of an infinitesimal body under the Newtonian gravity exerted by two heavy bodies moving on circular orbits about their center of mass, under the assumption that these orbits are not affected by the gravity of the infinitesimal body.

We can think of the heavy bodies (referred to as primaries) representing the Earth and Moon, and the infinitesimal mass representing a spaceship.

It is convenient to study the motion of the infinitesimal body relative to a co-rotating frame which rotates with the primaries around the center of mass, and to use normalized units. Henceforth, the masses of the heavy bodies are 1μ1-\mu and μ\mu, where μ(0,1/2]\mu\in(0,1/2]. Relative to the co-rotating frame, the heavier mass 1μ1-\mu is located at (μ,0)(\mu,0), and the lighter mass μ\mu is located at (1+μ,0)(-1+\mu,0). The motion of the infinitesimal body relative to the co-rotating frame is described via the autonomous Hamiltonian

(3.1) H0(p1,p2,q1,q2)=(p1+q2)2+(p2q1)22V(q1,q2),H_{0}(p_{1},p_{2},q_{1},q_{2})=\frac{(p_{1}+q_{2})^{2}+(p_{2}-q_{1})^{2}}{2}-V(q_{1},q_{2}),

where (p,q)=(p1,p2,q1,q2)4\left(p,q\right)=\left(p_{1},p_{2},q_{1},q_{2}\right)\in\mathbb{R}^{4} represents the momenta and the coordinates of the infinitesimal body with respect to the co-rotating frame,

(3.2) V(q1,q2)=q12+q222+1μr1+μr2,r1=((q1μ)2+q22)1/2,r2=((q1+1μ)2+q22)1/2.\begin{split}V(q_{1},q_{2})&=\frac{q_{1}^{2}+q_{2}^{2}}{2}+\frac{1-\mu}{r_{1}}+\frac{\mu}{r_{2}},\\ r_{1}&=\left((q_{1}-\mu)^{2}+q_{2}^{2}\right)^{1/2},\\ r_{2}&=\left((q_{1}+1-\mu)^{2}+q_{2}^{2}\right)^{1/2}.\end{split}

Above V(q1,q2)V(q_{1},q_{2}) represents the effective potential, and r1(t)r_{1}(t), r2(t)r_{2}(t) represent the distances from the infinitesimal body to the masses 1μ1-\mu and μ\mu, respectively. The phase space

M={(p,q)4|q(μ,0), and q(1+μ,0)}M=\{(p,q)\in\mathbb{R}^{4}\,|\,q\neq(\mu,0),\textrm{ and }q\neq(-1+\mu,0)\}

is endowed with the symplectic form

Ω=dp1dq1+dp2dq2.\Omega=dp_{1}\wedge dq_{1}+dp_{2}\wedge dq_{2}.

Note that the phase space MM is not compact.

The equations of motion of the infinitesimal body are given by the Hamilton equations corresponding to (3.1), that is

(3.3) ddtz=JH0(z),\frac{d}{dt}{z}=J\nabla H_{0}(z),

where z=z(p1,p2,q1,q2)z=z(p_{1},p_{2},q_{1},q_{2}) and JJ represents the standard almost complex structure.

The Hamiltonian H0H_{0} is an integral of motion, so the flow Φ0t\Phi^{t}_{0} of (3.3) leaves invariant each energy hyper-surface

(3.4) Mh:=H01(h)={(p1,p2,q1,q2)M|H0(p1,p2,q1,q2)=h}.M_{h}:=H_{0}^{-1}(h)=\{(p_{1},p_{2},q_{1},q_{2})\in M\,|\,H_{0}(p_{1},p_{2},q_{1},q_{2})=h\}.

The system has three equilibrium points L1,L2,L3L_{1},L_{2},L_{3} located on the q1q_{1}-axis, and two other equilibrium points L4L_{4}, L5L_{5}, each lying in the (q1,q2)(q_{1},q_{2})-plane and forming an equilateral triangle with the primaries. These are known as the Lagrange points. In our convention, L1L_{1} is located between the primaries, L2L_{2} is on the side of the lighter primary, and L3L_{3} is on the side of the heavier primary. The linearized stability of L1,L2,L3L_{1},L_{2},L_{3} is of center-saddle type. The linearized stability of L4L_{4}, L5L_{5} is of center-center type, for μ\mu less than some critical value μcr\mu_{\textrm{cr}}.

We note that condition (A-i) is satisfied for each of the equilibrium points L1,L2,L3L_{1},L_{2},L_{3}.

For i=1,2,3i=1,2,3, for each energy level hh, with H(Li)<hH(L_{i})<h and hh sufficiently close to H(Li)H(L_{i}), there exists a unique periodic orbit λ0(h)\lambda_{0}(h) near the equilibrium point LiL_{i}, which is referred to as a Lyapunov orbit. The existence of such periodic orbits follows from the Lyapunov Center Theorem (see, e.g., [Mos58]). Moreover, there exists a neighborhood of LiL_{i} in the phase space where the Hamiltonian H0H_{0} can be written in a normal form relative to some suitable coordinates (I,θ,y,x)(I,\theta,y,x); see Section 4.1.

Each Lyapunov orbit is hyperbolic in the energy surface, so it has associated 22-dimensional unstable and stable manifolds denoted Wu(λ0(h))W^{\rm u}(\lambda_{0}(h)) and Ws(λ0(h))W^{\rm s}(\lambda_{0}(h)).

Numerical evidence shows that these periodic orbits can be continued for energy levels h>H(Li)h>H(L_{i}) that are not necessarily close to H(Li)H(L_{i}) (see, e.g., [Bro68]).

Normally hyperbolic invariant manifold for the unperturbed system. For an energy range hDh\in D sufficiently close to the energy of LiL_{i}, the family of Lyapunov orbits

(3.5) Λ0=hDλ0(h),\Lambda_{0}=\bigcup_{h\in D}\lambda_{0}(h),

defines a 22-dimensional NHIM with boundary for the Hamiltonian flow of (3.3). The NHIM carries the symplectic structure ΩΛ0\Omega_{\mid\Lambda_{0}}, and it can be described in terms of the action-angle coordinates (I,θ)(I,\theta). The action I=IhI=I_{h} is uniquely defined by the energy hh, and θ\theta is symplectically conjugate with II with respect to ΩΛ0\Omega_{\mid\Lambda_{0}}. The variable II is a first integral along the trajectories of the flow on Λ0\Lambda_{0}, and the action level sets are in fact the Lyapunov orbits λ0(h)\lambda_{0}(h). The motion restricted to each Lyapunov orbit is a rigid rotation in the variable θ\theta, with the frequency depending on the energy level.

The NHIM Λ0\Lambda_{0} and its unstable and stable manifolds Wu(Λ0)W^{\rm u}(\Lambda_{0}) and Ws(Λ0)W^{\rm s}(\Lambda_{0}) have simple descriptions in terms of the normal form coordinates (I,θ,y,x)(I,\theta,y,x) in a neighborhood of L1L_{1}: Λ0\Lambda_{0} corresponds to x=y=0x=y=0, Wu(Λ0)W^{\rm u}(\Lambda_{0}) corresponds to y=0y=0, and Ws(Λ0)W^{\rm s}(\Lambda_{0}) corresponds to x=0x=0.

Homoclinic connections. We first focus on the dynamics around the equilibrium point L1L_{1}. There are analytic arguments (see [LMS85]) showing that, for a discrete set of values of μ\mu that are sufficiently small, and for each hh sufficiently close to H(L1)H(L_{1}), the branches of Wu(λ0(h))W^{\rm u}(\lambda_{0}(h)) and Ws(λ0(h))W^{\rm s}(\lambda_{0}(h)) on the side of the heavier primary do not collide with the primary and intersect transversally along some homoclinic orbit γ0(h)\gamma_{0}(h), not necessarily unique. Numerical evidence shows that, for a large range of values of masses μ\mu and energies hh, the branches of Wu(λ0(h))W^{\rm u}(\lambda_{0}(h)) and Ws(λ0(h))W^{\rm s}(\lambda_{0}(h)) on the side of each primary do not collide with either primary and intersect transversally; see, e.g. [KLMR00, Cap12].

Each choice of a transverse homoclinic orbit γ0(h)\gamma_{0}(h) can be continued in energy hh to a family of such homoclinic orbits, which forms a homoclinic manifold hDγ0(h)\bigcup_{h\in D}\gamma_{0}(h). Moreover, we can ensure that Wu(λ0(h))W^{\rm u}(\lambda_{0}(h)) and Ws(λ0(h))W^{\rm s}(\lambda_{0}(h)) are contained inside some compact subset 𝒦\mathscr{K} of the phase space. One can always restrict to a submanifold

(3.6) Γ0hDγ0(h),\Gamma_{0}\subseteq\bigcup_{h\in D}\gamma_{0}(h),

that is a homoclinic channel.

In this way, we can ensure condition (A-ii).

The remaining condition is (A-iii), which consists of (A-iii-a) and (A-iii-b). These are explicit non-degeneracy conditions that can separately be verified numerically.

In this way, for the PCRTBP, we can verify the existence of the geometric structures of interest and the corresponding conditions (A-i), (A-ii), (A-iii) from Section 2.1.

Heteroclinic connections. We now focus on the dynamics around the equilibrium points L1L_{1} and L2L_{2}. They satisfy condition (A’-i). For energy levels hh with H(L1)hH(L_{1})\lesssim h, there exists a family λ01(h)\lambda^{1}_{0}(h) of Lyapunov orbits near L1L_{1}, and for H(L2)hH(L_{2})\lesssim h there exists a family λ02(h)\lambda^{2}_{0}(h) of Lyapunov orbits near L2L_{2}. Moreover, there exist normal form coordinates (I1,θ1,y1,x1)(I^{1},\theta^{1},y^{1},x^{1}) and (I2,θ2,y2,x2)(I^{2},\theta^{2},y^{2},x^{2}) defined around L1L_{1} and L2L_{2}, respectively, for some suitable energy ranges. These normal form coordinates are as in Section 4.1.

Numerical evidence shows that families of periodic orbits near L1L_{1} and L2L_{2} can exist simultaneously, for some energy range. Therefore, we consider an interval of energies DD such that, for hDh\in D we have the following: there exists families of periodic orbits λ01(h)\lambda^{1}_{0}(h) near L1L_{1}, and λ02(h)\lambda^{2}_{0}(h) near L2L_{2}, the following sets

(3.7) Λ01=hDλ01(h),Λ02=hDλ02(h),\begin{split}\Lambda^{1}_{0}&=\bigcup_{h\in D}\lambda^{1}_{0}(h),\\ \Lambda^{2}_{0}&=\bigcup_{h\in D}\lambda^{2}_{0}(h),\end{split}

are NHIM’s for the Hamiltonian flow of (3.3), and the normal form coordinates (I1,θ1,y1,x1)(I^{1},\theta^{1},y^{1},x^{1}) and (I2,θ2,y2,x2)(I^{2},\theta^{2},y^{2},x^{2}) are defined in some neighborhoods of Λ01\Lambda^{1}_{0} and Λ02\Lambda^{2}_{0}, respectively.

Numerical evidence also shows that there exist transverse heteroclinic connections determined by Wu(λ01(h))Ws(λ02(h))W^{{\rm u}}(\lambda^{1}_{0}(h))\cap W^{{\rm s}}(\lambda^{2}_{0}(h)) and Wu(λ02(h))Ws(λ01(h))W^{{\rm u}}(\lambda^{2}_{0}(h))\cap W^{{\rm s}}(\lambda^{1}_{0}(h)) for certain ranges of energies. See, e.g., [KLMR00, WZ03, CM06, BGT10]. In either case, we denote the corresponding family of heteroclinic orbits by γ0(h)\gamma_{0}(h). We consider a range of energies hDh\in D for which this additional condition on the existence of transverse heteroclinic connections is satisfied.

The transverse intersection of the unstable manifold Wu(Λ01)W^{\rm u}(\Lambda^{1}_{0}) with the stable manifold Ws(Λ02)W^{\rm s}(\Lambda^{2}_{0}) define a heteroclinic manifold hDγ0(h)\bigcup_{h\in D}\gamma_{0}(h), which depends on the choice of the family of heteroclinic orbits γ0(h)\gamma_{0}(h). One can always restrict to a submanifold

(3.8) Γ0hDγ0(h),\Gamma_{0}\subseteq\bigcup_{h\in D}\gamma_{0}(h),

that is a heteroclinic channel.

If that is the case, the condition (A’-ii) is verified.

As in the case of homoclinic connections, the remaining condition (A’-iii) consists of two explicit non-degeneracy conditions that can be verified numerically.

Thus, one can verify numerically the conditions (A’-i), (A’-ii), (A’-iii), from Section 2.5.

It would be of interest to verify if Theorem 2.1 can be applied when the Lagrange point L3L_{3} is also considered; some numerical results concerning the dynamics around L3L_{3} can be found in [BO06, CCP16].

In summary, in this section we have outlined how the conditions of the Theorem 2.1 can be verified in the PCRTBP. The theoretical results of Theorem 2.1 are independent on the application to the PCRTBP.

4. Coordinate systems and evolution equations

4.1. New coordinate systems for the unperturbed system

We consider the case of homoclinic connections described by conditions (A-i), (A-ii), (A-iii). Under these assumptions, the manifolds Wu(Λ0)W^{\rm u}(\Lambda_{0}) and Ws(Λ0)W^{\rm s}(\Lambda_{0}) intersect transversally along the homoclinic channel Γ0\Gamma_{0}.

The next proposition states that, in a neighborhood of each Wu(Λ0)W^{\rm u}(\Lambda_{0}) and Ws(Λ0)W^{\rm s}(\Lambda_{0}), there exists a system of symplectic coordinates such that Λ0\Lambda_{0}, Wu(Λ0)W^{\rm u}(\Lambda_{0}) and Ws(Λ0)W^{\rm s}(\Lambda_{0}) have very simple descriptions, and moreover the unperturbed Hamiltonian H0H_{0} can be written in a normal form relative to the corresponding coordinates. As before, for zWs,u(Λ0)z\in W^{{\rm s},{\rm u}}(\Lambda_{0}), we denote z±=Ω±(z)z^{\pm}=\Omega^{\pm}(z).

Proposition 4.1 (Normal Forms).

There exist three systems of symplectic coordinates111coordinates obtained from (p,q)(p,q) via a canonical transformation, defined for some range of energies hDh\in D, as follows:

  • (N)

    A coordinate system (I,θ,y,x)(I,\theta,y,x) in a neighborhood 𝒩\mathscr{N} of Λ0\Lambda_{0} such that for z𝒩z\in\mathcal{N} we have:

    • (N-i)

      zΛ0z\in\Lambda_{0} if and only if x(z)=y(z)=0x(z)=y(z)=0;

    • (N-ii)

      zWu(Λ0)z\in W^{\rm u}(\Lambda_{0}) if and only if y=0y=0, and zWs(Λ0)z\in W^{\rm s}(\Lambda_{0}) if and only if x=0x=0;

    • (N-iii)

      for zWu(Λ0)z\in W^{\rm u}(\Lambda_{0}) we have I(z)=I(z)I(z)=I(z^{-}) and θ(z)=θ(z)\theta(z)=\theta(z^{-}), and for zWs(Λ0)z\in W^{\rm s}(\Lambda_{0}) we have I(z)=I(z+)I(z)=I(z^{+}) and θ(z)=θ(z+)\theta(z)=\theta(z^{+});

    • (N-iv)

      H0H_{0} restricted to 𝒩\mathscr{N} can be written in a normal form

      (4.1) H0(p,q)=H0(I,xy)=h0(I)+(xy)g1(I)+(xy)2g2(I,xy)\begin{split}H_{0}(p,q)=&H_{0}(I,xy)\\ =&h_{0}\left(I\right)+(xy)g_{1}\left(I\right)+\left(xy\right)^{2}g_{2}(I,xy)\end{split}

      for some analytic functions h0=h0(I)h_{0}=h_{0}(I), g1=g1(I)g_{1}=g_{1}(I), and g2=g2(I,xy)g_{2}=g_{2}(I,xy).

  • (U)

    A coordinate system (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) in a neighborhood 𝒩u\mathscr{N}^{\rm u} of Wu(Λ0)W^{\rm u}(\Lambda_{0}) such that for z𝒩uz\in\mathcal{N}^{\rm u} we have:

    • (U-i)

      zΛ0z\in\Lambda_{0} if and only if xu(z)=yu(z)=0x^{\rm u}(z)=y^{\rm u}(z)=0;

    • (U-ii)

      zWu(Λ0)z\in W^{\rm u}(\Lambda_{0}) if and only if yu=0y^{{\rm u}}=0;

    • (U-iii)

      for zWu(Λ0)z\in W^{\rm u}(\Lambda_{0}) we have Iu(z)=Iu(z)I^{\rm u}(z)=I^{\rm u}(z^{-}), and θu(z)=θu(z)\theta^{\rm u}(z)=\theta^{\rm u}(z^{-});

    • (U-iv)

      H0H_{0} restricted to 𝒩u\mathscr{N}^{\rm u} can be written in a normal form

      (4.2) H0(p,q)=H0u(Iu,xuyu)=h0(Iu)+(xuyu)g1(Iu)+(xuyu)2g2(Iu,xuyu)\begin{split}H_{0}(p,q)=&H_{0}^{\rm u}(I^{\rm u},x^{\rm u}y^{\rm u})\\ =&h_{0}\left(I^{u}\right)+(x^{u}y^{u})g_{1}\left(I^{{\rm u}}\right)+\left(x^{{\rm u}}y^{{\rm u}}\right)^{2}g_{2}(I^{\rm u},x^{\rm u}y^{\rm u})\end{split}

      for some analytic functions h0=h0(Iu)h_{0}=h_{0}(I^{\rm u}), g1=g1(Iu)g_{1}=g_{1}(I^{\rm u}), and g2=g2(Iu,xuyu)g_{2}=g_{2}(I^{\rm u},x^{\rm u}y^{\rm u}).

  • (S)

    A coordinate system (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}) in a neighborhood 𝒩s\mathscr{N}^{\rm s} of Ws(Λ0)W^{\rm s}(\Lambda_{0}) such that for z𝒩sz\in\mathcal{N}^{\rm s} we have:

    • (S-i)

      zΛ0z\in\Lambda_{0} if and only if xs(z)=ys(z)=0x^{\rm s}(z)=y^{\rm s}(z)=0;

    • (S-ii)

      zWs(Λ0)z\in W^{\rm s}(\Lambda_{0}) if and only if xs=0x^{{\rm s}}=0;

    • (S-iii)

      for zWs(Λ0)z\in W^{\rm s}(\Lambda_{0}) we have Is(z)=Is(z+)I^{\rm s}(z)=I^{\rm s}(z^{+}), and θs(z)=θs(z+)\theta^{\rm s}(z)=\theta^{\rm s}(z^{+});

    • (S-iv)

      H0H_{0} restricted to 𝒩s\mathscr{N}^{\rm s} can be written in a normal form

      (4.3) H0(p,q)=H0s(Is,xsys)=h0(Is)+(xsys)g1(Is)+(xsys)2g2(Is,xsys)\begin{split}H_{0}(p,q)=&H_{0}^{\rm s}(I^{\rm s},x^{\rm s}y^{\rm s})\\ =&h_{0}\left(I^{{\rm s}}\right)+(x^{{\rm s}}y^{{\rm s}})g_{1}\left(I^{{\rm s}}\right)+\left(x^{{\rm s}}y^{{\rm s}}\right)^{2}g_{2}(I^{\rm s},x^{\rm s}y^{\rm s})\end{split}

      for some analytic functions h0=h0(Is)h_{0}=h_{0}(I^{\rm s}), g1=g1(Is)g_{1}=g_{1}(I^{\rm s}), and g2=g2(Is,xsys)g_{2}=g_{2}(I^{\rm s},x^{\rm s}y^{\rm s}).

The coordinate systems (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) and (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}) coincide with (I,θ,y,x)(I,\theta,y,x) on 𝒩\mathscr{N}, i.e.,

(4.4) (Iu,θu,yu,xu)=(Is,θs,ys,xs)=(I,θ,y,x) on 𝒩.(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u})=(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s})=(I,\theta,y,x)\textrm{ on }\mathscr{N}.

The function h0,g1,g2h_{0},g_{1},g_{2} that appear in (4.1), (4.2) and (4.3) are the same.

Proof.

Part (N) follows from [Gio01], so we will not give a detailed proof.

We only summarize the procedure to obtain the normal form in a neighborhood of a center-saddle equilibrium point for a 22-degrees of freedom Hamiltonian H0H_{0}. First, the Hamiltonian H0H_{0} is expanded in a Taylor series around that equilibrium point (shifted to the origin) as

H0(p,q)=H2(p,q)+H3(p,q)+H4(p,q)+,H_{0}(p,q)=H_{2}(p,q)+H_{3}(p,q)+H_{4}(p,q)+\ldots,

where Hj(p,q)H_{j}(p,q) is an homogeneous polynomial of degree jj in the variables (p1,p2,q1,q2)(p_{1},p_{2},q_{1},q_{2}). Then, by making a linear canonical change of coordinates (p,q)(x,y)(p,q)\mapsto(x,y), with the eigenvectors of the linearized system given by JH2(0,0)J\nabla H_{2}(0,0) as the axes of the new system, the quadratic part H2H_{2} of H0H_{0} can be written in the new coordinates (x,y)4(x,y)\in\mathbb{R}^{4} as

H2(x,y)=λx1y1+ω2(x22+y22),H_{2}(x,y)=\lambda x_{1}y_{1}+\frac{\omega}{2}(x_{2}^{2}+y_{2}^{2}),

where ±λ1=±λ\pm\lambda_{1}=\pm\lambda and ±λ2=±iω\pm\lambda_{2}=\pm i\omega are the eigenvalues of JH2(0,0)J\nabla H_{2}(0,0). Then, via another linear canonical change of coordinates

x1=ξ1,y1=η1,x2=ξ2+iη22,y2=iξ2+η22,x_{1}=\xi_{1},\quad y_{1}=\eta_{1},\quad x_{2}=\frac{\xi_{2}+i\eta_{2}}{\sqrt{2}},\quad\quad y_{2}=\frac{i\xi_{2}+\eta_{2}}{\sqrt{2}},

we obtain H2H_{2} written in complex variables as

H2N(ξ,η)=λξ1η1+iωξ2η2=λ1ξ1η1+λ2ξ2η2.H^{N}_{2}(\xi,\eta)=\lambda\xi_{1}\eta_{1}+i\omega\xi_{2}\eta_{2}=\lambda_{1}\xi_{1}\eta_{1}+\lambda_{2}\xi_{2}\eta_{2}.

The next step is to apply a sequence of changes of coordinates to kill all monomials for which the exponent of ξj\xi_{j} is different from the exponent of ηj\eta_{j}. Since the eigenvalue λ1=λ\lambda_{1}=\lambda is real and λ2=iω\lambda_{2}=i\omega is imaginary, there are no small divisors. The only possible source of divergence is due to the use of Cauchy’s estimates for the derivatives required by the normalization procedure. In [Gio01] the accumulation of derivatives is controlled via a KAM technique. The process can be continued to any order.

Thus, in the limit H0H_{0} can be written as an expansion

H0N(ξ,η)=H2N(ξ1η1,ξ2η2)+H3N(ξ1η1,ξ2η2)+H4N(ξ1η1,ξ2η2)+,H^{N}_{0}(\xi,\eta)=H^{N}_{2}(\xi_{1}\eta_{1},\xi_{2}\eta_{2})+H^{N}_{3}(\xi_{1}\eta_{1},\xi_{2}\eta_{2})+H^{N}_{4}(\xi_{1}\eta_{1},\xi_{2}\eta_{2})+\ldots,

where HjH_{j} is an homogeneous polynomial in ξ1η1,ξ2η2\xi_{1}\eta_{1},\xi_{2}\eta_{2} of degree jj. The series expansion of H0NH_{0}^{N} is convergent in a neighborhood 𝒩¯\bar{\mathscr{N}} of the origin, and the coordinate change x=x(ξ,η)x=x(\xi,\eta), y=y(ξ,η)y=y(\xi,\eta) is canonical and given in terms of convergent series.

There exist periodic orbits λ0(h)\lambda_{0}(h) around the equilibrium point for all energy levels hh sufficiently close to the energy level of the equilibrium point. This implies that the NHIM Λ¯0=hD¯λ0(h)\bar{\Lambda}_{0}=\bigcup_{h\in\bar{D}}\lambda_{0}(h) is contained in 𝒩¯\bar{\mathscr{N}}, for some suitable energy range hD¯h\in\bar{D}.

To express H0NH_{0}^{N} in action-angle coordinates one applies the canonical transformation

ξ2=I¯exp(iθ¯),η2=iI¯exp(iθ¯).\xi_{2}=\sqrt{\bar{I}}\exp(i\bar{\theta}),\quad\eta_{2}=-i\sqrt{\bar{I}}\exp(-i\bar{\theta}).

Finally, denote x¯=ξ1\bar{x}=\xi_{1}, y¯=η1\bar{y}=\eta_{1}. We obtain the normal form

(4.5) H0N(I¯,x¯y¯)=λx¯y¯+ωI¯+H3(I¯,x¯y¯)+H4(I¯,x¯y¯)+.H_{0}^{N}(\bar{I},\bar{x}\bar{y})=\lambda\bar{x}\bar{y}+{\omega}\bar{I}+H_{3}(\bar{I},\bar{x}\bar{y})+H_{4}(\bar{I},\bar{x}\bar{y})+\ldots.

Moreover, in these coordinates the following hold:

  • (i)

    The normally hyperbolic invariant manifold Λ¯0\bar{\Lambda}_{0} is given by x¯=y¯=0\bar{x}=\bar{y}=0, and each periodic orbit in Λ¯0\bar{\Lambda}_{0} corresponds to a level set of I¯\bar{I};

  • (ii)

    The local unstable invariant manifold W𝒩¯u(Λ¯0)W^{\rm u}_{\bar{\mathscr{N}}}(\bar{\Lambda}_{0}) is given by y¯=0\bar{y}=0;

  • (iii)

    The local stable invariant manifold W𝒩¯s(Λ¯0)W^{\rm s}_{\bar{\mathscr{N}}}(\bar{\Lambda}_{0}) is given by x¯=0\bar{x}=0.

The equations of motion are

(4.6) ddtI¯=0,ddtθ¯=ω+H3NI¯+H4NI¯+,ddty¯=λy¯H3Nx¯H4Nx¯+,ddtx¯=λx¯+H3Ny¯+H4Ny¯+.\begin{split}\frac{d}{dt}\bar{I}=&0,\\ \frac{d}{dt}\bar{\theta}=&\omega+\frac{\partial H_{3}^{N}}{\partial\bar{I}}+\frac{\partial H_{4}^{N}}{\partial\bar{I}}+\ldots,\\ \frac{d}{dt}\bar{y}=&-\lambda\bar{y}-\frac{\partial H_{3}^{N}}{\partial\bar{x}}-\frac{\partial H_{4}^{N}}{\partial\bar{x}}+\ldots,\\ \frac{d}{dt}\bar{x}=&\lambda\bar{x}+\frac{\partial H_{3}^{N}}{\partial\bar{y}}+\frac{\partial H_{4}^{N}}{\partial\bar{y}}+\ldots.\end{split}

Note that Hamiltonian H0NH_{0}^{N} on 𝒩¯\bar{\mathscr{N}} has two first integrals I¯\bar{I} and x¯y¯\bar{x}\bar{y}, which are independent and in involution.

This implies that, if zW𝒩¯u(Λ¯0)z\in W^{\rm u}_{\bar{\mathscr{N}}}(\bar{\Lambda}_{0}) (resp. zW𝒩¯s(Λ¯0)z\in W^{\rm s}_{\bar{\mathscr{N}}}(\bar{\Lambda}_{0})), since x¯y¯=0\bar{x}\bar{y}=0, we have I¯(z)=I¯(z)\bar{I}(z)=\bar{I}(z^{-}) (resp. I¯(z)=I¯(z+)\bar{I}(z)=\bar{I}(z^{+})).

The Hamiltonian H0NH_{0}^{N} restricted to Λ¯0\bar{\Lambda}_{0} is given by

h0(I¯):=ωI¯+H3N(I¯)+H4N(I¯)+,h_{0}(\bar{I}):={\omega}\bar{I}+H_{3}^{N}(\bar{I})+H_{4}^{N}(\bar{I})+\ldots,

where we denote H0N(I¯,0)=h0(I¯)H^{N}_{0}(\bar{I},0)=h_{0}(\bar{I}), and HjN(I¯,0)=HjN(I¯)H^{N}_{j}(\bar{I},0)=H^{N}_{j}(\bar{I}) for all jj. Each Lyapunov orbit λh\lambda_{h} in Λ¯0\bar{\Lambda}_{0} corresponds to a unique level set of I¯\bar{I}, so we can write λ0(h)=λ0(I¯)\lambda_{0}(h)=\lambda_{0}(\bar{I}).

By truncating the expansion (4.5) at the first order we can write

(4.7) H0N(I¯,θ¯,y¯,x¯)=h0(I¯)+(x¯y¯)g1(I¯)+(x¯y¯)2g2(I¯,x¯y¯),H_{0}^{N}(\bar{I},\bar{\theta},\bar{y},\bar{x})=h_{0}(\bar{I})+(\bar{x}\bar{y})g_{1}(\bar{I})+(\bar{x}\bar{y})^{2}g_{2}(\bar{I},\bar{x}\bar{y}),

for some analytic functions h0=h0(I¯)h_{0}=h_{0}(\bar{I}), g1=g1(I¯,x¯y¯)g_{1}=g_{1}(\bar{I},\bar{x}\bar{y}), and g2=g2(I¯,x¯y¯)g_{2}=g_{2}(\bar{I},\bar{x}\bar{y}).

For points zW𝒩¯u(Λ¯0)z\in W^{\rm u}_{\bar{\mathscr{N}}}(\bar{\Lambda}_{0}) or zW𝒩¯s(Λ¯0)z\in W^{\rm s}_{\bar{\mathscr{N}}}(\bar{\Lambda}_{0}), since x¯y¯=0\bar{x}\bar{y}=0, we have

ddtθ(z)=h0I¯(I¯(z)).\frac{d}{dt}\theta(z)=\frac{\partial h_{0}}{\partial\bar{I}}(\bar{I}(z)).

This implies that if zW𝒩¯u(Λ¯0)z\in W^{\rm u}_{\bar{\mathscr{N}}}(\bar{\Lambda}_{0}) (resp. zW𝒩¯s(Λ¯0)z\in W^{\rm s}_{\bar{\mathscr{N}}}(\bar{\Lambda}_{0})), we have θ¯(z)=θ¯(z)\bar{\theta}(z)=\bar{\theta}(z^{-}) (resp. θ¯(z)=θ¯(z+)\bar{\theta}(z)=\bar{\theta}(z^{+})).

The coordinate system (I¯,θ¯,y¯,x¯)(\bar{I},\bar{\theta},\bar{y},\bar{x}) constructed above is not yet the coordinate system from part (N). The desired coordinate system (I,θ,y,x)(I,\theta,y,x) will be constructed below by flowing in time the coordinate system (I¯,θ¯,y¯,x¯)(\bar{I},\bar{\theta},\bar{y},\bar{x}).

Now we construct the coordinate system claimed in part (U).

We extend the coordinate system (I¯,θ¯,y¯,x¯)(\bar{I},\bar{\theta},\bar{y},\bar{x}) along the flow to a neighborhood 𝒩u\mathscr{N}^{\rm u} of Wu(Λ¯0)W^{\rm u}(\bar{\Lambda}_{0}), up to a neighborhood of the homoclinic manifold, as follows. Let T>0T>0 be a time such that Φ0T(𝒩¯)Γ\Phi^{T}_{0}(\bar{\mathscr{N}})\supseteq\Gamma. Let 𝒩u:=Φ0T(𝒩¯)\mathscr{N}^{\rm u}:=\Phi^{T}_{0}(\bar{\mathscr{N}}). Each point z𝒩uz\in\mathscr{N}^{\rm u} is of the form z=Φ0T(ζ)z=\Phi^{T}_{0}(\zeta) with ζ𝒩¯\zeta\in\bar{\mathscr{N}}. We define the coordinates (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) of zz to be equal to the coordinates (I¯,θ¯,y¯,x¯)(\bar{I},\bar{\theta},\bar{y},\bar{x}) of ζ\zeta, or equivalently

(4.8) (Iu,θu,yu,xu)(z)=(I¯,θ¯,y¯,x¯)(Φ0T(z)).(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u})(z)=(\bar{I},\bar{\theta},\bar{y},\bar{x})(\Phi^{-T}_{0}(z)).

The restriction of the coordinates (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) to 𝒩¯𝒩u\bar{\mathscr{N}}\cap\mathscr{N}^{\rm u} is given by (I¯,θ¯,,y¯x¯)Φ0T(\bar{I},\bar{\theta},,\bar{y}\bar{x})\circ\Phi^{-T}_{0}.

Since the coordinates (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) of a point zz are the coordinates (I¯,θ¯,y¯,x¯)(\bar{I},\bar{\theta},\bar{y},\bar{x}) of Φ0T(z)\Phi^{-T}_{0}(z), then they are symplectic, and they yield the same normal form expansion of H0H_{0} as (4.7). More precisely,

H(Iu,θu,yu,xu)=H(I¯Φ0T,θ¯Φ0T,y¯Φ0T,x¯Φ0T)=h0(I¯Φ0T)+((x¯Φ0T)(y¯Φ0T))g1(I¯Φ0T)+((x¯Φ0T)(y¯Φ0T))2g2((I¯Φ0T,x¯Φ0T)(y¯Φ0T))=h0(Iu)+(xuyu)g1(I)+(xuyu)2g2(Iu,xuyu).\begin{split}H(I^{\rm u},\theta^{\rm u},&y^{\rm u},x^{\rm u})\\ =&H(\bar{I}\circ\Phi^{-T}_{0},\bar{\theta}\circ\Phi^{-T}_{0},\bar{y}\circ\Phi^{-T}_{0},\bar{x}\circ\Phi^{-T}_{0})\\ =&h_{0}(\bar{I}\circ\Phi^{-T}_{0})\\ &+\left((\bar{x}\circ\Phi^{-T}_{0})\cdot(\bar{y}\circ\Phi^{-T}_{0})\right)g_{1}(\bar{I}\circ\Phi^{-T}_{0})\\ &+\left((\bar{x}\circ\Phi^{-T}_{0})\cdot(\bar{y}\circ\Phi^{-T}_{0})\right)^{2}g_{2}\left((\bar{I}\circ\Phi^{-T}_{0},\bar{x}\circ\Phi^{-T}_{0})\cdot(\bar{y}\circ\Phi^{-T}_{0})\right)\\ =&h_{0}(I^{\rm u})+(x^{\rm u}y^{\rm u})g_{1}(I)+(x^{\rm u}y^{\rm u})^{2}g_{2}(I^{\rm u},x^{\rm u}y^{\rm u}).\end{split}

Now we construct the coordinate system claimed in part (S).

We extend the coordinate system (I,θ,y,x)(I,\theta,y,x) along the flow to a neighborhood 𝒩s\mathscr{N}^{\rm s} of Ws(Λ¯0)W^{\rm s}(\bar{\Lambda}_{0}), up to the homoclinic manifold, as follows. Start with the coordinates (I¯,θ¯,y¯,x¯)Φ02T(\bar{I},\bar{\theta},\bar{y},\bar{x})\circ\Phi^{-2T}_{0} defined in the neighborhood Φ02T(𝒩¯)\Phi^{-2T}_{0}(\bar{\mathscr{N}}) of Λ¯0\bar{\Lambda}_{0}. Let 𝒩s:=Φ03T(𝒩¯)\mathscr{N}^{\rm s}:=\Phi^{-3T}_{0}(\bar{\mathscr{N}}). Each point z𝒩sz\in\mathscr{N}^{\rm s} is of the form z=Φ0T(ζ)z=\Phi^{-T}_{0}(\zeta) with ζΦ02T(𝒩¯)\zeta\in\Phi^{-2T}_{0}(\bar{\mathscr{N}}). We define the coordinates (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}) of zz to be equal to the coordinates (I¯,θ¯,y¯,x¯)Φ02T(\bar{I},\bar{\theta},\bar{y},\bar{x})\circ\Phi^{-2T}_{0} of ζ\zeta, or equivalently

(4.9) (Is,θs,ys,xs)(z)=(I¯,θ¯,y¯,x¯)(Φ02T(Φ0T(z)))=(I¯,θ¯,y¯,x¯)(Φ0T(z)).(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s})(z)=(\bar{I},\bar{\theta},\bar{y},\bar{x})(\Phi^{-2T}_{0}(\Phi^{T}_{0}(z)))=(\bar{I},\bar{\theta},\bar{y},\bar{x})(\Phi^{-T}_{0}(z)).

Since the coordinates (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}) of a point zz are the coordinates (I¯,θ¯,y¯,x¯)Φ02T(\bar{I},\bar{\theta},\bar{y},\bar{x})\circ\Phi^{-2T}_{0} of Φ0T(z)\Phi^{T}_{0}(z), then they are symplectic, and they yield the same normal for expansion of H0H_{0} as in (4.7), that is

H𝒩sN(Is,θs,ys,xs)=h0(Is)+(xsys)g1(I)+(xsys)2g2(Is,xsys).H^{N}_{\mathscr{N}^{\rm s}}(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s})=h_{0}(I^{\rm s})+(x^{\rm s}y^{\rm s})g_{1}(I)+(x^{\rm s}y^{\rm s})^{2}g_{2}(I^{\rm s},x^{\rm s}y^{\rm s}).

The restriction of the coordinates (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}) to 𝒩¯𝒩s\bar{\mathscr{N}}\cap\mathscr{N}^{\rm s} is given by (I¯,θ¯,y¯,x¯)Φ0T(\bar{I},\bar{\theta},\bar{y},\bar{x})\circ\Phi^{-T}_{0}.

Now we define the coordinate system claimed in part (N).

We construct the coordinate system (I,θ,y,x)(I,\theta,y,x) in a neighborhood 𝒩:=𝒩¯𝒩u𝒩s\mathscr{N}:=\bar{\mathscr{N}}\cap\mathscr{N}^{\rm u}\cap\mathscr{N}^{\rm s} by

(4.10) (I,θ,y,x)(z)=(I¯,θ¯,y¯,x¯)(Φ0T(z)).(I,\theta,y,x)(z)=(\bar{I},\bar{\theta},\bar{y},\bar{x})(\Phi^{-T}_{0}(z)).

In terms of the coordinates (I,θ,y,x)(I,\theta,y,x), the Hamiltonian has the same normal form expansion as in (4.7), that is

H𝒩N(I,θ,y,x)=h0(I)+(xy)g1(I)+(xy)2g2(I,xy).H^{N}_{\mathscr{N}}(I,\theta,y,x)=h_{0}(I)+(xy)g_{1}(I)+(xy)^{2}g_{2}(I,xy).

We note that 𝒩\mathscr{N} is a small neighborhood of the equilibrium point in the phase space. Thus we restrict the Hamiltonian to an energy range hDh\in D such that Λ0=hDλh𝒩\Lambda_{0}=\bigcup_{h\in D}\lambda_{h}\subseteq\mathscr{N}. Moreover, we choose DD such that Wu(Λ0)W^{\rm u}(\Lambda_{0}), Ws(Λ0)W^{\rm s}(\Lambda_{0}) are contained in 𝒦\mathscr{K}, where the set 𝒦\mathscr{K} is as in condition (A-ii).

By the above constructions, the coordinates (I,θ,y,x)(I,\theta,y,x), (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) and (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}) satisfy the properties listed in Proposition 4.1. ∎

We now formulate some non-degeneracy condition, in terms of the above normal form coordinates, that H0H_{0} must satisfy for our results.

  • (A-iii-a)

    The Hamiltonian H0H_{0}, written in the normal form given by Proposition 4.1 satisfies:

    (4.11) h0I(I0)0 for all I0=I0(h) with hD,g1(I0)0 for all I0=I0(h) with hD.\begin{split}\frac{\partial h_{0}}{\partial I}(I_{0})\neq&0\textrm{ for all }I_{0}=I_{0}(h)\textrm{ with }h\in D,\\ g_{1}(I_{0})\neq&0\textrm{ for all }I_{0}=I_{0}(h)\textrm{ with }h\in D.\end{split}
Remark 4.2.

It is important to note that the coordinates (Iu,θu,xu,yu)(I^{\rm u},\theta^{\rm u},x^{\rm u},y^{\rm u}) and (Is,θs,xs,ys)(I^{\rm s},\theta^{\rm s},x^{\rm s},y^{\rm s}) do not generally agree at homoclinic points away from the Lyapunov orbit, where both coordinate systems are well defined. Nevertheless, for any homoclinic point zMhWu(Λ0)Ws(Λ0)z\in M_{h}\cap W^{\rm u}(\Lambda_{0})\cap W^{\rm s}(\Lambda_{0}), we have that Iu(z)=Is(z)=IhI^{\rm u}(z)=I^{\rm s}(z)=I_{h}.

Remark 4.3.

The above result on the existence of a convergent normal form in a neighborhood of a center-saddle point, obtained via a convergent canonical coordinate transformation, is valid for 2-degrees of freedom Hamiltonian systems. For higher degree of freedom Hamiltonian systems, the same result is true under some additional non-resonance conditions (see [Gio01]). A related approach to the normal form that we use here can be found in [Mos58]. A numerical methodology for the effective computations of normal forms is developed in [Jor99].

Remark 4.4.

Now we discuss the case when we have heteroclinic connections between two NHIMs Λ01\Lambda^{1}_{0} around L1L^{1} and Λ02\Lambda^{2}_{0} around L2L^{2}, as in Section 2.5. The manifolds Wu(Λ01)W^{\rm u}(\Lambda^{1}_{0}) and Ws(Λ02)W^{\rm s}(\Lambda^{2}_{0}) are assumed to intersect transversally.

The construction of the normal form coordinates from Proposition 4.1 only works in a small neighborhood of the equilibrium point. In the case of heteroclinic connections, since L1L^{1} and L2L^{2} are on different energy level, the theory does not guarantee the simultaneous existence two normal form coordinate system around L1L^{1} and L2L^{2} respectively, for some common energy range.

In this case, we need to make a separate assumption that there exist two systems of normal form coordinates around L1L^{1} and L2L^{2}, for some common energy range. Indeed, this assumption is already made in (A’-i-b).

Based on this assumption, we can construct, as in the proof of Proposition 4.1, two systems of coordinates

  • (I1,u,θ1,u,y1,u,x1,u)(I^{1,{\rm u}},\theta^{1,{\rm u}},y^{1,{\rm u}},x^{1,{\rm u}}) in a neighborhood 𝒩1,u𝒦\mathscr{N}^{1,{\rm u}}\subseteq\mathscr{K} of Wu(Λ01)W^{\rm u}(\Lambda^{1}_{0}),

  • (I2,s,θ2,s,y2,s,x2,s)(I^{2,{\rm s}},\theta^{2,{\rm s}},y^{2,{\rm s}},x^{2,{\rm s}}) in a neighborhood 𝒩2,s𝒦\mathscr{N}^{2,{\rm s}}\subseteq\mathscr{K} of Ws(Λ02)W^{\rm s}(\Lambda^{2}_{0}),

so that they satisfy properties similar to those in the case of homoclinic connections.

4.2. The scattering map for the unperturbed system

Consider the scattering map σ0{\sigma}_{0} associated to Γ0{\Gamma}_{0}. We will express the scattering map in terms of the coordinates (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) and (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}).

Consider a homoclinic point z0Wu(λh)Ws(λh)z_{0}\in W^{\rm u}(\lambda_{h})\cap W^{\rm s}(\lambda_{h}). Both coordinate systems (Iu,θu,yu,xu)(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u}) and (Is,θs,ys,xs)(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s}) are defined in the neighborhood of z0z_{0}.

By Proposition 4.1 the action coordinate of z0Γ0z_{0}\in{\Gamma}_{0} is the same as the action of the unstable and stable foot-points z0,z0+Λ0z_{0}^{-},z_{0}^{+}\in{\Lambda}_{0}, that is I(z0)=Iu(z0)=Iu(z0)=Is(z0)=Is(z0+)=I(z0+)I(z_{0}^{-})=I^{\rm u}(z_{0}^{-})=I^{\rm u}(z_{0})=I^{\rm s}(z_{0})=I^{\rm s}(z_{0}^{+})=I(z_{0}^{+}). Therefore the scattering map σ0{\sigma}_{0} preserves the II-coordinate. Hence σ0{\sigma}_{0} is a phase shift on II-level-sets in Λ0{\Lambda}_{0} wherever it is defined:

(4.12) σ0(I,θ)=(I,θ+Δ(I)).{\sigma}_{0}(I,\theta)=(I,\theta+\Delta(I)).

In general, θu(z0)θs(z0)\theta^{\rm u}(z_{0})\neq\theta^{\rm s}(z_{0}). It is easy to see that the phase-shift determined by the unperturbed scattering map is given by the ‘mismatch’ between the two angle coordinates.

Proposition 4.5.

Let hh be a fixed energy level and let z0Γ0Mhz_{0}\in\Gamma_{0}\cap M_{h}.

(i) The angle mismatch θs(z0)θu(z0)\theta^{\rm s}(z_{0})-\theta^{\rm u}(z_{0}) is a constant that depends only on hh, so we write it as θs(h)θu(h)\theta^{\rm s}(h)-\theta^{\rm u}(h).

(ii) The scattering map σ0\sigma_{0} is given by (I,θ)σ0(I,θ)=(I,θ+Δ(I))(I,\theta)\mapsto\sigma_{0}(I,\theta)=(I,\theta+\Delta(I)), where Δ(I)=θs(h)θu(h)\Delta(I)=\theta^{\rm s}(h)-\theta^{\rm u}(h) for I=IhI=I_{h}.

Proof.

(i) If z0z_{0} is a point in Γ0Mh\Gamma_{0}\cap M_{h}, then Γ0Mh\Gamma_{0}\cap M_{h} consists of points of the form Φ0t(z)\Phi^{t}_{0}(z), t[t1,t2]t\in[t_{1},t_{2}], for some interval [t1,t2][t_{1},t_{2}] containing 0.

From Ω+(z0)=z0+\Omega^{+}(z_{0})=z_{0}^{+} and Ω(z0)=z0\Omega^{-}(z_{0})=z_{0}^{-}, by the equivariance property (A.5) we have Ω+(Φ0t(z0))=Φ0t(z0+)\Omega^{+}(\Phi^{t}_{0}(z_{0}))=\Phi^{t}_{0}(z_{0}^{+}) and Ω(Φ0t(z0))=Φ0t(z0)\Omega^{-}(\Phi^{t}_{0}(z_{0}))=\Phi^{t}_{0}(z_{0}^{-}) for t[t1,t2]t\in[t_{1},t_{2}]. By By Proposition 4.1 (U-iii) and (S-iii), and by the fact that the dynamics restricted to λh\lambda_{h} is a rigid rotation in θ\theta, we have

θs(Φ0t(z0))θu(Φ0t(z0))=θ(Φ0t(z0+))θ(Φ0t(z0))=θ(z0+)θ(z0),\begin{split}\theta^{\rm s}(\Phi^{t}_{0}(z_{0}))-\theta^{\rm u}(\Phi^{t}_{0}(z_{0}))=&\theta(\Phi^{t}_{0}(z_{0}^{+}))-\theta(\Phi^{t}_{0}(z_{0}^{-}))\\ =&\theta(z_{0}^{+})-\theta(z_{0}^{-}),\end{split}

which depends only on the energy level hh.

(ii) In action-angle coordinates, if z0=(I,θ)z_{0}^{-}=(I,\theta), then σ0(z0)=z0+=(I,θ+Δ(I))\sigma_{0}(z_{0}^{-})=z_{0}^{+}=(I,\theta+\Delta(I)). From (i) we immediately deduce that Δ(I)=Δ(I(h))=θs(h)θu(h)\Delta(I)=\Delta(I(h))=\theta^{\rm s}(h)-\theta^{\rm u}(h). ∎

Turning now the attention to the unperturbed homoclinic channel Γ~0\tilde{\Gamma}_{0} in the extended phase space, this can be parametrized in terms of the coordinates (Iu,θu,t)(I^{\rm u},\theta^{\rm u},t), as well as in terms of the coordinates (Is,θs,t)(I^{\rm s},\theta^{\rm s},t):

(4.13) Γ~0={(Iu,θu,yu,xu,t)|yu=0,xu=x0u(Iu,θu,t)},={(Is,θs,xs,ys,t)|y0s=ys(Is,θs,t),xs=0}.\begin{split}\tilde{\Gamma}_{0}=&\{(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u},t)\,|\,y^{\rm u}=0,\,x^{\rm u}=x^{\rm u}_{0}(I^{\rm u},\theta^{\rm u},t)\},\\ =&\{(I^{\rm s},\theta^{\rm s},x^{\rm s},y^{\rm s},t)\,|\,y^{\rm s}_{0}=y^{\rm s}(I^{\rm s},\theta^{\rm s},t),\,x^{\rm s}=0\}.\end{split}

That is, Γ~0\tilde{\Gamma}_{0} is a graph over the (Iu,θu,t)(I^{\rm u},\theta^{\rm u},t)-variables, as well as a graph over the (Is,θs,t)(I^{\rm s},\theta^{\rm s},t)-variables. Each homoclinic point z~0Γ~0\tilde{z}_{0}\in\tilde{\Gamma}_{0} is associated to unique Iu=Is=I0I^{\rm u}=I^{\rm s}=I_{0}, θu\theta^{\rm u}, θs\theta^{\rm s} with θs(z~0)θu(z~0)=Δ(I0)\theta^{\rm s}(\tilde{z}_{0})-\theta^{\rm u}(\tilde{z}_{0})=\Delta(I_{0}). Moreover, for the points on the homoclinic orbit Φ~τ(z~0)\tilde{\Phi}^{\tau}(\tilde{z}_{0}), we also have that θs(Φ~τ(z~0))θu(Φ~τ(z~0))=Δ(I0)\theta^{\rm s}(\tilde{\Phi}^{\tau}(\tilde{z}_{0}))-\theta^{\rm u}(\tilde{\Phi}^{\tau}(\tilde{z}_{0}))=\Delta(I_{0}).

The corresponding scattering map is given by

σ~0(I,θ,t)=(I,θ+Δ(I),t).\tilde{\sigma}_{0}(I,\theta,t)=(I,\theta+\Delta(I),t).

Define

(4.14) 𝒟I0={z~0|θs(z~0)θu(z~0)=Δ(I0),Is(z~0)Iu(z~0)=0}.\begin{split}\mathscr{D}_{I_{0}}=&\{\tilde{z}_{0}\,|\,\theta^{\rm s}(\tilde{z}_{0})-\theta^{\rm u}(\tilde{z}_{0})=\Delta(I_{0}),\,I^{\rm s}(\tilde{z}_{0})-I^{\rm u}(\tilde{z}_{0})=0\}.\end{split}

We require the following condition

  • (A-iii-b)

    For all I0=I0(h)I_{0}=I_{0}(h) with hDh\in D, (Δ(I0),0)(\Delta(I_{0}),0) is a regular value of the function

    z~0(θs(z~0)θu(z~0),Is(z~0)Iu(z~0)).\tilde{z}_{0}\mapsto(\theta^{\rm s}(\tilde{z}_{0})-\theta^{\rm u}(\tilde{z}_{0}),I^{\rm s}(\tilde{z}_{0})-I^{\rm u}(\tilde{z}_{0})).

Condition (A-iii-b) implies that 𝒟\mathscr{D} is a codimension-22 surface in M~\widetilde{M}, which intersects transversally Γ~0\tilde{\Gamma}_{0} along a 11-dimensional curve.

At this point, we have completed the formulation of condition (A-iii) in Theorem 2.1, consisting of (A-iii-a), given in Section 4.1 and (A-iii-b), given above.

4.3. Perturbed evolution equations

In the sequel, we will identify the vector fields 𝒳0\mathcal{X}^{0} and 𝒳1\mathcal{X}^{1} with derivative operators acting on functions. In general, given a smooth vector field 𝒳\mathcal{X} and a smooth function ff on a manifold MM, and (zj)j{1,,dim(M)}(z_{j})_{j\in\{1,\ldots,\dim(M)\}} a system of local coordinates, then

(4.15) (𝒳f)(z)=j(𝒳)j(z)(zjf)(z).(\mathcal{X}f)(z)=\sum_{j}(\mathcal{X})_{j}(z)(\partial_{z_{j}}f)(z).

Consider one of the coordinate systems defined in Section 4.1. To simplify notation, we will denote such a coordinate system by (I,θ,x,y)(I,\theta,x,y). Below we provide evolution equations of these coordinates, expressing the time-derivative of each coordinate along a solution of the perturbed system. We include the expression for a general perturbation, as well as for the case when the perturbation is Hamiltonian:

(4.16) ddtI=(𝒳0+ε𝒳1)(I)=H0θ+ε𝒳1(I)=H0θεH1θ.\begin{split}\frac{d}{dt}I&=(\mathcal{X}^{0}+\varepsilon\mathcal{X}^{1})(I)=-\frac{\partial{H_{0}}}{\partial\theta}+\varepsilon\mathcal{X}^{1}(I)\\ &=-\frac{\partial{H_{0}}}{\partial\theta}-\varepsilon\frac{\partial{H_{1}}}{\partial\theta}.\end{split}
(4.17) ddtθ=(𝒳0+ε𝒳1)(θ)=H0I+ε𝒳1(θ)=H0I+εH1I.\begin{split}\frac{d}{dt}\theta&=(\mathcal{X}^{0}+\varepsilon\mathcal{X}^{1})(\theta)=\frac{\partial{H_{0}}}{\partial I}+\varepsilon\mathcal{X}^{1}(\theta)\\ &=\frac{\partial{H_{0}}}{\partial I}+\varepsilon\frac{\partial{H_{1}}}{\partial I}.\end{split}
(4.18) ddty=𝒳0(y)+ε𝒳1(y)=H0x+ε𝒳1(y)=H0xεH1x.\begin{split}\frac{d}{dt}y&=\mathcal{X}^{0}(y)+\varepsilon\mathcal{X}^{1}(y)=-\frac{\partial{H_{0}}}{\partial x}+\varepsilon\mathcal{X}^{1}(y)\\ &=-\frac{\partial{H_{0}}}{\partial x}-\varepsilon\frac{\partial{H_{1}}}{\partial x}.\end{split}
(4.19) ddtx=𝒳0(x)+ε𝒳1(x)=H0y+ε𝒳1(x)=H0y+εH1y.\begin{split}\frac{d}{dt}x&=\mathcal{X}^{0}(x)+\varepsilon\mathcal{X}^{1}(x)=\frac{\partial{H_{0}}}{\partial y}+\varepsilon\mathcal{X}^{1}(x)\\ &=\frac{\partial{H_{0}}}{\partial y}+\varepsilon\frac{\partial{H_{1}}}{\partial y}.\end{split}

5. Proof of the main result

In this section we prove Theorem 2.1.

5.1. Perturbed normally hyperbolic invariant manifolds

In this section we prove the assertions (i) and (ii) of Theorem 2.1.

We only give the details in the case when H0H_{0} satisfies the conditions (A-i), (A-ii), (A-iii). The case when H0H_{0} satisfies (A’-i), (A’-ii), (A’-iii) follows similarly.

5.1.1. Persistence of the normally hyperbolic invariant manifold under perturbation

We have that Λ0\Lambda_{0} is a NHIM for the flow Φ0t\Phi^{t}_{0} of 𝒳0\mathcal{X}^{0}. Then DΦ0t(z)D\Phi^{t}_{0}(z) satisfies expansion/contraction rates as in Appendix A, for all zΛ0z\in\Lambda_{0}, where we denote the constant and the expansion and contraction rates by CC, λ\lambda_{-}, λ+\lambda_{+}, λc\lambda_{c}, μc\mu_{c}, μ\mu_{-}, μ+\mu_{+}, respectively.

It is immediate that Λ~0=Λ0×{\tilde{\Lambda}}_{0}=\Lambda_{0}\times\mathbb{R} is a NHIM for the flow Φ~0τ\tilde{\Phi}^{\tau}_{0} of the extended system (2.6).

Under the assumptions of Theorem 2.1, 𝒳1=𝒳1(z,t;ε)\mathcal{X}^{1}=\mathcal{X}^{1}(z,t;\varepsilon) is uniformly differentiable in all variables. The theory of normally hyperbolic invariant manifolds, [Fen72, HPS77, Pes04] (a handy summary of the results of the theory is [DdlLS06]), asserts that there exists ε1\varepsilon_{1} such that the manifold Λ~0{\tilde{\Lambda}}_{0} persists as a normally hyperbolic manifold Λ~ε{\tilde{\Lambda}}_{\varepsilon}, for all |ε|<ε1|\varepsilon|<\varepsilon_{1}, which is locally invariant under the flow Φ~ετ\tilde{\Phi}^{\tau}_{\varepsilon}. The persistent NHIM Λ~ε{\tilde{\Lambda}}_{\varepsilon} is O(ε)O(\varepsilon) close in the 𝒞\mathcal{C}^{\ell}-topology to Λ~0{\tilde{\Lambda}}_{0}, where \ell is as in A.3. The locally invariant manifolds are in fact invariant manifolds for an extended system, and they depend on the extension. Hence, they do not need to be unique. Nevertheless, we point out that, given a family of systems, it is possible to choose the invariant manifolds in such a way that the invariant manifods depend smoothly on parameters, as well as the stable and unstable bundles and the stable and unstable manifolds.

For the perturbed NHIM Λ~ε{\tilde{\Lambda}}_{\varepsilon}, |ε|<ε1|\varepsilon|<\varepsilon_{1}, there exists an invariant splitting of the tangent bundle TΛ~εT{\tilde{\Lambda}}_{\varepsilon}, similar to that in (A.1), so that DΦ~ετ(z~)D\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}) satisfies expansion/contraction relations similar to those in (A.2) for all z~Λ~ε\tilde{z}\in\tilde{\Lambda}_{\varepsilon}, for some constants C~\tilde{C}, λ~\tilde{\lambda}_{-}, λ~+\tilde{\lambda}_{+}, λ~c\tilde{\lambda}_{c}, μ~c\tilde{\mu}_{c}, μ~\tilde{\mu}_{-}, μ~+\tilde{\mu}_{+}. These constants are independent of ε\varepsilon, and can be chosen as close as desired to the unperturbed ones, that is, to CC, λ\lambda_{-}, λ+\lambda_{+}, λc\lambda_{c}, μc\mu_{c}, μ\mu_{-}, μ+\mu_{+}, respectively, by choosing ε1\varepsilon_{1} suitably small.

There exist unstable and stable manifolds Wu(Λ~ε)W^{\rm u}({\tilde{\Lambda}}_{\varepsilon}), Ws(Λ~ε)W^{\rm s}({\tilde{\Lambda}}_{\varepsilon}) associated to Λ~ε{\tilde{\Lambda}}_{\varepsilon}, and there exist corresponding projection maps Ω:Wu(Λ~ε)Λ~ε\Omega^{-}:W^{\rm u}({\tilde{\Lambda}}_{\varepsilon})\to{\tilde{\Lambda}}_{\varepsilon}, and Ω+:Ws(Λ~ε)Λ~ε\Omega^{+}:W^{\rm s}({\tilde{\Lambda}}_{\varepsilon})\to{\tilde{\Lambda}}_{\varepsilon}.

For z~+=Ω+(z~)\tilde{z}^{+}=\Omega^{+}(\tilde{z}), with z~Ws(Λ~ε)\tilde{z}\in W^{\rm s}({\tilde{\Lambda}}_{\varepsilon}) we have

(5.1) d(Φ~ετ(z~),Φ~ετ(z~+))Cz~eτλ~+, for all τ0,d(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}),\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}))\leq C_{\tilde{z}}e^{\tau\tilde{\lambda}_{+}},\quad\textrm{ for all }\tau\geq 0,

and for z~=Ω(z~)\tilde{z}^{-}=\Omega^{-}(\tilde{z}), with z~Wu(Λ~ε)\tilde{z}\in W^{\rm u}({\tilde{\Lambda}}_{\varepsilon}) we have

(5.2) d(Φ~ετ(z~),Φ~ετ(z~)Cz~eτμ~, for all τ0,d(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}),\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{-})\leq C_{\tilde{z}}e^{\tau\tilde{\mu}_{-}},\quad\textrm{ for all }\tau\leq 0,

for some Cz~>0{C}_{\tilde{z}}>0. The constant C~z~\tilde{C}_{\tilde{z}} can be chosen uniformly bounded, provided that we restrict to zz to the compact neighborhood 𝒦\mathscr{K} given by (A-ii), and we use the fact that 𝒳1=𝒳1(z,t;ε)\mathcal{X}^{1}=\mathcal{X}^{1}(z,t;\varepsilon) is uniformly differentiable in all variables.

To simplify notation, from now on we will drop the symbol ~\tilde{} from C~\tilde{C}, C~z~\tilde{C}_{\tilde{z}} λ~\tilde{\lambda}_{-}, λ~+\tilde{\lambda}_{+}, μ~\tilde{\mu}_{-}, μ~+\tilde{\mu}_{+}, λ~c\tilde{\lambda}_{c}, μ~c\tilde{\mu}_{c}.

In the sequel, we will fix a choice Λ~ε{\tilde{\Lambda}}_{\varepsilon}, and all computations will be performed relative to that choice. Nevertheless the estimate for the perturbed scattering map σ~ε\tilde{\sigma}_{\varepsilon} are independent of the choice of the locally invariant manifold Λ~ε{\tilde{\Lambda}}_{\varepsilon}.

5.1.2. Persistence of the transverse intersection of the hyperbolic invariant manifolds under perturbation

For the unperturbed system the unstable and stable manifolds Wu(Λ~0)W^{\rm u}({\tilde{\Lambda}}_{0}), Ws(Λ~0)W^{\rm s}({\tilde{\Lambda}}_{0}) intersect transversally along the 33-dimensional homoclinic channel Γ~0\tilde{\Gamma}_{0}. By the persistence of transversality under small perturbations, it follows that Wu(Λ~ε)W^{\rm u}({\tilde{\Lambda}}_{\varepsilon}), Ws(Λ~ε)W^{\rm s}({\tilde{\Lambda}}_{\varepsilon}) intersect transversally along Γ~ε\tilde{\Gamma}_{\varepsilon}, for all |ε|<ε1|\varepsilon|<\varepsilon_{1}, provided ε1\varepsilon_{1} is chosen small enough. The condition (B.1) in the definition of a homoclinic/heteroclinic channel is also a transversality-type condition, so it is also persistent under small perturbations. We conclude that (i) and (ii) from Theorem 2.1 hold true for all |ε|<ε1|\varepsilon|<\varepsilon_{1}, provided ε1\varepsilon_{1} is chosen small enough.

Recall from Section 4.2 that the unperturbed homoclinic channel Γ~0\tilde{\Gamma}_{0} can be described as a graph over the (Iu,θu,t)(I^{\rm u},\theta^{\rm u},t)-variables, as well as a graph over the (Is,θs,t)(I^{\rm s},\theta^{\rm s},t)-variables. Therefore, the perturbed homoclinic channel Γ~ε\tilde{\Gamma}_{\varepsilon}, for |ε|<ε1|\varepsilon|<\varepsilon_{1}, can also be described as a graph over the (Iu,θu,t)(I^{\rm u},\theta^{\rm u},t)-variables, as well as a graph over the (Is,θs,t)(I^{\rm s},\theta^{\rm s},t)-variables. Therefore, each homoclinic point z~εΓ~ε\tilde{z}_{\varepsilon}\in\tilde{\Gamma}_{\varepsilon} is associated to unique coordinate triples (Iu,θu,t)(I^{\rm u},\theta^{\rm u},t), and (Is,θs,t)(I^{\rm s},\theta^{\rm s},t). More precisely, we have

(5.3) Γ~ε={(Iu,θu,yu,xu,t)|yu=yεu(Iu,θu,t),xu=xεu(Iu,θu,t)},Γ~ε={(Is,θs,ys,xs,t)|ys=yεs(Is,θs,t),xs=xεs(Is,θs,t)},\begin{split}\tilde{\Gamma}_{\varepsilon}=&\{(I^{\rm u},\theta^{\rm u},y^{\rm u},x^{\rm u},t)\,|\,y^{\rm u}=y^{\rm u}_{\varepsilon}(I^{\rm u},\theta^{\rm u},t),\,x^{\rm u}=x^{\rm u}_{\varepsilon}(I^{\rm u},\theta^{\rm u},t)\},\\ \tilde{\Gamma}_{\varepsilon}=&\{(I^{\rm s},\theta^{\rm s},y^{\rm s},x^{\rm s},t)\,|\,y^{\rm s}=y^{\rm s}_{\varepsilon}(I^{\rm s},\theta^{\rm s},t),\,x^{\rm s}=x^{\rm s}_{\varepsilon}(I^{\rm s},\theta^{\rm s},t)\},\end{split}

with yεu(Iu,θu,t)=O(ε)y^{\rm u}_{\varepsilon}(I^{\rm u},\theta^{\rm u},t)=O(\varepsilon) and xεs(Is,θs,t)=O(ε)x^{\rm s}_{\varepsilon}(I^{\rm s},\theta^{\rm s},t)=O(\varepsilon).

By condition (A-iii-b), since 𝒟I0\mathscr{D}_{I_{0}} intersects Γ~0\tilde{\Gamma}_{0} transversally, it follows that 𝒟I0\mathscr{D}_{I_{0}} intersects Γ~ε\tilde{\Gamma}_{\varepsilon} for ε\varepsilon sufficiently small. Therefore, given z~0\tilde{z}_{0} in Γ~0\tilde{\Gamma}_{0} we will associate to it a homoclinic point z~ε𝒟I0\tilde{z}_{\varepsilon}\in\mathscr{D}_{I_{0}}, that is, satisfying the conditions

(5.4) θs(z~ε)θu(z~ε)=θs(z~0)θu(z~0),Is(z~ε)Iu(z~ε)=0.\begin{split}\theta^{\rm s}(\tilde{z}_{\varepsilon})-\theta^{\rm u}(\tilde{z}_{\varepsilon})=&\theta^{\rm s}(\tilde{z}_{0})-\theta^{\rm u}(\tilde{z}_{0}),\\ I^{\rm s}(\tilde{z}_{\varepsilon})-I^{\rm u}(\tilde{z}_{\varepsilon})=&0.\end{split}

Note that a homoclinic point z~ε\tilde{z}_{\varepsilon} satisfying these conditions is not uniquely defined. We can impose on z~ε\tilde{z}_{\varepsilon} an additional condition, for instance t(z~ε)=t(z~0)t(\tilde{z}_{\varepsilon})=t(\tilde{z}_{0}), or Is(z~ε)=Iu(z~ε)=I0I^{\rm s}(\tilde{z}_{\varepsilon})=I^{\rm u}(\tilde{z}_{\varepsilon})=I_{0}. Such an extra condition can be useful for applications.

In the sequel we will compare the scattering map associated to z~0\tilde{z}_{0} with the scattering map associated to z~ε\tilde{z}_{\varepsilon} satisfying (5.4).

5.2. Perturbed scattering map

In this section we prove the assertion (iii) of Theorem 2.1.

We start with the unperturbed system (2.3). We recall that for a given homoclinic channel Γ0\Gamma_{0}, the corresponding scattering map σ0\sigma_{0}, is a phase-shift of the form

σ0(I,θ)=(I,θ+Δ(I)).\sigma_{0}(I,\theta)=(I,\theta+\Delta(I)).

We choose and fix an energy level hh of H0H_{0}, and a point z0Γ0Mhz_{0}\in\Gamma_{0}\cap M_{h}. In the (Iu,s,θu,s,yu,s,xu,s)(I^{{\rm u},{\rm s}},\theta^{{\rm u},{\rm s}},y^{{\rm u},{\rm s}},x^{{\rm u},{\rm s}})-coordinates, z0z_{0} is given by

(5.5) z0=(I0s,θ0s,y0s,0)=(I0u,θ0u,0,x0u),z_{0}=(I^{\rm s}_{0},\theta^{\rm s}_{0},y^{\rm s}_{0},0)=(I^{\rm u}_{0},\theta^{\rm u}_{0},0,x^{\rm u}_{0}),

where I0s=I0u=I0I^{\rm s}_{0}=I^{\rm u}_{0}=I_{0}. The effect of the flow Φ0τ\Phi^{\tau}_{0} on z0z_{0} in these coordinates is given by

(5.6) Φ0τ(z0)=(I0s,θ0s+ω(I0)τ,ys(τ),0)=(I0u,θ0u+ω(I0)τ,0,xu(τ)),\Phi^{\tau}_{0}(z_{0})=(I^{\rm s}_{0},\theta^{\rm s}_{0}+\omega(I_{0})\tau,y^{\rm s}(\tau),0)=(I^{\rm u}_{0},\theta^{\rm u}_{0}+\omega(I_{0})\tau,0,x^{\rm u}(\tau)),

where ys(τ)y^{\rm s}(\tau) and xu(τ)x^{\rm u}(\tau) are the ysy^{\rm s}-component and the xux^{\rm u}-component, respectively, of Φ0τ(z0)\Phi^{\tau}_{0}(z_{0}) evaluated at time τ\tau, and ω(I0)=h0I\omega(I_{0})=\frac{\partial h_{0}}{\partial I}.

There exist uniquely defined points z0z^{-}_{0}, z0+z^{+}_{0} in λ0(h)\lambda_{0}(h) such that Wu(z0)(Γ0Mh)=Ws(z0+)(Γ0Mh)={z0}W^{\rm u}(z^{-}_{0})\cap(\Gamma_{0}\cap M_{h})=W^{\rm s}(z^{+}_{0})\cap(\Gamma_{0}\cap M_{h})=\{z_{0}\}. In the (I,θ,y,x)(I,\theta,y,x)-coordinates, the foot-points z0±z_{0}^{\pm} are given by

(5.7) z0=(I0,θ0,0,0),z0+=(I0s,θ0+,0,0),\begin{split}z^{-}_{0}=(I_{0},\theta^{-}_{0},0,0),\quad z^{+}_{0}=(I^{\rm s}_{0},\theta^{+}_{0},0,0),\end{split}

where θ0=θ0u\theta^{-}_{0}=\theta^{\rm u}_{0} and θ0+=θ0s\theta^{+}_{0}=\theta^{\rm s}_{0}. The effect of the flow Φ0τ\Phi^{\tau}_{0} on z0±z_{0}^{\pm} in these coordinates is given by

(5.8) Φ0τ(z0)=(I0,θ0+ω(I0)τ,0,0),Φ0τ(z0+)=(I0,θ0++ω(I0)τ,0,0).\begin{split}\Phi^{\tau}_{0}(z^{-}_{0})=(I_{0},\theta^{-}_{0}+\omega(I_{0})\tau,0,0),\\ \Phi^{\tau}_{0}(z^{+}_{0})=(I_{0},\theta^{+}_{0}+\omega(I_{0})\tau,0,0).\end{split}

The scattering map σ0\sigma_{0} takes z0λ0(h)z^{-}_{0}\in\lambda_{0}(h) into z0+λ0(h)z^{+}_{0}\in\lambda_{0}(h).

In the extended system (2.6), the corresponding homoclinic point is z~0=(z0,t0)\tilde{z}_{0}=(z_{0},t_{0}) for some t0t_{0}\in\mathbb{R}. The scattering map σ~0\tilde{\sigma}_{0} takes z~0=(z0,t0)\tilde{z}^{-}_{0}=(z^{-}_{0},t_{0}) into z~0+=(z0+,t0)\tilde{z}^{+}_{0}=(z^{+}_{0},t_{0}).

We will compute the effect of the perturbation on the scattering map σ~0\tilde{\sigma}_{0}.

When we add the perturbation there exists a homoclinic point z~εΓ~ε\tilde{z}_{\varepsilon}\in\tilde{\Gamma}_{\varepsilon} corresponding to z~0=(z0,t0)\tilde{z}_{0}=(z_{0},t_{0}) from the unperturbed case, such that z~ε\tilde{z}_{\varepsilon} satisfies the condition (A-iii-b). Associated to z~εΓ~ε\tilde{z}_{\varepsilon}\in\tilde{\Gamma}_{\varepsilon} we have the points z~ε\tilde{z}^{-}_{\varepsilon}, z~ε+\tilde{z}^{+}_{\varepsilon} in Λ~ε\tilde{\Lambda}_{\varepsilon} such that Wu(z~ε)Γ~ε={z~ε}W^{\rm u}(\tilde{z}^{-}_{\varepsilon})\cap\tilde{\Gamma}_{\varepsilon}=\{\tilde{z}_{\varepsilon}\}, and Ws(z~ε+)Γ~ε={z~ε}W^{\rm s}(\tilde{z}^{+}_{\varepsilon})\cap\tilde{\Gamma}_{\varepsilon}=\{\tilde{z}_{\varepsilon}\}. The scattering map σ~ε\tilde{\sigma}_{\varepsilon} takes z~εΛ~ε\tilde{z}^{-}_{\varepsilon}\in{\tilde{\Lambda}}_{\varepsilon} into z~ε+Λ~ε\tilde{z}^{+}_{\varepsilon}\in{\tilde{\Lambda}}_{\varepsilon}.

In the sequel we will make a quantitative comparison between

z~0σ~0(z0):=z~0+,\tilde{z}^{-}_{0}\mapsto\tilde{\sigma}_{0}({z}^{-}_{0}):=\tilde{z}^{+}_{0},

and

z~εσ~ε(z~ε):=z~ε+.\tilde{z}^{-}_{\varepsilon}\mapsto\tilde{\sigma}_{\varepsilon}(\tilde{z}^{-}_{\varepsilon}):=\tilde{z}^{+}_{\varepsilon}.

5.2.1. Estimates

Below we will refer to the notation in (5.5), (5.6), (5.7), and (5.8). To simplify notation, we denote Iεs=Is(z~ε)I^{{\rm s}}_{\varepsilon}=I^{\rm s}(\tilde{z}_{\varepsilon}), Iεu=Iu(z~ε)I^{{\rm u}}_{\varepsilon}=I^{\rm u}(\tilde{z}_{\varepsilon}), Iεs+=Is(z~ε+)I^{{\rm s}+}_{\varepsilon}=I^{\rm s}(\tilde{z}^{+}_{\varepsilon}), Iεu=Iu(z~ε)I^{{\rm u}-}_{\varepsilon}=I^{\rm u}(\tilde{z}^{-}_{\varepsilon}), ξεs=(xsys)(z~ε)\xi^{{\rm s}}_{\varepsilon}=(x^{\rm s}y^{\rm s})(\tilde{z}_{\varepsilon}), ξs+=(xsys)(z~ε+)\xi^{{\rm s}+}=(x^{\rm s}y^{\rm s})(\tilde{z}^{+}_{\varepsilon}), ξεu=(xuyu)(z~ε)\xi^{{\rm u}}_{\varepsilon}=(x^{\rm u}y^{\rm u})(\tilde{z}_{\varepsilon}), ξu=(xuyu)(z~ε)\xi^{{\rm u}-}=(x^{\rm u}y^{\rm u})(\tilde{z}^{-}_{\varepsilon}).

Note that in the following, the coordinates of the scattering map can be considered as functions of the point. Hence, the symbols O(ε)O(\varepsilon) can be interpreted as relative to the CrC^{r} norm.

Lemma 5.1.
  • (i)

    Estimates on II:

    (5.9) Iεs+Iεs=O(ε),IεuIεu=O(ε),Iεs+Iεu=O(ε).\begin{split}I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}}_{\varepsilon}=O(\varepsilon),\,I^{{\rm u}-}_{\varepsilon}-I^{{\rm u}}_{\varepsilon}=O(\varepsilon),\,I^{{\rm s}+}_{\varepsilon}-I^{{\rm u}-}_{\varepsilon}=O(\varepsilon).\end{split}
  • (ii)

    Estimates on h0h_{0}:

    (5.10) h0(Iεs+)h0(Iεs)=(Iεs+Iεs)(h0I(I0))+O(ε2),h0(Iεu)h0(Iεu)=(IεuIεu)(h0I(I0))+O(ε2).\begin{split}h_{0}(I^{{\rm s}+}_{\varepsilon})-h_{0}(I^{{\rm s}}_{\varepsilon})=&(I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}}_{\varepsilon})\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)+O(\varepsilon^{2}),\\ h_{0}(I^{{\rm u}-}_{\varepsilon})-h_{0}(I^{{\rm u}}_{\varepsilon})=&(I^{{\rm u}-}_{\varepsilon}-I^{{\rm u}}_{\varepsilon})\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)+O(\varepsilon^{2}).\end{split}
  • (iii)

    Estimates on ξ\xi:

    (5.11) ξεs=O(ε),ξεu=O(ε),ξεs+=O(ε2),ξεu=O(ε2).\xi^{{\rm s}}_{\varepsilon}=O(\varepsilon),\,\xi^{{\rm u}}_{\varepsilon}=O(\varepsilon),\,\xi^{{\rm s}+}_{\varepsilon}=O(\varepsilon^{2}),\,\xi^{{\rm u}-}_{\varepsilon}=O(\varepsilon^{2}).
  • (iv)

    Estimates on g1g_{1}:

    (5.12) g1(Iεs+)=g1(I0)+g1I(I0)(Iεs+I0)+O(ε2),g1(Iεu)=g1(I0)+g1I(I0)(IεuI0)+O(ε2),\begin{split}g_{1}(I^{{\rm s}+}_{\varepsilon})=&g_{1}(I_{0})+\frac{\partial g_{1}}{\partial I}(I_{0})(I^{{\rm s}+}_{\varepsilon}-I_{0})+O(\varepsilon^{2}),\\ g_{1}(I^{{\rm u}-}_{\varepsilon})=&g_{1}(I_{0})+\frac{\partial g_{1}}{\partial I}(I_{0})(I^{{\rm u}-}_{\varepsilon}-I_{0})+O(\varepsilon^{2}),\end{split}
  • (v)

    Estimates on g1I\frac{\partial g_{1}}{\partial I}:

    (5.13) g1I(Iεs+)=g1I(I0)+2g1I2(I0)(Iεs+I0)+O(ε2),g1I(Iεu)=g1I(I0)+2g1I2(I0)(IεuI0)+O(ε2),\begin{split}\frac{\partial g_{1}}{\partial I}(I^{{\rm s}+}_{\varepsilon})=&\frac{\partial g_{1}}{\partial I}(I_{0})+\frac{\partial^{2}g_{1}}{\partial I^{2}}(I_{0})(I^{{\rm s}+}_{\varepsilon}-I_{0})+O(\varepsilon^{2}),\\ \frac{\partial g_{1}}{\partial I}(I^{{\rm u}-}_{\varepsilon})=&\frac{\partial g_{1}}{\partial I}(I_{0})+\frac{\partial^{2}g_{1}}{\partial I^{2}}(I_{0})(I^{{\rm u}-}_{\varepsilon}-I_{0})+O(\varepsilon^{2}),\end{split}
Proof.

(i) Since z~ε=z~0+O(ε)\tilde{z}_{\varepsilon}=\tilde{z}_{0}+O(\varepsilon) and z~ε±=z~0±+O(ε)\tilde{z}^{\pm}_{\varepsilon}=\tilde{z}^{\pm}_{0}+O(\varepsilon), we have Is,u(z~ε)=Is,u(z~0)+O(ε)I^{{\rm s},{\rm u}}(\tilde{z}_{\varepsilon})=I^{{\rm s},{\rm u}}(\tilde{z}_{0})+O(\varepsilon), and Is,u(z~ε±)=Is,u(z~0±)+O(ε)I^{{\rm s},{\rm u}}(\tilde{z}^{\pm}_{\varepsilon})=I^{{\rm s},{\rm u}}(\tilde{z}^{\pm}_{0})+O(\varepsilon). The fact that Is,u(z~0)=Is,u(z~0±)=I0I^{{\rm s},{\rm u}}(\tilde{z}_{0})=I^{{\rm s},{\rm u}}(\tilde{z}^{\pm}_{0})=I_{0} yields the estimates in (i).

(ii) We estimate the term h0(Iεs+)h0(Iεs)h_{0}(I^{{\rm s}+}_{\varepsilon})-h_{0}(I^{{\rm s}}_{\varepsilon}). Applying the integral form of the Mean Value Theorem we have

(5.14) h0(Iεs+)h0(Iεs)=(Iεs+Iεs)01h0I(tIεs++(1t)Iεs)𝑑t.\begin{split}h_{0}(I^{{\rm s}+}_{\varepsilon})-h_{0}(I^{{\rm s}}_{\varepsilon})=&(I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}}_{\varepsilon})\int_{0}^{1}\frac{\partial h_{0}}{\partial I}(tI^{{\rm s}+}_{\varepsilon}+(1-t)I^{{\rm s}}_{\varepsilon})dt.\end{split}

We write the integrand of (5.14) as a Taylor expansion

(5.15) h0I(tIεs++(1t)Iεs)=h0I(I0)+2h0I2(I0)(t(Iεs+I0s+)+(1t)(IεsI0s))+O(ε2)\begin{split}\frac{\partial h_{0}}{\partial I}(tI^{{\rm s}+}_{\varepsilon}+(1-t)I^{{\rm s}}_{\varepsilon})=&\frac{\partial h_{0}}{\partial I}(I_{0})\\ &+\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})(t(I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}+}_{0})+(1-t)(I^{{\rm s}}_{\varepsilon}-I^{{\rm s}}_{0}))\\ &+O(\varepsilon^{2})\end{split}

where we used

h0I(tI0s++(1t)I0s)=h0I(I0) and 2h0I2(tI0s++(1t)I0s)=2h0I2(I0).\frac{\partial h_{0}}{\partial I}(tI^{{\rm s}+}_{0}+(1-t)I^{{\rm s}}_{0})=\frac{\partial h_{0}}{\partial I}(I_{0})\textrm{ and }\frac{\partial^{2}h_{0}}{\partial I^{2}}(tI^{{\rm s}+}_{0}+(1-t)I^{{\rm s}}_{0})=\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0}).

Since Iεs+I0s+=O(ε)I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}+}_{0}=O(\varepsilon) and IεsI0s=O(ε)I^{{\rm s}}_{\varepsilon}-I^{{\rm s}}_{0}=O(\varepsilon), we have

(5.16) h0I(tI0s++(1t)I0s)=h0I(I0)+O(ε).\frac{\partial h_{0}}{\partial I}(tI^{{\rm s}+}_{0}+(1-t)I^{{\rm s}}_{0})=\frac{\partial h_{0}}{\partial I}(I_{0})+O(\varepsilon).

Since Iεs+Iεs=O(ε)I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}}_{\varepsilon}=O(\varepsilon) from (5.14) we obtain the first estimate in (5.10).

The other estimate follows similarly.

(iii) The fact that ξs,u=O(ε)\xi^{{\rm s},{\rm u}}=O(\varepsilon) follows from z~ε=z~0+O(ε)\tilde{z}_{\varepsilon}=\tilde{z}_{0}+O(\varepsilon), and ξs,u(z~0)=0\xi^{{\rm s},{\rm u}}(\tilde{z}_{0})=0, hence ξs,u(z~ε)=ξs,u(z~0)+O(ε)=O(ε)\xi^{{\rm s},{\rm u}}(\tilde{z}_{\varepsilon})=\xi^{{\rm s},{\rm u}}(\tilde{z}_{0})+O(\varepsilon)=O(\varepsilon).

In the same way we obtain ξs+=ξu=O(ε)\xi^{{\rm s}+}=\xi^{{\rm u}-}=O(\varepsilon). To prove that in fact ξs+=ξu=O(ε2)\xi^{{\rm s}+}=\xi^{{\rm u}-}=O(\varepsilon^{2}), we proceed as follows. For ξεs+\xi^{{\rm s}+}_{\varepsilon}, we use the Taylor expansion:

ξεs+=ξ0s++Dξ0s+(ξεs+ξ0s+)+O(ε2),\begin{split}\xi^{{\rm s}+}_{\varepsilon}=&\xi^{{\rm s}+}_{0}+D\xi^{{\rm s}+}_{0}\cdot(\xi^{{\rm s}+}_{\varepsilon}-\xi^{{\rm s}+}_{0})+O(\varepsilon^{2}),\\ \end{split}

where \cdot denotes the dot product, and we used that ξεs+ξ0s+=O(ε)\xi^{{\rm s}+}_{\varepsilon}-\xi^{{\rm s}+}_{0}=O(\varepsilon). We have

ξ0s+=0\xi^{{\rm s}+}_{0}=0

and

Dξ0s+(ξεs+ξ0s+)=xs(z~0+)(yεs+y0s+)+ys(z~0+)(xεs+x0s+)=0,D\xi^{{\rm s}+}_{0}\cdot(\xi^{{\rm s}+}_{\varepsilon}-\xi^{{\rm s}+}_{0})=x^{\rm s}(\tilde{z}^{+}_{0})(y^{{\rm s}+}_{\varepsilon}-y^{{\rm s}+}_{0})+y^{\rm s}(\tilde{z}^{+}_{0})(x^{{\rm s}+}_{\varepsilon}-x^{{\rm s}+}_{0})=0,

since ys(z~0+)=xs(z~0+)=0y^{\rm s}(\tilde{z}^{+}_{0})=x^{\rm s}(\tilde{z}^{+}_{0})=0 on Λ~0{\tilde{\Lambda}}_{0}. Therefore (5.2.1) implies

ξεs+=O(ε2).\begin{split}\xi^{{\rm s}+}_{\varepsilon}=&O(\varepsilon^{2}).\end{split}

Similarly we obtain

ξεu=O(ε2).\begin{split}\xi^{{\rm u}-}_{\varepsilon}=&O(\varepsilon^{2}).\end{split}

(iv) We write g1(Iεs+)g_{1}(I^{{\rm s}+}_{\varepsilon}) as a Taylor expansion, using that Iεs+I0s+=O(ε)I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}+}_{0}=O(\varepsilon), obtaining

g1(Iεs+)=g1(I0s+)+g1I(I0s+)(Iεs+I0s+)+O(ε2).g_{1}(I^{{\rm s}+}_{\varepsilon})=g_{1}(I^{{\rm s}+}_{0})+\frac{\partial g_{1}}{\partial I}(I^{{\rm s}+}_{0})(I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}+}_{0})+O(\varepsilon^{2}).

Since I0s+=I0I^{{\rm s}+}_{0}=I_{0}, the first equation in (5.12) follows.

Similarly

g1(Iεu)=g1(I0u)+g1I(I0u)(IεuI0u)+O(ε2),g_{1}(I^{{\rm u}-}_{\varepsilon})=g_{1}(I^{{\rm u}-}_{0})+\frac{\partial g_{1}}{\partial I}(I^{{\rm u}-}_{0})(I^{{\rm u}-}_{\varepsilon}-I^{{\rm u}-}_{0})+O(\varepsilon^{2}),

and I0u=I0I^{{\rm u}-}_{0}=I_{0} yield the second equation in (5.12).

(v) The proof is similar to that of (iv). ∎

5.2.2. Change in action by the scattering map

We now give the expression of the action-component 𝒮~I\tilde{\mathcal{S}}^{I} of the mapping 𝒮~\tilde{\mathcal{S}} in (2.7). Below we will refer to the notation in (5.5), (5.6), (5.7), and (5.8).

Proposition 5.2.

The change in II by the scattering map σ~ε\tilde{\sigma}_{\varepsilon} is given by the following formula

(5.17) I(z~ε+)I(z~ε)=ε(h0I(I0))10+((𝒳1H0)(Φ0τ(z~0+)(𝒳1H0)(Φ~0τ(z~0)))dτε(h0I(I0))10((𝒳1H0)(Φ~0τ(z~0))(𝒳1H0)(Φ~0τ(z~0)))𝑑τ+O(ε2).\begin{split}I\left(\tilde{z}^{+}_{\varepsilon}\right)&-I\left(\tilde{z}^{-}_{\varepsilon}\right)\\ =&-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)^{-1}\int_{0}^{+\infty}\left((\mathcal{X}^{1}H_{0})({\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0})-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)^{-1}\int_{-\infty}^{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &+O\left(\varepsilon^{2}\right).\end{split}

When the perturbation is Hamiltonian 𝒳1=JH1\mathcal{X}^{1}=J\nabla H_{1}, in (5.17) we have 𝒳1H0={H0,H1}\mathcal{X}^{1}H_{0}=\{H_{0},H_{1}\}, where {,}\{\cdot,\cdot\} denotes the Poisson bracket.

Proposition 5.2 implies that

σ~εI(I,θ,t)=σ~0I(I,θ,t)+ε𝒮~I(I,θ,t)+O(ε2),\tilde{\sigma}^{I}_{\varepsilon}(I,\theta,t)=\tilde{\sigma}^{I}_{0}(I,\theta,t)+\varepsilon\tilde{\mathcal{S}}^{I}(I,\theta,t)+O(\varepsilon^{2}),

where ε𝒮~I\varepsilon\tilde{\mathcal{S}}^{I} is given by the first two terms on the right hand-side of (5.17). The expression of ε𝒮~I\varepsilon\tilde{\mathcal{S}}^{I} is particularly simple since II is a slow variable.

We note that we can express the right-hand side of (5.17) in terms of the (Iu,s,θu,s,yu,s,xu,s)(I^{{\rm u},{\rm s}},\theta^{{\rm u},{\rm s}},y^{{\rm u},{\rm s}},x^{{\rm u},{\rm s}}) coordinates, by making the following substitutions:

Φ~0τ(z~0+)=(I0s,θ0s+ω(I0)τ,0,0,t0+τ),Φ~0τ(z~0)=(I0s,θ0u+ω(I0)τ,0,0,t0+τ),Φ~0τ(z~0)=(I0s,θ0s+ω(I0)τ,ys(τ),0,t0+τ)=(I0u,θ0u+ω(I0)τ,0,xu(τ),t0+τ).\begin{split}\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0})=&\left(I^{\rm s}_{0},\theta^{\rm s}_{0}+\omega(I_{0})\tau,0,0,t_{0}+\tau\right),\\ \tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0})=&\left(I^{\rm s}_{0},\theta^{\rm u}_{0}+\omega(I_{0})\tau,0,0,t_{0}+\tau\right),\\ \tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0})=&\left(I^{\rm s}_{0},\theta^{\rm s}_{0}+\omega(I_{0})\tau,y^{\rm s}(\tau),0,t_{0}+\tau\right)\\ =&\left(I^{\rm u}_{0},\theta^{\rm u}_{0}+\omega(I_{0})\tau,0,x^{\rm u}(\tau),t_{0}+\tau\right).\end{split}

To prove Proposition 5.2 we will use the following:

Lemma 5.3.

The change in H0H_{0} by the scattering map σ~ε\tilde{\sigma}_{\varepsilon} is given by the following equation

(5.18) H0(z~ε+)H0(z~ε)=ε0+((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))𝑑τε0((𝒳1H0)(Φ~0τ(z~0))(𝒳1H0)(Φ~0τ(z~0)))𝑑τ+O(ε2).\begin{split}H_{0}(\tilde{z}^{+}_{\varepsilon})-H_{0}(\tilde{z}^{-}_{\varepsilon})=&-\varepsilon\int_{0}^{+\infty}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &-\varepsilon\int_{-\infty}^{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &+O\left(\varepsilon^{2}\right).\end{split}
Proof of Lemma 5.3.

First, note that

(𝒳0+ε𝒳1)H0=𝒳0H0+ε𝒳1H0={H0,H0}+ε𝒳1H0=ε𝒳1H0.(\mathcal{X}^{0}+\varepsilon\mathcal{X}^{1})H_{0}=\mathcal{X}^{0}H_{0}+\varepsilon\mathcal{X}^{1}H_{0}=\{H_{0},H_{0}\}+\varepsilon\mathcal{X}^{1}H_{0}=\varepsilon\mathcal{X}^{1}H_{0}.

Second, applying Lemma D.1 and Lemma D.2 from Appendix D, for 𝐅=H0\mathbf{F}=H_{0}, we have

(5.19) H0(z~ε+)H0(z~ε)=ε0+((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))𝑑τ+O(ε1+ϱ)\begin{split}H_{0}(\tilde{z}^{+}_{\varepsilon})-H_{0}(\tilde{z}_{\varepsilon})=&-\varepsilon\int_{0}^{+\infty}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &+O\left(\varepsilon^{1+\varrho}\right)\end{split}
(5.20) H0(z~ε)H0(z~ε)=ε0((𝒳1H0)(Φ~0τ(z~0))(𝒳1H0)(Φ~0τ(z~0)))𝑑τ+O(ε1+ϱ).\begin{split}H_{0}(\tilde{z}^{-}_{\varepsilon})-H_{0}(\tilde{z}_{\varepsilon})=&\varepsilon\int_{-\infty}^{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &+O\left(\varepsilon^{1+\varrho}\right).\end{split}

Subtracting the two equations from above, after cancelling out the common term H0(z~ε)H_{0}(\tilde{z}_{\varepsilon}) representing the value of H0H_{0} at the homoclinic point z~0\tilde{z}_{0}, we obtain (5.18) with an error term of order O(ε1+ϱ)O(\varepsilon^{1+\varrho}). Since the function H0(z~ε+)H0(z~ε)H_{0}(\tilde{z}^{+}_{\varepsilon})-H_{0}(\tilde{z}^{-}_{\varepsilon}) can be expanded as a Taylor series in ε\varepsilon, by matching the corresponding terms of this Taylor expansion with the terms in (5.19) minus (5.20), it follows that the error term O(ε1+ϱ)O(\varepsilon^{1+\varrho}) must equal O(ε2)O(\varepsilon^{2}). ∎

Proof of Proposition 5.2.

By Proposition 4.1 we have

(5.21) H0(z~ε+)=h0(Iεs+)+(ξεs+)g1(Iεs+)+(ξεs+)2g2(Iεs+,ξεs+),\begin{split}H_{0}(\tilde{z}^{+}_{\varepsilon})=&h_{0}(I^{{\rm s}+}_{\varepsilon})+(\xi^{{\rm s}+}_{\varepsilon})g_{1}(I^{{\rm s}+}_{\varepsilon})+(\xi^{{\rm s}+}_{\varepsilon})^{2}g_{2}(I^{{\rm s}+}_{\varepsilon},\xi^{{\rm s}+}_{\varepsilon}),\end{split}
(5.22) H0(z~ε)=h0(Iεu)+(ξεu)g1(Iεu)+(ξεu)2g2(Iεu,ξεu).\begin{split}H_{0}(\tilde{z}^{-}_{\varepsilon})=&h_{0}(I^{{\rm u}-}_{\varepsilon})+(\xi^{{\rm u}-}_{\varepsilon})g_{1}(I^{{\rm u}-}_{\varepsilon})+(\xi^{{\rm u}-}_{\varepsilon})^{2}g_{2}(I^{{\rm u}-}_{\varepsilon},\xi^{{\rm u}-}_{\varepsilon}).\end{split}

Subtracting we obtain

(5.23) H0(z~ε+)H0(z~ε)=(h0(Iεs+)h0(Iεu))+((ξεs+)g1(Iεs+)(ξεu)g1(Iεu))+O(ε2)\begin{split}H_{0}(\tilde{z}^{+}_{\varepsilon})-H_{0}(\tilde{z}^{-}_{\varepsilon})=&(h_{0}(I^{{\rm s}+}_{\varepsilon})-h_{0}(I^{{\rm u}-}_{\varepsilon}))\\ &+((\xi^{{\rm s}+}_{\varepsilon})g_{1}(I^{{\rm s}+}_{\varepsilon})-(\xi^{{\rm u}-}_{\varepsilon})g_{1}(I^{{\rm u}-}_{\varepsilon}))\\ &+O(\varepsilon^{2})\end{split}

where the error term O(ε2)O(\varepsilon^{2}) in the above is due to Lemma 5.1 equation (5.11).

The term h0(Iεs+)h0(Iεu)h_{0}(I^{{\rm s}+}_{\varepsilon})-h_{0}(I^{{\rm u}-}_{\varepsilon}) in (5.23) is given, by Lemma 5.1 equation (5.10), as

(5.24) h0(Iεs+)h0(Iεu)=(Iεs+Iεu)(h0I(I0))+O(ε2).h_{0}(I^{{\rm s}+}_{\varepsilon})-h_{0}(I^{{\rm u}-}_{\varepsilon})=(I^{{\rm s}+}_{\varepsilon}-I^{{\rm u}-}_{\varepsilon})\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)+O(\varepsilon^{2}).

Since, by Lemma 5.1 equation (5.11), we have

(5.25) ξεs+=O(ε2),ξεu=O(ε2).\begin{split}\xi^{{\rm s}+}_{\varepsilon}=O(\varepsilon^{2}),\,\xi^{{\rm u}-}_{\varepsilon}=O(\varepsilon^{2}).\end{split}

we obtain

(5.26) (ξεs+)g1(Iεs+)(ξεu)g1(Iεu)=O(ε2).(\xi^{{\rm s}+}_{\varepsilon})g_{1}(I^{{\rm s}+}_{\varepsilon})-(\xi^{{\rm u}-}_{\varepsilon})g_{1}(I^{{\rm u}-}_{\varepsilon})=O(\varepsilon^{2}).

Thus, from (5.23) and using (5.24) we have

(5.27) H0(z~ε+)H0(z~ε)=(Iεs+Iεu)(h0I(I))+O(ε2).H_{0}(\tilde{z}^{+}_{\varepsilon})-H_{0}(\tilde{z}^{-}_{\varepsilon})=(I^{{\rm s}+}_{\varepsilon}-I^{{\rm u}-}_{\varepsilon})\left(\frac{\partial h_{0}}{\partial I}(I)\right)+O(\varepsilon^{2}).

As the left-hand side of (5.27) is given by (5.18), since h0I0\frac{\partial h_{0}}{\partial I}\neq 0 by condition (A-iii-a), solving for Iεs+IεuI^{{\rm s}+}_{\varepsilon}-I^{{\rm u}-}_{\varepsilon} yields

(5.28) Iεs+Iεu=(h0I(I))1(H0(z~ε+)H0(z~ε))=ε(h0I(I))10+((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))𝑑τε(h0I(I))10((𝒳1H0)(Φ~0τ(z~0))(𝒳1H0)(Φ~0τ(z~0)))𝑑τ+O(ε1+ϱ).\begin{split}I^{{\rm s}+}_{\varepsilon}-I^{{\rm u}-}_{\varepsilon}=&\left(\frac{\partial h_{0}}{\partial I}(I)\right)^{-1}\left(H_{0}(\tilde{z}^{+}_{\varepsilon})-H_{0}(\tilde{z}^{-}_{\varepsilon})\right)\\ =&-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I)\right)^{-1}\int_{0}^{+\infty}\left((\mathcal{X}^{1}H_{0})\left(\tilde{\Phi}^{\tau}_{0}\left(\tilde{z}^{+}_{0}\right)\right)-(\mathcal{X}^{1}H_{0})\left(\tilde{\Phi}^{\tau}_{0}\left(\tilde{z}_{0}\right)\right)\right)d\tau\\ &-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I)\right)^{-1}\int_{-\infty}^{0}\left((\mathcal{X}^{1}H_{0})\left(\tilde{\Phi}^{\tau}_{0}\left(\tilde{z}^{-}_{0}\right)\right)-(\mathcal{X}^{1}H_{0})\left(\tilde{\Phi}^{\tau}_{0}\left(\tilde{z}_{0}\right)\right)\right)d\tau\\ &+O\left(\varepsilon^{1+\varrho}\right).\end{split}

By the same argument as in the proof of Lemma 5.3, the error term O(ε1+ϱ)O(\varepsilon^{1+\varrho}) in (5.28) must equal O(ε2)O(\varepsilon^{2}). This shows (5.17). ∎

Proposition 5.4.
(5.29) Iεs+Iεs=ε(h0I(I0)))1+0((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))dτε(h0I(I0)))1g1(I0)+0((𝒳1ξs)(Φ~0τ(z~0+))(𝒳1ξs)(Φ~0τ(z~0)))dτ+O(ε2).\begin{split}I^{{\rm s}+}_{\varepsilon}-&I^{{\rm s}}_{\varepsilon}=\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}\int^{+\infty}_{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})\int^{+\infty}_{0}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &+O\left(\varepsilon^{2}\right).\end{split}
(5.30) IεuIεu=ε(h0I(I0)))10((𝒳1H0)(Φ~0τ(z~0))(𝒳1H0)(Φ~0τ(z~0)))dτε(h0I(I0)))1g1(I0)0((𝒳1ξu)(Φ~0τ(z~0))(𝒳1ξu)(Φ~0τ(z~0)))dτ+O(ε2).\begin{split}I^{{\rm u}-}_{\varepsilon}-&I^{{\rm u}}_{\varepsilon}=\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}\int_{-\infty}^{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})\int_{-\infty}^{0}\left((\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-(\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &+O\left(\varepsilon^{2}\right).\end{split}
Proof.

The formula for H0(z~ε+)H0(z~ε)H_{0}(\tilde{z}^{+}_{\varepsilon})-H_{0}(\tilde{z}_{\varepsilon}) is given by (5.19). The normal form expansion of H0H_{0} at a homoclinic point z~ε\tilde{z}_{\varepsilon} can be written with respect to the two sets of coordinates as

(5.31) H0(z~ε)=h0(Iεs)+(ξεs)g1(Iεs)+(ξεs)2g2(Iεs,ξεs)H_{0}(\tilde{z}_{\varepsilon})=h_{0}(I^{{\rm s}}_{\varepsilon})+(\xi^{{\rm s}}_{\varepsilon})g_{1}(I^{{\rm s}}_{\varepsilon})+(\xi^{{\rm s}}_{\varepsilon})^{2}g_{2}(I^{{\rm s}}_{\varepsilon},\xi^{{\rm s}}_{\varepsilon})
(5.32) H0(z~ε)=h0(Iεu)+(ξεu)g1(Iεu)+(ξεu)2g2(Iεu,ξεu)H_{0}(\tilde{z}_{\varepsilon})=h_{0}(I^{{\rm u}}_{\varepsilon})+(\xi^{{\rm u}}_{\varepsilon})g_{1}(I^{{\rm u}}_{\varepsilon})+(\xi^{{\rm u}}_{\varepsilon})^{2}g_{2}(I^{{\rm u}}_{\varepsilon},\xi^{{\rm u}}_{\varepsilon})

Subtracting (5.31) from (5.21) we obtain

(5.33) H0(z~ε+)H0(z~ε)=h0(Iεs+)h0(Iεs)+(ξεs+)g1(Iεs+)(ξεs)g1(Iεs)+O(ε2)=(h0I(I0))(Iεs+Iεs)+(ξεs+)(g1(Iεs+)g1(Iεs))+(ξεs+ξεs)g1(Iεs)+O(ε2)=(h0I(I0))(Iεs+Iεs)+(ξεs+ξεs)g1(I0)+O(ε2)\begin{split}H_{0}(\tilde{z}^{+}_{\varepsilon})-H_{0}(\tilde{z}_{\varepsilon})=&h_{0}(I^{{\rm s}+}_{\varepsilon})-h_{0}(I^{{\rm s}}_{\varepsilon})+(\xi^{{\rm s}+}_{\varepsilon})g_{1}(I^{{\rm s}+}_{\varepsilon})-(\xi^{{\rm s}}_{\varepsilon})g_{1}(I^{{\rm s}}_{\varepsilon})+O(\varepsilon^{2})\\ =&\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)(I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}}_{\varepsilon})\\ &+(\xi^{{\rm s}+}_{\varepsilon})(g_{1}(I^{{\rm s}+}_{\varepsilon})-g_{1}(I^{{\rm s}}_{\varepsilon}))\\ &+(\xi^{{\rm s}+}_{\varepsilon}-\xi^{{\rm s}}_{\varepsilon})g_{1}(I^{{\rm s}}_{\varepsilon})+O(\varepsilon^{2})\\ =&\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)(I^{{\rm s}+}_{\varepsilon}-I^{{\rm s}}_{\varepsilon})\\ &+(\xi^{{\rm s}+}_{\varepsilon}-\xi^{{\rm s}}_{\varepsilon})g_{1}(I_{0})+O(\varepsilon^{2})\end{split}

In the above we have used Lemma 5.1, equations (5.10), (5.11), and (5.12).

Applying Lemma D.1 and Lemma D.2 from Appendix D, for 𝐅=ξs\mathbf{F}=\xi^{\rm s}, we have

(5.34) ξεs+ξεs=ε0+((𝒳1ξs)(Φ~0τ(z~0+))(𝒳1ξs)(Φ~0τ(z~0)))𝑑τ+O(ε1+ϱ).\begin{split}\xi^{{\rm s}+}_{\varepsilon}-\xi^{{\rm s}}_{\varepsilon}=&-\varepsilon\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})\left(\tilde{\Phi}^{\tau}_{0}\left(\tilde{z}^{+}_{0}\right)\right)-(\mathcal{X}^{1}\xi^{\rm s})\left(\tilde{\Phi}^{\tau}_{0}\left(\tilde{z}_{0}\right)\right)\right)d\tau\\ &+O\left(\varepsilon^{1+\varrho}\right).\end{split}

Thus, using (5.33), (5.19), and (5.34) we obtain (5.29).

Equation (5.30) follows similarly.

The argument that the error term O(ε1+ϱ)O(\varepsilon^{1+\varrho}) can be replaced by O(ε2)O(\varepsilon^{2}) follows in the same way as in the proof of Lemma 5.3. ∎

Let τ\tau\in\mathbb{R} be some value of the time variable. Applying the formula (5.29) to Φ~ετ(z~ε+)\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}) and Φ~ετ(z~ε)\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}) instead z~ε+\tilde{z}^{+}_{\varepsilon} and z~ε\tilde{z}_{\varepsilon}, respectively, and the formula (5.30) to Φ~ετ(z~ε)\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{-}_{\varepsilon}) and Φ~ετ(z~ε)\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}) instead z~ε\tilde{z}^{-}_{\varepsilon} and z~ε\tilde{z}_{\varepsilon}, respectively, we obtain:

Corollary 5.5.

For any time τ\tau\in\mathbb{R} we have

(5.35) Iεs(Φ~τ(z~ε+))Iεs(Φ~τ(z~ε))=ε(h0I(I0)))10+((𝒳1H0)(Φ~0τ+ς(z~0+))(𝒳1H0)(Φ~0τ+ς(z~0)))dς+ε(h0I(I0)))1g1(I0)0+((𝒳1ξs)(Φ~0τ+ς(z~0+))(𝒳1ξs)(Φ~0τ+ς(z~0)))dς+O(ε2).\begin{split}I^{{\rm s}}_{\varepsilon}(\tilde{\Phi}^{\tau}(\tilde{z}_{\varepsilon}^{+}))-&I^{{\rm s}}_{\varepsilon}(\tilde{\Phi}^{\tau}(\tilde{z}_{\varepsilon}))\\ =&-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}\int_{0}^{+\infty}\left((\mathcal{X}^{1}H_{0})\left(\tilde{\Phi}^{\tau+\varsigma}_{0}\left(\tilde{z}^{+}_{0}\right)\right)\right.\\ &\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}H_{0})\left(\tilde{\Phi}^{\tau+\varsigma}_{0}\left(\tilde{z}_{0}\right)\right)\right)d\varsigma\\ &+\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})\left(\tilde{\Phi}^{\tau+\varsigma}_{0}\left(\tilde{z}^{+}_{0}\right)\right)\right.\\ &\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}\xi^{\rm s})\left(\tilde{\Phi}^{\tau+\varsigma}_{0}\left(\tilde{z}_{0}\right)\right)\right)d\varsigma\\ &+O\left(\varepsilon^{2}\right).\end{split}
(5.36) Iεu(Φ~τ(z~ε))Iεu(Φ~τ(z~ε))=ε(h0I(I0)))10((𝒳1H0)(Φ~0τ+ς(z~0))(𝒳1H0)(Φ~0τ+ς(z~0)))dςε(h0I(I0)))1g1(I0)0((𝒳1ξu)(Φ~0τ+ς(z~0))(𝒳1ξu)(Φ~0τ+ς(z~0)))dς+O(ε2).\begin{split}I^{{\rm u}}_{\varepsilon}(\tilde{\Phi}^{\tau}(\tilde{z}_{\varepsilon}^{-}))-&I^{{\rm u}}_{\varepsilon}(\tilde{\Phi}^{\tau}(\tilde{z}_{\varepsilon}))\\ =&\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}\int_{-\infty}^{0}\left((\mathcal{X}^{1}H_{0})\left(\tilde{\Phi}^{\tau+\varsigma}_{0}\left(\tilde{z}^{-}_{0}\right)\right)\right.\\ &\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}H_{0})\left(\tilde{\Phi}^{\tau+\varsigma}_{0}\left(\tilde{z}_{0}\right)\right)\right)d\varsigma\\ &-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})\int_{-\infty}^{0}\left((\mathcal{X}^{1}\xi^{\rm u})\left(\tilde{\Phi}^{\tau+\varsigma}_{0}\left(\tilde{z}^{-}_{0}\right)\right)\right.\\ &\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}\xi^{\rm u})\left(\tilde{\Phi}^{\tau+\varsigma}_{0}\left(\tilde{z}_{0}\right)\right)\right)d\varsigma\\ &+O\left(\varepsilon^{2}\right).\end{split}
Corollary 5.6.
(5.37) 0+((𝒳1ξs)(Φ~0τ(z~0+))(𝒳1ξs)(Φ~0τ(z~0)))𝑑τ0((𝒳1ξu)(Φ~0τ(z~0))(𝒳1ξu)(Φ~0τ(z~0)))𝑑τ=0.\begin{split}&\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau\\ &\qquad-\int_{-\infty}^{0}\left((\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-(\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau=0.\end{split}
Proof.

Let us denote

(5.38) J+=0+((𝒳1ξs)(Φ~0τ(z~0+))(𝒳1ξs)(Φ~0τ(z~0)))𝑑τ,J=0((𝒳1ξu)(Φ~0τ(z~0))(𝒳1ξu)(Φ~0τ(z~0)))𝑑τ.\begin{split}J^{+}=&\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau,\\ J^{-}=&-\int_{-\infty}^{0}\left((\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-(\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)d\tau.\end{split}

Recall that, by condition (A-iii-b) and (5.4), we have Iεu=IεsI^{{\rm u}}_{\varepsilon}=I^{{\rm s}}_{\varepsilon}. Subtracting (5.36) from (5.35), and comparing with (5.17), we should have

(h0I(I0)))1g1(I0)(J++J)=0.\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})(J^{+}+J^{-})=0.

By condition (A-iii-a), we have that h0I(I0)0\frac{\partial h_{0}}{\partial I}(I_{0})\neq 0 and g1(I0)0g_{1}(I_{0})\neq 0, therefore

(5.39) J++J=0.J^{+}+J^{-}=0.

5.2.3. Change in angle by the scattering map

We now give the expression of the angle-component 𝒮~θ\tilde{\mathcal{S}}^{\theta} of the mapping 𝒮~\tilde{\mathcal{S}} in (2.7).

Proposition 5.7.

The change in θ\theta by the scattering map σ~ε\tilde{\sigma}_{\varepsilon} is given by the following equation

(5.40) θs(z~ε+)θu(z~ε)=Δ(I0)ε0+𝒳1θs(Φ~0τ(z~0+))𝒳1θs(Φ~0τ(z~0))dτε0𝒳1θu(Φ~0τ(z~0))𝒳1θu(Φ~0τ(z~0))dτ+ε(2h0I2(I0))(h0I(I0))10+((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))τdτ+ε(2h0I2(I0))(h0I(I0))10((𝒳1H0)(Φ~0τ(z~0))(𝒳1H0)(Φ~0τ(z~0)))τdτ+O(ε2),\begin{split}\theta^{\rm s}\left(\tilde{z}^{+}_{\varepsilon}\right)-&\theta^{\rm u}\left(\tilde{z}_{\varepsilon}^{-}\right)\\ =&\Delta(I_{0})-\varepsilon\int^{+\infty}_{0}\mathcal{X}^{1}\theta^{\rm s}(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-\mathcal{X}^{1}\theta^{\rm s}(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))d\tau\\ &\qquad\,\,-\varepsilon\int_{-\infty}^{0}\mathcal{X}^{1}\theta^{\rm u}(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-\mathcal{X}^{1}\theta^{\rm u}(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))d\tau\\ &+\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)^{-1}\int^{+\infty}_{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &+\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)^{-1}\int_{-\infty}^{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &+O(\varepsilon^{2}),\end{split}

where Δ(I0)\Delta(I_{0}) is the phase-shift on the action level set I0I_{0} that defines the unperturbed scattering map σ~0\tilde{\sigma}_{0} (see Proposition 4.5).

Proposition 5.7 implies that

σ~εθ(I,θ,t)=σ~0θ(I,θ,t)+Δ(I0)+ε𝒮~θ(I,θ,t)+O(ε2),\tilde{\sigma}^{\theta}_{\varepsilon}(I,\theta,t)=\tilde{\sigma}^{\theta}_{0}(I,\theta,t)+\Delta(I_{0})+\varepsilon\tilde{\mathcal{S}}^{\theta}(I,\theta,t)+O(\varepsilon^{2}),

where ε𝒮~θ\varepsilon\tilde{\mathcal{S}}^{\theta} is given by the first four terms on the right hand-side of (5.40). The expression of ε𝒮~θ\varepsilon\tilde{\mathcal{S}}^{\theta} is more complicated than the one for ε𝒮~I\varepsilon\tilde{\mathcal{S}}^{I} since θ\theta is a fast variable.

Proof.

We will begin by computing the difference of the θs\theta^{{\rm s}} evaluated at a homoclinic point and at the footpoint of the stable fiber through the homoclinic point. By Lemma D.2 from Appendix D, we have

θs(z~ε+)θs(z~ε)=0+ddτ[θs(Φ~ετ(z~ε+))θs(Φ~ετ(z~ε))]𝑑τ\theta^{{\rm s}}\left(\tilde{z}^{+}_{\varepsilon}\right)-\theta^{{\rm s}}\left(\tilde{z}_{\varepsilon}\right)=-\int^{+\infty}_{0}\frac{d}{d\tau}\left[\theta^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-\theta^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))\right]d\tau

Now by equation (4.17) we have:

dθsdτ=H0Is+ε𝒳1θs.\frac{d\theta^{{\rm s}}}{d\tau}=\frac{\partial H_{0}}{\partial I^{{\rm s}}}+\varepsilon\mathcal{X}^{1}\theta^{{\rm s}}.

We can break the integral up into two parts:

(5.41) A=0+H0Is(Φ~ετ(z~ε+))H0Is(Φ~ετ(z~ε))dτA=-\int^{+\infty}_{0}\frac{\partial H_{0}}{\partial I^{{\rm s}}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-\frac{\partial H_{0}}{\partial I^{{\rm s}}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))d\tau

and

(5.42) B=ε0+𝒳1θs(Φ~ετ(z~ε+))𝒳1θs(Φ~ετ(z~ε))dτ.B=-\varepsilon\int^{+\infty}_{0}\mathcal{X}^{1}\theta^{\rm s}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-\mathcal{X}^{1}\theta^{\rm s}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))d\tau.

As for the integral BB, ε𝒳1θs\varepsilon\mathcal{X}^{1}\theta^{\rm s} is O(ε)O(\varepsilon). So, by Lemma D.4 from Appendix D, we can express the integral in terms of the unperturbed system plus an error term:

B=ε0+𝒳1θs(Φ~0τ(z~0+))𝒳1θs(Φ~0τ(z~0))dτ+O(ε1+ϱ).B=-\varepsilon\int^{+\infty}_{0}\mathcal{X}^{1}\theta^{\rm s}(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-\mathcal{X}^{1}\theta^{\rm s}(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))d\tau+O(\varepsilon^{1+\varrho}).

Returning to the integral (5.41), we now use the normal form (4.3) of H0H_{0} given by Proposition 4.1, yielding

(5.43) H0Is=h0Is(Is)+(ξs)g1Is(Is)+(ξs)2g2Is(Is,ξs),\frac{\partial H_{0}}{\partial I^{{\rm s}}}=\frac{\partial h_{0}}{\partial I^{{\rm s}}}(I^{{\rm s}})+(\xi^{\rm s})\frac{\partial g_{1}}{\partial I^{{\rm s}}}(I^{{\rm s}})+(\xi^{\rm s})^{2}\frac{\partial g_{2}}{\partial I^{{\rm s}}}(I^{{\rm s}},\xi^{\rm s}),

where ξs=xsys\xi^{\rm s}=x^{\rm s}y^{\rm s}.

Thus, the integral AA given by (5.41) breaks into three parts

(5.44) A1=\displaystyle A_{1}= 0+h0Is(Φ~ετ(z~ε+))h0Is(Φ~ετ(z~ε))dτ,\displaystyle\displaystyle-\int^{+\infty}_{0}\frac{\partial h_{0}}{\partial I^{{\rm s}}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-\frac{\partial h_{0}}{\partial I^{{\rm s}}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))d\tau,
(5.45) A2=\displaystyle A_{2}= 0+ξsg1sIs(Φ~ετ(z~ε+))ξsg1sIs(Φ~ετ(z~ε))dτ,\displaystyle\displaystyle-\int^{+\infty}_{0}\xi^{{\rm s}}\frac{\partial g_{1}^{\rm s}}{\partial I^{{\rm s}}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-\xi^{{\rm s}}\frac{\partial g_{1}^{\rm s}}{\partial I^{{\rm s}}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))d\tau,
(5.46) A3=\displaystyle A_{3}= 0+(ξs)2g2sIs(Φ~ετ(z~ε+))(ξs)2g2sIs(Φ~ετ(z~ε))dτ.\displaystyle\displaystyle-\int^{+\infty}_{0}(\xi^{{\rm s}})^{2}\frac{\partial g_{2}^{\rm s}}{\partial I^{{\rm s}}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-(\xi^{{\rm s}})^{2}\frac{\partial g_{2}^{\rm s}}{\partial I^{{\rm s}}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))d\tau.

From Lemma 5.1 equation (5.11) we have ξs(Φ~(z~ε+))=O(ε2)\xi^{\rm s}(\tilde{\Phi}(\tilde{z}^{+}_{\varepsilon}))=O(\varepsilon^{2}) and ξs(Φ~(z~ε))=O(ε)\xi^{\rm s}(\tilde{\Phi}(\tilde{z}_{\varepsilon}))=O(\varepsilon) in (5.46). Thus, we can immediately obtain that A3A_{3} is O(ε2)O(\varepsilon^{2}).

We use the integral form of the Mean Value Theorem to rewrite the integral A1A_{1}. Recall that

Fx(b)Fx(a)=(ba)012Fx2(a+t(ba))𝑑t\frac{\partial F}{\partial x}(b)-\frac{\partial F}{\partial x}(a)=(b-a)\int^{1}_{0}\frac{\partial^{2}F}{\partial x^{2}}\left(a+t(b-a)\right)dt

Applying this result to A1A_{1} for b=Is(Φ~ετ(z~ε+))b=I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon})), a=Is(Φ~ετ(z~ε))a=I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon})), and F=h0IsF=\frac{\partial h_{0}}{\partial I^{{\rm s}}}, yields

(5.47) A1=0+(Is(Φ~ετ(z~ε+))Is(Φ~ετ(z~ε)))Cε(τ)𝑑τ,A_{1}=-\int^{+\infty}_{0}(I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon})))C_{\varepsilon}(\tau)\,d\tau,

where CεC_{\varepsilon} stands for the integral

(5.48) Cε(τ)=012h0(Is)2[Is(Φ~ετ(z~ε))+t(Is(Φ~ετ(z~ε+))Is(Φ~ετ(z~ε)))]𝑑t.C_{\varepsilon}(\tau)=\int^{1}_{0}\frac{\partial^{2}h_{0}}{\partial(I^{{\rm s}})^{2}}\left[I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))+t\left(I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))\right)\right]dt.

We evaluate the expression Is(Φ~ετ(z~ε+))Is(Φ~ετ(z~ε))I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-I^{{\rm s}}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon})) in (5.47) by invoking Corollary 5.5, obtaining

(5.49) ε(h0I(I0)))10+((𝒳1H0)(Φ~0τ+ς(z~0+))(𝒳1H0)(Φ~0τ+ς(z~0)))dς+ε(h0Is(I0)))1g1(I0)0+((𝒳1ξs)(Φ~0τ+ς(z~0+))(𝒳1ξs)(Φ~0τ+ς(z~0)))dς+O(ε2).\begin{split}&-\varepsilon\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}\int_{0}^{+\infty}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}_{0}))\right)d\varsigma\\ &+\varepsilon\left(\frac{\partial h_{0}}{\partial I^{\rm s}}(I_{0}))\right)^{-1}g_{1}(I_{0})\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}_{0}))\right)d\varsigma\\ &+O\left(\varepsilon^{2}\right).\end{split}

The next part of the integrand is

Cε(τ)=012h0(Is)2[Is(Φ~ετ(z~ε))+t(Is(Φ~ετ(z~ε+))Is(Φ~ετ(z~ε)))]𝑑t.C_{\varepsilon}(\tau)=\int^{1}_{0}\frac{\partial^{2}h_{0}}{\partial(I^{{\rm s}})^{2}}\left[I^{{\rm s}}\left(\tilde{\Phi}^{\tau}_{\varepsilon}\left(\tilde{z}_{\varepsilon}\right)\right)+t\left(I^{{\rm s}}\left(\tilde{\Phi}^{\tau}_{\varepsilon}\left(\tilde{z}^{+}_{\varepsilon}\right)\right)-I^{{\rm s}}\left(\tilde{\Phi}^{\tau}_{\varepsilon}\left(\tilde{z}_{\varepsilon}\right)\right)\right)\right]dt.

Using Gronwall’s inequality – Lemma (C.1) –, we can write

Cε(τ)=C0(τ)+O(εϱ),C_{\varepsilon}(\tau)=C_{0}(\tau)+O(\varepsilon^{\varrho}),

where 0<ϱ<10<\varrho<1. However, when ε=0\varepsilon=0, IsI^{{\rm s}} along the flow of the footpoint z~0+\tilde{z}^{+}_{0} is equal to IsI^{{\rm s}} along the flow of the homoclinic point z~0\tilde{z}_{0}. Since Is(z~0)=I0I^{{\rm s}}\left(\tilde{z}_{0}\right)=I_{0} we obtain

C0(τ)=2h0I2(I0),C_{0}(\tau)=\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0}),

which is a constant.

Putting these expressions together, we can write A1A_{1} as

(5.50) ε(2h0I2(I0))(h0I(I0)))1+00+((𝒳1H0)(Φ~0τ+ς(z~0+))(𝒳1H0)(Φ~0τ+ς(z~0)))dςdτε(2h0I2(I0))(h0I(I0)))1g1(I0)0+0+((𝒳1ξs)(Φ~0τ+ς(z~0+))(𝒳1ξs)(Φ~0τ+ς(z~0)))dςdτ+O(ε1+ϱ).\begin{split}&\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}\left(I_{0}\right)\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}\int^{+\infty}_{0}\int_{0}^{+\infty}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}^{+}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}_{0}))\right)d\varsigma d\tau\\ &-\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})\int_{0}^{+\infty}\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}^{+}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}_{0}))\right)d\varsigma d\tau\\ &+O\left(\varepsilon^{1+\varrho}\right).\end{split}

We will now write the double integrals in (5.50) in a simpler form. We show the details of the computation from the first double integral that appears in (5.50), as the second double integral can be treated in a similar fashion. Denote by s\mathscr{I}^{\rm s} the following improper integral

(5.51) s(τ)=τ+((𝒳1H0)(Φ~0υ(z~0+))(𝒳1H0)(Φ~0υ(z~0)))𝑑υ.\begin{split}\mathscr{I}^{\rm s}(\tau)=&-\int_{\tau}^{+\infty}((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\upsilon}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\upsilon}_{0}(\tilde{z}_{0})))d\upsilon.\end{split}

Since (𝒳1H0)(Φ~0υ(z~0+))(𝒳1H0)(Φ~0υ(z~0)))(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\upsilon}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\upsilon}_{0}(\tilde{z}_{0}))) approaches 0 exponentially as υ+\upsilon\to+\infty, the above integral is convergent and moreover

ddτ(s(τ))=(𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)).\frac{d}{d\tau}(\mathscr{I}^{\rm s}(\tau))=(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0})).

That is, s(τ)\mathscr{I}^{\rm s}(\tau) is the antiderivative of τ(𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0))\tau\mapsto(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0})) satisfying the condition that it equals 0 at ++\infty.

Making the change of variable υ=σ+τ\upsilon=\sigma+\tau with dυ=dσd\upsilon=d\sigma the double integral in (5.50) becomes

(5.52) 0+τ+((𝒳1H0)(Φ~0υ(z~0+))(𝒳1H0)(Φ~0υ(z~0)))𝑑υ𝑑τ=0+s(τ)𝑑τ.\begin{split}\int_{0}^{+\infty}\int_{\tau}^{+\infty}((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\upsilon}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\upsilon}_{0}(\tilde{z}_{0})))d\upsilon d\tau=-\int_{0}^{+\infty}\mathscr{I}^{\rm s}(\tau)d\tau.\end{split}

Using Integration by Parts we obtain

(5.53) 0+s(τ)𝑑τ=τs(τ)|0++0+((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))τ𝑑τ=0+((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))τ𝑑τ.\begin{split}-\int_{0}^{+\infty}\mathscr{I}^{\rm s}(\tau)d\tau=&-\tau\mathscr{I}^{\rm s}(\tau)\biggr{|}_{0}^{+\infty}\\ &+\int_{0}^{+\infty}((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0})))\tau d\tau\\ =&\int_{0}^{+\infty}((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0})))\tau d\tau.\end{split}

In the above, the quantity τs(τ)\tau\mathscr{I}^{\rm s}(\tau) obviously equals 0 at τ=0\tau=0, and equals 0 when τ+\tau\to+\infty since, by l’Hospital Rule

limτ+s(τ)τ1=limτ+(𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0))τ2=0,\lim_{\tau\to+\infty}\frac{\mathscr{I}^{\rm s}(\tau)}{\tau^{-1}}=\lim_{\tau\to+\infty}-\frac{(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))}{\tau^{-2}}=0,

since (𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0))(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0})) approaches 0 at exponential rate as τ+\tau\to+\infty.

A similar computation can be done to write the second double integral that appears in (5.50) as a single integral.

Thus, we obtain the following expression of A1A_{1}:

(5.54) ε(2h0I2(I0))(h0I(I0))10+((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))τdτε(2h0I2(I0))(h0I(I0)))1g1(I0)0+((𝒳1ξs)(Φ~0τ(z~0+))(𝒳1ξs)(Φ~0τ(z~0)))τdτ+O(ε1+ϱ).\begin{split}&\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)^{-1}\int^{+\infty}_{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &-\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &+O\left(\varepsilon^{1+\varrho}\right).\end{split}

Finally, we turn to the integral A2A_{2} given by (5.45). Using Lemma 5.1 equations (5.13) and (5.11), (5.34), as well as integration by parts similarly to above, we express A2A_{2} as

(5.55) 0+(ξsg1I)(Φ~ετ(z~ε+))(ξsg1I)(Φ~ετ(z~ε))dτ=(g1I(I0))0+(ξs((Φ~ετ(z~ε+)))ξs((Φ~ετ(z~ε))))𝑑τ+O(ε1+ϱ)=ε(g1I(I0))0+0+((𝒳1ξs)(Φ~0τ+ς(z~0+))(𝒳1ξs)(Φ~0τ+ς(z~0)))𝑑ς𝑑τ+O(ε1+ϱ)=ε(g1I(I0))0+((𝒳1ξs)(Φ~0τ(z~0+))(𝒳1ξs)(Φ~0τ(z~0)))τ𝑑τ+O(ε1+ϱ)\begin{split}-&\int_{0}^{+\infty}(\xi^{{\rm s}}\frac{\partial g_{1}}{\partial I})(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon}))-(\xi^{{\rm s}}\frac{\partial g_{1}}{\partial I})(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))d\tau\\ =&-\left(\frac{\partial g_{1}}{\partial I}(I_{0})\right)\int_{0}^{+\infty}(\xi^{\rm s}((\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{+}_{\varepsilon})))-\xi^{\rm s}((\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))))d\tau\\ &+O(\varepsilon^{1+\varrho})\\ =&\varepsilon\left(\frac{\partial g_{1}}{\partial I}(I_{0})\right)\int_{0}^{+\infty}\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau+\varsigma}_{0}(\tilde{z}_{0}))\right)d\varsigma d\tau\\ &+O(\varepsilon^{1+\varrho})\\ =&\varepsilon\left(\frac{\partial g_{1}}{\partial I}(I_{0})\right)\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &+O(\varepsilon^{1+\varrho})\end{split}

In the above, we have used that

(𝒳0+ε𝒳1)ξs=𝒳0ξs+ε𝒳1ξs={ξs,H0}+ε𝒳1ξs=ε𝒳1ξs+O(ε2).(\mathcal{X}^{0}+\varepsilon\mathcal{X}^{1})\xi^{s}=\mathcal{X}^{0}\xi^{s}+\varepsilon\mathcal{X}^{1}\xi^{s}=\{\xi^{s},H_{0}\}+\varepsilon\mathcal{X}^{1}\xi^{s}=\varepsilon\mathcal{X}^{1}\xi^{s}+O(\varepsilon^{2}).

Combining (5.54) and (5.55) we obtain

(5.56) θs(z~ε+)θs(z~ε)=ε(2h0I2(I0))(h0I(I0))10+((𝒳1H0)(Φ~0τ(z~0+))(𝒳1H0)(Φ~0τ(z~0)))τdτε(2h0I2(I0))(h0I(I0)))1g1(I0)0+((𝒳1ξs)(Φ~0τ(z~0+))(𝒳1ξs)(Φ~0τ(z~0)))τdτ+ε(g1I(I0))0+((𝒳1ξs)(Φ~0τ(z~0+))(𝒳1ξs)(Φ~0τ(z~0)))τ𝑑τ+O(ε2)\begin{split}\theta^{\rm s}\left(\tilde{z}^{+}_{\varepsilon}\right)-&\theta^{\rm s}\left(\tilde{z}_{\varepsilon}\right)\\ =&\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)^{-1}\int^{+\infty}_{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &-\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &+\varepsilon\left(\frac{\partial g_{1}}{\partial I}(I_{0})\right)\int_{0}^{+\infty}\left((\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{+}_{0}))-(\mathcal{X}^{1}\xi^{\rm s})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &+O(\varepsilon^{2})\end{split}

In the above, we have also replaced the error term O(ε1+ϱ)O(\varepsilon^{1+\varrho}) by O(ε2)O(\varepsilon^{2}), by calling the same argument as in the proof of Proposition 5.2.

A similar computation yields:

(5.57) θu(z~ε)θu(z~ε)=ε(2h0I2(I0))(h0I(I0))10((𝒳1H0)(Φ~0τ(z~0))(𝒳1H0)(Φ~0τ(z~0)))τdτ+ε(2h0I2(I0))(h0I(I0)))1g1(I0)0((𝒳1ξu)(Φ~0τ(z~0))(𝒳1ξu)(Φ~0τ(z~0)))τdτε(g1I(I0))0((𝒳1ξu)(Φ~0τ(z~0))(𝒳1ξu)(Φ~0τ(z~0)))τ𝑑τ+O(ε2)\begin{split}\theta^{\rm u}\left(\tilde{z}^{-}_{\varepsilon}\right)-&\theta^{\rm u}\left(\tilde{z}_{\varepsilon}\right)\\ =&-\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0})\right)^{-1}\int_{-\infty}^{0}\left((\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}H_{0})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &+\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})\int^{0}_{-\infty}\left((\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))\right.\\ &\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\left.-(\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &-\varepsilon\left(\frac{\partial g_{1}}{\partial I}(I_{0})\right)\int^{0}_{-\infty}\left((\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}^{-}_{0}))-(\mathcal{X}^{1}\xi^{\rm u})(\tilde{\Phi}^{\tau}_{0}(\tilde{z}_{0}))\right)\tau d\tau\\ &+O(\varepsilon^{2})\end{split}

Subtracting (5.57) from (5.56) yields θs(z~ε+)θu(z~ε)=θs(z~ε)θu(z~ε)\theta^{\rm s}\left(\tilde{z}^{+}_{\varepsilon}\right)-\theta^{\rm u}\left(\tilde{z}^{-}_{\varepsilon}\right)=\theta^{\rm s}\left(\tilde{z}_{\varepsilon}\right)-\theta^{\rm u}\left(\tilde{z}_{\varepsilon}\right), plus an epsilon order term consisting of the sum of six integrals, plus an error term of order O(ε2)O(\varepsilon^{2}). Using the notation (5.38), four of these integrals are

(5.58) ε(2h0I2(I0))(h0I(I0)))1g1(I0)J+,ε(2h0I2(I0))(h0I(I0)))1g1(I0)J,ε(g1I(I0))J+,ε(g1I(I0))J.\begin{split}&-\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})J^{+},\\ &-\varepsilon\left(\frac{\partial^{2}h_{0}}{\partial I^{2}}(I_{0})\right)\left(\frac{\partial h_{0}}{\partial I}(I_{0}))\right)^{-1}g_{1}(I_{0})J^{-},\\ &\varepsilon\left(\frac{\partial g_{1}}{\partial I}(I_{0})\right)J^{+},\\ &\varepsilon\left(\frac{\partial g_{1}}{\partial I}(I_{0})\right)J^{-}.\end{split}

By Corollary 5.6, since J++J=0J^{+}+J^{-}=0, the sum of the first two expressions in (5.58) equals 0, and the sum of the last two expressions in (5.58) equals 0.

Also, we recall from Section 5.1.2, that for a given unperturbed homoclinic point z~0\tilde{z}_{0} we selected a perturbed homoclinic point z~ε\tilde{z}_{\varepsilon} satisfying condition (A-iii-b) (5.4), that is θs(z~ε)θu(z~ε)=θs(z~0)θu(z~0)=Δ(I(z~0))\theta^{\rm s}(\tilde{z}_{\varepsilon})-\theta^{\rm u}(\tilde{z}_{\varepsilon})=\theta^{\rm s}(\tilde{z}_{0})-\theta^{\rm u}(\tilde{z}_{0})=\Delta(I(\tilde{z}_{0})).

Combining these results we obtain (5.40). ∎

Acknowledgements

We are grateful to Rodney Anderson and Angel Jorba for discussions and comments.

Appendix A Normally hyperbolic invariant manifolds

We briefly recall the notion of a normally hyperbolic invariant manifold.

Definition A.1.

Let MM be a 𝒞r\mathcal{C}^{r}-smooth manifold, Φt\Phi^{t} a 𝒞r\mathcal{C}^{r}-flow on MM. A submanifold (with or without boundary) Λ\Lambda of MM is a normally hyperbolic invariant manifold (NHIM) for Φt\Phi^{t} if it is invariant under Φt\Phi^{t}, and there exists a splitting of the tangent bundle of TMTM into sub-bundles over Λ\Lambda

(A.1) TzM=EzuEzsTzΛ,zΛT_{z}M=E^{{\rm u}}_{z}\oplus E^{{\rm s}}_{z}\oplus T_{z}\Lambda,\quad\forall z\in\Lambda

that are invariant under DΦtD\Phi^{t} for all tt\in\mathbb{R}, and there exist rates

λλ+<λc<0<μc<μμ+\lambda_{-}\leq\lambda_{+}<\lambda_{c}<0<\mu_{c}<\mu_{-}\leq\mu_{+}

and a constant C>0{C}>0, such that for all xΛx\in\Lambda we have

(A.2) CetλvDΦt(z)(v)Cetλ+v for all t0, if and only if vEzs,Cetμ+vDΦt(z)(v)Cetμv for all t0, if and only if vEzu,Ce|t|λcvDΦt(z)(v)Ce|t|μcv for all t, if and only if vTzΛ.\begin{split}{C}e^{t\lambda_{-}}\|v\|\leq\|D\Phi^{t}(z)(v)\|\leq{C}e^{t\lambda_{+}}\|v\|\textrm{ for all }t\geq 0,&\textrm{ if and only if }v\in E^{{\rm s}}_{z},\\ {C}e^{t\mu_{+}}\|v\|\leq\|D\Phi^{t}(z)(v)\|\leq{C}e^{t\mu_{-}}\|v\|\textrm{ for all }t\leq 0,&\textrm{ if and only if }v\in E^{{\rm u}}_{z},\\ {C}e^{|t|\lambda_{c}}\|v\|\leq\|D\Phi^{t}(z)(v)\|\leq{C}e^{|t|\mu_{c}}\|v\|\textrm{ for all }t\in\mathbb{R},&\textrm{ if and only if }v\in T_{z}\Lambda.\end{split}

In the case when Φt\Phi^{t} is a Hamiltonian flow, the rates can be chosen so that

λ=μ+,λ+=μ, and λc=μc.\lambda_{-}=-\mu_{+},\,\lambda_{+}=-\mu_{-},\,\textrm{ and }\lambda_{c}=-\mu_{c}.

The regularity of the manifold Λ\Lambda depends on the rates λ\lambda^{-}, λ+\lambda^{+}, μ\mu^{-}, μ+\mu^{+}, λc\lambda_{c}, and μc\mu_{c}. More precisely, Λ\Lambda is 𝒞\mathcal{C}^{\ell}-differentiable, with r1\ell\leq r-1, provided that

(A.3) μc+λ+<0,λc+μ>0.\begin{split}&\ell{\mu}_{c}+{\lambda}_{+}<0,\\ &\ell{\lambda}_{c}+{\mu}_{-}>0.\end{split}

The manifold Λ\Lambda has associated unstable and stable manifolds of Λ\Lambda, denoted Wu(Λ)W^{{\rm u}}(\Lambda) and Ws(Λ)W^{{\rm s}}(\Lambda), respectively, which are 𝒞1\mathcal{C}^{\ell-1}-differentiable. They are foliated by 11-dimensional unstable and stable manifolds (fibers) of points, Wu(z)W^{{\rm u}}(z), Ws(z)W^{{\rm s}}(z), zΛz\in\Lambda, respectively, which are as smooth as the flow.

These manifolds are defined by:

(A.4) Ws(Λ)={y|d(Φεt(y),Λ)0 as t+ }={y|d(Φεt(y),Λ)Cyetλ+,t0},Wu(Λ)={y|d(Φεt(y),Λ)0 as t }={y|d(Φεt(y),Λ)Cyetμ,t0},Ws(x)={y|d(Φt(y),Φt(x))<Cyetλ+,t0},Wu(x)={y|d(Φt(y),Φt(x))<Cyetμ,t0}.\begin{split}W^{\rm s}(\Lambda)=&\{y\,|\,d({\Phi}^{t}_{\varepsilon}(y),\Lambda)\rightarrow 0\textrm{ as {$t\to+\infty$} }\}\\ =&\{y\,|\,d({\Phi}^{t}_{\varepsilon}(y),\Lambda)\leq C_{y}e^{t\lambda_{+}},t\geq 0\},\\ W^{\rm u}(\Lambda)=&\{y\,|\,d({\Phi}^{t}_{\varepsilon}(y),\Lambda)\rightarrow 0\textrm{ as {$t\to-\infty$} }\}\\ =&\{y\,|\,d({\Phi}^{t}_{\varepsilon}(y),\Lambda)\leq C_{y}e^{t\mu_{-}},t\geq 0\},\\ W^{{\rm s}}(x)=&\{y\,|\,d(\Phi^{t}(y),\Phi^{t}(x))<C_{y}e^{t\lambda_{+}},\,t\geq 0\},\\ W^{{\rm u}}(x)=&\{y\,|\,d(\Phi^{t}(y),\Phi^{t}(x))<C_{y}e^{t\mu_{-}},\,t\leq 0\}.\end{split}

The fibers Wu(x)W^{{\rm u}}(x), Ws(x)W^{{\rm s}}(x) are not invariant by the flow, but equivariant in the sense that

(A.5) Φt(Wu(z))=Wu(Φt(z)),Φt(Ws(z))=Ws(Φt(z)).\begin{split}\Phi^{t}(W^{\rm u}(z))&=W^{\rm u}(\Phi^{t}(z)),\\ \Phi^{t}(W^{\rm s}(z))&=W^{\rm s}(\Phi^{t}(z)).\end{split}

Since Ws,u(Λ)=zΛWs,u(z)W^{{\rm s},{\rm u}}(\Lambda)=\bigcup_{z\in\Lambda}W^{s,u}(z), we can define the projections along the fibers

(A.6) Ω+:Ws(Λ)Λ,Ω+(z)=z+ iff zWs(z+),Ω:Wu(Λ)Λ,Ω(z)=z iff zWu(z).\begin{split}\Omega^{+}:W^{{\rm s}}(\Lambda)\to\Lambda,\quad&\Omega^{+}(z)=z^{+}\textrm{ iff }z\in W^{{\rm s}}(z^{+}),\\ \Omega^{-}:W^{{\rm u}}(\Lambda)\to\Lambda,\quad&\Omega^{-}(z)=z^{-}\textrm{ iff }z\in W^{{\rm u}}(z^{-}).\end{split}

The point z+Λz^{+}\in\Lambda is characterized by

(A.7) d(Φt(z),Φt(z+))Czetλ+, for all t0.d({\Phi}^{t}(z),{\Phi}^{t}(z^{+}))\leq C_{z}e^{t\lambda_{+}},\quad\textrm{ for all }t\geq 0.

and the point zΛz^{-}\in\Lambda by

(A.8) d(Φt(z),Φt(z))Czetμ, for all t0,d({\Phi}^{t}(z),{\Phi}^{t}(z^{-}))\leq C_{z}e^{t\mu_{-}},\quad\textrm{ for all }t\leq 0,

for some Cz>0C_{z}>0.

For our applications, the most important result about NHIM’s is that they persist when we perturb the flow. This is the fundamental result of [Fen72, HPS77, Pes04]. In the case when the manifold has a boundary, the persistence result requires a step of extending the flow. This makes that the persistent manifold is not invariant but only locally invariant and not unique (it depends on the extension).

When we are given a family of flows, it is possible to choose the extensions depending smoothly on parameters and obtain that the manifolds depend smoothly on parameters.

The precise meaning of the smooth dependence is that the we can find diffeomorphisms kε:Λ0Λεk_{\varepsilon}:\Lambda_{0}\rightarrow\Lambda_{\varepsilon}. The maps kε(x)k_{\varepsilon}(x) are jointly CrC^{r} as functions of x,εx,\varepsilon. The proof of this well known result is not very difficult. It suffices to consider an extended flow Φ~t(x,ε)=(Φεt(x),ε)\tilde{\Phi}^{t}(x,\varepsilon)=(\Phi_{\varepsilon}^{t}(x),\varepsilon), which is a small perturbation of Φ~0t(x,ε)=(Φ0t(x),ε)\tilde{\Phi}^{t}_{0}(x,\varepsilon)=(\Phi_{0}^{t}(x),\varepsilon). The regularity of the NHIM of Φ~t\tilde{\Phi}^{t} gives the claimed regularity of the NHIM of Φt\Phi^{t} with respect to parameters.

From the same proof (using the invariant objects of the extended flow), it easily follows the regularity with respect to parameters of the stable and unstable bundles and the stable and unstable manifolds.

Appendix B Scattering map

Assume that Wu(Λ)W^{\rm u}(\Lambda), Ws(Λ)W^{\rm s}(\Lambda) have a transverse intersection along a manifold Γ\Gamma satisfying:

(B.1) TzΓ=TzWs(Λ)TzWu(Λ), for all zΓ,TzM=TzΓTzWu(z)TzWs(z+), for all zΓ.\begin{split}&T_{z}\Gamma=T_{z}W^{\rm s}(\Lambda)\cap T_{z}W^{\rm u}(\Lambda),\textrm{ for all }z\in\Gamma,\\ &T_{z}M=T_{z}\Gamma\oplus T_{z}W^{\rm u}(z^{-})\oplus T_{z}W^{\rm s}(z^{+}),\textrm{ for all }z\in\Gamma.\end{split}

Under these conditions the projection mappings Ω±\Omega^{\pm} restricted to Γ\Gamma are local diffeomorphisms. We can restrict Γ\Gamma if necessary so that Ω±\Omega^{\pm} are diffeomorphisms from Γ\Gamma onto open subsets U±U^{\pm} in Λ\Lambda.

Definition B.1.

A homoclinic channel is a homoclinic manifold Γ\Gamma satisfying the strong transversality condition B.1, and such that

ΩΓ±:ΓU±:=Ω±(Γ)\Omega^{\pm}_{\mid\Gamma}:\Gamma\to U^{\pm}:=\Omega^{\pm}(\Gamma)

is a 𝒞1\mathcal{C}^{\ell-1}-diffeomorphism.

Definition B.2.

Given a homoclinic channel Γ\Gamma, the scattering map associated to Γ\Gamma is defined as

σ:=σΓ:UΛU+Λ,σ=Ω+(Ω)1.\begin{split}\sigma:=&\sigma^{\Gamma}:U^{-}\subseteq\Lambda\to U^{+}\subseteq\Lambda,\\ \sigma=&\Omega^{+}\circ(\Omega^{-})^{-1}.\end{split}

Equivalently, σ(z)=z+\sigma(z^{-})=z^{+}, provided that Wu(z)W^{\rm u}(z^{-}) intersects Ws(z+)W^{\rm s}(z^{+}) at a unique point zΓz\in\Gamma.

The meaning of the scattering map is that, given a homoclinic excursion, it has two orbits in the manifold is asymptotic to. It is asymptotic to an orbit in the past and to another orbit in the future. The scattering map considers the future asymptotic orbit as a function of the asymptotic in the past. When we consider all the homoclinic orbits in a homoclinic channel we obtain a scattering map from an open domain. The intuition of the scattering map is that if we observe the orbit for long times, we just measure the effect of the homoclinic excursion on the asymptotic behavior. The scattering map is a very economical way of studying these excursions since it is a map only on the NHIM. Furthermore, as we will see now, it satisfies remarkable geometric properties.

Due to (A.5), the scattering map satisfies the following property

(B.2) ΦTσΓ=σΦT(Γ)ΦT\Phi^{T}\circ\sigma^{\Gamma}=\sigma^{\Phi^{T}(\Gamma)}\circ\Phi^{T}

for any TT\in\mathbb{R}.

If MM is a symplectic manifold, Φt\Phi^{t} is a Hamiltonian flow on MM, and ΛM\Lambda\subseteq M is symplectic, then the scattering map is symplectic. If the flow is exact Hamiltonian, the scattering map is exact symplectic. For details see [DdlLS08].

In a similar fashion, we can define heteroclinic channels and associated scattering maps.

Given two NHIM’s Λ1\Lambda^{1} and Λ2\Lambda^{2}, we can define the projection mappings Ω±,i:Ws,u(Λi)Λi\Omega^{\pm,i}:W^{{\rm s},{\rm u}}(\Lambda^{i})\to\Lambda^{i} for i=1,2i=1,2. Assume that Wu(Λ1)W^{\rm u}(\Lambda^{1}) intersects transversally Ws(Λ2)W^{\rm s}(\Lambda^{2}) along a heteroclinic manifold Γ\Gamma so that:

(B.3) TzΓ=TzWu(Λ1)TzWs(Λ2), for all zΓ,TzM=TzΓTzWu(z)TzWs(z+), for all zΓ,\begin{split}&T_{z}\Gamma=T_{z}W^{\rm u}(\Lambda^{1})\cap T_{z}W^{\rm s}(\Lambda^{2}),\textrm{ for all }z\in\Gamma,\\ &T_{z}M=T_{z}\Gamma\oplus T_{z}W^{\rm u}(z^{-})\oplus T_{z}W^{\rm s}(z^{+}),\textrm{ for all }z\in\Gamma,\end{split}

where z=Ω,1(z)Λ1z^{-}=\Omega^{-,1}(z)\in\Lambda^{1} and z+=Ω+,2(z)Λ2z^{+}=\Omega^{+,2}(z)\in\Lambda^{2}.

We can restrict Γ\Gamma so that Ω,1:ΓΛ1\Omega^{-,1}:\Gamma\to\Lambda^{1} and Ω+,2:ΓΛ2\Omega^{+,2}:\Gamma\to\Lambda^{2} are diffeomorphisms onto their corresponding images.

Definition B.3.

A heteroclinic channel is a heteroclinic manifold Γ\Gamma satisfying the strong transversality condition B.3, and such that

ΩΓ,1:ΓU:=Ω,1(Γ)Λ1,ΩΓ+,2:ΓU+:=Ω+,2(Γ)Λ2,\begin{split}\Omega^{-,1}_{\mid\Gamma}:\Gamma\to U^{-}:=\Omega^{-,1}(\Gamma)\subseteq\Lambda^{1},\\ \Omega^{+,2}_{\mid\Gamma}:\Gamma\to U^{+}:=\Omega^{+,2}(\Gamma)\subseteq\Lambda^{2},\end{split}

are 𝒞l1\mathcal{C}^{l-1}-diffeomorphisms.

Definition B.4.

Given a heteroclinic channel Γ\Gamma, the scattering map associated to Γ\Gamma is defined as

σ:=σΓ:UΛ1U+Λ2,σ=Ω+,2(Ω,1)1.\begin{split}\sigma:=&\sigma^{\Gamma}:U^{-}\subseteq\Lambda^{1}\to U^{+}\subseteq\Lambda^{2},\\ \sigma=&\Omega^{+,2}\circ(\Omega^{-,1})^{-1}.\end{split}

From the result of the regularity with respect to parameters of the stable and unstable manifolds and the fact that the scattering map is expressed in terms of transverse intersections, we obtain that the scattering map depends smoothly on parameters. Thus, the goal of this paper is not to prove the derivative of the scattering map with respect to the perturbation parameter exists, only to give explicit formulas knowing that the derivative exists.

Appendix C Gronwall’s inequality

In this section we apply Gronwall’s Inequality to estimate the error between the solution of an unperturbed system and the solution of the perturbed system, over a time of logarithmic order with respect to the size of the perturbation.

Lemma C.1.

Consider the following differential equations:

(C.1) ddtz(t)\displaystyle\frac{d}{dt}{z}(t) =\displaystyle= 𝒳0(z,t)\displaystyle\mathcal{X}^{0}(z,t)
(C.2) ddtz(t)\displaystyle\frac{d}{dt}{z}(t) =\displaystyle= 𝒳0(z,t)+ε𝒳1(z,t,ε)\displaystyle\mathcal{X}^{0}(z,t)+\varepsilon\mathcal{X}^{1}(z,t,\varepsilon)

Assume that 𝒳0,𝒳1\mathcal{X}^{0},\mathcal{X}^{1} are uniformly Lipschitz continuous in the variable zz, C0C_{0} is the Lipschitz constant of 𝒳0\mathcal{X}^{0}, and 𝒳1\mathcal{X}^{1} is bounded with 𝒳1C1\|\mathcal{X}^{1}\|\leq C_{1}, for some C0,C1>0C_{0},C_{1}>0. Let z0z_{0} be a solution of the equation (C.1) and zεz_{\varepsilon} be a solution of the equation (C.2) such that

(C.3) z0(t0)zε(t0)<cε.\|z_{0}(t_{0})-z_{\varepsilon}(t_{0})\|<c\varepsilon.

Then, for 0<ϱ0<10<\varrho_{0}<1, k1ϱ0C0k\leq\frac{1-{\varrho_{0}}}{C_{0}}, and K=c+C1C0K=c+\frac{C_{1}}{C_{0}}, we have

(C.4) z0(t)zε(t)<Kεϱ0, for 0tt0kln(1/ε).\|z_{0}(t)-z_{\varepsilon}(t)\|<K\varepsilon^{\varrho_{0}},\textrm{ for }0\leq t-t_{0}\leq k\ln(1/\varepsilon).

For a proof, see [GdlLM21].

Appendix D Master lemmas

In this section we recall some abstract Melnikov-type integral operators and some of their properties from [GdlLM21].

Consider a system as in (2.2) and the extended systems as in (2.6).

Assume that, for some ε1>0\varepsilon_{1}>0, and for each ε(ε1,ε1)\varepsilon\in(-\varepsilon_{1},\varepsilon_{1}), there exists a normally hyperbolic invariant manifold Λ~ε{\tilde{\Lambda}}_{\varepsilon} for Φ~ετ\tilde{\Phi}^{\tau}_{\varepsilon}, as well as a homoclinic channel Γ~ε\tilde{\Gamma}_{\varepsilon}, which depend 𝒞\mathcal{C}^{\ell}-smoothly on ε\varepsilon. Associated to Γ~ε\tilde{\Gamma}_{\varepsilon} we have projections Ω±:Γ~εΩ±(Γ~ε)Λ~ε\Omega^{\pm}:\tilde{\Gamma}_{\varepsilon}\to\Omega^{\pm}(\tilde{\Gamma}_{\varepsilon})\subseteq{\tilde{\Lambda}}_{\varepsilon}, which are local diffeomorphisms. We are thinking of Φ~ετ\tilde{\Phi}^{\tau}_{\varepsilon}, Λ~ε{\tilde{\Lambda}}_{\varepsilon}, Γ~ε\tilde{\Gamma}_{\varepsilon} as perturbations of Φ~0τ\tilde{\Phi}^{\tau}_{0}, Λ~0{\tilde{\Lambda}}_{0}, Γ~0\tilde{\Gamma}_{0}.

For z~0Γ~0\tilde{z}_{0}\in\tilde{\Gamma}_{0} let z~εΓ~ε\tilde{z}_{\varepsilon}\in\tilde{\Gamma}_{\varepsilon} be the corresponding homoclinic point satisfying (5.4). Because of the smooth dependence of the normally hyperbolic manifold and of its stable and unstable manifolds on the perturbation, z~ε\tilde{z}_{\varepsilon} is O(ε)O(\varepsilon)-close to z~0\tilde{z}_{0} in the 𝒞\mathcal{C}^{\ell}-topology, that is

(D.1) z~ε=z~0+O(ε).\tilde{z}_{\varepsilon}=\tilde{z}_{0}+O(\varepsilon).

Let (z~ε,ε)M~𝐅(z~ε,ε)k(\tilde{z}_{\varepsilon},\varepsilon)\in\widetilde{M}\mapsto{\bf F}(\tilde{z}_{\varepsilon},\varepsilon)\in\mathbb{R}^{k} be a uniformly 𝒞1\mathcal{C}^{1}-smooth mapping on M~×\widetilde{M}\times\mathbb{R}.

We define the integral operators

(D.2) +(𝐅,Φ~ετ,z~ε)=0+(𝐅(Φ~ετ(z~ε+))𝐅(Φ~ετ(z~ε)))𝑑τ,(𝐅,Φ~ετ,z~ε)=0(𝐅(Φ~ετ(z~ε))𝐅(Φ~ετ(z~ε))dτ.\begin{split}\mathfrak{I}^{+}(\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},\tilde{z}_{\varepsilon})=&\int_{0}^{+\infty}\left(\mathbf{F}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}^{+}))-\mathbf{F}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon}))\right)d\tau,\\ \mathfrak{I}^{-}(\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},\tilde{z}_{\varepsilon})=&\int_{-\infty}^{0}\left(\mathbf{F}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}^{-}_{\varepsilon}))-\mathbf{F}(\tilde{\Phi}^{\tau}_{\varepsilon}(\tilde{z}_{\varepsilon})\right)d\tau.\end{split}
Lemma D.1 (Master Lemma 1).

The improper integrals (D.2) are convergent. The operators +(𝐅,Φ~ετ,zε)\mathfrak{I}^{+}(\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},z_{\varepsilon}) and (𝐅,Φ~ετ,zε)\mathfrak{I}^{-}(\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},z_{\varepsilon}) are linear in 𝐅\mathbf{F}.

Lemma D.2 (Master Lemma 2).
(D.3) 𝐅(z~ε+)𝐅(z~ε)=+((𝒳0+ε𝒳1)𝐅,Φ~ετ,z~ε),𝐅(z~ε)𝐅(z~ε)=((𝒳0+ε𝒳1)𝐅,Φ~ετ,z~ε).\begin{split}\mathbf{F}(\tilde{z}^{+}_{\varepsilon})-\mathbf{F}(\tilde{z}_{\varepsilon})=&-\mathfrak{I}^{+}((\mathcal{X}^{0}+\varepsilon\mathcal{X}^{1})\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},\tilde{z}_{\varepsilon}),\\ \mathbf{F}(\tilde{z}^{-}_{\varepsilon})-\mathbf{F}(\tilde{z}_{\varepsilon})=&\mathfrak{I}^{-}((\mathcal{X}^{0}+\varepsilon\mathcal{X}^{1})\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},\tilde{z}_{\varepsilon}).\end{split}
Lemma D.3 (Master Lemma 3).
(D.4) +(𝐅,Φ~ετ,z~ε)=+(𝐅,Φ~0τ,z~0)+O(εϱ),(𝐅,Φ~ετ,z~ε)=(𝐅,Φ~0τ,z~0)+O(εϱ),\begin{split}\mathfrak{I}^{+}(\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},\tilde{z}_{\varepsilon})=&\mathfrak{I}^{+}(\mathbf{F},\tilde{\Phi}^{\tau}_{0},\tilde{z}_{0})+O(\varepsilon^{\varrho}),\\ \mathfrak{I}^{-}(\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},\tilde{z}_{\varepsilon})=&\mathfrak{I}^{-}(\mathbf{F},\tilde{\Phi}^{\tau}_{0},\tilde{z}_{0})+O(\varepsilon^{\varrho}),\end{split}

for 0<ϱ<10<\varrho<1. The integrals on the right-hand side are evaluated with 𝒳1=𝒳1(;0)\mathcal{X}^{1}=\mathcal{X}^{1}(\cdot;0).

Lemma D.4 (Master Lemma 4).

If F=O𝒞1(ε)\textbf{F}=O_{\mathcal{C}^{1}}(\varepsilon) then

(D.5) +(𝐅,Φ~ετ,zε)=+(𝐅,Φ~0τ,z0)+O(ε1+ϱ),(𝐅,Φ~ετ,zε)=(𝐅,Φ~0τ,z0)+O(ε1+ϱ),\begin{split}\mathfrak{I}^{+}(\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},z_{\varepsilon})=&\mathfrak{I}^{+}(\mathbf{F},\tilde{\Phi}^{\tau}_{0},z_{0})+O(\varepsilon^{1+\varrho}),\\ \mathfrak{I}^{-}(\mathbf{F},\tilde{\Phi}^{\tau}_{\varepsilon},z_{\varepsilon})=&\mathfrak{I}^{-}(\mathbf{F},\tilde{\Phi}^{\tau}_{0},z_{0})+O(\varepsilon^{1+\varrho}),\end{split}

for 0<ϱ<10<\varrho<1. The integrals on the right-hand side are evaluated with 𝒳1=𝒳1(;0)\mathcal{X}^{1}=\mathcal{X}^{1}(\cdot;0).

The proofs of the above lemmas can be found in [GdlLM21], and similar arguments can be found in [GdlL18].

References

  • [Bel04] Edward Belbruno. Capture Dynamics and Chaotic Motions in Celestial Mechanics: With Applications to the Construction of Low Energy Transfers. Princeton University Press, 2004.
  • [BGT10] Edward Belbruno, Marian Gidea, and Francesco Topputo. Weak stability boundary and invariant manifolds. SIAM Journal on Applied Dynamical Systems, 9(3):1061–1089, 2010.
  • [BO06] Esther Barrabés and Merce Ollé. Invariant manifolds of L3 and horseshoe motion in the restricted three-body problem. Nonlinearity, 19(9):2065, 2006.
  • [Bro68] RA Broucke. Periodic orbits in the restricted three body problem with earth-moon masses. Jet Propulsion Laboratory, California Institute of Technology, 1968.
  • [Cap12] Maciej J Capinski. Computer Assisted Existence Proofs of Lyapunov Orbits at L2 and Transversal Intersections of Invariant Manifolds in the Jupiter–Sun PCR3BP. SIAM Journal on Applied Dynamical Systems, 11(4):1723–1753, 2012.
  • [CCdlL20] Renato Calleja, Alessandra Celletti, and Rafael de la Llave. Kam theory for some dissipative systems. arXiv preprint arXiv:2007.08394, 2020.
  • [CCP16] Marta Ceccaroni, Alessandra Celletti, and Giuseppe Pucacco. Halo orbits around the collinear points of the restricted three-body problem. Physica D: Nonlinear Phenomena, 317:28–42, 2016.
  • [CDMR06] E. Canalias, A. Delshams, J. J. Masdemont, and P. Roldán. The scattering map in the planar restricted three body problem. Celestial Mech. Dynam. Astronom., 95(1-4):155–171, 2006.
  • [CGdlL16] Maciej J Capiński, Marian Gidea, and Rafael de la Llave. Arnold diffusion in the planar elliptic restricted three-body problem: mechanism and numerical verification. Nonlinearity, 30(1):329, 2016.
  • [CM06] E Canalias and Josep J Masdemont. Homoclinic and heteroclinic transfer trajectories between Lyapunov orbits in the Sun-Earth and Earth-Moon systems. Discrete & Continuous Dynamical Systems-A, 14(2):261, 2006.
  • [DdlLS06] Amadeu Delshams, Rafael de la Llave, and Tere M. Seara. A geometric mechanism for diffusion in Hamiltonian systems overcoming the large gap problem: heuristics and rigorous verification on a model. Mem. Amer. Math. Soc., 179(844):viii+141, 2006.
  • [DdlLS08] Amadeu Delshams, Rafael de la Llave, and Tere M. Seara. Geometric properties of the scattering map of a normally hyperbolic invariant manifold. Adv. Math., 217(3):1096–1153, 2008.
  • [DGR16] Amadeu Delshams, Marian Gidea, and Pablo Roldan. Arnold’s mechanism of diffusion in the spatial circular restricted three-body problem: A semi-analytical argument. Physica D: Nonlinear Phenomena, 334:29–48, 2016.
  • [dlLK19] Rafael de la Llave and Florian Kogelbauer. Global persistence of Lyapunov subcenter manifolds as spectral submanifolds under dissipative perturbations. SIAM Journal on Applied Dynamical Systems, 18(4):2099–2142, 2019.
  • [Fen72] Neil Fenichel. Persistence and smoothness of invariant manifolds for flows. Indiana Univ. Math. J., 21:193–226, 1971/1972.
  • [GdlL18] Marian Gidea and Rafael de la Llave. Global Melnikov theory in Hamiltonian systems with general time-dependent perturbations. Journal of NonLinear Science, 28:1657–1707, 2018.
  • [GdlLM21] Marian Gidea, Rafael de la Llave, and Maxwell Musser. Global effect of non-conservative perturbations on homoclinic orbits. Qualitative Theory of Dynamical Systems, 20(1):1–40, 2021.
  • [GEPC17] Ioannis Gkolias, Christos Efthymiopoulos, Giuseppe Pucacco, and Alessandra Celletti. Hamiltonian formulation of the spin-orbit model with time-varying non-conservative forces. Communications in Nonlinear Science and Numerical Simulation, 51:23–38, 2017.
  • [Gio01] Antonio Giorgilli. Unstable equilibria of Hamiltonian systems. Discrete & Continuous Dynamical Systems-A, 7(4):855, 2001.
  • [GJMS01] Gerard Gómez, Angel Jorba, Josep J Masdemont, and Carles Simo. Dynamics And Mission Design Near Libration Points, Vol III: Advanced Methods For Collinear Points, volume 4. World Scientific, 2001.
  • [GLMS01] G Gómez, J Llibre, R Martínez, and C Simó. Dynamics And Mission Design Near Libration Points: Volume I: Fundamentals: The Case of Collinear Libration Points. World Scientific, 2001.
  • [HPS77] M.W. Hirsch, C.C. Pugh, and M. Shub. Invariant manifolds, volume 583 of Lecture Notes in Math. Springer-Verlag, Berlin, 1977.
  • [Jor99] Angel Jorba. A methodology for the numerical computation of normal forms, centre manifolds and first integrals of Hamiltonian systems. Experimental Mathematics, 8(2):155–195, 1999.
  • [KLMR00] Wang Sang Koon, Martin W Lo, Jerrold E Marsden, and Shane D Ross. Heteroclinic connections between periodic orbits and resonance transitions in celestial mechanics. Chaos: An Interdisciplinary Journal of Nonlinear Science, 10(2):427–469, 2000.
  • [LMS85] Jaume Llibre, Regina Martínez, and Carles Simó. Tranversality of the invariant manifolds associated to the Lyapunov family of periodic orbits near L2L_{2} in the restricted three-body problem. J. Differential Equations, 58(1):104–156, 1985.
  • [MNF87] Andrea Milani, Anna Maria Nobili, and Paolo Farinella. Non-gravitational perturbations and satellite geodesy. CRC Press, 1987.
  • [Mos58] Jürgen Moser. On the generalization of a theorem of A. Liapounoff. Communications on Pure and Applied Mathematics, 11(2):257–271, 1958.
  • [PA14] Jeffrey S Parker and Rodney L Anderson. Low-energy lunar trajectory design, volume 12. John Wiley & Sons, 2014.
  • [Pes04] Yakov B. Pesin. Lectures on partial hyperbolicity and stable ergodicity. Zurich Lectures in Advanced Mathematics. European Mathematical Society (EMS), Zürich, 2004.
  • [WZ03] Daniel Wilczak and Piotr Zgliczynski. Heteroclinic connections between periodic orbits in planar restricted circular three-body problem– a computer assisted proof. Communications in mathematical physics, 234(1):37–75, 2003.