This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Inclusion of higher-order terms in the border-collision normal form: persistence of chaos and applications to power converters.

D.J.W. Simpson and P.A. Glendinning
School of Fundamental Sciences, Massey University, Palmerston North, New Zealand
Department of Mathematics, University of Manchester, UK
Abstract

The dynamics near a border-collision bifurcation are approximated to leading order by a continuous, piecewise-linear map. The purpose of this paper is to consider the higher-order terms that are neglected when forming this approximation. For two-dimensional maps we establish conditions under which a chaotic attractor created in a border-collision bifurcation persists for an open interval of parameters beyond the bifurcation. We apply the results to a prototypical power converter model to prove the model exhibits robust chaos.

1 Introduction

While most engineering systems are designed to operate outside chaotic parameter regimes, in some situations the presence of chaos is advantageous. Examples include optical resonators for which additional energy can be stored when photons follow chaotic trajectories as without periodic motion they become trapped for longer times [1]. In mechanical energy harvesters the presence of chaos allows high-energy modes of operation to be stabilised with only a small control force [2]. Also, power converters, when run chaotically, have the advantage of increased electromagnetic compatibility due to broad spectral characteristics [3]. Regardless of whether or not chaos is desired, it is extremely helpful to understand when and why it occurs.

Power converters, and many other engineering systems, function by switching between different modes of operation. There is a growing understanding of the creation of chaos in such systems through use of the border-collision normal form (BCNF). This is a piecewise-linear (or more precisely, piecewise-affine) map that models the dynamics near parameter values at which a fixed point intersects a boundary between two modes of operation [4, 5]. The piecewise-linear nature of the BCNF makes it possible to prove results about chaotic attractors and their persistence, a phenomenon called robust chaos [6, 7, 8].

The BCNF is obtained using coordinate transformations to make the lowest order terms of the map as simple as possible. The map is then truncated, ignoring all terms that are nonlinear with respect to variables and parameters. As such, it is by no means clear that results for chaotic attractors in the BCNF carry over to the full nonlinear models originally being considered. In this paper we prove a persistence result showing that, under certain conditions, the existence of a chaotic attractor in the BCNF implies the existence of a chaotic attractor in the corresponding full model regardless of the nature of the nonlinear terms that have been neglected to form the BCNF. This shows that chaotic attractors created in border-collision bifurcations typically persist for an open interval of parameters beyond the bifurcation and justifies our use of the BCNF for determining when chaotic attractors are created.

In earlier work [9] we described a computational method for determining when the two-dimensional BCNF is chaotic by establishing the existence of a trapping region in phase space and a contracting-invariant expanding cone in tangent space. The trapping region guarantees the existence of an attractor, while the cone ensures it has a positive Lyapunov exponent. In this paper we use the robustness of these objects to demonstrate persistence with respect to higher-order terms. The most difficult technical issue we have to overcome is in showing that the higher-order terms do not cause orbits of the map to accumulate new symbolic itineraries that cannot be handled by the cone.

The remainder of the paper is arranged as follows. In section 2 we describe the two-dimensional BCNF and its relation to nonlinear models. We then state our persistence result in section 3. In section 4 we provide a detailed description of the geometric structure of the phase space of the BCNF that we use in section 5 to prove the persistence result. The result, together with computational methods of [9], are then applied to a model of power converters with pulse-width modulated control, section 6. We believe this verifies, for the first time, the long-standing belief from numerical simulations that chaos occurs robustly in these types of systems. Finally section 7 contains concluding remarks.

2 Border-collision bifurcations and the border-collision normal form

The study of border-collision bifurcations has a long history dating back to at least Feigin and coworkers in the context of relay control [10, 11]. The two-dimensional BCNF was introduced by Nusse and Yorke in [4] motivated by observations of anomalous behaviour in piecewise-linear economics models [12]. The BCNF has since been shown to display a remarkably rich array of dynamics, such as robust chaos [6], multi-stability [13], multi-dimensional attractors [14], and resonance regions with sausage-string structures [15, 16]. The BCNF is relevant for describing a wide-range of physical phenomena, another example being mechanical systems with stick-slip friction [17]; see [18] for a recent review.

Let yf(y;μ)y\mapsto f(y;\mu) be a continuous, piecewise-smooth map with variable y=(y1,y2)2y=(y_{1},y_{2})\in\mathbb{R}^{2} and parameter μ\mu\in\mathbb{R}. We are interested in the dynamics local to a border-collision bifurcation where a fixed point of ff collides with a switching manifold as μ\mu is varied. Assuming the bifurcation occurs at single smooth switching manifold, only two pieces of the map are relevant to the local dynamics. By choosing coordinates so that the switching manifold is y1=0y_{1}=0, we can assume ff has the form

f(y;μ)={fL(y;μ),y10,fR(y;μ),y10,f(y;\mu)=\begin{cases}f_{L}(y;\mu),&y_{1}\leq 0,\\ f_{R}(y;\mu),&y_{1}\geq 0,\end{cases} (2.1)

where fLf_{L} and fRf_{R} are C1C^{1}.

Suppose the border-collision bifurcation occurs at y=(0,0)=𝟎y=(0,0)={\bf 0} when μ=0\mu=0. By continuity, 𝟎{\bf 0} is a fixed point of both fLf_{L} and fRf_{R} when μ=0\mu=0:

fL(𝟎;0)=fR(𝟎;0)=𝟎.f_{L}({\bf 0};0)=f_{R}({\bf 0};0)={\bf 0}. (2.2)

From the map ff we can extract the four key values

τL=trace(DfL(𝟎;0)),δL=det(DfL(𝟎;0)),τR=trace(DfR(𝟎;0)),δR=det(DfR(𝟎;0)),\begin{split}\tau_{L}&={\rm trace}\big{(}{\rm D}f_{L}({\bf 0};0)\big{)},\\ \delta_{L}&={\rm det}\big{(}{\rm D}f_{L}({\bf 0};0)\big{)},\\ \tau_{R}&={\rm trace}\big{(}{\rm D}f_{R}({\bf 0};0)\big{)},\\ \delta_{R}&={\rm det}\big{(}{\rm D}f_{R}({\bf 0};0)\big{)},\end{split} (2.3)

which determine the eigenvalues associated with 𝟎{\bf 0} for the two smooth components of (2.1). We can then use these values to form the piecewise-linear map

g(x)={[τL1δL0]x+[10],x10,[τR1δR0]x+[10],x10.g(x)=\begin{cases}\begin{bmatrix}\tau_{L}&1\\ -\delta_{L}&0\end{bmatrix}x+\begin{bmatrix}1\\ 0\end{bmatrix},&x_{1}\leq 0,\\[12.80373pt] \begin{bmatrix}\tau_{R}&1\\ -\delta_{R}&0\end{bmatrix}x+\begin{bmatrix}1\\ 0\end{bmatrix},&x_{1}\geq 0.\end{cases} (2.4)

This is the two-dimensional BCNF, except often μ\mu-dependence is retained in the constant term. In the remainder of this section we explain how (2.4) can be used to describe the local dynamics of (2.1) near the border-collision bifurcation.

We first write the components of (2.1) as

fL(y;μ)=[a11La12a21La22]y+[b1b2]μ+(𝓎+|μ|),fR(y;μ)=[a11Ra12a21Ra22]y+[b1b2]μ+(𝓎+|μ|),\begin{split}f_{L}(y;\mu)&=\begin{bmatrix}a^{L}_{11}&a_{12}\\ a^{L}_{21}&a_{22}\end{bmatrix}y+\begin{bmatrix}b_{1}\\ b_{2}\end{bmatrix}\mu+\mathpzc{o}\mathopen{}\mathclose{{}\left(\|y\|+|\mu|}\right),\\ f_{R}(y;\mu)&=\begin{bmatrix}a^{R}_{11}&a_{12}\\ a^{R}_{21}&a_{22}\end{bmatrix}y+\begin{bmatrix}b_{1}\\ b_{2}\end{bmatrix}\mu+\mathpzc{o}\mathopen{}\mathclose{{}\left(\|y\|+|\mu|}\right),\end{split} (2.5)

where the two expressions share several coefficients due to the assumed continuity of (2.1) on y1=0y_{1}=0. By dropping the higher-order terms we obtain the piecewise-linear map

h(y;μ)={[a11La12a21La22]y+[b1b2]μ,y10,[a11Ra12a21Ra22]y+[b1b2]μ,y10.h(y;\mu)=\begin{cases}\begin{bmatrix}a^{L}_{11}&a_{12}\\ a^{L}_{21}&a_{22}\end{bmatrix}y+\begin{bmatrix}b_{1}\\ b_{2}\end{bmatrix}\mu,&y_{1}\leq 0,\\[12.80373pt] \begin{bmatrix}a^{R}_{11}&a_{12}\\ a^{R}_{21}&a_{22}\end{bmatrix}y+\begin{bmatrix}b_{1}\\ b_{2}\end{bmatrix}\mu,&y_{1}\geq 0.\end{cases} (2.6)

This approximates (2.1) in the following sense: for any ε>0\varepsilon>0 there exists δ>0\delta>0 such that if y+|μ|<δ\|y\|+|\mu|<\delta then f(y;μ)h(y;μ)<ε(y+|μ|)\big{\|}f(y;\mu)-h(y;\mu)\big{\|}<\varepsilon\mathopen{}\mathclose{{}\left(\|y\|+|\mu|}\right).

The piecewise-linear map (2.6) satisfies the identity h(αy;αμ)=αh(y;μ)h(\alpha y;\alpha\mu)=\alpha h(y;\mu) for any α>0\alpha>0. For this reason the magnitude of μ\mu only affects the spatial scale of the dynamics of (2.6): if Λ2\Lambda\subset\mathbb{R}^{2} is an invariant set of hh for some μ0\mu_{0}, then αΛ\alpha\Lambda is an invariant set of hh with μ=αμ0\mu=\alpha\mu_{0} for all α>0\alpha>0. In view of this scaling property, we can convert (2.6) to (2.4) for any μ>0\mu>0. This is achieved via the change of variables

x=1γμ[10a22a12]y+1γ[0a22b1a12b2],x=\frac{1}{\gamma\mu}\begin{bmatrix}1&0\\ -a_{22}&a_{12}\end{bmatrix}y+\frac{1}{\gamma}\begin{bmatrix}0\\ a_{22}b_{1}-a_{12}b_{2}\end{bmatrix}, (2.7)

where

γ=(1a22)b1+a12b2,\gamma=(1-a_{22})b_{1}+a_{12}b_{2}\,, (2.8)

and is valid assuming

a12\displaystyle a_{12} 0,\displaystyle\neq 0, (2.9)
γ\displaystyle\gamma >0.\displaystyle>0. (2.10)

The condition a120a_{12}\neq 0 ensures (2.7) is invertible (if a12=0a_{12}=0 then (2.6) can be partly decoupled [18]). We require γ0\gamma\neq 0 so that μ\mu unfolds the border-collision bifurcation in a generic fashion, while γ>0\gamma>0 ensures the left and right components of (2.6) transform to their respective components in (2.4). The case γ<0\gamma<0 can be accommodated by simply redefining μ\mu as μ-\mu.

The transformation (2.7) performs a similarity transform to the Jacobian matrices of the components of the map, thus it preserves their traces and determinants. For this reason the values (2.3) were used to construct (2.4) — notice how τL\tau_{L}, δL\delta_{L}, τR\tau_{R}, and δR\delta_{R} are the traces and determinants of the Jacobian matrices of the two components of (2.4).

In summary, for any map of the form (2.1) that has a border-collision bifurcation at μ=0\mu=0, we can use the values (2.3) to form the BCNF (2.4). Then, if the conditions (2.9)–(2.10) are satisfied, the BCNF is affinely conjugate to the piecewise-linear approximation to (2.1) for any μ>0\mu>0. Intuitively this approximation should be reasonable for sufficiently small values of μ\mu. Our persistence result in the next section gives conditions under which the approximation can indeed be justified.

3 Persistence of chaotic attractors

Our main result, Theorem 3.11 below, links the existence of chaotic attractors of the piecewise-linear BCNF (2.4) to attractors of the nonlinear map (2.1) for small μ>0\mu>0. To state the result we need some preliminary definitions and conditions.

Let

gL(x)\displaystyle g_{L}(x) =ALx+[10],\displaystyle=A_{L}x+\begin{bmatrix}1\\ 0\end{bmatrix}, gR(x)\displaystyle g_{R}(x) =ARx+[10],\displaystyle=A_{R}x+\begin{bmatrix}1\\ 0\end{bmatrix},

denote the left and right components of the BCNF, gg, where

AL\displaystyle A_{L} =[τL1δL0],\displaystyle=\begin{bmatrix}\tau_{L}&1\\ -\delta_{L}&0\end{bmatrix}, AR\displaystyle A_{R} =[τR1δR0].\displaystyle=\begin{bmatrix}\tau_{R}&1\\ -\delta_{R}&0\end{bmatrix}.

Theorem 3.11 requires the assumptions

δL\displaystyle\delta_{L} >0,\displaystyle>0, (3.1)
δR\displaystyle\delta_{R} >τR24,\displaystyle>\frac{\tau_{R}^{2}}{4}, (3.2)
gi(𝟎)2\displaystyle g^{-i}({\bf 0})_{2} <0,for all i1,\displaystyle<0,\quad\text{for all $i\geq 1$}, (3.3)

where the subscript in (3.3) indicates we are looking at the second component of the vector gi(𝟎)g^{-i}({\bf 0}). Conditions (3.1) and (3.2) imply that gg is a homeomorphism, so g1g^{-1} exists and (3.3) is well-defined. Condition (3.3) ensures the backwards orbit of the origin does not enter the closed upper half-plane {x2|x20}\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{2}\geq 0}\right\}. In fact, the backwards orbit is constrained to the fourth quadrant of 2\mathbb{R}^{2} so is governed purely by gRg_{R} and simply converges to the unique fixed point of gRg_{R} (a repelling focus), see Fig. 1. Together conditions (3.1)–(3.3) allow us to divide phase space by the number of iterations required to cross the switching manifold

Σ={x2|x1=0},\Sigma=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{1}=0}\right\},

and this is achieved in section 4. Having a precise understanding of this division is critical to our proof of Theorem 3.11 presented in section 5.

Refer to caption
Figure 1: A sketch of the backwards orbit of the origin for the BCNF (2.4) satisfying (3.1)–(3.3). The backwards orbit converges to the fixed point of gRg_{R} (red triangle).

To motivate the following definitions, consider the forward orbit of a point x2x\in\mathbb{R}^{2} under gg. Suppose it maps under gLg_{L} pp times, then under gRg_{R} qq times. That is, gp+q(x)=gRq(gLp(x))g^{p+q}(x)=g_{R}^{q}\mathopen{}\mathclose{{}\left(g_{L}^{p}(x)}\right), hence Dgp+q(x)=ARqALp{\rm D}g^{p+q}(x)=A_{R}^{q}A_{L}^{p}, assuming no iterates lie on Σ\Sigma where gg is non-differentiable. For orbits that go back and forth across Σ\Sigma without landing on Σ\Sigma, which includes almost all orbits in the chaotic attractors we wish to analyse, derivatives of gn(x)g^{n}(x) for large nn can be expressed as products of matrices of the form ARqALpA_{R}^{q}A_{L}^{p}. Consequently we can estimate Lyapunov exponents based on bounds for the values of pp and qq.

Let

ΠL\displaystyle\Pi_{L} ={x2|x10},\displaystyle=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{1}\leq 0}\right\},
ΠR\displaystyle\Pi_{R} ={x2|x10},\displaystyle=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{1}\geq 0}\right\},

denote the closed left and right half-planes.

Definition 3.1.

Given x2x\in\mathbb{R}^{2}, let χL(x)\chi_{L}(x) be the smallest p1p\geq 1 for which gp(x)ΠLg^{p}(x)\notin\Pi_{L} and let χR(x)\chi_{R}(x) be the smallest q1q\geq 1 for which gq(x)ΠRg^{q}(x)\notin\Pi_{R}, if such pp and qq exist.

Now define the regions

ΦL\displaystyle\Phi_{L} ={x2|x1<0,x20},\displaystyle=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{1}<0,\,x_{2}\leq 0}\right\}, (3.4)
ΦR\displaystyle\Phi_{R} ={x2|x1>0,x20}.\displaystyle=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{1}>0,\,x_{2}\geq 0}\right\}. (3.5)

The following result shows that when an orbit crosses Σ\Sigma from right to left, it must arrive at a point in ΦL\Phi_{L}. Moreover, ΦL\Phi_{L} is exactly the set of all such points. The set ΦR\Phi_{R} admits the same characterisation for orbits crossing Σ\Sigma from left to right. The result follows immediately from the observation that g1(x)1g^{-1}(x)_{1} and x2x_{2} have opposite signs.

Lemma 3.1.

Suppose δL>0\delta_{L}>0 and δR>0\delta_{R}>0. Then

ΦL\displaystyle\Phi_{L} ={x2ΠR|g1(x)ΠR},\displaystyle=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\setminus\Pi_{R}\,\middle|\,g^{-1}(x)\in\Pi_{R}}\right\},
ΦR\displaystyle\Phi_{R} ={x2ΠL|g1(x)ΠL}.\displaystyle=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\setminus\Pi_{L}\,\middle|\,g^{-1}(x)\in\Pi_{L}}\right\}.

Now given a set Ω2\Omega\subset\mathbb{R}^{2}, suppose there exist numbers 1pminpmax1\leq p_{\rm min}\leq p_{\rm max} and 1qminqmax1\leq q_{\rm min}\leq q_{\rm max} such that

χL(x)\displaystyle\chi_{L}(x) pmin,for allxΩΦL,\displaystyle\geq p_{\rm min}\,,\qquad\text{for all}~{}x\in\Omega\cap\Phi_{L}\,, (3.6)
χL(x)\displaystyle\chi_{L}(x) pmax,for allxΩΠL,\displaystyle\leq p_{\rm max}\,,\qquad\text{for all}~{}x\in\Omega\cap\Pi_{L}\,, (3.7)
χR(x)\displaystyle\chi_{R}(x) qmin,for allxΩΦR,\displaystyle\geq q_{\rm min}\,,\qquad\text{for all}~{}x\in\Omega\cap\Phi_{R}\,, (3.8)
χR(x)\displaystyle\chi_{R}(x) qmax,for allxΩΠR.\displaystyle\leq q_{\rm max}\,,\qquad\text{for all}~{}x\in\Omega\cap\Pi_{R}\,. (3.9)

That is, any point in ΩΠL\Omega\cap\Pi_{L} requires at most pmaxp_{\rm max} iterations to cross Σ\Sigma, and at least pminp_{\rm min} iterations if its preimage lies in ΠR\Pi_{R}. The numbers qminq_{\rm min} and qmaxq_{\rm max} similarly bound the number of iterations required to cross Σ\Sigma from right to left.

For the given set Ω\Omega, let

𝐌Ω={ARqALp|pminppmax,qminqqmax}.{\bf M}_{\Omega}=\mathopen{}\mathclose{{}\left\{A_{R}^{q}A_{L}^{p}\,\big{|}\,p_{\rm min}\leq p\leq p_{\rm max},\,q_{\rm min}\leq q\leq q_{\rm max}}\right\}. (3.10)

In Theorem 3.11, Ω\Omega is be assumed to be a trapping region for gg, that is, g(Ω)int(Ω)g(\Omega)\subset{\rm int}(\Omega), where int(){\rm int}(\cdot) denotes interior. Also, to ensure a positive Lyapunov exponent, we assume 𝐌Ω{\bf M}_{\Omega} has a contracting-invariant, expanding cone. This is defined as follows and illustrated in Fig. 2.

Refer to caption
Figure 2: A sketch of a cone CC and its action under a matrix MM when CC is contracting-invariant and expanding for a collection 𝐌{\bf M} that contains MM. The shaded region is the set MC={Mv|vC}MC=\mathopen{}\mathclose{{}\left\{Mv\,\middle|\,v\in C}\right\}. The coloured curves indicate how unit vectors in CC map under MM.
Definition 3.2.

A cone is a non-empty set C2C\subseteq\mathbb{R}^{2} for which tvCtv\in C for all tt\in\mathbb{R} and vCv\in C. A cone CC is contracting-invariant for a collection of 2×22\times 2 matrices 𝐌{\bf M} if Mvint(C){𝟎}Mv\in{\rm int}(C)\cup\{{\bf 0}\} for all vCv\in C and M𝐌M\in{\bf M}. A cone CC is expanding for 𝐌{\bf M} if there exists c>1c>1 such that Mvcv\|Mv\|\geq c\|v\| for all vCv\in C and M𝐌M\in{\bf M}.

Finally we state the main result. We write Br(x)B_{r}(x) for the ball of radius r>0r>0 centred at x2x\in\mathbb{R}^{2}.

Theorem 3.2.

Let ff be a piecewise-C1C^{1} map of the form (2.1) satisfying (2.2). Let gg be the corresponding map (2.4) formed by using the values (2.3). Suppose

  1. i)

    conditions (2.9)–(2.10) and (3.1)–(3.3) are satisfied,

  2. ii)

    gg has a trapping region Ω2\Omega\subset\mathbb{R}^{2} and (3.6)–(3.9) are satisfied for some 1pminpmax1\leq p_{\rm min}\leq p_{\rm max} and 1qminqmax1\leq q_{\rm min}\leq q_{\rm max}, and

  3. iii)

    there exists a contracting-invariant, expanding cone for 𝐌Ω{\bf M}_{\Omega}.

Then there exists δ>0\delta>0 and s>0s>0 such that for all μ(0,δ)\mu\in(0,\delta) the map ff has a topological attractor ΛμBμs(𝟎)\Lambda_{\mu}\in B_{\mu s}({\bf 0}) with the property that for Lebesgue almost all yΛμy\in\Lambda_{\mu} there exists v2v\in\mathbb{R}^{2} such that

lim infn1nln(Dfn(y)v)>0.\liminf_{n\to\infty}\frac{1}{n}\ln\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|{\rm D}f^{n}(y)v}\right\|}\right)>0. (3.11)

Notice Λμ\Lambda_{\mu} is chaotic in the sense of a positive Lyapunov exponent, as implied by (3.11).

4 A partition of the plane

In this section we describe the geometry of the regions of the left half-plane with different values of χL\chi_{L} and regions of the right half-plane with different values of χR\chi_{R} for the BCNF (2.4). The ways in which their images intersect can be used to establish conditions under which chaotic attractors exist [9]. The regions corresponding to values of χL\chi_{L} were described in [19], so for these we simply state results without proof. The regions corresponding to values of χR\chi_{R} can be described in a similar way; details of these calculations are provided in Appendix A.

Definition 4.1.

For each p,q1p,q\geq 1, let

Dp\displaystyle D_{p} ={xΠL|χL(x)=p},\displaystyle=\mathopen{}\mathclose{{}\left\{x\in\Pi_{L}\,\big{|}\,\chi_{L}(x)=p}\right\}, (4.1)
Eq\displaystyle E_{q} ={xΠR|χR(x)=q}.\displaystyle=\mathopen{}\mathclose{{}\left\{x\in\Pi_{R}\,\big{|}\,\chi_{R}(x)=q}\right\}. (4.2)
Refer to caption
Figure 3: The regions DpD_{p} (4.1) and EqE_{q} (4.2) for a typical instance of the two-dimensional BCNF (2.4), specifically, (τL,δL,τR,δR)=(1.4,1,1.15,1.15)(\tau_{L},\delta_{L},\tau_{R},\delta_{R})=(1.4,1,1.15,1.15). In DpD_{p} iterates require pp iterations to escape the closed left half-plane; in EqE_{q} iterates require qq iterations to escape the closed right half-plane (see Definition 3.1). Here p=3p^{*}=3 (see Definition 4.2), so D1,,D4D_{1},\ldots,D_{4} cover the third quadrant ΦL\Phi_{L} (see Lemma 4.1). Also q=3q^{*}=3 and q=5q^{**}=5 (see Definition 4.3) so E3,,E5E_{3},\ldots,E_{5} cover the first quadrant ΦR\Phi_{R} (see Lemma 4.3). The backwards orbit of the origin is shown with black dots. The fixed point xRx^{R} (a repelling focus) is shown with a red triangle.

First consider the sets DpD_{p}. As shown by Proposition 6.6 of [19], each DpD_{p} is bounded by the lines gLp(Σ)g_{L}^{-p}(\Sigma), gL(p1)(Σ)g_{L}^{-(p-1)}(\Sigma), and Σ\Sigma, Fig. 3. If the line gLp(Σ)g_{L}^{-p}(\Sigma) is not vertical, let mpm_{p} denote its slope and cpc_{p} denote its x2x_{2}-intercept, i.e.,

gLp(Σ)={x2|x2=mpx1+cp}.g_{L}^{-p}(\Sigma)=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{2}=m_{p}x_{1}+c_{p}}\right\}. (4.3)
Definition 4.2.

Let pp^{*} be the smallest p1p\geq 1 for which mp0m_{p}\geq 0, with p=p^{*}=\infty if mp<0m_{p}<0 for all p1p\geq 1.

The next result is a simple consequence of results obtained in [19]. Recall ΦL\Phi_{L} is the third quadrant (3.4).

Lemma 4.1.

Suppose δL>0\delta_{L}>0. If p<p^{*}<\infty then the sets DpΦLD_{p}\cap\Phi_{L}, for p=1,,p+1p=1,\ldots,p^{*}+1, are non-empty and cover ΦL\Phi_{L}. If p=p^{*}=\infty then DpΦLD_{p}\cap\Phi_{L}\neq\varnothing for all p1p\geq 1.

For example with (τL,δL)=(1.4,1)(\tau_{L},\delta_{L})=(1.4,1), as in Fig. 3, we have p=3p^{*}=3. Fig. 4 shows how the value of pp^{*}, and hence the values of pp for which DpΦLD_{p}\cap\Phi_{L}\neq\varnothing, depends on the values of τL\tau_{L} and δL\delta_{L}. Between curves where mi1=0m_{i-1}=0 and mi=0m_{i}=0, we have DpΦLD_{p}\cap\Phi_{L}\neq\varnothing if and only if p{1,2,i+1}p\in\{1,2,\ldots i+1\}. As pp\to\infty these curves accumulate on δL=τL24\delta_{L}=\frac{\tau_{L}^{2}}{4} where ALA_{L} has repeated eigenvalues. With δL>τL24\delta_{L}>\frac{\tau_{L}^{2}}{4}, p=p^{*}=\infty and every DpΦLD_{p}\cap\Phi_{L} is non-empty.

Refer to caption
Figure 4: A division of the (τL,δL)(\tau_{L},\delta_{L})-plane according to values of pp for which DpΦLD_{p}\cap\Phi_{L}\neq\varnothing. The regions are labelled by these values of pp and are bounded by curves where mi=0m_{i}=0 for some ii (see Definition 4.2 and Lemma 4.1). For example, to the left of line τL=0\tau_{L}=0 we have DpΦLD_{p}\cap\Phi_{L}\neq\varnothing if and only if p{1,2}p\in\{1,2\}, so this region is labelled by the numbers 11 and 22. The orange circle (located just to the left of the curve where m3=0m_{3}=0) indicates the values of τL\tau_{L} and δL\delta_{L} used in Fig. 3.

In regards to Theorem 3.11, while the values of pminp_{\rm min} and pmaxp_{\rm max} depend on the particular trapping region Ω\Omega being considered, Lemma 4.1 tells us that the value of pminp_{\rm min} could always be as low as 11, while the value of pmaxp_{\rm max} cannot be more than p+1p^{*}+1.

Next we note that the boundaries of the DpD_{p} intersect the x1x_{1}-axis transversally. This is the case because each gLp(Σ)g_{L}^{-p}(\Sigma) is a line intersecting the x2x_{2}-axis at (0,cp)(0,c_{p}) with cp<0c_{p}<0, see Lemma 6.2 of [19]. These transversal intersections are used to argue persistence in section 5.

Lemma 4.2.

Suppose δL>0\delta_{L}>0 and x=(x1,0)x=(x_{1},0), with x1<0x_{1}<0, is a point on the boundary of some DpD_{p}. Then, local to xx, the boundary of DpD_{p} is a line segment that intersects g(Σ)g(\Sigma) transversally.

We now turn our attention to the sets EqE_{q}. To characterise these we assume τR\tau_{R} and δR\delta_{R} satisfy (3.2) and (3.3). Condition (3.2) implies gRg_{R} has the unique fixed point

xR=1δRτR+1[1δR],x^{R}=\frac{1}{\delta_{R}-\tau_{R}+1}\begin{bmatrix}1\\ -\delta_{R}\end{bmatrix}, (4.4)

which lies in the fourth quadrant. Condition (3.3) ensures that the set of qq for which EqΦRE_{q}\cap\Phi_{R}\neq\varnothing admits a relatively simple characterisation and that the boundaries of the EqE_{q} intersect the x1x_{1}-axis transversally.

By an analogous argument to that for the sets DpD_{p}, the lines gRq(Σ)g_{R}^{-q}(\Sigma), gR(q1)(Σ)g_{R}^{-(q-1)}(\Sigma), and Σ\Sigma form the boundaries of the EqE_{q}. However, their layout is more complicated than that of the DpD_{p}. Fig. 3 shows a typical example.

If the line gRq(Σ)g_{R}^{-q}(\Sigma) is not vertical, let nqn_{q} denote its slope and dqd_{q} denote its x2x_{2}-intercept, i.e.,

gRq(Σ)={x2|x2=nqx1+dq}.g_{R}^{-q}(\Sigma)=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{2}=n_{q}x_{1}+d_{q}}\right\}. (4.5)

If gRq(Σ)g_{R}^{-q}(\Sigma) is vertical we write nq=n_{q}=\infty (in this case gRq(Σ)g_{R}^{-q}(\Sigma) is the line x1=dq1δRx_{1}=-\frac{d_{q-1}}{\delta_{R}}, see Appendix A).

Definition 4.3.

Let qq^{*} be the smallest q1q\geq 1 for which nq>0n_{q}>0 or nq=n_{q}=\infty. Let qq^{**} be the smallest q>qq>q^{*} for which dq0d_{q}\leq 0.

Lemma 4.3.

Suppose (3.2) and (3.3) are satisfied. Then qq^{*} and qq^{**} exist and the sets EqΦRE_{q}\cap\Phi_{R}, for q=q,,qq=q^{*},\ldots,q^{**}, are non-empty and cover ΦR\Phi_{R}.

Refer to caption
Figure 5: A division of the (τR,δR)(\tau_{R},\delta_{R})-plane according to values of qq for which EqΦRE_{q}\cap\Phi_{R}\neq\varnothing. These regions are labelled by these values of qq and bounded by curves where ni=0n_{i}=0 (red) and di=0d_{i}=0 (blue) for some ii (see Definition 4.3 and Lemma 4.3). The black curves bound the region where condition (3.3) is satisfied (each yiy_{i} denotes gRi(𝟎)2g_{R}^{-i}({\bf 0})_{2}). The orange circle indicates the values of τR\tau_{R} and δR\delta_{R} used in Fig. 3.

Lemma 4.3 is proved in Appendix A. The values of qq^{*} and qq^{**} are determined by the values of τR\tau_{R} and δR\delta_{R} as indicated in Fig. 5. As we move about the (τR,δR)(\tau_{R},\delta_{R})-plane, the value of qq^{*} changes by one when we cross a curve where nq=0n_{q}=0, and the value of qq^{**} changes by one when we cross a curve where dq=0d_{q}=0. These curves accumulate on δR=τR24\delta_{R}=\frac{\tau_{R}^{2}}{4} past which ARA_{R} has real eigenvalues and condition (3.2) is no longer satisfied. In Fig. 5, condition (3.3) is satisfied above the piecewise-smooth black curve.

In regards to Theorem 3.11, Lemma 4.3 implies qminqq_{\rm min}\geq q^{*} and qmaxqq_{\rm max}\leq q^{**}. Analogous to Lemma 4.2 we have the following result (proved in Appendix A).

Lemma 4.4.

Suppose (3.2) and (3.3) are satisfied and x=(x1,0)x=(x_{1},0), with x1>0x_{1}>0, is a point on the boundary of some EqE_{q}. Then, local to xx, the boundary of EqE_{q} is a line segment that intersects g(Σ)g(\Sigma) transversally.

5 Proof of Theorem 3.11

The proof is divided into six steps.

Step 1 — Robustness of the cone.
Let CC be a contracting-invariant, expanding cone for 𝐌Ω{\bf M}_{\Omega} of (3.10) and let c>1c>1 be a corresponding expansion factor as in Definition 3.2. Here we show that CC is an invariant expanding cone for a collection of matrices A^RqA^Lp\hat{A}_{R}^{q}\hat{A}_{L}^{p} that are sufficiently close to some matrix ARqALpA_{R}^{q}A_{L}^{p} in 𝐌Ω{\bf M}_{\Omega}. Note, the expansion factor will be c+12\frac{c+1}{2} instead of cc.

We first define a function \mathcal{F} from the space of 2×22\times 2 matrices to the space of subsets of 2\mathbb{R}^{2} as follows: given a 2×22\times 2 matrix NN, let

(N)={Nvv|vC{𝟎}}.\mathcal{F}(N)=\mathopen{}\mathclose{{}\left\{\frac{Nv}{\|v\|}\,\middle|\,v\in C\setminus\{{\bf 0}\}}\right\}.

By the definition of an contracting-invariant expanding cone, given M𝐌ΩM\in{\bf M}_{\Omega} we have (M)int(C)\mathcal{F}(M)\subset{\rm int}(C) and (M)Bc(𝟎)=\mathcal{F}(M)\cap B_{c}({\bf 0})=\varnothing. The function \mathcal{F} is continuous, so if NN is sufficiently close to MM then (N)C\mathcal{F}(N)\subset C and (N)Bc+12(𝟎)=\mathcal{F}(N)\cap B_{\frac{c+1}{2}}({\bf 0})=\varnothing. That is, for all vCv\in C,

NvC,Nvc+12v.\begin{split}Nv&\in C,\\ \|Nv\|&\geq\frac{c+1}{2}\|v\|.\end{split} (5.1)

To be precise, there exists η1>0\eta_{1}>0 such that if NM<η1\|N-M\|<\eta_{1} for some M𝐌ΩM\in{\bf M}_{\Omega} then (5.1) is satisfied for all vCv\in C. Furthermore, there exists η2>0\eta_{2}>0 such that A^LAL<η2\|\hat{A}_{L}-A_{L}\|<\eta_{2} and A^RAR<η2\|\hat{A}_{R}-A_{R}\|<\eta_{2} implies

A^RqA^LpARqALp<η1,\big{\|}\hat{A}_{R}^{q}\hat{A}_{L}^{p}-A_{R}^{q}A_{L}^{p}\big{\|}<\eta_{1}, (5.2)

for all pminppmaxp_{\rm min}\leq p\leq p_{\rm max} and qminqqmaxq_{\rm min}\leq q\leq q_{\rm max}, and that any such A^L\hat{A}_{L} and A^R\hat{A}_{R} are invertible (which is possible because ALA_{L} and ARA_{R} are invertible by (3.1) and (3.2)).

Step 2 — Bounds related to g(Ω)g(\Omega).
By the definition of pminp_{\rm min}, pmaxp_{\rm max}, qminq_{\rm min}, and qmaxq_{\rm max}, see (3.6)–(3.9), we have ΩΦLp=pminpmaxDp\Omega\cap\Phi_{L}\subset\bigcup_{p=p_{\rm min}}^{p_{\rm max}}D_{p} and ΩΦRq=qminqmaxEq\Omega\cap\Phi_{R}\subset\bigcup_{q=q_{\rm min}}^{q_{\rm max}}E_{q}. In this step we use the results of section 4 and the fact that Ω\Omega maps to its interior under gg to control the behaviour of points inside and near g(Ω)g(\Omega).

Since g(Ω)int(Ω)g(\Omega)\subset{\rm int}(\Omega) is compact there exists ε1>0\varepsilon_{1}>0 such that

Bε1(g(Ω))Ω,B_{\varepsilon_{1}}(g(\Omega))\subset\Omega, (5.3)

as illustrated in Fig. 6. Now consider the set U=Bε1(g(Ω)ΦL)ΠLU=B_{\varepsilon_{1}}\mathopen{}\mathclose{{}\left(g(\Omega)\cap\Phi_{L}}\right)\cap\Pi_{L}. This set is contained in ΩΦL\Omega\cap\Phi_{L} except has some points just above the negative x1x_{1}-axis. By Lemma 4.2, we can assume ε1\varepsilon_{1} is small enough that UU does not intersect any ‘other’ regions DpD_{p}, that is Up=pminpmaxDpU\subset\bigcup_{p=p_{\rm min}}^{p_{\rm max}}D_{p}. Further, by shrinking this set by a small amount, the result will be bounded away from the other regions DpD_{p}. That is, there exists ε2>0\varepsilon_{2}>0 such that for all xBε12(g(Ω)ΦL)ΠLx\in B_{\frac{\varepsilon_{1}}{2}}\mathopen{}\mathclose{{}\left(g(\Omega)\cap\Phi_{L}}\right)\cap\Pi_{L}:

  1. i)

    gLp(x)1<ε2g_{L}^{p}(x)_{1}<-\varepsilon_{2} for all p=1,,pmin1p=1,\ldots,p_{\rm min}-1, and

  2. ii)

    if gLp(x)10g_{L}^{p}(x)_{1}\leq 0 for all p=pmin,,pmax1p=p_{\rm min},\ldots,p_{\rm max}-1, then gLpmax(x)1>ε2g_{L}^{p_{\rm max}}(x)_{1}>\varepsilon_{2}.

Also, ΩΠLp=1pmaxDp\Omega\cap\Pi_{L}\subset\bigcup_{p=1}^{p_{\rm max}}D_{p} by (3.7). Thus by (5.3) we can assume ε2\varepsilon_{2} is small enough that for all xBε12(g(Ω))x\in B_{\frac{\varepsilon_{1}}{2}}(g(\Omega)):

  1. iii)

    if gLp(x)10g_{L}^{p}(x)_{1}\leq 0 for all p=1,,pmax1p=1,\ldots,p_{\rm max}-1, then gLpmax(x)1>ε2g_{L}^{p_{\rm max}}(x)_{1}>\varepsilon_{2}.

In view of Lemma 4.4 we can assume ε1\varepsilon_{1} and ε2\varepsilon_{2} are small enough that analogous bounds also hold for gRq(x)1g_{R}^{q}(x)_{1}.

Refer to caption
Figure 6: A sketch of Ω\Omega and its image g(Ω)g(\Omega) illustrating the bound (5.3). For the perturbed map g~\tilde{g} we also sketch the regions ΦL,μ\Phi_{L,\mu} and ΦR,μ\Phi_{R,\mu} (shaded) introduced in Step 4. The part of g~(Ω;μ)\tilde{g}(\Omega;\mu) that lies in ΦL,μ\Phi_{L,\mu} is the set Ψ\Psi (striped) introduced in Step 5.

Step 3 — Change of coordinates.
Here we apply the coordinate change (2.7) for converting ff to gg plus higher order terms. For small μ>0\mu>0 this coordinate change represents a spatial blow-up of phase space near the origin. Since the higher order terms are small near the origin we are able to control the higher order terms by assuming μ\mu is sufficiently small. Here we also let r>0r>0 be such that ΩBr(𝟎)\Omega\subset B_{r}({\bf 0}).

We first decompose (2.7) into the spatial blow-up

x=y~γμ,x=\frac{\tilde{y}}{\gamma\mu}, (5.4)

and the bounded coordinate change

y~=ϕμ(y)=[y1a22y1+a12y2+(a22b1a12b2)μ].\tilde{y}=\phi_{\mu}(y)=\begin{bmatrix}y_{1}\\ -a_{22}y_{1}+a_{12}y_{2}+(a_{22}b_{1}-a_{12}b_{2})\mu\end{bmatrix}. (5.5)

Notice (5.5) is invertible because a120a_{12}\neq 0, (2.9). The coordinate change (5.5) transforms ff to a map

f~(y~;μ)={f~L(y~;μ),y~10,f~R(y~;μ),y~10,\tilde{f}(\tilde{y};\mu)=\begin{cases}\tilde{f}_{L}(\tilde{y};\mu),&\tilde{y}_{1}\leq 0,\\ \tilde{f}_{R}(\tilde{y};\mu),&\tilde{y}_{1}\geq 0,\end{cases} (5.6)

where

f~L(y~;μ)=ALy~+[γμ0]+EL(y~;μ),f~R(y~;μ)=ARy~+[γμ0]+ER(y~;μ).\begin{split}\tilde{f}_{L}(\tilde{y};\mu)&=A_{L}\tilde{y}+\begin{bmatrix}\gamma\mu\\ 0\end{bmatrix}+E_{L}(\tilde{y};\mu),\\ \tilde{f}_{R}(\tilde{y};\mu)&=A_{R}\tilde{y}+\begin{bmatrix}\gamma\mu\\ 0\end{bmatrix}+E_{R}(\tilde{y};\mu).\end{split} (5.7)

The higher order terms ELE_{L} and ERE_{R} are C1C^{1} and (𝓎~+|μ|)\mathpzc{o}\mathopen{}\mathclose{{}\left(\|\tilde{y}\|+|\mu|}\right). Thus given any ε>0\varepsilon>0 there exists δ=δ(ε)>0\delta=\delta(\varepsilon)>0 such that

EL(y~;μ)y~+μ<γεγr+1,for all(y~;μ)Bδγr(𝟎)×(0,δ),ER(y~;μ)y~+μ<γεγr+1,for all(y~;μ)Bδγr(𝟎)×(0,δ),\begin{split}\frac{\mathopen{}\mathclose{{}\left\|E_{L}(\tilde{y};\mu)}\right\|}{\|\tilde{y}\|+\mu}&<\frac{\gamma\varepsilon}{\gamma r+1},\qquad\text{for all}~{}(\tilde{y};\mu)\in B_{\delta\gamma r}({\bf 0})\times(0,\delta),\\ \frac{\mathopen{}\mathclose{{}\left\|E_{R}(\tilde{y};\mu)}\right\|}{\|\tilde{y}\|+\mu}&<\frac{\gamma\varepsilon}{\gamma r+1},\qquad\text{for all}~{}(\tilde{y};\mu)\in B_{\delta\gamma r}({\bf 0})\times(0,\delta),\end{split} (5.8)

and

Df~L(y~;μ)AL<η2,for all(y~;μ)Bδγr(𝟎)×(0,δ),Df~R(y~;μ)AR<η2,for all(y~;μ)Bδγr(𝟎)×(0,δ).\begin{split}\big{\|}{\rm D}\tilde{f}_{L}(\tilde{y};\mu)-A_{L}\big{\|}&<\eta_{2}\,,\qquad\text{for all}~{}(\tilde{y};\mu)\in B_{\delta\gamma r}({\bf 0})\times(0,\delta),\\ \big{\|}{\rm D}\tilde{f}_{R}(\tilde{y};\mu)-A_{R}\big{\|}&<\eta_{2}\,,\qquad\text{for all}~{}(\tilde{y};\mu)\in B_{\delta\gamma r}({\bf 0})\times(0,\delta).\end{split} (5.9)

Then, since γ>0\gamma>0 (2.10), the spatial blow-up (5.4) converts (5.6) with μ>0\mu>0 to a map of the form

g~(x;μ)={g~L(x;μ),x10,g~R(x;μ),x10,\tilde{g}(x;\mu)=\begin{cases}\tilde{g}_{L}(x;\mu),&x_{1}\leq 0,\\ \tilde{g}_{R}(x;\mu),&x_{1}\geq 0,\end{cases} (5.10)

where g~L(x;μ)=1γμf~L(γμx;μ)\tilde{g}_{L}(x;\mu)=\frac{1}{\gamma\mu}\tilde{f}_{L}(\gamma\mu x;\mu) and g~R(x;μ)=1γμf~R(γμx;μ)\tilde{g}_{R}(x;\mu)=\frac{1}{\gamma\mu}\tilde{f}_{R}(\gamma\mu x;\mu). By (5.7)–(5.9) we have

g~L(x;μ)gL(x)<ε,for all(x;μ)Br(𝟎)×(0,δ),g~R(x;μ)gR(x)<ε,for all(x;μ)Br(𝟎)×(0,δ),\begin{split}\mathopen{}\mathclose{{}\left\|\tilde{g}_{L}(x;\mu)-g_{L}(x)}\right\|&<\varepsilon,\qquad\text{for all}~{}(x;\mu)\in B_{r}({\bf 0})\times(0,\delta),\\ \mathopen{}\mathclose{{}\left\|\tilde{g}_{R}(x;\mu)-g_{R}(x)}\right\|&<\varepsilon,\qquad\text{for all}~{}(x;\mu)\in B_{r}({\bf 0})\times(0,\delta),\end{split} (5.11)

and

Dg~L(x;μ)AL<η2,for all(x;μ)Br(𝟎)×(0,δ),Dg~R(x;μ)AR<η2,for all(x;μ)Br(𝟎)×(0,δ).\begin{split}\mathopen{}\mathclose{{}\left\|{\rm D}\tilde{g}_{L}(x;\mu)-A_{L}}\right\|&<\eta_{2}\,,\qquad\text{for all}~{}(x;\mu)\in B_{r}({\bf 0})\times(0,\delta),\\ \mathopen{}\mathclose{{}\left\|{\rm D}\tilde{g}_{R}(x;\mu)-A_{R}}\right\|&<\eta_{2}\,,\qquad\text{for all}~{}(x;\mu)\in B_{r}({\bf 0})\times(0,\delta).\end{split} (5.12)

Further assume εε1\varepsilon\leq\varepsilon_{1} so that, by (5.3), Ω\Omega is a trapping region for g~(x;μ)\tilde{g}(x;\mu) with any μ(0,δ)\mu\in(0,\delta). That Ω\Omega is a trapping region implies g~\tilde{g} has a topological attractor ΓμΩ\Gamma_{\mu}\subset\Omega. Then Λμ=ϕμ1(γμΓμ)\Lambda_{\mu}=\phi_{\mu}^{-1}\mathopen{}\mathclose{{}\left(\gamma\mu\Gamma_{\mu}}\right) is the corresponding attractor of ff. Since γμΓμBγμr(𝟎)\gamma\mu\Gamma_{\mu}\subset B_{\gamma\mu r}({\bf 0}) and ϕμ1(y~)\phi_{\mu}^{-1}(\tilde{y}) is a linear function of the pair (y~;μ)(\tilde{y};\mu), there exists s>0s>0 such that ΛμBμs(𝟎)\Lambda_{\mu}\subset B_{\mu s}({\bf 0}) for all μ(0,δ)\mu\in(0,\delta).

Step 4 — Extend bounds on gg to the perturbed map g~\tilde{g}.
In Step 2 we obtained bounds on the number of iterations required for orbits of the BCNF gg to cross Σ\Sigma. Here we show the same bounds hold for the perturbed map g~\tilde{g}.

We first extend the definitions of χL\chi_{L} and χR\chi_{R} to g~\tilde{g}. Given x2x\in\mathbb{R}^{2}, let χL,μ(x)\chi_{L,\mu}(x) be the smallest p1p\geq 1 for which g~p(x;μ)ΠL\tilde{g}^{p}(x;\mu)\notin\Pi_{L} and let χR,μ(x)\chi_{R,\mu}(x) be the smallest q1q\geq 1 for which g~q(x;μ)ΠR\tilde{g}^{q}(x;\mu)\notin\Pi_{R}, if such pp and qq exist. In view of Lemma 3.1, we can similarly generalise ΦL\Phi_{L} and ΦR\Phi_{R} by defining

ΦL,μ={x2|x1<0,g~1(x;μ)10},ΦR,μ={x2|x1>0,g~1(x;μ)10}.\begin{split}\Phi_{L,\mu}&=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{1}<0,\,\tilde{g}^{-1}(x;\mu)_{1}\geq 0}\right\},\\ \Phi_{R,\mu}&=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\big{|}\,x_{1}>0,\,\tilde{g}^{-1}(x;\mu)_{1}\leq 0}\right\}.\end{split} (5.13)

These sets are sketched in Fig. 6 and for sufficiently small μ<0\mu<0 they are within ε12\frac{\varepsilon_{1}}{2} of ΦL\Phi_{L} and ΦR\Phi_{R} in the bounded set Ω\Omega. Next, take 0<ε<ε120<\varepsilon<\frac{\varepsilon_{1}}{2} small enough so that, by (5.11), g~(Ω;μ)ΦL,μBε12(g(Ω)ΦL)\tilde{g}(\Omega;\mu)\cap\Phi_{L,\mu}\subset B_{\frac{\varepsilon_{1}}{2}}\mathopen{}\mathclose{{}\left(g(\Omega)\cap\Phi_{L}}\right) and g~(Ω;μ)ΦR,μBε12(g(Ω)ΦR)\tilde{g}(\Omega;\mu)\cap\Phi_{R,\mu}\subset B_{\frac{\varepsilon_{1}}{2}}\mathopen{}\mathclose{{}\left(g(\Omega)\cap\Phi_{R}}\right) for all μ(0,δ)\mu\in(0,\delta). For small enough ε>0\varepsilon>0, (5.11) also implies

g~Lp(x;μ)gLp(x)<ε2,for all(x;μ)Ω×(0,δ)and allp=1,2,,pmax,g~Rq(x;μ)gRq(x)<ε2,for all(x;μ)Ω×(0,δ)and allq=1,2,,qmax.\begin{split}\mathopen{}\mathclose{{}\left\|\tilde{g}_{L}^{p}(x;\mu)-g_{L}^{p}(x)}\right\|&<\varepsilon_{2},\qquad\text{for all}~{}(x;\mu)\in\Omega\times(0,\delta)~{}\text{and all}~{}p=1,2,\ldots,p_{\rm max}\,,\\ \mathopen{}\mathclose{{}\left\|\tilde{g}_{R}^{q}(x;\mu)-g_{R}^{q}(x)}\right\|&<\varepsilon_{2},\qquad\text{for all}~{}(x;\mu)\in\Omega\times(0,\delta)~{}\text{and all}~{}q=1,2,\ldots,q_{\rm max}\,.\end{split} (5.14)

Then by (i) and (ii) of Step 2, and analogous bounds on gRq(x)1g_{R}^{q}(x)_{1},

pminχL,μ(x)pmax,for all(x;μ)g~(Ω;μ)ΦL,μ×(0,δ),qminχR,μ(x)qmax,for all(x;μ)g~(Ω;μ)ΦL,μ×(0,δ).\begin{split}p_{\rm min}&\leq\chi_{L,\mu}(x)\leq p_{\rm max},\qquad\text{for all}~{}(x;\mu)\in\tilde{g}(\Omega;\mu)\cap\Phi_{L,\mu}\times(0,\delta),\\ q_{\rm min}&\leq\chi_{R,\mu}(x)\leq q_{\rm max},\qquad\text{for all}~{}(x;\mu)\in\tilde{g}(\Omega;\mu)\cap\Phi_{L,\mu}\times(0,\delta).\end{split} (5.15)

Step 5 — Construct an induced map FF.
Let

Ψ=g~(Ω;μ)ΦL,μ,\Psi=\tilde{g}(\Omega;\mu)\cap\Phi_{L,\mu}\,,

as indicated in Fig. 6. In this step we introduce an induced map F:ΨΨF:\Psi\to\Psi that provides the first return to Ψ\Psi under iterations of g~\tilde{g}.

Fix μ(0,δ)\mu\in(0,\delta). Points in Ψ\Psi map to g~(Ω;μ)ΦR,μ\tilde{g}(\Omega;\mu)\cap\Phi_{R,\mu} under χL,μ\chi_{L,\mu} iterations of g~L\tilde{g}_{L}. Similarly, points in g~(Ω;μ)ΦR,μ\tilde{g}(\Omega;\mu)\cap\Phi_{R,\mu} map to Ψ\Psi under χR,μ\chi_{R,\mu} iterations of g~=g~R\tilde{g}=\tilde{g}_{R}. Consequently we can define F:ΨΨF:\Psi\to\Psi by

F(x)=g~Rq(g~Lp(x;μ);μ),F(x)=\tilde{g}_{R}^{q}\big{(}\tilde{g}_{L}^{p}(x;\mu);\mu\big{)}, (5.16)

where p=χL,μ(x)p=\chi_{L,\mu}(x) and q=χR,μ(g~Lp(x;μ))q=\chi_{R,\mu}\mathopen{}\mathclose{{}\left(\tilde{g}_{L}^{p}(x;\mu)}\right). Let Σ={x2|g~i(x)Σ for some i0}\Sigma_{\infty}=\mathopen{}\mathclose{{}\left\{x\in\mathbb{R}^{2}\,\middle|\,\tilde{g}^{i}(x)\in\Sigma\text{~{}for some~{}}i\geq 0}\right\} be the set of all points whose forward orbits under g~\tilde{g} intersect Σ\Sigma. Then DF(x;μ){\rm D}F(x;\mu) is well-defined at any xΨΣx\in\Psi\setminus\Sigma_{\infty}. By (5.2) and (5.12),

DF(x;μ)ARqALp<η1,\mathopen{}\mathclose{{}\left\|{\rm D}F(x;\mu)-A_{R}^{q}A_{L}^{p}}\right\|<\eta_{1}\,, (5.17)

where pp and qq are as in (5.16). Note (5.17) also relies on the bounds pminppmaxp_{\rm min}\leq p\leq p_{\rm max} and qminqqmaxq_{\rm min}\leq q\leq q_{\rm max} provided by (5.15).

Step 6 — Bound the Lyapunov exponent.
Finally we verify (3.11). Choose any xΩΣx\in\Omega\setminus\Sigma_{\infty}. We have g~(x;μ)Bε12(g(Ω))\tilde{g}(x;\mu)\in B_{\frac{\varepsilon_{1}}{2}}(g(\Omega)) by (5.11) because εε12\varepsilon\leq\frac{\varepsilon_{1}}{2}. Then by (5.14) and (i)–(iii) of Step 2, there exists kpmax+qmax+1k\leq p_{\rm max}+q_{\rm max}+1 such that g~k(x;μ)Ψ\tilde{g}^{k}(x;\mu)\in\Psi. Let x(0)=g~k(x;μ)x^{(0)}=\tilde{g}^{k}(x;\mu). Also let u(0)C{𝟎}u^{(0)}\in C\setminus\{{\bf 0}\} and u=(Dg~k(x;μ))1u(0)u=\mathopen{}\mathclose{{}\left({\rm D}\tilde{g}^{k}(x;\mu)}\right)^{-1}u^{(0)} (the inverse exists by the last remark in Step 1).

For each j1j\geq 1, let x(j)=F(x(j1))x^{(j)}=F\mathopen{}\mathclose{{}\left(x^{(j-1)}}\right) and let pj1p_{j-1} and qj1q_{j-1} be the corresponding pp and qq values in (5.16). Then for all j1j\geq 1 we have x(j)=g~nj(x;μ)x^{(j)}=\tilde{g}^{n_{j}}(x;\mu) where nj=k+(p0+q0)+(p1+q1)++(pj1+qj1)n_{j}=k+(p_{0}+q_{0})+(p_{1}+q_{1})+\cdots+(p_{j-1}+q_{j-1}). For all j1j\geq 1, let u(j)=DF(x(j1))u(j1)u^{(j)}={\rm D}F\mathopen{}\mathclose{{}\left(x^{(j-1)}}\right)u^{(j-1)}. By (5.1) and (5.17) and an inductive argument on jj, we have u(j)Cu^{(j)}\in C and u(j)>c+12u(j1)\|u^{(j)}\|>\frac{c+1}{2}\|u^{(j-1)}\| for all j1j\geq 1. Hence u(j)>(c+12)ju(0)\|u^{(j)}\|>\mathopen{}\mathclose{{}\left(\frac{c+1}{2}}\right)^{j}\|u^{(0)}\|. Since u(j)=Dg~nj(x;μ)uu^{(j)}={\rm D}\tilde{g}^{n_{j}}(x;\mu)u, we have

lim infn1nln(Dg~n(x;μ)u)\displaystyle\liminf_{n\to\infty}\frac{1}{n}\ln\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|{\rm D}\tilde{g}^{n}(x;\mu)u}\right\|}\right) =lim infj1njln(u(j))\displaystyle=\liminf_{j\to\infty}\frac{1}{n_{j}}\ln\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|u^{(j)}}\right\|}\right)
lim infj1k+j(pmax+qmax)ln((c+12)ju(0))\displaystyle\geq\liminf_{j\to\infty}\frac{1}{k+j(p_{\rm max}+q_{\rm max})}\ln\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left(\tfrac{c+1}{2}}\right)^{j}\|u^{(0)}\|}\right)
=ln(c+12)pmax+qmax>0.\displaystyle=\frac{\ln\mathopen{}\mathclose{{}\left(\frac{c+1}{2}}\right)}{p_{\rm max}+q_{\rm max}}>0.

Since the coordinate transformation ϕμ\phi_{\mu} is invertible, the same bound applies to ff with y=ϕμ1(γμx)y=\phi^{-1}_{\mu}(\gamma\mu x) and v=(Dϕμ(y))1uv=\mathopen{}\mathclose{{}\left({\rm D}\phi_{\mu}(y)}\right)^{-1}u, i.e. (3.11). Since this applies to any xΩΣx\in\Omega\setminus\Sigma_{\infty}, where Σ\Sigma_{\infty} has zero Lebesgue measure, (3.11) holds for Lebesgue almost all yΛμy\in\Lambda_{\mu}. \Box

6 An application to power converters

Power converters take a raw input voltage and use control strategies to produce an output that is close to a desired voltage [20, 21]. These have applications in many areas including personal electronic equipment where the voltage from a domestic electricity supplier is different from the voltage required by the device. A prototypical DC/DC power converter model described in [22, 23] is written in terms of time-dependent variables X(t)X(t) and Y(t)Y(t) that represent linear combinations of an internal current and voltage as

dXdt=λ1(XH(ξ(t)η(t))),dYdt=λ2(YH(ξ(t)η(t))),\begin{split}\frac{dX}{dt}&=\lambda_{1}\big{(}X-H(\xi(\lfloor t\rfloor)-\eta(t))\big{)},\\ \frac{dY}{dt}&=\lambda_{2}\big{(}Y-H(\xi(\lfloor t\rfloor)-\eta(t))\big{)},\end{split} (6.1)

where HH is the Heaviside function and

ξ(t)\displaystyle\xi(t) =X(t)θY(t)+q2ω,\displaystyle=X(t)-\theta Y(t)+\frac{q}{2\omega}, (6.2)
η(t)\displaystyle\eta(t) =qαω(tt).\displaystyle=\frac{q}{\alpha\omega}\big{(}t-\lfloor t\rfloor\big{)}. (6.3)

The function ξ(t)\xi(t) is the control signal employed by the converter. In this model time has been scaled so that the switching period is 11, see section 5.2 of [22] for more details. The floor function t\lfloor t\rfloor denotes the largest integer less than or equal to tt.

Here we fix

λ1=0.977,λ2=0.232,q=35.606,θ=4.2,α=70,\begin{split}\lambda_{1}&=-0.977,\\ \lambda_{2}&=-0.232,\\ q&=35.606,\\ \theta&=4.2,\\ \alpha&=70,\end{split} (6.4)

and vary the value of ω\omega which represents input voltage and is a controllable parameter. The values (6.4) are as given in [23] except we have used a slightly larger value of α\alpha so that the border-collision bifurcation produces a chaotic attractor instead of an invariant torus.

Refer to caption
Figure 7: A time series of (6.1) with (6.4) and ω=5.9\omega=5.9 showing the control signal ξ(t)\xi(t) on the vertical axis. Also ξ(t)\xi(\lfloor t\rfloor) is shown in purple and η(t)\eta(t) is shown in green. Where ξ(t)>η(t)\xi(\lfloor t\rfloor)>\eta(t) we have H=1H=1 in (6.1) and the time series is coloured black; where ξ(t)<η(t)\xi(\lfloor t\rfloor)<\eta(t) we have H=0H=0 in (6.1) and the time series is coloured orange.

Fig. 7 shows a typical time series of (6.1). On the vertical axis we have plotted ξ(t)\xi(t) which is an affine function of the variables. The system switches from H=1H=1 (black) to H=0H=0 (orange) when the purple line ξ(t)\xi(\lfloor t\rfloor) meets the green line η(t)\eta(t), and switches back to H=1H=1 at integer times.

We now provide a stroboscopic map that captures the dynamics of (6.1). This map is given in [23] and is straight-forward to derive. Let w(n)=(X(n),Y(n))w^{(n)}=(X(n),Y(n)), for nn\in\mathbb{Z}, denote the solution to (6.1) at integer times. The stroboscopic map, w(n+1)=p(w(n))w^{(n+1)}=p\mathopen{}\mathclose{{}\left(w^{(n)}}\right), is

p(w)=[eλ1(w11)+eλ1(1z)eλ2(w21)+eλ2(1z)],p(w)=\begin{bmatrix}{\rm e}^{\lambda_{1}}(w_{1}-1)+{\rm e}^{\lambda_{1}(1-z)}\\ {\rm e}^{\lambda_{2}}(w_{2}-1)+{\rm e}^{\lambda_{2}(1-z)}\end{bmatrix}, (6.5)

where

z={0,φ0,αωφq,0φqαω,1,φqαω,z=\begin{cases}0,&\varphi\leq 0,\\ \frac{\alpha\omega\varphi}{q},&0\leq\varphi\leq\frac{q}{\alpha\omega},\\ 1,&\varphi\geq\frac{q}{\alpha\omega},\end{cases} (6.6)

and

φ=w1θw2+q2ω.\varphi=w_{1}-\theta w_{2}+\frac{q}{2\omega}. (6.7)

Fig. 8 shows a bifurcation diagram of (6.5). As the value of ω\omega is increased, a stable fixed point undergoes a border-collision bifurcation at

ωBCB=q1θ(1α12)5.4045.\omega_{\rm BCB}=\frac{q}{1-\theta}\mathopen{}\mathclose{{}\left(\frac{1}{\alpha}-\frac{1}{2}}\right)\approx 5.4045. (6.8)

Numerical simulations suggest that a chaotic attractor is created in the border-collision bifurcation. The numerically computed Lyapunov exponent remains positive until ω5.555\omega\approx 5.555. Fig. 9(a) shows a phase portrait of the chaotic attractor.

Refer to caption
Figure 8: A bifurcation diagram (lower plot) of (6.5) with (6.4) and a numerically computed Lyapunov exponent λ\lambda (upper plot). A border-collision bifurcation occurs at ω=ωBCB5.4045\omega=\omega_{\rm BCB}\approx 5.4045. For 50005000 different values of ω\omega the Lyapunov exponent was estimated from 10610^{6} iterates of one orbit.
Refer to captionRefer to captiona)b)
Figure 9: Panel (a) shows a phase portrait of (6.5) with (6.4) and ω=5.45\omega=5.45. Specifically we show 10410^{4} points of a forward orbit with the first 100100 points removed. Panel (b) similarly shows part of the forward orbit of the corresponding two-dimensional BCNF, except converted to the coordinates of the stroboscopic map.

The map (6.5) has two switching manifolds. The border-collision bifurcation occurs on the switching manifold φ=qαω\varphi=\frac{q}{\alpha\omega}, so for the purposes of analyzing the local dynamics associated with this bifurcation we can ignore the φ0\varphi\leq 0 component of (6.6). Upon performing the affine change of variables

y=[qαω(w1θw2+q2ω)1w2],y=\begin{bmatrix}\frac{q}{\alpha\omega}-\mathopen{}\mathclose{{}\left(w_{1}-\theta w_{2}+\frac{q}{2\omega}}\right)\\ 1-w_{2}\end{bmatrix}, (6.9)

the map (without the φ0\varphi\leq 0 component of (6.6)) takes the form (2.1). Also let

μ=ωωBCB,\mu=\omega-\omega_{\rm BCB}\,, (6.10)

which with (2.2) is satisfied, i.e. the border-collision bifurcation occurs at y=𝟎y={\bf 0} when μ=0\mu=0. By differentiating (6.5) and evaluating it at the border-collision bifurcation we obtain

τL=eλ1+eλ2,δL=eλ1+λ2,τR=eλ1+eλ2α21θ1(λ1θλ2),δR=eλ1+λ2α21θ1(λ1eλ2θλ2eλ1).\begin{split}\tau_{L}&={\rm e}^{\lambda_{1}}+{\rm e}^{\lambda_{2}},\\ \delta_{L}&={\rm e}^{\lambda_{1}+\lambda_{2}},\\ \tau_{R}&={\rm e}^{\lambda_{1}}+{\rm e}^{\lambda_{2}}-\frac{\frac{\alpha}{2}-1}{\theta-1}\big{(}\lambda_{1}-\theta\lambda_{2}\big{)},\\ \delta_{R}&={\rm e}^{\lambda_{1}+\lambda_{2}}-\frac{\frac{\alpha}{2}-1}{\theta-1}\mathopen{}\mathclose{{}\left(\lambda_{1}{\rm e}^{\lambda_{2}}-\theta\lambda_{2}{\rm e}^{\lambda_{1}}}\right).\end{split} (6.11)

By substituting (6.4) and (6.8) into (6.11) and rounding to four decimal places we obtain

τL=1.1694,δL=0.2985,τR=1.1970,δR=4.6325.\begin{split}\tau_{L}&=1.1694,\\ \delta_{L}&=0.2985,\\ \tau_{R}&=1.1970,\\ \delta_{R}&=4.6325.\end{split} (6.12)
Refer to caption
Figure 10: A forward invariant region ΩFI\Omega_{\rm FI} for the BCNF (2.4) with parameter values (6.12) corresponding to the border-collision bifurcation of the power converter model. The regions DpD_{p} and EqE_{q} are shaded as in Fig. 3.

We now show that this border-collision bifurcation satisfies the conditions of Theorem 3.11. Certainly (2.9) is satisfied; (2.10) is also satisfied due to signs choices made when constructing (6.9). Conditions (3.1)–(3.3) are satisfied with p=p^{*}=\infty in Definition 4.2 (the eigenvalues of ALA_{L} are complex) and q=2q^{*}=2 and q=3q^{**}=3 in Definition 4.3.

With (6.12) the BCNF gg has the forward invariant region ΩFI\Omega_{\rm FI} shown in Fig. 10. This region was constructed using the algorithm of [9] (specifically ΩFI\Omega_{\rm FI} is the union of images of a ‘recurrent’ region Ωrec\Omega_{\rm rec}). For ΩFI\Omega_{\rm FI} we have

pmin\displaystyle p_{\rm min} =6,\displaystyle=6, pmax\displaystyle p_{\rm max} =8,\displaystyle=8, qmin\displaystyle q_{\rm min} =2,\displaystyle=2, qmax\displaystyle q_{\rm max} =3,\displaystyle=3, (6.13)

in (3.6)–(3.9). While ΩFI\Omega_{\rm FI} does not map to its interior, i.e. only g(ΩFI)ΩFIg(\Omega_{\rm FI})\subset\Omega_{\rm FI}, by generalising the approach used in [19] we have observed numerically that ΩFI\Omega_{\rm FI} can be shrunk by a small amount to produce a trapping region ΩΩFI\Omega\subset\Omega_{\rm FI} that necessarily satisfies (3.6)–(3.9) with the same bounds on pp and qq.

For the collection 𝐌Ω{\bf M}_{\Omega}, the algorithm in [9] also produces the invariant expanding cone

C={a[cos(θ)sin(θ)]|a,θ0θθ1},C=\mathopen{}\mathclose{{}\left\{a\begin{bmatrix}\cos(\theta)\\ \sin(\theta)\end{bmatrix}\,\middle|\,a\in\mathbb{R},\,\theta_{0}\leq\theta\leq\theta_{1}}\right\},

where θ00.8062\theta_{0}\approx 0.8062 and θ12.0227\theta_{1}\approx 2.0227. By Proposition 8.1 of [9], this cone can be enlarged slightly to produce a cone that is contracting-invariant and expanding.

This shows that the border-collision bifurcation of the power converter model satisfies the conditions of Theorem 3.11. We can therefore conclude that the model has a chaotic attractor for all ωBCB<ω<ωBCB+δ\omega_{\rm BCB}<\omega<\omega_{\rm BCB}+\delta, for some δ>0\delta>0. Based on the numerically computed Lyapunov exponent shown in Fig. 8, we could possibly take δ=0.15\delta=0.15. By inverting the coordinate changes required to go from the stroboscopic map pp to the BCNF gg, we were able to reproduce the attractor of gg with (6.12) in the coordinates of pp, and this shown in Fig. 9(b).

7 Discussion

The border-collision normal form has attracted a great deal of attention because it acts as a paradigm for the dynamics of general piecewise-smooth systems, and because it has a broad variety of applications. In both contexts it is important to know which features of the dynamics are particular to the piecewise-linear character of the BCNF, and which are persistent features of piecewise-smooth systems. Certainly in applications this question is fundamental to the interpretation of results.

In this paper we have established techniques that can be used to prove that a chaotic attractor observed in the BCNF persists with the addition of nonlinear terms close to a border-collision bifurcation. The techniques are sufficiently simple that the conditions can be checked in explicit examples, and we have achieved this for a prototypical power converter model. This approach appears to be quite effective because it is possible to use analytic (or simple, finite numerical calculations) to prove the existence of chaotic attractors in the BCNF, then use persistence arguments to infer the existence of chaotic attractors in the original system. This eliminates the need to compute asymptotic quantities, such as Lyapunov exponents, or to rely on a visual examination of numerically computed attractors.

For the power converter model, the chaotic attractor appears to vary continuously (with respect to Hausdorff metric) as ω\omega is varied just past the border-collision bifurcation. This is why the attractor of the BCNF shown in Fig. 9(b) closely resembles that of the power converter shown in Fig. 9(a). It remains to determine general conditions that ensure continuity, perhaps by employing the result of Hoang et. al. [24], see [25].

This work feeds into a larger (and often unspoken) question about the BCNF: is it a normal form in the strict, bifurcation theory sense of the term [26]? The dynamics of a (strict) normal form is equivalent to those of the original system in some neighbourhood of parameter space and phase space. In contrast, in Theorem 3.11 the neighbourhood Bμs(𝟎)B_{\mu s}({\bf 0}) shrinks to a point at the bifurcation. These concepts depend on the type of equivalence being used and the result of this paper comes a step closer to showing that, with an appropriate definition of equivalence, the BCNF can indeed be a normal form in this stronger, technical sense.

Acknowledgements

The authors were supported by Marsden Fund contract MAU1809, managed by Royal Society Te Apārangi.

Appendix A Calculations for the regions EiE_{i}

Here we prove Lemmas 4.3 and 4.4.

We first show how the values of nqn_{q} and dqd_{q} can be computed iteratively. By substituting x2=nq1x1+dq1x_{2}=n_{q-1}x_{1}+d_{q-1}, which represents gR(q1)(Σ)g_{R}^{-(q-1)}(\Sigma), into

gR1(x)=[1δRx2x1+τRδRx21],g_{R}^{-1}(x)=\begin{bmatrix}-\frac{1}{\delta_{R}}\,x_{2}\\ x_{1}+\frac{\tau_{R}}{\delta_{R}}\,x_{2}-1\end{bmatrix}, (A.1)

we obtain

gR1([x1nq1x1+dq1])=[nq1δRτRnq1δR+1]x1+[dq1δRτRdq1δR1],g_{R}^{-1}\mathopen{}\mathclose{{}\left(\begin{bmatrix}x_{1}\\ n_{q-1}x_{1}+d_{q-1}\end{bmatrix}}\right)=\begin{bmatrix}-\frac{n_{q-1}}{\delta_{R}}\\ \frac{\tau_{R}n_{q-1}}{\delta_{R}}+1\end{bmatrix}x_{1}+\begin{bmatrix}-\frac{d_{q-1}}{\delta_{R}}\\ \frac{\tau_{R}d_{q-1}}{\delta_{R}}-1\end{bmatrix}, (A.2)

which represents gRq(Σ)g_{R}^{-q}(\Sigma). From (A.2) we see that the slope and x2x_{2}-intercept of gRq(Σ)g_{R}^{-q}(\Sigma) are

nq=δRnq1τR,dq=dq1nq11,\begin{split}n_{q}&=-\frac{\delta_{R}}{n_{q-1}}-\tau_{R}\,,\\ d_{q}&=-\frac{d_{q-1}}{n_{q-1}}-1,\end{split} (A.3)

assuming nq10n_{q-1}\neq 0. If nq1=0n_{q-1}=0, then nq=n_{q}=\infty, and from (A.2) we see that gRq(Σ)g_{R}^{-q}(\Sigma) is the vertical line x1=dq1δRx_{1}=-\frac{d_{q-1}}{\delta_{R}}. Further, by substituting this into (A.1) we obtain

gR1([dq1δRx2])=[1δRτRδR]x2+[0dq1δR1],g_{R}^{-1}\mathopen{}\mathclose{{}\left(\begin{bmatrix}-\frac{d_{q-1}}{\delta_{R}}\\ x_{2}\end{bmatrix}}\right)=\begin{bmatrix}-\frac{1}{\delta_{R}}\\ \frac{\tau_{R}}{\delta_{R}}\end{bmatrix}x_{2}+\begin{bmatrix}0\\ -\frac{d_{q-1}}{\delta_{R}}-1\end{bmatrix},

and therefore in this case the slope and x2x_{2}-intercept of gR(q+1)(Σ)g_{R}^{-(q+1)}(\Sigma) are

nq+1=τR,dq+1=dq1δR1.\begin{split}n_{q+1}&=-\tau_{R}\,,\\ d_{q+1}&=-\frac{d_{q-1}}{\delta_{R}}-1.\end{split} (A.4)

To obtain starting values for the iterations, notice gR(Σ)g_{R}(\Sigma) is the x1x_{1}-axis, so n1=d1=0n_{-1}=d_{-1}=0. By substituting these into (A.4) we obtain n1=τRn_{1}=-\tau_{R} and d1=1d_{1}=-1.

In summary, starting with n1=τRn_{1}=-\tau_{R} and d1=1d_{1}=-1 we can use (A.3) to iteratively generate nqn_{q} and dqd_{q} for all q1q\geq 1, using instead (A.4) for the special case nq1=0n_{q-1}=0.

We now prove Lemmas 4.3 and 4.4 together. This is achieved by carefully characterising the regions EqE_{q} and here the reader may find it helpful to refer to Fig. 3.

Proof of Lemmas 4.3 and 4.4.

We first describe the backwards orbit of 𝟎{\bf 0} under gRg_{R}. Notice 𝟎Σ{\bf 0}\in\Sigma and gR1(𝟎)=(0,1)Σg_{R}^{-1}({\bf 0})=(0,-1)\in\Sigma. Thus, for all q0q\geq 0, the points gRq(𝟎)g_{R}^{-q}({\bf 0}) and gR(q+1)(𝟎)g_{R}^{-(q+1)}({\bf 0}) lie on gRq(Σ)g_{R}^{-q}(\Sigma). They are distinct points, hence gRq(Σ)g_{R}^{-q}(\Sigma) is the unique line that passes through these points. By (3.2)–(3.3),

gRq(𝟎){x2|x1>0,x2<0},for all q2.g_{R}^{-q}({\bf 0})\in\{x\in\mathbb{R}^{2}\,\big{|}\,x_{1}>0,\,x_{2}<0\},\qquad\text{for all $q\geq 2$}. (A.5)

Thus for each q2q\geq 2 and any point x2x\in\mathbb{R}^{2} sufficiently close to gRq(𝟎)g_{R}^{-q}({\bf 0}), we have χR(x)q1\chi_{R}(x)\geq q-1 by the definition of χR\chi_{R}. Further, there exist points arbitrarily close to gRq(𝟎)g_{R}^{-q}({\bf 0}) such that χR(x)=q1\chi_{R}(x)=q-1. Therefore

gRq(𝟎)cl(Ei),for all i=1,,q2,gRq(𝟎)Eq1,\begin{split}g_{R}^{-q}({\bf 0})&\notin{\rm cl}(E_{i}),\qquad\text{for all $i=1,\ldots,q-2$},\\ g_{R}^{-q}({\bf 0})&\in\partial E_{q-1}\,,\end{split} (A.6)

where cl(){\rm cl}(\cdot) denotes closure and \partial denotes boundary.

Next we characterise the regions EqE_{q} up to q=qq=q^{*}. By definition, E1E_{1} consists of all xΠRx\in\Pi_{R} for which gR(x)1<0g_{R}(x)_{1}<0. We have gR(x)1=τRx1+x2+1g_{R}(x)_{1}=\tau_{R}x_{1}+x_{2}+1, therefore

E1={xΠR|x10,x2<n1x1+d1},E_{1}=\mathopen{}\mathclose{{}\left\{x\in\Pi_{R}\,\big{|}\,x_{1}\geq 0,\,x_{2}<n_{1}x_{1}+d_{1}}\right\}, (A.7)

recalling n1=τRn_{1}=-\tau_{R} and d1=1d_{1}=-1. For each q2q\geq 2, EqE_{q} consists of all xΠRx\in\Pi_{R} for which gR(x)Eq1g_{R}(x)\in E_{q-1}. It follows that

Eq=gR1(Eq1)ΠR,for all q2.E_{q}=g_{R}^{-1}(E_{q-1})\cap\Pi_{R}\,,\qquad\text{for all $q\geq 2$.} (A.8)

By (A.7), E1E_{1} is the region bounded by two rays emanating from gR1(𝟎)=(0,1)g_{R}^{-1}({\bf 0})=(0,-1). One ray is part of Σ\Sigma, the other ray is part of gR1(Σ)g_{R}^{-1}(\Sigma) and contains the point gR2(𝟎)g_{R}^{-2}({\bf 0}). Then from (A.6) and (A.8), for all q{1,,q}q\in\{1,\ldots,q^{*}\} the region EqE_{q} is bounded by two rays emanating from gRq(𝟎)g_{R}^{-q}({\bf 0}). One ray is part of gR(q1)(Σ)g_{R}^{-(q-1)}(\Sigma), the other ray is part of gRq(Σ)g_{R}^{-q}(\Sigma) and contains the point gR(q+1)(𝟎)g_{R}^{-(q+1)}({\bf 0}). By (A.5) and the definition of qq^{*}, EqΦR=E_{q}\cap\Phi_{R}=\varnothing for all q{1,,q1}q\in\{1,\ldots,q^{*}-1\}, whereas EqΦRE_{q^{*}}\cap\Phi_{R}\neq\varnothing.

Notice EqE_{q^{*}} contains an infinite section of the positive x1x_{1}-axis. The preimage of the x1x_{1}-axis under gRg_{R} is the x2x_{2}-axis, thus Eq+1E_{q^{*}+1} contains an infinite section of the positive x2x_{2}-axis. Therefore Eq+1E_{q^{*}+1} has three boundaries:

  1. i)

    a ray (part of gRq(Σ)g_{R}^{-q^{*}}(\Sigma) emanating from gR(q+1)(𝟎)g_{R}^{-(q^{*}+1)}({\bf 0})),

  2. ii)

    the line segment from gR(q+1)(𝟎)g_{R}^{-(q^{*}+1)}({\bf 0}) to (0,dq+1)\mathopen{}\mathclose{{}\left(0,d_{q^{*}+1}}\right), and

  3. iii)

    the part of Σ\Sigma above (0,dq+1)\mathopen{}\mathclose{{}\left(0,d_{q^{*}+1}}\right).

Finally, if q>q+1q^{**}>q^{*}+1, then for all q{q+2,,q}q\in\{q^{*}+2,\ldots,q^{**}\}, EqE_{q} is the triangle with vertices gRq(𝟎)g_{R}^{-q}({\bf 0}), (0,dq1)(0,d_{q-1}), and (0,dq)(0,d_{q}). This in part relies on the observation dq>1d_{q^{**}}>-1 which is a consequence of (A.3) and the definition of qq^{**}. For each q{q+1,,q}q\in\{q^{*}+1,\ldots,q^{**}\} we have EqΦRE_{q}\cap\Phi_{R}\neq\varnothing because dq1>0d_{q-1}>0. Our precise description of the EqE_{q} implies ΦRq=qqEq\Phi_{R}\subset\bigcup_{q=q^{*}}^{q^{**}}E_{q}, and this completes the proof Lemma 4.3. Further, we have shown that the boundaries of EqE_{q} do not coincide with an interval the x1x_{1}-axis or have vertices on the x1x_{1}-axis, and this completes the proof of Lemma 4.4. ∎

References

  • [1] C. Liu, A. Di Falco, D. Molinari, Y. Khan, B.S. Ooi, T.F. Krauss, and A. Fratalocchi. Enhanced energy storage in chaotic optical resonators. Nature Photon., 7:473–478, 2013.
  • [2] A. Kumar, S.F. Ali, and A. Arockiarajan. Enhanced energy harvesting from nonlinear oscillators via chaos control. IFAC-PapersOnLine, 49(1):35–40, 2016.
  • [3] J.H.B. Deane and D.C. Hamill. Improvement of power supply EMC by chaos. Electron. Lett., 32(12):1045, 1996.
  • [4] H.E. Nusse and J.A. Yorke. Border-collision bifurcations including “period two to period three” for piecewise smooth systems. Phys. D, 57:39–57, 1992.
  • [5] M. di Bernardo, C.J. Budd, A.R. Champneys, and P. Kowalczyk. Piecewise-smooth Dynamical Systems. Theory and Applications. Springer-Verlag, New York, 2008.
  • [6] S. Banerjee, J.A. Yorke, and C. Grebogi. Robust chaos. Phys. Rev. Lett., 80(14):3049–3052, 1998.
  • [7] S. Banerjee and C. Grebogi. Border collision bifurcations in two-dimensional piecewise smooth maps. Phys. Rev. E, 59(4):4052–4061, 1999.
  • [8] P. Glendinning. Robust chaos revisited. Eur. Phys. J. Special Topics, 226(9):1721–1738, 2017.
  • [9] P. Glendinning and D.J.W. Simpson. Chaos in the border-collision normal form: A computer-assisted proof using induced maps and invariant expanding cones. arXiv:2108.05999, 2021.
  • [10] V.A. Brousin, Yu.I. Neimark, and M.I. Feigin. On some cases of dependence of periodic motions of relay system upon parameters. Izv. Vyssh. Uch. Zav. Radiofizika, 4:785–800, 1963. In Russian.
  • [11] M.I. Feigin. Doubling of the oscillation period with CC-bifurcations in piecewise continuous systems. Prikl. Mat. Mekh., 34(5):861–869, 1970. In Russian.
  • [12] C.H. Hommes and H.E. Nusse. “Period three to period two” bifurcation for piecewise linear models. J. Economics, 54(2):157–169, 1991.
  • [13] M. Dutta, H.E. Nusse, E. Ott, J.A. Yorke, and G. Yuan. Multiple attractor bifurcations: A source of unpredictability in piecewise smooth systems. Phys. Rev. Lett., 83(21):4281–4284, 1999.
  • [14] P. Glendinning. Bifurcation from stable fixed point to 2D attractor in the border collision normal form. IMA J. Appl. Math., 81(4):699–710, 2016.
  • [15] D.J.W. Simpson and J.D. Meiss. Shrinking point bifurcations of resonance tongues for piecewise-smooth, continuous maps. Nonlinearity, 22(5):1123–1144, 2009.
  • [16] D.J.W. Simpson. The structure of mode-locking regions of piecewise-linear continuous maps: I. Nearby mode-locking regions and shrinking points. Nonlinearity, 30(1):382–444, 2017.
  • [17] M. di Bernardo, P. Kowalczyk, and A. Nordmark. Sliding bifurcations: A novel mechanism for the sudden onset of chaos in dry friction oscillators. Int. J. Bifurcation Chaos, 13(10):2935–2948, 2003.
  • [18] D.J.W. Simpson. Border-collision bifurcations in n\mathbb{R}^{n}. SIAM Rev., 58(2):177–226, 2016.
  • [19] D.J.W. Simpson. Detecting invariant expanding cones for generating word sets to identify chaos in piecewise-linear maps. Submitted., 2020.
  • [20] S. Banerjee and G.C. Verghese, editors. Nonlinear Phenomena in Power Electronics. IEEE Press, New York, 2001.
  • [21] C.K. Tse. Complex Behavior of Switching Power Converters. CRC Press, Boca Raton, FL, 2003.
  • [22] Z.T. Zhusubaliyev and E. Mosekilde. Bifurcations and Chaos in Piecewise-Smooth Dynamical Systems. World Scientific, Singapore, 2003.
  • [23] Z.T. Zhusubaliyev and E. Mosekilde. Equilibrium-torus bifurcation in nonsmooth systems. Phys. D, 237:930–936, 2008.
  • [24] L.T. Hoang, E.J. Olson, and J.C. Robinson. On the continuity of global attractors. Proc. Amer. Math. Soc., 143:4389–4395, 2015.
  • [25] P.A. Glendinning and D.J.W. Simpson. Robust chaos and the continuity of attractors. Trans. Math. Appl., 4(1):tnaa002, 2020.
  • [26] Yu.A. Kuznetsov. Elements of Bifurcation Theory., volume 112 of Appl. Math. Sci. Springer-Verlag, New York, 3rd edition, 2004.