This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Local well-posedness for quasilinear problems: a primer

Mihaela Ifrim Department of Mathematics, University of Wisconsin, Madison [email protected]  and  Daniel Tataru Department of Mathematics, University of California at Berkeley [email protected]
Abstract.

Proving local well-posedness for quasilinear problems in pde’s presents a number of difficulties, some of which are universal and others of which are more problem specific. On one hand, a common standard for what well-posedness should mean has existed for a long time, going back to Hadamard. On the other hand, in terms of getting there, there are by now both many variations, but also many misconceptions.

The aim of these expository notes is to collect a number of both classical and more recent ideas in this direction, and to assemble them into a cohesive road-map that can be then adapted to the reader’s problem of choice.

Key words and phrases:
quasilinear evolutions, local well-posedness, frequency envelopes
1991 Mathematics Subject Classification:
Primary: 35L45, 35L50, 35L60.

1. Introduction

Local well-posedness is the first question to ask for any evolution problem in partial differential equations. These notes, prepared by the authors for a summer school at MSRI [13] in 2020, aim to discuss ideas and strategies for local well-posedness in quasilinear and fully nonlinear evolution equations, primarily of hyperbolic type. We hope to persuade the reader that the structure presented here should be adopted as the standard for proving these results. Of course, there are many possible variations, and we try to point out some of them in our many remarks. While a few of the ideas here can be found in several of the classical books, see e.g. [30],[10], [3],[24], some of the others have appeared only in articles devoted to specific problems, and have never been collected together, to the best of our knowledge.

1.1. Nonlinear evolutions

For our exposition we will adopt a two track structure, where we will broadly discuss ideas for a general problem, and in parallel implement these ideas on a simple, classical concrete example.


Our general problem will be a nonlinear partial differential equation of the form

(1.1) ut=N(u),u(0)=u0,u_{t}=N(u),\qquad u(0)=u_{0},

i.e. a first order system in time, where we think of uu as a scalar or a vector valued function belonging to a scale of either real or complex Sobolev spaces. This scale will be chosen to be Hs:=Hs(n)H^{s}:=H^{s}(\mathbb{R}^{n}) for the purpose of this discussion, though in practice it often has to be adapted to the class of problems to be considered. The nonlinearity NN represents a nonlinear function of uu and its derivatives,

N(u)=N({αu}|α|k),N(u)=N(\{\partial^{\alpha}u\}_{|\alpha|\leq k}),

where we will refer to kk as the order of the evolution. Here typical examples include k=1k=1 (hyperbolic equations), k=2k=2 (Schrödinger type evolutions) and k=3k=3 (KdV type evolutions). But many other situations arise in models which are nonlocal, e.g. in water waves one encounters k=12k=\frac{1}{2} for gravity waves respectively k=32k=\frac{3}{2} for capillary waves.

Some problems are most naturally formulated as second order evolutions in time, for instance nonlinear wave equations. While some such problems admit also good first order in time formulations (e.g. the compressible Euler flow), it is sometimes better to treat them as second order. Regardless, our road-map still applies, with obvious adjustments.


Our model problem will be a classical first order symmetric hyperbolic system in ×n\mathbb{R}\times\mathbb{R}^{n}, of the form

(1.2) tu=𝒜j(u)ju,u(0)=u0,\partial_{t}u={\mathcal{A}}^{j}(u)\partial_{j}u,\qquad u(0)=u_{0},

where uu takes values in m\mathbb{R}^{m} and the m×mm\times m matrices 𝒜j{\mathcal{A}}^{j} are symmetric, and smooth as functions of uu. Here the order of the nonlinearity NN is k=1k=1, and the scale of Sobolev spaces to be used is indeed the Sobolev scale.


1.2. What is well-posedness ?

To set the expectations for our problems, we recall the classical Hadamard standard for well-posedness, formulated relative to our chosen scale of spaces.

Definition 1.1.

The problem (1.1) is locally well-posed in a Sobolev space Hs(n)H^{s}(\mathbb{R}^{n}) if the following properties are satisfied:

(i) Existence:

For each u0Hsu_{0}\in H^{s} there exists some time T>0T>0 and a solution uC([0,T];Hs)u\in C([0,T];H^{s}).

(ii) Uniqueness:

The above solution is unique in C([0,T];Hs)C([0,T];H^{s}).

(iii) Continuous dependence:

The data to solution map is continuous from u0Hsu_{0}\in H^{s} to uC([0,T];Hs)u\in C([0,T];H^{s}).

As a historical remark, we note that Hadamard primarily discussed the question of well-posedness in the context of linear pde’s, specifically for the Laplace and wave equation, beginning with an incipient form in [8], and a more developed form in [9]. It is in the latter reference where the continuous dependence is discussed, seemingly inspired by Cauchy’s theorem for ode’s.

The above definition should not be taken as universal, but rather as a good starting point, which may need to be adjusted depending on the problem. Consider for instance the uniqueness statement, which, as given in (ii), is in the strongest form, which is often referred to as unconditional uniqueness. Often this may need to be relaxed somewhat, particularly when low regularity solutions are concerned. Some common variations concerning uniqueness are as follows:

  1. a)

    The solutions uu in (i) are shown to belong to a smaller space, XTsC([0,T];Hs(n))X^{s}_{T}\subset C([0,T];H^{s}(\mathbb{R}^{n})), and then the uniqueness in (ii) holds in the same class.

  2. b)

    Unconditional uniqueness holds apriori only in a more regular class HNH^{N} with N>sN>s, but the data to solution map extends continuously as a map from HsH^{s} to C([0,T];Hs)C([0,T];H^{s}).

Since we are discussing nonlinear equations here, the lifespan of the solutions need not be infinite, i.e. there is always the possibility that solutions may blow up in finite time. In particular, in the context of well-posed problems it is natural to consider the notion of maximal lifespan, which is the largest TT for which the solution exists in C([0,T);Hs)C([0,T);H^{s}); here the limit of u(t)u(t) as tt approaches TT cannot exist, or else the solution uu may be continued further.

In this context, the last property in the definition should be interpreted to mean in particular that, for a solution uC([0,T];Hs)u\in C([0,T];H^{s}), small perturbations of the initial data u0u_{0} yield solutions which are also defined in [0,T][0,T]. This in turn implies that the maximal lifespan T=T(u0)T=T(u_{0}) is lower semicontinuous as a function of u0Hs(n)u_{0}\in H^{s}(\mathbb{R}^{n}).

In view of the above discussion, it is always interesting to provide more precise assertions about the lifespan of solutions, or, equivalently, continuation (or blow-up) criteria for the solutions. Some interesting examples are as follows:

  1. a)

    The lifespan T(u0)T(u_{0}) is bounded from below uniformly for data in a bounded set,

    T(u0)C(u0Hs)>0.T(u_{0})\geq C(\|u_{0}\|_{H^{s}})>0.

    This implies a blow-up criteria as follows:

    limtT(u0)u(t)Hs=.\lim_{t\to T(u_{0})}\|u(t)\|_{H^{s}}=\infty.
  2. b)

    The blow-up may be characterized in terms of weaker bounds,

    limtT(u0)u(t)Y=.\lim_{t\to T(u_{0})}\|u(t)\|_{Y}=\infty.

    relative to a Banach topology YHsY\supset H^{s}, or perhaps a time integrated version thereof

    0T(u0)u(t)Y𝑑t=.\int_{0}^{T(u_{0})}\|u(t)\|_{Y}dt=\infty.

To conclude our discussion of the above definition, we note that many well-posedness statements also provide additional properties for the flow:

Higher regularity:

if the initial data has more regularity u0Hσu_{0}\in H^{\sigma} with σ>s\sigma>s, then this regularity carries over to the solution, uC[(0,T);Hσ]u\in C[(0,T);H^{\sigma}], with bounds and lifespan bounds depending only on the HsH^{s} size of the data.

Weak Lipschitz bounds:

on bounded sets in HsH^{s}, the flow is Lipschitz in a weaker topology (e.g. up to Hs1H^{s-1} in our model problem).

Both of these properties are often an integral part of a complete theory, and frequently also serve as intermediate steps in establishing the main well-posedness result.

In all of the above discussion, a common denominator remains the fact that the solution to data map is locally continuous, but not uniformly continuous. It is very natural indeed to redefine (expand) the notion of quasilinear evolution equations to include all flows which share this property.


In many problems of this type, one is interested not only in local well-posedness in some Sobolev space HsH^{s}, but also in lowering the exponent ss as much as possible. We will refer to such solutions as rough solutions. Then, a natural question is what kind of regularity thresholds should one expect or aim for in such problems? One clue in this direction comes from the scaling symmetry, whenever available. As an example, our model problem exhibits the scaling symmetry

u(t,x)u(λt,λx),λ>0.u(t,x)\to u(\lambda t,\lambda x),\qquad\lambda>0.

The scale invariant initial data Sobolev space corresponding to this symmetry is the homogeneous space H˙sc\dot{H}^{s_{c}}, where sc=n/2s_{c}=n/2. This space is called the critical Sobolev space, and should heuristically be thought of as an absolute lower bound for any reasonable well-posedness result. Whereas in some semilinear dispersive evolutions one can actually reach this threshold, in nonlinear flows it seems to be out of reach in general.

1.3. A set of results for the model problem

In order to state the results, we begin with a discussion of control parameters. We will use two such control parameters. The first one is

A=uL.A=\|u\|_{L^{\infty}}.

This is a scale invariant quantity, which appears in the implicit constants in all of our bounds. Our second control parameter is

B=uL,B=\|\nabla u\|_{L^{\infty}},

which instead will be shown to control the energy growth in all the energy estimates. Precisely, the norm BB, plays the role of the YY norm mentioned in the discussion above.


The primary well-posedness result for the model problem is as follows:

Theorem 1.

The equation (1.2) is locally well-posed in HsH^{s} in the Hadamard sense for s>d2+1s>\frac{d}{2}+1.

The reader will notice that this result is one derivative above scaling. It is also optimal in some cases, including the scalar case (where the problem can be solved locally using the method of characteristics), but not optimal in many other cases where the system is dispersive.

For the uniqueness result we have in effect a stronger statement, which only requires Lipschitz bounds for uu. This however does not improve the scaling comparison relative to the critical spaces:

Theorem 2.

Uniqueness holds in the Lipschitz class, and we have the L2L^{2} difference bound

(1.3) (u1u2)(t)L2eC(A)0tB(s)𝑑s(u1u2)(0)L2.\|(u_{1}-u_{2})(t)\|_{L^{2}}\lesssim e^{C(A)\int_{0}^{t}B(s)\,ds}\|(u_{1}-u_{2})(0)\|_{L^{2}}.

This is exactly the kind of weak Lipschitz bound discussed earlier. With a bit of additional effort, for the HsH^{s} solutions in Theorem 1 this may be extended to a larger range of Sobolev spaces,

(1.4) (u1u2)L([0,T];Hσ)(u1u2)(0)Hσ,|σ|s1.\|(u_{1}-u_{2})\|_{L^{\infty}([0,T];H^{\sigma})}\lesssim\|(u_{1}-u_{2})(0)\|_{H^{\sigma}},\qquad|\sigma|\leq s-1.

The small price to pay here is that now the implicit constant in the estimate depends not only on AA and BB but also on the norms of u1u_{1} and u2u_{2} in C([0,T];Hs)C([0,T];H^{s}).


A key role in the proof of the well-posedness result is played by the energy estimates, which are also of independent interest:

Theorem 3.

The following bounds hold for for solutions to (1.2) for all s0s\geq 0:

(1.5) u(t)HseC(A)0tB(s)𝑑su(0)Hs.\|u(t)\|_{H^{s}}\lesssim e^{C(A)\int_{0}^{t}B(s)\,ds}\|u(0)\|_{H^{s}}.

Finally, as a corollary of the last result, we obtain a continuation criteria for solutions:

Theorem 4.

Solutions can be continued in HsH^{s} for as long as B\int B remains finite.

Theorem 1 has been first proved by Kato [16], borrowing ideas from nonlinear semigroup theory, see e.g. Barbu’s book [4]. The existence and uniqueness part, as well as the energy estimates, can also be found in standard references, e.g. in the books of Taylor [30], Hörmander [10] and Sogge [24] (in the last two the wave equation is considered, but the idea is similar). However, interestingly enough, the continuous dependence part is missing in all these references. We did find presentations of continuous dependence arguments inspired from Kato’s work in Chemin’s book [3], and also on Tao’s blog [26].

Our objective for remainder of the paper will be to provide complete proofs for the four theorems above, which the reader may take as a guide for his problem of choice. While these results are not new in the model case we consider, to the best of our knowledge this is the first time when the proofs of these results are presented in this manner. Along the way, we will also provide extensive comments and pointers to alternative methods developed along the years.

In particular, we would emphasize the frequency envelope approach for the regularization and continuous dependence parts, as well as the time discretization approach for the existence proof. The frequency envelope approach has been repeatedly used by the authors, jointly with different collaborators, in a number of papers, see e.g. [23], [29],[18], [12], [15], with some of the ideas crystalizing along the way. The version of the existence proof based on a time discretization is in some sense very classical, going back to ideas which have originally appeared in the context of semigroup theory; however, its implementation is inspired from the authors’ recent work [15], though the situation considered here is considerably simpler.

1.4. An outline of these notes

Our strategy will be, in each section, to provide some ideas and a broader discussion in the context of the general equation (1.1), and then show how this works in detail in the context of our chosen example (1.2).

In the next section we introduce the paradifferential form of our equations, both the main equation and its linearization. This is an idea that goes back to work of Bony [6], and helps clarify the roles played by different frequency interaction modes in the equation. Another very useful reference here is Metivier’s more recent book [21].

Section 3 is devoted to the energy estimates, in multiple contexts. These are presented both for the full equation, for its linearization, for its associated linear paradifferential flow, and for differences of solutions. The latter, in turn, yields the uniqueness part of the well-posedness theorem. A common misconception here has been that for well-posedness it suffices to prove energy estimates for the full equation. Instead, in our presentation we regard the bound for the linearized problem as fundamental, though, at the implementation level, it is the paradifferential flow bound which can be found at the core.

Section 4 provides two approaches for the existence part of the well-posedness theorem. The first one, more classical, is based on an iteration scheme, which works well on our model problem but may run into implementation issues in more complex problems. The second approach, which we regard as more robust, relies on a time discretization, and is somewhat related to nonlinear semigroup theory, which also inspired Kato’s work. Two other possible strategies, which have played a role historically, are briefly outlined.

Section 5 introduces Tao’s notion of frequency envelopes (see for example [27]), which is very well suited to track the flow of energy as time progresses. This is then used to show how rough solutions can be obtained as uniform limits of smooth solutions. This is a key step in many well-posedness arguments, and helps decouple the regularity for the initial existence result from the rough data results.

Finally the last section of the paper is devoted to the continuous dependence result, where we provide the modern, frequency envelopes based approach. At the same time, for a clean, elegant reinterpretation of Kato’s original strategy we refer the reader to Tao’s blog [26].

1.5. Acknowledgements

The first author was supported by a Luce Professorship, by the Sloan Foundation, and by an NSF CAREER grant DMS-1845037. The second author was supported by the NSF grant DMS-1800294 as well as by a Simons Investigator grant from the Simons Foundation. Both authors are extremely grateful to MSRI for their full support in holding the graduate summer school “Introduction to water waves” in a virtual format due to the less than ideal circumstances.

2. A menagerie of related equations

While ultimately one would want all the results stated in terms of the full nonlinear equation, any successful approach to quasilinear problems needs to also consider a succession of closely related linear equations, as well as associated reformulations of the nonlinear flow. Here we aim to motivate and describe these related flows, stripping away technicalities.

2.1. The linearized equation

This plays a key role in comparing different solutions; we will write it in the form

(2.1) vt=DN(u)v,v(0)=v0,v_{t}=DN(u)v,\qquad v(0)=v_{0},

where DNDN stands for the differential of NN, which in our setting is a partial differential operator of order kk. One may also reinterpret the equation for the difference of two solutions as a perturbed linearized equation with a quadratic source term; some caution is required here, because often some structure is lost in doing this, and the question is whether that is not too much.

In the particular case of (1.2), the linearized equation takes the form

(2.2) tv=𝒜j(u)jv+D𝒜j(u)vju,v(0)=v0.\partial_{t}v={\mathcal{A}}^{j}(u)\partial_{j}v+D{\mathcal{A}}^{j}(u)v\,\partial_{j}u,\qquad v(0)=v_{0}.

2.2. The linear paradifferential equation

One distinguishing feature of quasilinear evolutions is that the nonlinearity cannot be interpreted as perturbative. Nevertheless, one may seek to separate parts of the nonlinearity which can be seen as perturbative, at least at high regularity, in order to better isolate and understand the nonperturbative part.

To narrow things down, consider a nonlinear term which is quadratic, say of the form αu1βu2\partial^{\alpha}u_{1}\partial^{\beta}u_{2}, and consider the three modes of interaction between these terms, according to the Littlewood-Paley trichotomy, or paraproduct decomposition,

αu1βu2=Tαu1βu2+Tβu2αu1+Π(αu1,βu2),\partial^{\alpha}u_{1}\partial^{\beta}u_{2}=T_{\partial^{\alpha}u_{1}}\partial^{\beta}u_{2}+T_{\partial^{\beta}u_{2}}\partial^{\alpha}u_{1}+\Pi(\partial^{\alpha}u_{1},\partial^{\beta}u_{2}),

where the three terms represent the lowlow-highhigh, highhigh-lowlow respectively the highhigh-highhigh frequency interactions. The high-high interactions in the last term are always perturbative at high regularity, so are placed into the perturbative box. But one cannot do the same with the low-high or high-low interactions, which are kept on the nonperturbative side. This is closely related to the linearization, and indeed, at the end of the day, we are left with a paradifferential style nonperturbative part of our evolution, which we can formally write as

(2.3) wt=TDN(u)w,w(0)=w0.w_{t}=T_{DN(u)}w,\qquad w(0)=w_{0}.

Here, one can naively use Bony’s notion of paraproduct [6] to define the linear operator TDN(u)T_{DN(u)} as

TDN(u)w=|α|kTpαN(u)αw,T_{DN(u)}w=\sum_{|\alpha|\leq k}T_{\partial_{p^{\alpha}}N(u)}\partial^{\alpha}w,

where pαp^{\alpha} is a placeholder for the αu\partial^{\alpha}u argument of the nonlinearity NN. However there are also other related choices one can make, see for instance the discussion at the end of this subsection. For a discussion on the use of paradifferential calculus in nonlinear PDE’s (though not the above notation) we refer the reader to Metivier’s book [21].

One can think of the above evolution as a linear evolution of high frequency waves on a low frequency background. Then one can interpret solving the nonperturbative part of our evolution as an infinite dimensional triangular system, where each dyadic frequency of the solution is obtained at some step by solving a linear system with coefficients depending only on the lower components, and in turn it affects the coefficients of the equations for the higher frequency components. Of course, this should only be understood in a philosophical sense, because a variable coefficient flow in general does not preserve frequency localizations. This can sometimes be achieved with careful choices of the paraproduct quantizations, but it never seems worthwhile to implement, as the perturbative terms will mix frequencies anyway and add tails.

Turning to our model problem, in a direct interpretation the associated paradifferential equation will have the form

(2.4) tw=T𝒜j(u)jw+TD𝒜j(u)juw,w(0)=w0.\partial_{t}w=T_{{\mathcal{A}}^{j}(u)}\partial_{j}w+T_{D{\mathcal{A}}^{j}(u)\partial_{j}u}w\,,\qquad w(0)=w_{0}.

However, upon closer examination one may see several choices that could be made. Considering for instance the first paraproduct, which of the following expressions would make the better choice at frequency 2k2^{k} ?

𝒜j(u)<k8jwk,𝒜j(u<k8)jwk,[𝒜j(u<k8)]<k4jwk.{\mathcal{A}}^{j}(u)_{<k-8}\partial_{j}w_{k},\qquad{\mathcal{A}}^{j}(u_{<k-8})\partial_{j}w_{k},\qquad[{\mathcal{A}}^{j}(u_{<k-8})]_{<k-4}\partial_{j}w_{k}.

The last one may seem the most complicated, but it is also the most accurate. In many cases, including our model problem, it makes no difference in practice. However, one should be aware that often a simpler choice, which is made for convenience in one problem, might not work anymore in a more complex setting.

Remark 2.1.

Here the frequency gap, which was set to be equal to 88 in the above formulas, is chosen rather arbitrarily; its role is simply to enforce the frequency separation between the coefficients and the leading term. On occasion, particularly in large data problems, it is also useful to work instead with a large frequency gap as a proxy for smallness, see e.g. [25].

2.3. The paradifferential formulation of the main equations

Consider first our general equation (1.1), which we can write in the form

(2.5) ut=TDN(u)u+F(u),u(0)=u0.u_{t}=T_{DN(u)}u+F(u),\qquad u(0)=u_{0}.

Here one would hope that the paradifferential source term can be seen as perturbative, in the sense that

F:HsHs, Lipschitz.F:H^{s}\to H^{s},\qquad\text{ Lipschitz}.

Similarly we can write the linearized equation (2.1) in the same format,

(2.6) vt=TDN(u)v+Flin(u)v,v(0)=v0,v_{t}=T_{DN(u)}v+F^{lin}(u)v,\qquad v(0)=v_{0},

with the appropriate nonlinearity FlinF^{lin}. This is still based on the paradifferential equation (2.5), but can no longer be interpreted as the direct paralinearization of the linearized equation. This is because the expression Flin(u)vF^{lin}(u)v also contains some low-high interactions, precisely those where vv is the low frequency factor.

3. Energy estimates

Energy estimates are a critical part of any well-posedness result, even if they do not tell the entire story. In this section we begin with a heuristic discussion of several ideas in the general case, and then continue with some more concrete analysis in the model case.

3.1. The general case

Consider first the energy estimates for the general problem (1.1), where it is simpler to think of this in the paradifferential formulation (2.3). An energy estimate for this problem is an estimate that allows us to control the time evolution of the Sobolev norms of the solution. In the simplest formulation, the idea would be to prove that

ddtuHσ2CuHσ2,\frac{d}{dt}\|u\|^{2}_{H^{\sigma}}\lesssim C\|u\|^{2}_{H^{\sigma}},

with a constant CC that at the very least depends on the HsH^{s} norm of uu.

There are two points that one should take into account when considering such estimates. The first is that it is often useful to strenghten such bounds by relaxing the dependence of the constant CC on uu. Heuristically, the idea is that this constant measures the effect of nonlinear interactions, which are strongest when our functions are pointwise large, not only large in an L2L^{2} sense. Thus, it is often possible to replace the constant CC with an analogue of the uniform control norm BB in the model case, perhaps with some additional implicit dependence on another scale invariant uniform control parameter AA. See however the discussion in Remark 3.2.

A second point is that, although it is tempting to try to work directly with the HsH^{s} norm, it is often the case that the straight HsH^{s} norm is not well adapted to the structure of the problem; see e.g. what happens in water waves [2], [12]. Then it is useful to construct energy functionals EσE^{\sigma} adapted to the problem at hand. For these energies we should aim for the following properties

  1. i)

    Energy equivalence:

    (3.1) Eσ(u)uHσ2.E^{\sigma}(u)\approx\|u\|_{H^{\sigma}}^{2}.
  2. ii)

    Energy propagation

    (3.2) ddtEσ(u)ABuHσ2,\frac{d}{dt}E^{\sigma}(u)\lesssim_{A}B\|u\|^{2}_{H^{\sigma}},

    where the control parameter BB satisfies

    (3.3) BuHs.B\lesssim\|u\|_{H^{s}}.

Now consider our main equation written in the form (2.3). For the perturbative part of the nonlinearity FF we hope to have some boundedness,

(3.4) F(u)HσABuHσ.\|F(u)\|_{H^{\sigma}}\lesssim_{A}B\|u\|_{H^{\sigma}}.

This in turn allows us to reduce nonlinear energy bounds of the form (3.2) to similar bounds for the linear paradifferential equation (2.5). One may legitimately worry here that some structure is lost when we decouple the paradifferential coefficients from the evolution variable; however, the point is that these two objects are indeed separate, as they represent different frequencies of the solution.

Remark 3.1.

In our discussion here we took the simplified view that bounds for FF begin at σ=0\sigma=0. But this is not always the case in practice, and often one needs to identify the lower range for σ\sigma where this works; see e.g. the nonlinear wave equation [23], the wave map equation [29], or the water wave problem considered in [1].

Now consider the paradifferential evolution (2.5), and begin with the L2L^{2} case by setting σ=0\sigma=0. Then we need to produce a linearized type energy Eu0,linE^{0,lin}_{u} so that the solutions satisfy

(3.5) ddtEu0,lin(w)ABwL22.\frac{d}{dt}E^{0,lin}_{u}(w)\lesssim_{A}B\|w\|^{2}_{L^{2}}.

Then the associated nonlinear energy at σ=0\sigma=0 would be

E0(u)=Eu0,lin(u).E^{0}(u)=E^{0,lin}_{u}(u).

If Eu0,lin(w)=wL22E_{u}^{0,lin}(w)=\|w\|_{L^{2}}^{2}, then the bound (3.5) would simply require that the paradifferential operator TDN(u)T_{DN(u)} is essentially antisymmetric in L2L^{2}. If that is not true, then the backup plan is to find an equivalent Hilbert norm on L2L^{2} so that the antisymmetry holds. Some care is however needed; if this norm depends on uu, then this dependence needs to be mild.

The next step is to consider a larger σ\sigma. By interpolation it suffices to work with integer σ\sigma, in which case one might simply differentiate (2.3),

(σw)t=TDN(u)(σw)+[σ,TDN(u)]w.(\partial^{\sigma}w)_{t}=T_{DN(u)}(\partial^{\sigma}w)+[\partial^{\sigma},T_{DN(u)}]w.

Here we would be done if the last commutator is bounded from HσH^{\sigma} into L2L^{2}. In principle that would be the case almost automatically at least when the order kk of NN is at most one. One can heuristically associate this with the finite speed of propagation in the high frequency limit.

Remark 3.2.

The case k>1k>1, which corresponds to an infinite speed of propagation, is often more delicate; see e.g. [18, 17, 19] for quasilinear Schrödinger flows, or [14] for capillary waves. There one needs to further develop the function space structure based on either dispersive properties of solutions, or on normal forms analysis.


3.2. Coifman-Meyer and Moser type estimates

Before considering our model problem, we briefly review some standard bilinear and nonlinear estimates that play a role later on. In the context of bilinear estimates, a standard tool is to consider the Littlewood-Paley paraproduct type decomposition of the product of two functions, which leads to Coifman-Meyer type estimates, see [7], [22]:

Proposition 3.3.

Using the standard paraproduct notations, one has the following estimates

(3.6) TfgL2fLgL2,\displaystyle\|T_{f}g\|_{L^{2}}\lesssim\|f\|_{L^{\infty}}\|g\|_{L^{2}},
TfgL2gBMOfL2,\displaystyle\|T_{f}g\|_{L^{2}}\lesssim\|g\|_{BMO}\|f\|_{L^{2}},
Π(f,g)L2fBMOgL2,\displaystyle\|\Pi(f,g)\|_{L^{2}}\lesssim\|f\|_{BMO}\|g\|_{L^{2}},

as well as the commutator bound

(3.7) [Pk,f]gL22kxfLgL2.\|[P_{k},f]g\|_{L^{2}}\lesssim 2^{-k}\|\partial_{x}f\|_{L^{\infty}}\|g\|_{L^{2}}.

Here PkP_{k} is the Littlewood-Paley projection onto frequencies 2k\approx 2^{k}.

These results are standard in the harmonic/microlocal analysis community. For nonlinear expressions we use Moser type estimates instead:

Proposition 3.4.

The following Moser estimate holds for a smooth function FF, with F(0)=0F(0)=0, and s0s\geq 0:

F(u)HsuLuHs.\|F(u)\|_{H^{s}}\lesssim_{\|u\|_{L^{\infty}}}\|u\|_{H^{s}}.

Of course many more extensions of both the bilinear and the nonlinear estimates above are available.

3.3. The model case

We now turn our attention to our model problem, where, if we adopt the expression (2.4) for the paradifferential flow, the source term F(u)F(u) is given by

(3.8) F(u)=𝒜jjuT𝒜j(u)juTD𝒜j(u)juu.F(u)={\mathcal{A}}^{j}\partial_{j}u-T_{{\mathcal{A}}^{j}(u)}\partial_{j}u-T_{D{\mathcal{A}}^{j}(u)\partial_{j}u}u.

We can rewrite this in the form

(3.9) F(u)=Π(𝒜j(u),ju)+Tju𝒜j(u)TD𝒜j(u)juu.F(u)=\Pi({\mathcal{A}}^{j}(u),\partial_{j}u)+T_{\partial_{j}u}{\mathcal{A}}^{j}(u)-T_{D{\mathcal{A}}^{j}(u)\partial_{j}u}u.

For this expression we can show that it always plays a perturbative role:

Proposition 3.5.

The above nonlinearity FF satisfies the following bounds:

i) Sobolev bounds

(3.10) F(u)HσABuHσ,σ0.\|F(u)\|_{H^{\sigma}}\lesssim_{A}B\|u\|_{H^{\sigma}},\qquad\sigma\geq 0.

ii) Difference bounds

(3.11) F(u)F(v)HσAB[uvHσ+uvL(uHσ+vHσ)],σ0,\|F(u)-F(v)\|_{H^{\sigma}}\lesssim_{A}B\left[\|u-v\|_{H^{\sigma}}+\|u-v\|_{L^{\infty}}(\|u\|_{H^{\sigma}}+\|v\|_{H^{\sigma}})\right],\qquad\sigma\geq 0,

as well as

(3.12) F(u)F(v)L2ABuvL2.\|F(u)-F(v)\|_{L^{2}}\lesssim_{A}B\|u-v\|_{L^{2}}.

The next to last bound shows in particular that FF is Lipschitz in HsH^{s} for s>d/2s>d/2. The simplification in the case σ=0\sigma=0 is also useful in order to bound differences of solutions in the L2L^{2} topology.

Proof.

i) We use the expression (3.9) for FF. The first term can be estimated using a version of the Coifman-Meyer estimates and Moser estimates by

Π(𝒜j(u),ju)Hσ𝒜j(u)HσjuBMOABuHσ.\|\Pi({\mathcal{A}}^{j}(u),\partial_{j}u)\|_{H^{\sigma}}\lesssim\|{\mathcal{A}}^{j}(u)\|_{H^{\sigma}}\|\partial_{j}u\|_{BMO}\lesssim_{A}B\|u\|_{H^{\sigma}}.

For the second term we use again paraproduct bounds and Moser estimates to get

Tju𝒜j(u)HσjuL𝒜j(u)HσAjuLuHσ.\|T_{\partial_{j}u}{\mathcal{A}}^{j}(u)\|_{H^{\sigma}}\lesssim\|\partial_{j}u\|_{L^{\infty}}\|{\mathcal{A}}^{j}(u)\|_{H^{\sigma}}\lesssim_{A}\|\partial_{j}u\|_{L^{\infty}}\|u\|_{H^{\sigma}}.

The third term is similar to the second.

ii) First, we note the representation

𝒜(u)𝒜(v)=:G(u,v)(uv),{\mathcal{A}}(u)-{\mathcal{A}}(v)=:G(u,v)(u-v),

which we use to separate uvu-v factors. Here G(u,v)G(u,v) is a smooth function of uu and vv. Then taking differences in the first term of FF, we need two estimates

Π(𝒜j(u),j(uv))Hσ𝒜j(u)LuvHσABuvHσ\|\Pi({\mathcal{A}}^{j}(u),\partial_{j}(u-v))\|_{H^{\sigma}}\lesssim\|\partial{\mathcal{A}}^{j}(u)\|_{L^{\infty}}\|u-v\|_{H^{\sigma}}\lesssim_{A}B\|u-v\|_{H^{\sigma}}

respectively

Π(G(u,v)(uv),jv)HσG(u,v)(uv)HσvLAB(uvHσ+uvL(uHσ+vHσ)),\|\Pi(G(u,v)(u-v),\partial_{j}v)\|_{H^{\sigma}}\lesssim\|G(u,v)(u-v)\|_{H^{\sigma}}\|\partial v\|_{L^{\infty}}\lesssim_{A}B(\|u-v\|_{H^{\sigma}}+\|u-v\|_{L^{\infty}}(\|u\|_{H^{\sigma}}+\|v\|_{H^{\sigma}})),

noting that for σ=0\sigma=0 the last term can be avoided.

Similarly we have two estimates corresponding to the second term in FF, namely

Tju𝒜j(u)Tjv𝒜j(v)Hσ\displaystyle\|T_{\partial_{j}u}{\mathcal{A}}^{j}(u)-T_{\partial_{j}v}{\mathcal{A}}^{j}(v)\|_{H^{\sigma}} =Tju[G(u,v)(uv)]T[jujv]𝒜j(v)Hσ\displaystyle=\|T_{\partial_{j}u}[G(u,v)(u-v)]-T_{\left[\partial_{j}u-\partial_{j}v\right]}{\mathcal{A}}^{j}(v)\|_{H^{\sigma}}
Tju[G(u,v)(uv)]Hσ+T[jujv]𝒜j(v)Hσ,\displaystyle\lesssim\|T_{\partial_{j}u}[G(u,v)(u-v)]\|_{H^{\sigma}}+\|T_{\left[\partial_{j}u-\partial_{j}v\right]}{\mathcal{A}}^{j}(v)\|_{H^{\sigma}},

where

T[jujv]𝒜j(v)HσuvLj𝒜j(v)HσABuvLvHσ,\|T_{\left[\partial_{j}u-\partial_{j}v\right]}{\mathcal{A}}^{j}(v)\|_{H^{\sigma}}\lesssim\|u-v\|_{L^{\infty}}\|\partial_{j}{\mathcal{A}}^{j}(v)\|_{H^{\sigma}}\lesssim_{A}B\|u-v\|_{L^{\infty}}\|v\|_{H^{\sigma}},

respectively

Tju[G(u,v)(uv)]HσAjuL(uvHσ+uvL(uHσ+vHσ)),\|T_{\partial_{j}u}[G(u,v)(u-v)]\|_{H^{\sigma}}\lesssim_{A}\|\partial_{j}u\|_{L^{\infty}}(\|u-v\|_{H^{\sigma}}+\|u-v\|_{L^{\infty}}(\|u\|_{H^{\sigma}}+\|v\|_{H^{\sigma}})),

both with obvious simplifications if σ=0\sigma=0. Finally, the bounds for the third term in FF are similar to the ones for the second.

Remark 3.6.

For this Proposition 3.5 one can further relax BB to a BMOBMO norm,

B=uBMO.B=\|\nabla u\|_{BMO}.

On the other hand we can also simplify the paradifferential equation (2.4) to a simpler version

wt=T𝒜j(u)jw,w_{t}=T_{{\mathcal{A}}^{j}(u)}\partial_{j}w,

but in this case we no longer can relax BB to a BMO norm.

Next we consider the paradifferential equation:

Proposition 3.7.

Assume that uLt,xu\in L_{t,x}^{\infty} and uLt1Lx\nabla u\in L^{1}_{t}L_{x}^{\infty} (i.e. BLt1B\in L^{1}_{t}). Then the paradifferential equation (2.4) is well-posed in all HσH^{\sigma} spaces, σ\sigma\in\mathbb{R}, and

(3.13) ddtwHσ2ABwHσ2.\frac{d}{dt}\|w\|_{H^{\sigma}}^{2}\lesssim_{A}B\|w\|_{H^{\sigma}}^{2}.
Proof.

We first consider the energy estimate, where we work with the corresponding inhomogeneous equation,

(3.14) tw=T𝒜j(u)jw+TD𝒜j(u)juw+f,w(0)=w0.\partial_{t}w=T_{{\mathcal{A}}^{j}(u)}\partial_{j}w+T_{D{\mathcal{A}}^{j}(u)\partial_{j}u}w+f\,,\qquad w(0)=w_{0}.

The L2L^{2} bound is easiest; we have

12ddtwL22=wT𝒜j(u)jw+wTD𝒜j(u)juw+wfdx.\frac{1}{2}\frac{d}{dt}\|w\|_{L^{2}}^{2}=\int w\cdot T_{{\mathcal{A}}^{j}(u)}\partial_{j}w+w\cdot T_{D{\mathcal{A}}^{j}(u)\partial_{j}u}w+w\cdot f\,dx.

In the second term we simply estimate the para-coefficient in LL^{\infty}. In the first term we commute and integrate by parts, to arrive at

12wTj𝒜j(u)w+w(T𝒜j(u)(T𝒜j(u)))jwdx,\frac{1}{2}\int-w\cdot T_{\partial_{j}{\mathcal{A}}^{j}(u)}w+w\cdot(T_{{\mathcal{A}}^{j}(u)}-(T_{{\mathcal{A}}^{j}(u)})^{*})\partial_{j}w\,dx,

where due to the symmetry of the matrices 𝒜j{\mathcal{A}}^{j} we have the bound

(3.15) (T𝒜j(u)(T𝒜j(u)))jwL2ABwL2,\|(T_{{\mathcal{A}}^{j}(u)}-(T_{{\mathcal{A}}^{j}(u)})^{*})\partial_{j}w\|_{L^{2}}\lesssim_{A}B\|w\|_{L^{2}},

which shows that the corresponding paraproduct operators are self-adjoint at leading order. Here we use the notation to denote the adjoint of an operator. Hence we obtain

|ddtwL22|ABwL22+wL2fL2,\left|\frac{d}{dt}\|w\|_{L^{2}}^{2}\right|\lesssim_{A}B\|w\|_{L^{2}}^{2}+\|w\|_{L^{2}}\|f\|_{L^{2}},

which further by Gronwall’s inequality yields

(3.16) wLt([0,T];Lx2)Ae0TB𝑑t(w(0)Lx2+fLt1Lx2).\|w\|_{L_{t}^{\infty}([0,T];L_{x}^{2})}\lesssim_{A}e^{\int_{0}^{T}B\,dt}(\|w(0)\|_{L_{x}^{2}}+\|f\|_{L_{t}^{1}L^{2}_{x}}).

This by itself does not prove well-posedness in L2L^{2}, it only proves uniqueness. However, a similar bound will hold for the backward adjoint system in the same spaces; this is because the adjoint system coincides with the direct system modulo L2L^{2} bounded terms. Together, these two pieces of information yield L2L^{2} well-posedness for the paradifferential system in L2L^{2}. This is a standard linear duality argument, where the solutions are constructed by a direct application of the Hahn-Banach Theorem. In a nutsell, one has the following equivalencies, see for instance [11]:

Energy estimates for the direct forward problemExistence for the adjoint backward problem\text{Energy estimates for the direct forward problem}\Longleftrightarrow\text{Existence for the adjoint backward problem}
Energy estimates for the adjoint backward problemExistence for the direct forward problem\text{Energy estimates for the adjoint backward problem}\Longleftrightarrow\text{Existence for the direct forward problem}

Exactly the same argument applies in HσH^{\sigma}, with the small change that now the the adjoint system should be considered in HσH^{-\sigma}. There the bound (3.15) is replaced by

(3.17) (DσT𝒜j(u)(T𝒜j(u))Dσ)jwL2𝒜(u)LwHσABwHσ.\|(\langle D\rangle^{\sigma}T_{{\mathcal{A}}^{j}(u)}-(T_{{\mathcal{A}}^{j}(u)})^{*}\langle D\rangle^{\sigma})\partial_{j}w\|_{L^{2}}\lesssim\|\nabla{\mathcal{A}}(u)\|_{L^{\infty}}\|w\|_{H^{\sigma}}\lesssim_{A}B\|w\|_{H^{\sigma}}.

Combining the last two propositions, Proposition 3.5 and Proposition 3.7, we obtain the HσH^{\sigma} bound in Theorem 3.

3.4. The linearized equation

Next, we turn our attention to the linearized equation, which we also write in a paradifferential form

(3.18) tv=T𝒜j(u)jv+TD𝒜j(u)juv+Flin(u)v,v(0)=v0,\partial_{t}v=T_{{\mathcal{A}}^{j}(u)}\partial_{j}v+T_{D{\mathcal{A}}^{j}(u)\partial_{j}u}v+F^{lin}(u)v,\qquad v(0)=v_{0},

where

Flin(u)v:=Π(𝒜j(u),jv)+Π(D𝒜j(u)ju,v)+Tjv𝒜j(u)+Tv(D𝒜j(u)ju):=FΠlin(u)v+FTlin(u)v.F^{lin}(u)v:=\Pi({\mathcal{A}}^{j}(u),\partial_{j}v)+\Pi(D{\mathcal{A}}^{j}(u)\partial_{j}u,v)+T_{\partial_{j}v}{\mathcal{A}}^{j}(u)+T_{v}(D{\mathcal{A}}^{j}(u)\partial_{j}u):=F^{lin}_{\Pi}(u)v+F^{lin}_{T}(u)v.

We note here that the equation (3.18) is not exactly a true paralinearization of the linearized equation, as FTlin(u)vF^{lin}_{T}(u)v does contain low-high interactions. This difference is observed in the estimates satisfied by the two terms.

On one hand, the term FΠlin(u)vF^{lin}_{\Pi}(u)v satisfies good bounds in all Sobolev spaces,

(3.19) FΠlin(u)vHσABvHσ,σ0,\|F^{lin}_{\Pi}(u)v\|_{H^{\sigma}}\lesssim_{A}B\|v\|_{H^{\sigma}},\qquad\sigma\geq 0,

so it can be seen as a true perturbative term. This a simple, Coifman-Meyer type estimate which is left for the reader.

On the other hand, assuming we know that uHsu\in H^{s}, the term FTlin(u)vF^{lin}_{T}(u)v can at best be estimated in Hs1H^{s-1}, and there of course we could not use the control norms, instead we would have to use the full HsH^{s} norm of uu. However, we can use the control norms for L2L^{2} bounds to directly estimate

(3.20) FTlin(u)vL2ABvL2.\|F^{lin}_{T}(u)v\|_{L^{2}}\lesssim_{A}B\|v\|_{L^{2}}.

Combining the last two estimates with Proposition 3.7 we perturbatively obtain

Proposition 3.8.

Assume that ALA\in L^{\infty} and that BL1B\in L^{1}. Then the linearized equation (2.2) is well-posed in L2L^{2}, with bounds

(3.21) ddtvL22ABvL22.\frac{d}{dt}\|v\|_{L^{2}}^{2}\lesssim_{A}B\|v\|_{L^{2}}^{2}.

We observe the obvious fact that one does not need paradifferential calculus in order to prove this proposition; a simple integration by parts suffices. However, it is instructive to dissect the terms in the equation and understand their respective roles. Also, it is interesting to observe that in appropriate settings, the linearized equation can be thought of as a perturbation of the associated paradifferential equation.

Remark 3.9.

Well-posedness and bounds for the linearized equation can be also obtained in all HσH^{\sigma} spaces for |σ|s1|\sigma|\leq s-1. However, this can no longer be done in terms of our control parameters; for instance if σ=s1\sigma=s-1 then we need to use the full HsH^{s} norm of the solutions. While interesting, this observation will not be needed for the rest of the paper.

3.5. Difference bounds and uniqueness

The easiest way to compare two solutions u1u_{1} and u2u_{2} for (1.1) is to subtract their respective equations, to obtain an equation for v=u1u2v=u_{1}-u_{2}. In the general case, using the form (2.5) of the equation, we obtain

vt=TDN(u1)v+TDN(u1)DN(u2)u2+F(u1)F(u2).v_{t}=T_{DN(u_{1})}v+T_{DN(u_{1})-DN(u_{2})}u_{2}+F(u_{1})-F(u_{2}).

Here we identify this equation as the paradifferential equation associated to u1u_{1}, but with two source terms, which we would like to interpret as perturbative in a low regularity Sobolev space, say L2L^{2}. That would yield a bound of the form

(3.22) v(t)L2eC(A)0tB(s)𝑑sv(0)L2,\|v(t)\|_{L^{2}}\lesssim e^{C(A)\int_{0}^{t}B(s)ds}\|v(0)\|_{L^{2}},

where A=A1+A2A=A_{1}+A_{2}, B=B1+B2B=B_{1}+B_{2}, with Ai=uiLA_{i}=\|u_{i}\|_{L^{\infty}}, and Bi=uiLB_{i}=\|\nabla u_{i}\|_{L^{\infty}}, for i=1,2¯i=\overline{1,2}.

Let us see how this works out in our model problem. We will show that

Proposition 3.10.

Let u1u_{1} and u2u_{2} be two Lipschitz solutions to (1.2) with associated control parameters A1,B1A_{1},B_{1} respectively A2,B2A_{2},B_{2}. Then their difference v=u1u2v=u_{1}-u_{2} satisfies the bound (3.22).

Proof.

We have already seen in Proposition 3.7 that the paradifferential evolution is well-posed in L2L^{2}, and in Proposition 3.5 that we have a good Lipschitz bound for FF. It remains to bound the remaining difference

TDN(u1)DN(u2)u2L2ABu1u2L2.\|T_{DN(u_{1})-DN(u_{2})}u_{2}\|_{L^{2}}\lesssim_{A}B\|u_{1}-u_{2}\|_{L^{2}}.

For this we write

TDN(u1)DN(u2)u2=\displaystyle T_{DN(u_{1})-DN(u_{2})}u_{2}= T𝒜j(u1)𝒜j(u2)ju2+TD𝒜j(u1)ju1D𝒜j(u2)ju2u2\displaystyle\ T_{{\mathcal{A}}^{j}(u_{1})-{\mathcal{A}}^{j}(u_{2})}\partial_{j}u_{2}+T_{D{\mathcal{A}}^{j}(u_{1})\partial_{j}u_{1}-D{\mathcal{A}}^{j}(u_{2})\partial_{j}u_{2}}u_{2}
=\displaystyle= T𝒜j(u1)𝒜j(u2)ju2+T(D𝒜j(u1)D𝒜j(u2))ju1u2TjD𝒜j(u2)(u1u2)u2\displaystyle\ T_{{\mathcal{A}}^{j}(u_{1})-{\mathcal{A}}^{j}(u_{2})}\partial_{j}u_{2}+T_{(D{\mathcal{A}}^{j}(u_{1})-D{\mathcal{A}}^{j}(u_{2}))\partial_{j}u_{1}}u_{2}-T_{\partial_{j}D{\mathcal{A}}^{j}(u_{2})(u_{1}-u_{2})}u_{2}
+Tj(D𝒜j(u2)(u1u2))u2.\displaystyle\ +T_{\partial_{j}(D{\mathcal{A}}^{j}(u_{2})(u_{1}-u_{2}))}u_{2}.

For the first term we have a Coifman-Meyer type bound

T𝒜(u1)𝒜(u2)u2L2u1u2L2u2BMOBu1u2L2.\|T_{{\mathcal{A}}(u_{1})-{\mathcal{A}}(u_{2})}\nabla u_{2}\|_{L^{2}}\lesssim\|u_{1}-u_{2}\|_{L^{2}}\|\nabla u_{2}\|_{BMO}\lesssim B\|u_{1}-u_{2}\|_{L^{2}}.

The second term is even easier,

T(D𝒜j(u1)D𝒜j(u2))ju1u2L2(D𝒜j(u1)D𝒜j(u2))ju1L2u2LABu1u2L2,\|T_{(D{\mathcal{A}}^{j}(u_{1})-D{\mathcal{A}}^{j}(u_{2}))\partial_{j}u_{1}}u_{2}\|_{L^{2}}\lesssim\|(D{\mathcal{A}}^{j}(u_{1})-D{\mathcal{A}}^{j}(u_{2}))\partial_{j}u_{1}\|_{L^{2}}\|u_{2}\|_{L^{\infty}}\lesssim_{A}B\|u_{1}-u_{2}\|_{L^{2}},

and the third term is similar. Finally, in the fourth term we can use Coifman-Meyer to rebalance again the derivatives and obtain

Tj(D𝒜j(u2)(u1u2)u2L2D𝒜j(u2)(u1u2)L2u2BMO,\|T_{\partial_{j}(D{\mathcal{A}}^{j}(u_{2})(u_{1}-u_{2})}u_{2}\|_{L^{2}}\lesssim\|D{\mathcal{A}}^{j}(u_{2})(u_{1}-u_{2})\|_{L^{2}}\|\nabla u_{2}\|_{BMO},

concluding as before. ∎

Remark 3.11.

The observant reader may have noticed that for our model problem the difference bound can be directly proved using a simple integration by parts, without any need for paradifferential calculus, and may wonder why we are doing it this way. There are three reasons for this: (i) to show that it works, (ii) to show how both the bound for the full equation and the bound for the difference equation can be seen as two sides of the same coin, and (iii) to provide a guide for the reader for situations where a simpler approach does not work.

Remark 3.12.

In the same vein as in Remark 3.9, bounds for the difference equation can be also obtained in all HσH^{\sigma} spaces for |σ|s1|\sigma|\leq s-1.

Remark 3.13.

In our particular example it was easy to cast the difference equation in a form which is very much like the linearized equation. However, this is not always the case. For this reason, we point out that there is another way one can think of difference bounds, namely by viewing the two initial data u01u_{01} and u02u_{02} as being connected via a one parameter family of data u0hu_{0h} where h[1,2]h\in[1,2]. Then we can interpret the difference u2u1u_{2}-u_{1} as

u2u1=12ddhuh𝑑h,u_{2}-u_{1}=\int_{1}^{2}\frac{d}{dh}u_{h}\,dh,

where uhu_{h} are the solutions with data u0hu_{0h}. Here the integrand represents a solution to the linearized equation around uhu_{h}. Hence difference bounds for u2u1u_{2}-u_{1} can be obtained by integrating bounds for the linearized equation. The only downside to such an argument is that such bounds will require the control parameters for the entire family of solutions, rather than just the end-points.

4. Existence of solutions

Here we consider the question of existence of solutions for the evolution (1.1) with initial data in HsH^{s}, where ss will be taken sufficiently large. The idea here is to construct a good sequence of approximate solutions unu^{n}, which will eventually be shown to converge in a weaker topology. The tricky bit is to choose the correct iteration scheme.

Naively, one might think of trying to base such a scheme on the linearized flow, setting

t(un+1un)DN(un)(un+1un)=(tunN(un)),(un+1un)(0)=0,\partial_{t}(u^{n+1}-u^{n})-DN(u^{n})(u^{n+1}-u^{n})=-(\partial_{t}u^{n}-N(u^{n})),\qquad(u^{n+1}-u^{n})(0)=0,

where the expression on the right represents the error at step nn. Here one can eliminate the time derivative of unu^{n} and rewrite this as

tun+1DN(un)un+1=N(un)DN(un)un,un+1(0)=u0.\partial_{t}u^{n+1}-DN(u^{n})u^{n+1}=N(u^{n})-DN(u^{n})u^{n},\qquad u^{n+1}(0)=u_{0}.

This would be akin to a Nash-Moser scheme, which, even when it works, loses derivatives. That may be reasonable in a small divisor situation, but not so much if our goal is to obtain a Hadamard style well-posedness result. Nevertheless, Nash-Moser schemes have been used on occasion to produce solutions for quasilinear evolutions, though often they prove to be unnecessary.

Remark 4.1.

We observe that for the existence of solutions one does not need to work from the start at low regularity. As we will see, rough solutions can be constructed later on as limits of smooth solutions. This is strictly speaking not necessary in our model problem, but for more nonlinear, geometric problems it does seem to make a difference. This is because in such situations it is often easier to compare exact solutions via the linearized equation which is a geometric object, instead of working with approximate solutions where the geometric character might be lost.

We will present two strategies to prove existence, and at the end we point out several other methods which have been successfully used in existence proofs.

4.1. Take 1: an iterative/fixed point construction

In order not to lose derivatives in the approximation scheme, the idea here is to carefully choose how to distribute un+1u^{n+1} and unu^{n} in the iteration. A key observation is that, whereas solving the linearized equation would cause a loss of derivatives, solving the paradifferential equation does not in general. Then, a good starting point would be the formulation (2.3) of the equations, which would suggest the following iteration scheme:

(4.1) tun+1TDN(un)un+1=F(un),un+1(0)=u0.\partial_{t}u^{n+1}-T_{DN(u^{n})}u^{n+1}=F(u^{n}),\qquad u^{n+1}(0)=u_{0}.

We will apply this scheme on a time interval [0,T][0,T], with T=T(M)T=T(M) sufficiently small depending on the initial data size

M:=u0Hs.M:=\|u_{0}\|_{H^{s}}.

For the above sequence unu^{n} the aim would be to inductively prove two uniform bounds in [0,T][0,T]:

(4.2) unLtHxsCM,\|u^{n}\|_{L_{t}^{\infty}H_{x}^{s}}\leq CM,

and

(4.3) un+1unLtLx2C(M)Tunun1LtLx2,\|u^{n+1}-u^{n}\|_{L_{t}^{\infty}L_{x}^{2}}\leq C(M)T\|u^{n}-u^{n-1}\|_{L_{t}^{\infty}L_{x}^{2}},

where CC is a fixed large constant. In the last bound, the time interval size TT is used in order to gain smallness for the constant, which is needed in order to obtain convergence. Together, these two bounds imply convergence in LtLx2L_{t}^{\infty}L_{x}^{2} to some function uu, as well as LtHxsL_{t}^{\infty}H_{x}^{s} regularity for the limit. This in general suffices in order to show that the limit solves the equation.

To obtain uniform bounds for this evolution one would need two pieces of information:

  1. (1)

    Well-posedness of the paradifferential equation (2.3) in L2L^{2} and more generally in all HsH^{s} spaces. Heuristically, the two should be equivalent, as the operator TDN(un)T_{DN(u^{n})} does not change the dyadic frequency localization. In practice though it might not be as easy, as leakage to other frequencies may occur, and in particular even the associated Hamilton flow might not preserve the dyadic localization on a unit time scale.

  2. (2)

    Lipschitz property of FF in Sobolev spaces. More generally, a bound of the form

    (4.4) F(u)F(v)HσC(uHs,vHs)uvHσ,σ0,\|F(u)-F(v)\|_{H^{\sigma}}\leq C(\|u\|_{H^{s}},\|v\|_{H^{s}})\|u-v\|_{H^{\sigma}},\qquad\sigma\geq 0,

    which should be thought of as a Moser type inequality.

In addition to uniform bounds in a strong norm HsH^{s}, one would also like to have convergence in a weaker topology, say L2L^{2} for the purpose of this presentation. The difference equation reads

(4.5) (tTDN(un))(un+1un)=F(un)F(un1)+(TDN(un1)TDN(un))un.(\partial_{t}-T_{DN(u^{n})})(u^{n+1}-u^{n})=F(u^{n})-F(u^{n-1})+(T_{DN(u^{n-1})}-T_{DN(u^{n})})u^{n}.

Here energy estimates in L2L^{2} would follow from (1) and (2) above, provided that the last difference has a good bound

(TDN(un1)TDN(un))unL2C(un1Hs,unHs)unun1L2.\|(T_{DN(u^{n-1})}-T_{DN(u^{n})})u^{n}\|_{L^{2}}\lesssim C(\|u^{n-1}\|_{H^{s}},\|u^{n}\|_{H^{s}})\|u_{n}-u_{n-1}\|_{L^{2}}.

This is in general relatively straightforward if ss is large enough.

Remark 4.2.

The argument above yields solutions which are apriori only in LtHxsL_{t}^{\infty}H_{x}^{s} as opposed to C(Hs)C(H^{s}), as desired. Getting continuity in HσH^{\sigma} for σ<s\sigma<s is relatively straightforward by interpolation, but proving continuity in HsH^{s} requires considerable extra work111e.g. by showing continuity in time of solutions to the linear paradifferential equation. if one wants a direct argument. The easy way out is to rely on the arguments in the next section, where we show that all HsH^{s} solutions can be seen as uniform limits of smooth solutions.

Remark 4.3.

The above iterative argument can be rephrased as a fixed point argument as follows. For uC[0,T;Hs]u\in C[0,T;H^{s}] we define Lu(t):=vLu(t):=v as the solution to

tvTDN(u)v=F(u),v(0)=u0\partial_{t}v-T_{DN(u)}v=F(u),\qquad v(0)=u_{0}

Then the desired solution uu has to be a fixed point for LL. Solutions to this fixed point problem may often be obtained using the contraction principle in the right topology. Precisely, the strategy is to choose the domain of LL to be the ball B(0,CM)B(0,CM) in L[0,T;Hs]L^{\infty}[0,T;H^{s}], but endow this ball with a weaker topology, e.g. C[0,T;L2]C[0,T;L^{2}]. Then both the mapping properties of LL and the small Lipschitz constant can be achieved by choosing the time TT sufficiently small. Here for the domain we have to choose LL^{\infty} rather than continuity in order to guarantee completeness.


We now implement this scheme for our model problem. Denoting M=u0HsM=\|u_{0}\|_{H^{s}}, we will prove inductively that for fixed large enough TT and small enough TT, we have the bound

unC(0,T;Hs)CM.\|u^{n}\|_{C(0,T;H^{s})}\leq CM.

Taking this as induction hypothesis we have the following bounds for the control parameters AnA^{n} and BnB^{n} associated to unu^{n}:

An,BnCM.A^{n},B^{n}\lesssim CM.

Then we can estimate un+1u^{n+1} in HsH^{s} by combining Proposition 3.7 and Proposition 3.5 to obtain

ddtun+1Hs2C(M)(1+un+1Hs2),\frac{d}{dt}\|u^{n+1}\|_{H^{s}}^{2}\lesssim C(M)(1+\|u^{n+1}\|_{H^{s}}^{2}),

and by Gronwall’s inequality we arrive at

unC(0,T;Hs)MeC(M)T,\|u^{n}\|_{C(0,T;H^{s})}\lesssim Me^{C(M)T},

with a universal implicit constant. This completes the induction if we first choose CC large enough (to dominate the implicit constant), and then TT small enough (depending on CC and MM).

On the other hand, in order to prove the convergence in L2L^{2} we use the equation (4.5) for the difference un+1unu^{n+1}-u^{n}, and claim that the following L2L^{2} estimate holds:

(4.6) ddtun+1unL22C(M)un+1unL22+C(M)unun1L22.\frac{d}{dt}\|u^{n+1}-u^{n}\|_{L^{2}}^{2}\lesssim C(M)\|u^{n+1}-u^{n}\|_{L^{2}}^{2}+C(M)\|u^{n}-u^{n-1}\|_{L^{2}}^{2}.

Assuming this is true, by Gronwall’s inequality we obtain

un+1unC(0,T;L2)C(M)TeC(M)Tunun1C(0,T;L2),\|u^{n+1}-u^{n}\|_{C(0,T;L^{2})}\lesssim C(M)Te^{C(M)T}\|u^{n}-u^{n-1}\|_{C(0,T;L^{2})},

which gives us the small Lipschitz constant if TT is sufficiently small, depending only on MM.

It remains to prove (4.6). For the paradifferential equation we can use Proposition 3.7 and for the FF difference we can use Proposition 3.5, so it remains to examine the last term in (4.5), and show that

(TDN(un1)TDN(un))unL2C(M)un1unL2.\|(T_{DN(u^{n-1})}-T_{DN(u^{n})})u^{n}\|_{L^{2}}\lesssim C(M)\|u^{n-1}-u^{n}\|_{L^{2}}.

In the case of the model problem the difference on the left reads

T𝒜j(un1)𝒜j(un)jun+TD𝒜j(un1)jun1D𝒜j(un)junun.T_{{\mathcal{A}}^{j}(u^{n-1})-{\mathcal{A}}^{j}(u^{n})}\partial_{j}u^{n}+T_{D{\mathcal{A}}^{j}(u^{n-1})\partial_{j}u^{n-1}-D{\mathcal{A}}^{j}(u^{n})\partial_{j}u^{n}}u^{n}.

For the first term we have the obvious bound

T𝒜j(un1)𝒜j(un)junL2𝒜j(un1)𝒜j(un)L2junLC(M)un1unL2.\|T_{{{\mathcal{A}}}^{j}(u^{n-1})-{\mathcal{A}}^{j}(u^{n})}\partial_{j}u^{n}\|_{L^{2}}\lesssim\|{\mathcal{A}}^{j}(u^{n-1})-{\mathcal{A}}^{j}(u^{n})\|_{L^{2}}\|\partial_{j}u^{n}\|_{L^{\infty}}\lesssim C(M)\|u^{n-1}-u^{n}\|_{L^{2}}.

The second term is split into three parts,

T(D𝒜j(un1)D𝒜j(un))jununTjD𝒜j(un1)(un1un)un+Tj[D𝒜j(un1)(un1un)]un,T_{(D{\mathcal{A}}^{j}(u^{n-1})-D{\mathcal{A}}^{j}(u^{n}))\partial_{j}u^{n}}u^{n}-T_{\partial_{j}D{\mathcal{A}}^{j}(u^{n-1})(u^{n-1}-u^{n})}u^{n}+T_{\partial_{j}[D{\mathcal{A}}^{j}(u^{n-1})(u^{n-1}-u^{n})]}u^{n},

where the first two parts are easy to estimate. A similar bound follows for the third term after we move the derivative onto the high frequency factor, using an estimate of the form

TfgL2fL2gBMO,\|T_{\partial f}g\|_{L^{2}}\lesssim\|f\|_{L^{2}}\|\partial g\|_{BMO},

which is a corollary of the second bound in (3.6).

4.2. Take 2: a time discretization method

Here the idea is to discretize time at a small scale ϵ\epsilon, and to construct approximate discrete solutions uϵ(jϵ)u^{\epsilon}(j\epsilon) with the following properties:

  1. i)

    Uniform bounds

    (4.7) uϵ(jϵ)HsCM,jMϵ1;\|u^{\epsilon}(j\epsilon)\|_{H^{s}}\leq CM,\qquad j\ll_{M}\epsilon^{-1};
  2. ii)

    Approximate solution

    (4.8) uϵ((j+1)ϵ)uϵ(jϵ)ϵN(uϵ(jϵ))L2ϵ2.\|u^{\epsilon}((j+1)\epsilon)-u^{\epsilon}(j\epsilon)-\epsilon N(u^{\epsilon}(j\epsilon))\|_{L^{2}}\lesssim\epsilon^{2}.

Once this is done, if ss is large enough222For instance in our model case case s>n/2+1s>n/2+1 suffices. then it is a relatively straightforward matter to show that a uniform limit uu exists333Here one may extend uϵu^{\epsilon} to all times by linear interpolation. on a subsequence as ϵ0\epsilon\to 0, by applying the Arzela-Ascoli theorem. This works in a time interval [0,T][0,T] with TM1T\ll_{M}1. By passing to the limit in the above bounds in a weak topology, it follows the limit uu solves the equation and has regularity

uL(0,T;Hs)Lip(0,T;L2).u\in L^{\infty}(0,T;H^{s})\cap\text{Lip}(0,T;L^{2}).

The nice feature of this method is that one really only needs to carry out one single step. Precisely, given u0Hsu_{0}\in H^{s} with size MM, and 0<ϵ10<\epsilon\ll 1, one needs to find u1u_{1} (which corresponds to uϵ(ϵ)u^{\epsilon}(\epsilon) above) with the following properties:

  1. i)’

    Uniform bounds

    (4.9) u1Hs(1+C(M)ϵ)u0Hs;\|u_{1}\|_{H^{s}}\leq(1+C(M)\epsilon)\|u_{0}\|_{H^{s}};
  2. ii)’

    Approximate solution

    (4.10) u1u0ϵN(u0)L2ϵ2.\|u_{1}-u_{0}-\epsilon N(u_{0})\|_{L^{2}}\lesssim\epsilon^{2}.

Reiterating this, the bound (4.7) follows by applying a discrete form of Gronwall’s inequality.

Remark 4.4.

The ϵ2\epsilon^{2} bound in ii)’ can be harmlessly replaced by ϵ1+δ\epsilon^{1+\delta} with a small constant δ>0\delta>0.

Remark 4.5.

Sometimes the square HsH^{s} norm of uu is not the correct quantity to propagate in time, and one needs to replace it with appropriate equivalent energies EsE^{s} in property (ii)’.

Remark 4.6.

The choice of the L2L^{2} in (ii)’ above was in order to keep the exposition simple. However, sometimes a different topology may be required by the problem, see e.g. [29], [1].

The remaining question is how to construct the single iterate satisfying properties (i)’,(ii)’ above. The obvious choice would be Euler’s method, which is to set

u1=u0+ϵN(u0),u_{1}=u_{0}+\epsilon N(u_{0}),

but this does not work because it loses derivatives.

Inspired by the nonlinear semigroup theory [4], one may choose instead to solve

u1ϵN(u1)=u0.u_{1}-\epsilon N(u_{1})=u_{0}.

This idea has potential at least when this is an elliptic equation. Alternatively one may opt for a paradifferential version

u1ϵTDN(u0)u1=u0+ϵF(u0),u_{1}-\epsilon T_{DN(u_{0})}u_{1}=u_{0}+\epsilon F(u_{0}),

which has the advantage that one only needs to solve a linear elliptic equation. However, ellipticity is not guaranteed.

Instead, here we will adopt a two steps approach, which has the advantage that no partial differential equation needs to be solved. Precisely, our steps are as follows:


STEP 1: Regularization. Here we take the initial data u0u_{0} and we regularize it on an ϵ\epsilon dependent scale. Precisely, if kk is the order of the nonlinearity NN, then it is natural to choose the spatial truncation frequency scale to be ϵ12k\epsilon^{-\frac{1}{2k}}, which corresponds to an order 2k2k parabolic regularization; this regularization scale is needed in order to be able to bound the error in the Euler step. Then our regularization u~\tilde{u} would have the following properties:

  1. (a)

    Regularization

    (4.11) u~Hs+kϵ12u0Hs.\|\tilde{u}\|_{H^{s+k}}\lesssim\epsilon^{-\frac{1}{2}}\|u_{0}\|_{H^{s}}.
  2. (b)

    Energy bound

    (4.12) Es(u~)(1+C(M)ϵ)Es(u0).E^{s}(\tilde{u})\leq(1+C(M)\epsilon)E^{s}(u_{0}).
  3. (c)

    Approximate solution

    (4.13) u~u0L2ϵ2.\|\tilde{u}-u_{0}\|_{L^{2}}\lesssim\epsilon^{2}.

STEP 2: Euler iteration. Here we simply set

(4.14) u1=u~+ϵN(u~),u_{1}=\tilde{u}+\epsilon N(\tilde{u}),

so that the approximate solution bound (4.10) becomes relatively straightforward, and the energy bound (4.9) becomes akin to proving the energy estimate; see the example below.


We now implement the above strategy on our chosen model problem. Here our chosen energy is simply the Sobolev norm,

EN(u)=uHN2.E^{N}(u)=\|u\|_{H^{N}}^{2}.

Our equation has order k=1k=1, so the proper regularization scale is δx=ϵ12\delta x=\epsilon^{\frac{1}{2}}. Hence, we use a Littlewood-Paley projector to simply define

u~=P<ϵ12u,\tilde{u}=P_{<\epsilon^{-\frac{1}{2}}}u,

and the three properties (a), (b) and (c) above are trivially satisfied.

Next we turn our attention to the Euler iteration (4.14) for which we need to establish the properties (i)’ and (ii)’. We begin with (i)’, where it suffices to compare the energies of u1u_{1} and u~\tilde{u}. For |α|N|\alpha|\leq N we have

αu1=αu~+ϵα(𝒜j(u~)ju~).\partial^{\alpha}u_{1}=\partial^{\alpha}\tilde{u}+\epsilon\partial^{\alpha}({\mathcal{A}}^{j}(\tilde{u})\partial_{j}\tilde{u}).

If |α|<N|\alpha|<N, then in the second term on the right we have at most NN derivatives, so this term has size O(ϵ)O(\epsilon) in the L2L^{2} norm

α(𝒜j(u~)ju~)L2Au~HN,\|\partial^{\alpha}({\mathcal{A}}^{j}(\tilde{u})\partial_{j}\tilde{u})\|_{L^{2}}\lesssim_{A}\|\tilde{u}\|_{H^{N}},

and we can neglect it.

It remains to consider |α|=N|\alpha|=N. Then we can separate the terms with no more than NN derivatives and estimate them as above, using appropriate interpolation inequalities,

α(𝒜j(u~)ju~)=𝒜j(u~)αju~+OL2(Bu~HN).\partial^{\alpha}({\mathcal{A}}^{j}(\tilde{u})\partial_{j}\tilde{u})={\mathcal{A}}^{j}(\tilde{u})\partial^{\alpha}\partial_{j}\tilde{u}+O_{L^{2}}(B\|\tilde{u}\|_{H^{N}}).

Hence we have

αu1=αu~+ϵ𝒜j(u~)αju~+OL2(ϵ),\partial^{\alpha}u_{1}=\partial^{\alpha}\tilde{u}+\epsilon{\mathcal{A}}^{j}(\tilde{u})\partial^{\alpha}\partial_{j}\tilde{u}+O_{L^{2}}(\epsilon),

and, neglecting O(ϵ)O(\epsilon) terms, we compute L2L^{2} norms,

αu1L22=αu~L22+2ϵαu~𝒜j(u~)αju~dx+ϵ2Aj(u~)αju~L22.\|\partial^{\alpha}u_{1}\|_{L^{2}}^{2}=\|\partial^{\alpha}\tilde{u}\|_{L^{2}}^{2}+2\epsilon\int\partial^{\alpha}\tilde{u}\cdot{\mathcal{A}}^{j}(\tilde{u})\partial^{\alpha}\partial_{j}\tilde{u}\,dx+\epsilon^{2}\|A^{j}(\tilde{u})\partial^{\alpha}\partial_{j}\tilde{u}\|^{2}_{L^{2}}.

The last L2L^{2} norm has size O(ϵ)O(\epsilon) in view of property (a) above. In the integral, on the other hand, we use the symmetry of 𝒜{\mathcal{A}} to integrate by parts,

2αu~𝒜j(u~)αju~dx=αu~j𝒜j(u~)αu~dx,2\int\partial^{\alpha}\tilde{u}\cdot{\mathcal{A}}^{j}(\tilde{u})\partial^{\alpha}\partial_{j}\tilde{u}\,dx=-\int\partial^{\alpha}\tilde{u}\cdot\partial_{j}{\mathcal{A}}^{j}(\tilde{u})\partial^{\alpha}\tilde{u}\,dx,

which can again be estimated by ABu~HN2\lesssim_{A}B\|\tilde{u}\|_{H^{N}}^{2}. Thus we obtain

u1HN2A(1+ϵB)u~HN2,\|u_{1}\|_{H^{N}}^{2}\lesssim_{A}(1+\epsilon B)\|\tilde{u}\|_{H^{N}}^{2},

as desired, as BB can be estimated by the Sobolev norm of u0u_{0} by Sobolev embeddings.

It remains to consider (ii)’, where, by (c) above, it suffices to show that

𝒜j(u)ju𝒜j(u~)ju~L2Mϵ.\|{\mathcal{A}}^{j}(u)\partial_{j}u-{\mathcal{A}}^{j}(\tilde{u})\partial_{j}\tilde{u}\|_{L^{2}}\lesssim_{M}\epsilon.

This is a soft argument, where we simply write

𝒜j(u)ju𝒜j(u~)ju~L2M𝒜(u)𝒜(u~)L2+juju~L2Muu~H1,\|{\mathcal{A}}^{j}(u)\partial_{j}u-{\mathcal{A}}^{j}(\tilde{u})\partial_{j}\tilde{u}\|_{L^{2}}\lesssim_{M}\|{\mathcal{A}}(u)-{\mathcal{A}}(\tilde{u})\|_{L^{2}}+\|\partial_{j}u-\partial_{j}\tilde{u}\|_{L^{2}}\lesssim_{M}\|u-\tilde{u}\|_{H^{1}},

where the H1H^{1} norm on the right is bounded by interpolating (c) above with the uniform HNH^{N} bound provided by (b). This requires N2N\geq 2.

4.3. Other strategies

Most of the other strategies to prove existence of solutions are based on constructing approximate flows, and solutions are obtained as limits of solutions to the approximate flows. There are two such methods which are more widely used.


a) Parabolic regularization. Here one uses a parabolic regularization of the original flow (1.1), defining the approximate solutions uϵu^{\epsilon} by

utϵ=N(uϵ)ϵ(Δ)kuϵ,u(0)=u0,u^{\epsilon}_{t}=N(u^{\epsilon})-\epsilon(-\Delta)^{k}u^{\epsilon},\qquad u(0)=u_{0},

where the correct choice for the parabolic term seems to be to double the order of the original equation. These problems can often be solved for a short, ϵ\epsilon dependent time, as semilinear problems, with a direct, fixed point argument. However, in doing this, the main challenge is to prove uniform in ϵ\epsilon bounds for these approximate flows. This sometimes requires more careful choices of the regularization term, to make it fit better with the geometry of the problem.


b) Galerkin approximation. Here the idea is to work with a low frequency projector in the equation, e.g. of the type

ut=P<hN(P<hu)u_{t}=P_{<h}N(P_{<h}u)

with hh\to\infty, see e.g. the example in [30]. The local solvability for this evolution becomes trivial as this evolution is an ordinary differential equation in a Hilbert space, but the challenge is again to prove uniform in ϵ\epsilon bounds for these approximate flows. The double use of the projector above is a choice that usually facilitates achieving this objective. Depending on the problem, this may require careful choices for the frequency projectors, adapted to the problem.

5. Rough solutions as limits of smooth solutions

Here we explore the idea of constructing rough solutions as limits of smooth solutions. There are at least two good reasons to do this, which we discuss in order:

  1. (1)

    In quasilinear problems one does not expect any sort of uniformly continuous dependence of solutions on the initial data, so the continuity of the flow map becomes a purely qualitative assertion. However, one can still ask for a quantitative way of comparing solutions, and such a quantitative venue is found by using the regular approximations as a convenient proxy. This is discussed in the last section.

  2. (2)

    It is also often the case that more regular solutions are sometimes easier to produce, and in such situations, obtaining the rough solutions as limits of smooth solutions might be the only option. This is particularly the case in problems where the state space is not a linear space, such as Schrödinger maps [20], Yang-Mills or other problems with a nontrivial gauge structure. See also [15] for an implementation of this idea in a free boundary problem. This is because in such problems it is always easier to obtain estimates for the linearized equations, or at least to compare exact solutions, rather than to cook up a constructive scheme which is consistent with the geometry.

To make this analysis quantitative, it is very useful to track the flow of energy between different frequencies. Whereas energy cascades (energy migration to higher frequencies) have long been associated with blow-up phenomena, well-posedness should correspond to a lack thereof. To quantify this, we will use Tao’s notion of frequency envelopes.

5.1. Frequency envelopes

Frequency envelopes, introduced by Tao (see for example [27]), are a very useful device in order to track the evolution of the energy of solutions between dyadic energy shells. As there is always nearby leakage between the dyadic shells in nonlinear flows, on needs to do this in a more stable way, rather than look directly at the exact amount of energy in every shell.

This is realized via the following definition:

Definition 5.1.

We say that {ck}k02\{c_{k}\}_{k\geq 0}\in\ell^{2} is a frequency envelope for a function uu in HsH^{s} if we have the following two properties:

a) Energy bound:

(5.1) PkuHsck,\|P_{k}u\|_{H^{s}}\leq c_{k},

b) Slowly varying

(5.2) ckcj2δ|jk|,j,k.\frac{c_{k}}{c_{j}}\lesssim 2^{\delta|j-k|},\quad j,k\in\mathbb{N}.

Here PkP_{k} represent the standard Littlewood-Paley projectors, and δ\delta is a positive constant, which is taken small enough in order to account for energy leakage between nearby frequencies.

One can also try to limit from above the size of a frequency envelope, for instance by requiring that

uHs2ck2.\|u\|_{H^{s}}^{2}\approx\sum c_{k}^{2}.

We call such envelopes sharp. Such frequency envelopes always exist, for instance one can take

ck=supj2δ|jk|cj.c_{k}=\sup_{j}2^{-\delta|j-k|}c_{j}.

For a better understanding see Figure 1 below, where the actual dyadic norms, indicated by red bullets on a logarithmic scale, are lifted (based on the above formula) to a slowly varying frequency envelope, indicated by the green circles.

12345678910hhlogch\log c_{h}\bullet\bullet\bullet\bullet\bullet\bullet\ = dyadic norms uh\|u_{h}\| = sharp frequency envelope chc_{h}
Figure 1. Construction of sharp frequency envelopes.

We will use frequency envelopes in order to track the evolution of energy in time as follows: we start with a sharp frequency envelope for the initial data, and then seek to show that we can propagate this frequency envelope to the solutions to our quasilinear flow, at least for a short time.

Remark 5.2.

One alternative here is to unbalance the choice of δ\delta in (5.2), asking for a small δ\delta if k<jk<j, but replacing δ\delta with a large constant for k>jk>j. This heuristically corresponds to a better control of leakage to higher frequencies, and it is useful in order to deal with higher regularity properties also within the frequency envelope set-up.

5.2. Regularized data

Consider an initial data u0Hsu_{0}\in H^{s} with size MM, and let {ck}k0\left\{c_{k}\right\}_{k\geq 0} be a sharp frequency envelope for u0u_{0} in HsH^{s}. For u0u_{0} we consider a family of regularizations u0hH:=s=0Hsu_{0}^{h}\in H^{\infty}:=\cap_{s=0}^{\infty}H^{s} at frequencies 2h\lesssim 2^{h} where hh is a dyadic frequency parameter. This parameter can be taken either discrete or continuous, depending on whether we have access to difference bounds or only to the linearized equation. Suppose we work with differences. Then the family u0hu^{h}_{0} can be taken to have similar properties to Littlewood-Paley truncations:

  1. i)

    Uniform bounds:

    (5.3) Pku0hHsck.\|P_{k}u^{h}_{0}\|_{H^{s}}\lesssim c_{k}.
  2. ii)

    High frequency bounds:

    (5.4) u0hHs+j2jhch,j>0.\|u^{h}_{0}\|_{H^{s+j}}\lesssim 2^{jh}c_{h},\qquad j>0.
  3. iii)

    Difference bounds:

    (5.5) u0h+1u0hL22shch.\|u^{h+1}_{0}-u^{h}_{0}\|_{L^{2}}\lesssim 2^{-sh}c_{h}.
  4. iv)

    Limit as hh\to\infty:

    (5.6) u0=limhu0h in Hs.u_{0}=\lim_{h\to\infty}u_{0}^{h}\qquad\text{ in }H^{s}.

Correspondingly, we obtain a family of smooth solutions uhu^{h}.

Here in the simplest setting where the phase space is linear one may simply choose u0h=P<hu0u^{h}_{0}=P_{<h}u_{0}, which would have all the above properties. However, in geometric settings where the phase space is nonlinear, a more complex regularization method may be needed, for instance using a corresponding geometric heat flow, see [28] or a variable scale regularization as in [15].

5.3. Uniform bounds

Corresponding to the above family of regularized data, we obtain a family of smooth solutions uhu^{h}. For this we can use the energy estimates as in Theorem 3 to propagate Sobolev regularity for solutions as well as difference bounds as in Proposition 3.10. This yields a time interval [0,T][0,T] where all these solutions exist, and whose size TT depends only on M=u0HsM=\|u_{0}\|_{H^{s}}, where we have the following properties:

  1. i)

    High frequency bounds:

    (5.7) uhC(0,T;Hs+j)2jhch,j>0.\|u^{h}\|_{C(0,T;H^{s+j})}\lesssim 2^{jh}c_{h},\qquad j>0.
  2. ii)

    Difference bounds:

    (5.8) uh+1uhC(0,T;L2)2shch.\|u^{h+1}-u^{h}\|_{C(0,T;L^{2})}\lesssim 2^{-sh}c_{h}.

From (5.7) one may obtain a similar bound for the difference uh+1uhu^{h+1}-u^{h}. Interpolating this with (5.8), we also have

(5.9) uh+1uhC(0,T;Hm)2(sm)hch,m0.\|u^{h+1}-u^{h}\|_{C(0,T;H^{m})}\lesssim 2^{-(s-m)h}c_{h},\qquad m\geq 0.

One may use these bounds to establish uniform frequency envelope bounds for uhu^{h},

(5.10) PkuhC(0,T;Hs)ck2N(kh)+,\|P_{k}u^{h}\|_{C(0,T;H^{s})}\lesssim c_{k}2^{-N(k-h)_{+}},

on the same time interval which depends only on the initial data HsH^{s} size. This is a direct consequence of (5.7) for khk\geq h, while if k<hk<h we can use the telescopic expansion

uh=uk+l=kh1(ul+1ul),u^{h}=u^{k}+\sum_{l=k}^{h-1}\left(u^{l+1}-u^{l}\right),

and use (5.7) for the first term and (5.8) for the differences.

5.4. The limiting solution

Consider now the convergence of uhu^{h} as hh\to\infty. From the difference bounds (5.8) we obtain convergence in L2L^{2} to a limit uC(0,T;L2)u\in C(0,T;L^{2}), with

uuhC(0,T;L2)2sh.\|u-u^{h}\|_{C(0,T;L^{2})}\lesssim 2^{-sh}.

On the other hand, expanding the difference as a telescopic sum we get

uuh=m=hum+1um,u-u^{h}=\sum_{m=h}^{\infty}u^{m+1}-u^{m},

where, in view of the above bounds (5.7) and (5.8), each summand is essentially concentrated at frequency 2m2^{m}, with HsH^{s} size cmc_{m} and exponentially decreasing tails. This leads to

(5.11) uuhC(0,T;Hs)ch:=(mhcm2)12,\|u-u^{h}\|_{C(0,T;H^{s})}\lesssim c_{\geq h}:=\left(\sum_{m\geq h}c_{m}^{2}\right)^{\frac{1}{2}},

so we also have convergence in C(0,T;Hs)C(0,T;H^{s}).

This type of argument plays multiple roles:

  1. (1)

    It produces rough solutions as smooth solutions, justifying the earlier assertion that it often suffices to carry out the initial construction of solutions only in a smooth setting.

  2. (2)

    It establishes the continuity of solutions as HsH^{s} valued flows, which is sometimes missing from the constructive proof of existence.

  3. (3)

    It provides the quantitative bound (5.11) for the difference between the rough and the smooth solutions, which plays a key role in the continuous dependence proof in the next section.

6. Continuous dependence

Here we use frequency envelopes in order to prove continuous dependence of the solution uC(0,T;Hs)u\in C(0,T;H^{s}) as a function of the initial data u0Hsu_{0}\in H^{s}, and also discuss some historical alternatives.

6.1. The continuous dependence proof

Consider a sequence of initial data

u0ju0inHs,s>d2+1,u_{0j}\to u_{0}\qquad in\ H^{s},\quad s>\frac{d}{2}+1,

and the corresponding solutions uju_{j}, uu which exist with a uniform lifespan [0,T][0,T], where TT depends only on the initial data size u0Hs\|u_{0}\|_{H^{s}}. We will prove that ujuu_{j}\to u in C(0,T;Hs)C(0,T;H^{s}). Once we have this property, it automatically extends to any larger time interval [0,T1][0,T_{1}] where the solution uu is defined and satisfies uC(0,T1;Hs)u\in C(0,T_{1};H^{s}). This should be understood in the sense that for all large enough jj, the solutions uju_{j} are defined in [0,T1][0,T_{1}], with similar regularity, and the convergence holds as jj\to\infty.

The difference bounds in Proposition 3.10 guarantee that ujuu_{j}\to u in C(0,T;L2)C(0,T;L^{2}). Since uju_{j} are uniformly bounded in C(0,T;Hs)C(0,T;H^{s}), this also implies convergence in C(0,T;Hσ)C(0,T;H^{\sigma}) for every 0σ<s0\leq\sigma<s, but not for σ=s\sigma=s.

It remains to consider the convergence in the strong topology, i.e. in HsH^{s}. Rather than trying to compare the solutions uju_{j} and uu directly, we will use as a proxy the approximate solutions ujhu_{j}^{h}, respectively uhu^{h}. For these, we will take advantage of the fact that their initial data converge in all Sobolev norms,

u0jhu0hinHσ,0σ<.u_{0j}^{h}\to u_{0}^{h}\qquad\text{in}\ H^{\sigma},\quad 0\leq\sigma<\infty.

Hence, according to the preceding discussion, we have convergence of the regular solutions in all Sobolev norms,

ujhuhinC(0,T;Hσ),0σ<.u_{j}^{h}\to u^{h}\qquad\text{in}\ C(0,T;H^{\sigma}),\quad 0\leq\sigma<\infty.

To compare the solutions uu and uju^{j} themselves, we use the triangle inequality,

(6.1) ujuC(0,T;Hs)ujhuhC(0,T;Hs)+uhuC(0,T;Hs)+ujhujC(0,T;Hs).\|u_{j}-u\|_{C(0,T;H^{s})}\lesssim\|u_{j}^{h}-u^{h}\|_{C(0,T;H^{s})}+\|u^{h}-u\|_{C(0,T;H^{s})}+\|u_{j}^{h}-u_{j}\|_{C(0,T;H^{s})}.

The first term goes to zero as jj\to\infty for fixed hh, while the second goes to zero as hh\to\infty, but does not depend on jj. It is the third term which is the problem, and for which we need to gain some smallness uniformly in jj.

However, in the previous section we have learned to estimate such differences using frequency envelopes. Precisely, let {ck}k0\left\{c_{k}\right\}_{k\geq 0}, respectively {ckj}k0\left\{c_{k}^{j}\right\}_{k\geq 0} be frequency envelopes for the initial data u0u_{0}, respectively u0ju_{0}^{j} in HsH^{s}. Then, as we saw in the previous section, we can estimate the last two terms above in terms of frequency envelopes and obtain

(6.2) ujuC(0,T;Hs)ujhuhC(0,T;Hs)+ch+chj.\|u_{j}-u\|_{C(0,T;H^{s})}\lesssim\|u_{j}^{h}-u^{h}\|_{C(0,T;H^{s})}+c_{\geq h}+c^{j}_{\geq h}.

The important observation is that the convergence u0ju0u_{0j}\to u_{0} in HsH^{s} allows us to choose the frequency envelopes cc, respectively cjc^{j} so that

cjcin 2.c^{j}\to c\qquad\text{in }\ell^{2}.

This implies that

limjchj=ch.\lim_{j\to\infty}c^{j}_{\geq h}=c_{\geq h}.

Hence, passing to the limit jj\to\infty in the relation (6.1), we obtain

(6.3) lim supjujuC(0,T;Hs)ch,\limsup_{j\to\infty}\|u_{j}-u\|_{C(0,T;H^{s})}\lesssim c_{\geq h},

and finally letting hh\to\infty we obtain

limjujuC(0,T;Hs)=0,\lim_{j\to\infty}\|u_{j}-u\|_{C(0,T;H^{s})}=0,

as desired.

6.2. Comparison with Kato and Bona-Smith

The more classical approach for continuous dependence goes back to Kato [16] as well as a variation due to Bona-Smith [5]. We will briefly describe this approach using our notations and set-up; we caution the reader that the original arguments in these papers are not self-contained and are instead mixed with the other parts of well-posedness proofs, so it is not exactly easy to correlate the papers with the description below. In effect our discussion below is more closely based on the interpretations of Kato’s work provided by Chemin [3] and, even closer, by Tao [26].

This also relies on the use of some sort of approximate solutions uhu^{h}. However, in this approach one aims to directly estimate the difference uhuu^{h}-u in HsH^{s} in terms of the corresponding initial data. One might at first hope to directly track the difference uhuC(0,T;Hs)\|u^{h}-u\|_{C(0,T;H^{s})}, but this cannot work without knowledge that the low frequencies of the difference (i.e. below 2h2^{h}) are better controlled. So the better object to track turns out to be a norm of the form

(6.4) uhuHs+2khuhuHsk,\|u^{h}-u\|_{H^{s}}+2^{kh}\|u^{h}-u\|_{H^{s-k}},

where we recall that kk is the order of our nolinearity. Here the second part can be estimated directly for any two HsH^{s} solutions, see Remark 3.12, so one can think of this as decoupled as a two step process. To better understand why this works, it is useful to write the equation for the difference w=uhuw=u^{h}-u in a paradifferential form

(6.5) tw+TDN(u)w=[F(u)F(uh)]+TDN(u)DN(uh)uh,\partial_{t}w+T_{DN(u)}w=[F(u)-F(u^{h})]+T_{DN(u)-DN(u^{h})}u^{h},

which should essentially be thought of as a perturbation of the linear paradifferential flow, which can be estimated in all Sobolev spaces. The FF difference is tame because FF admits Lipschitz bounds in all Sobolev spaces, so the issue is the last term.

There there is seemingly a loss of kk derivatives, but these derivatives are applied to uhu^{h}, which has higher regularity bounds, so they yield losses of at most a 2kh2^{kh} factor. But this factor can be absorbed by the lower frequency paradifferential coefficients given by DN(u)DN(uh)DN(u)-DN(u^{h}), in view of the 2kh2^{kh} factor in (6.4). Here it is important that we wrote the equation using TDN(u)T_{DN(u)} rather TDN(uh)T_{DN(u^{h})} on the left, which allows us to use uhu^{h} as the argument in the last term on the right.

In Kato’s argument the same principle is used to get HsH^{s} bounds not only for the difference uhuu^{h}-u but also for uhvu^{h}-v for an arbitrary solution vv. In Bona-Smith’s, version, on the other hand, one estimates only uhuu^{h}-u, but the proof is more roundabout in that uhu^{h} is not only assumed to have regularized data, but also to solve a regularized equation, thus combining the existence and the continuous dependence arguments.

In our opinion, working with frequency envelopes has definite advantages:

  • It provides more accurate information on the solutions.

  • It does not require any direct difference bounds in the strong HsH^{s} topology.

  • By working with a continuous, rather than a discrete family of regularizations one can fully replace difference estimates by bounds for the linearized equation, which is to be preferred in many cases, in particular in geometric contexts where the state space is an infinite dimensional manifold.

References

  • [1] A. Ai, M. Ifrim, and D. Tataru. Two dimensional gravity waves at low regularity I: Energy estimates. arXiv e-prints, page arXiv:1910.05323, Oct. 2019.
  • [2] T. Alazard, N. Burq, and C. Zuily. On the Cauchy problem for gravity water waves. Invent. Math., 198(1):71–163, 2014.
  • [3] H. Bahouri, J.-Y. Chemin, and R. Danchin. Fourier Analysis and Nonlinear Partial Differential Equations, volume 343 of Grundlehren der mathematischen Wissenschaften. Springer-Verlag Berlin Heidelberg, 2011.
  • [4] V. Barbu. Nonlinear semigroups and differential equations in Banach spaces. Editura Academiei Republicii Socialiste România, Bucharest; Noordhoff International Publishing, Leiden, 1976. Translated from the Romanian.
  • [5] J. L. Bona and R. Smith. The initial-value problem for the Korteweg-de Vries equation. Philos. Trans. Roy. Soc. London Ser. A, 278(1287):555–601, 1975.
  • [6] J.-M. Bony. Calcul symbolique et propagation des singularités pour les équations aux dérivées partielles non linéaires. Ann. Sci. École Norm. Sup. (4), 14(2):209–246, 1981.
  • [7] R. R. Coifman and Y. Meyer. Au delà des opérateurs pseudo-différentiels, volume 57 of Astérisque. Société Mathématique de France, Paris, 1978. With an English summary.
  • [8] J. Hadamard. Sur les problèmes aux dérivés partielles et leur signification physique. Princeton University Bulletin, 13:49–52, 1902.
  • [9] J. Hadamard. Lectures on Cauchy’s Problem in Linear Partial Differential Equations. Mrs. Hepsa Ely Silliman memorial lectures. Yale University Press, 1923.
  • [10] L. Hörmander. Lectures on nonlinear hyperbolic differential equations, volume 26 of Mathématiques & Applications (Berlin) [Mathematics & Applications]. Springer-Verlag, Berlin, 1997.
  • [11] L. Hörmander. The analysis of linear partial differential operators. III. Classics in Mathematics. Springer, Berlin, 2007. Pseudo-differential operators, Reprint of the 1994 edition.
  • [12] J. K. Hunter, M. Ifrim, and D. Tataru. Two dimensional water waves in holomorphic coordinates. Comm. Math. Phys., 346(2):483–552, 2016.
  • [13] M. Ifrim and D. Tataru. Lectures, Summer Graduate School: Introduction to water waves; MSRI, Summer 2020. https://www.msri.org/summer_schools/910/schedules. Accessed:2020-08-10.
  • [14] M. Ifrim and D. Tataru. The lifespan of small data solutions in two dimensional capillary water waves. Arch. Ration. Mech. Anal., 225(3):1279–1346, 2017.
  • [15] M. Ifrim and D. Tataru. The compressible Euler equations in a physical vacuum: a comprehensive Eulerian approach. arXiv e-prints, page arXiv:2007.05668, July 2020.
  • [16] T. Kato. The Cauchy problem for quasi-linear symmetric hyperbolic systems. Arch. Rational Mech. Anal., 58(3):181–205, 1975.
  • [17] J. L. Marzuola, J. Metcalfe, and D. Tataru. Quasilinear Schrödinger equations I: Small data and quadratic interactions. Adv. Math., 231(2):1151–1172, 2012.
  • [18] J. L. Marzuola, J. Metcalfe, and D. Tataru. Quasilinear Schrödinger equations, II: Small data and cubic nonlinearities. Kyoto J. Math., 54(3):529–546, 2014.
  • [19] J. L. Marzuola, J. Metcalfe, and D. Tataru. Quasilinear Schrödinger equations III: Large Data and Short Time. arXiv e-prints, page arXiv:2001.01014, Jan. 2020.
  • [20] H. McGahagan. An approximation scheme for Schrödinger maps. Comm. Partial Differential Equations, 32(1-3):375–400, 2007.
  • [21] G. Métivier. Para-differential calculus and applications to the Cauchy problem for nonlinear systems, volume 5 of Centro di Ricerca Matematica Ennio De Giorgi (CRM) Series. Edizioni della Normale, Pisa, 2008.
  • [22] C. Muscalu and W. Schlag. Classical and multilinear harmonic analysis. Vol. II, volume 138 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge, 2013.
  • [23] H. F. Smith and D. Tataru. Sharp local well-posedness results for the nonlinear wave equation. Ann. of Math. (2), 162(1):291–366, 2005.
  • [24] C. D. Sogge. Lectures on non-linear wave equations. International Press, Boston, MA, second edition, 2008.
  • [25] J. Sterbenz and D. Tataru. Energy dispersed large data wave maps in 2+12+1 dimensions. Comm. Math. Phys., 298(1):139–230, 2010.
  • [26] T. Tao. Blog, Local well-posedness for the Euler equations. https://terrytao.wordpress.com/2018/10/09/254a-notes-3-local-well-posedness-for-the-euler-equations/#apb-diff. Accessed:2020-08-10.
  • [27] T. Tao. Global regularity of wave maps. II. Small energy in two dimensions. Comm. Math. Phys., 224(2):443–544, 2001.
  • [28] T. Tao. Geometric renormalization of large energy wave maps. In Journées “Équations aux Dérivées Partielles”, pages Exp. No. XI, 32. École Polytech., Palaiseau, 2004.
  • [29] D. Tataru. Rough solutions for the wave maps equation. Amer. J. Math., 127(2):293–377, 2005.
  • [30] M. E. Taylor. Partial differential equations III. Nonlinear equations, volume 117 of Applied Mathematical Sciences. Springer, New York, second edition, 2011.