This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Regularity for a special case of two-phase Hele-Shaw flow via parabolic integro-differential equations

Farhan Abedin  and  Russell W. Schwab Department of Mathematics
Michigan State University
619 Red Cedar Road
East Lansing, MI 48824
[email protected], [email protected]
(Date: ,  arXiv ver 2)
Abstract.

We establish that the C1,γC^{1,\gamma} regularity theory for translation invariant fractional order parabolic integro-differential equations (via Krylov-Safonov estimates) gives an improvement of regularity mechanism for solutions to a special case of a two-phase free boundary flow related to Hele-Shaw. The special case is due to both a graph assumption on the free boundary of the flow and an assumption that the free boundary is C1,DiniC^{1,\textnormal{Dini}} in space. The free boundary then must immediately become C1,γC^{1,\gamma} for a universal γ\gamma depending upon the Dini modulus of the gradient of the graph. These results also apply to one-phase problems of the same type.

Key words and phrases:
Global Comparison Property, Integro-differential Operators, Dirichlet-to-Neumann, Free Boundaries, Hele-Shaw, Fully Nonlinear Equations, Viscosity Solutiuons, Krylov-Safonov
2010 Mathematics Subject Classification:
35B51, 35R09, 35R35, 45K05, 47G20, 49L25, 60J75, 76D27, 76S05
R. Schwab acknowledges partial support from the NSF with DMS-1665285. F. Abedin acknowledges support from the AMS and the Simons Foundation with an AMS–Simons Travel Grant.

1. Introduction

This paper has two goals. The first is to give a precise characterization of the integro-differential operators that can be used to represent the solution of some free boundary flows with both one and two phases, of what we call Hele-Shaw type. We give a characterization that is precise enough to determine whether or not existing integro-differential results apply to this setting. The second goal is to show that, indeed, a new regularization mechanism resulting from parabolic integro-differential theory is applicable. This will show that solutions that are C1,DiniC^{1,\textnormal{Dini}} must immediately become C1,γC^{1,\gamma} regular. We note that there is an earlier and stronger regularization mechanism for the one-phase Hele-Shaw flow by Choi-Jerison-Kim [26] which shows that Lipschitz solutions with a dimensionally small Lipschitz norm must be C1C^{1} regular and hence classical. We want to emphasize that in our context, both one and two phase problems are treated under the exact same methods. For simplicity and technical reasons, we focus on the case in which the free boundary is the graph of a time dependent function on n\mathbb{R}^{n}, n2n\geq 2.

These free boundary problems are the time dependent evolution of the zero level set of a function U:n+1×[0,T]U:\mathbb{R}^{n+1}\times[0,T]\to\mathbb{R} that satisfies the following equation, with VV representing the normal velocity on {U(,t)>0}\partial\{U(\cdot,t)>0\}, and GG a prescribed balance law. Here A1A_{1} and A2A_{2} are two (possibly different) elliptic constant coefficient diffusion matrices that dictate the equations:

{tr(A1D2U)=0in{U(,t)>0}tr(A2D2U)=0in{U(,t)<0}U(,t)=1on{xn+1=0}U(,t)=1on{xn+1=L}V=G(ν+U,νU)on{U(,t)>0}.\displaystyle\begin{cases}\textnormal{tr}(A_{1}D^{2}U)=0\ &\text{in}\ \{U(\cdot,t)>0\}\\ \textnormal{tr}(A_{2}D^{2}U)=0\ &\text{in}\ \{U(\cdot,t)<0\}\\ U(\cdot,t)=1\ &\text{on}\ \{x_{n+1}=0\}\\ U(\cdot,t)=-1\ &\text{on}\ \{x_{n+1}=L\}\\ V=G(\partial^{+}_{\nu}U,\partial^{-}_{\nu}U)\ &\text{on}\ \partial\{U(\cdot,t)>0\}.\end{cases} (1.1)

Without loss of generality, we take A1=IdA_{1}=\textnormal{Id} (which can be obtained by an orthogonal change of coordinates). The prescribed values for UU at xn+1=0x_{n+1}=0 and xn+1=Lx_{n+1}=L can be thought of as an ambient background pressure for UU, and the free boundary, {U=0}\{U=0\}, will be located in between.

As mentioned above, this work treats the special case of the free boundary problem in which the boundary of the positivity set can be given as the graph of a function over n\mathbb{R}^{n}. To this end, we will use the notation, DfD_{f}, as

Df={(x,xn+1)n+1: 0<xn+1<f(x)},\displaystyle D_{f}=\{(x,x_{n+1})\in\mathbb{R}^{n+1}\ :\ 0<x_{n+1}<f(x)\},

and in our context, we will assume that for some f:n×[0,T]f:\mathbb{R}^{n}\times[0,T]\to\mathbb{R},

{U(,t)>0}=Df(,t)\displaystyle\{U(\cdot,t)>0\}=D_{f(\cdot,t)}

and

{U(,t)>0}=graph(f(,t)).\displaystyle\partial\{U(\cdot,t)>0\}=\textnormal{graph}(f(\cdot,t)).

The main technical part of our work is centered on the properties of the (fully nonlinear) operator we call II, which is defined for the one-phase problem as

{ΔUf=0inDfUf=1onn×{0}Uf=0onΓf=graph(f),\displaystyle\begin{cases}\Delta U_{f}=0\ &\text{in}\ D_{f}\\ U_{f}=1\ &\text{on}\ \mathbb{R}^{n}\times\{0\}\\ U_{f}=0\ &\text{on}\ \Gamma_{f}=\textnormal{graph}(f),\end{cases} (1.2)

and II is the map,

I(f,x)=νUf(x,f(x)).\displaystyle I(f,x)=\partial_{\nu}U_{f}(x,f(x)). (1.3)

We note, the map II does not depend on tt and it is a fully nonlinear function of ff (in the sense that it does not have a divergence structure, and it fails linearity in the highest order terms acting on ff – in fact it is fails linearity for all terms). Here, II, can be thought of as a nonlinear Dirichlet-to-Neumann operator, but one that tracks how a particular solution depends on the boundary. This type of operator is not at all new, and we will briefly comment on its rather long history later on, in Section 2.

It turns out (probably not surprisingly) that the key features of (1.1) are entirely determined by the properties of the mapping, II. To this end, we will define a two phase version of this operator via the positive and negative sets,

Df+={(x,xn+1): 0<xn+1<f(x)},\displaystyle D_{f}^{+}=\{(x,x_{n+1})\ :\ 0<x_{n+1}<f(x)\}, (1.4)
Df={(x,xn+1):f(x)<xn+1<L},\displaystyle D_{f}^{-}=\{(x,x_{n+1})\ :\ f(x)<x_{n+1}<L\}, (1.5)

with the equation, (recall we take A1=IdA_{1}=\textnormal{Id})

{ΔUf=0inDf+tr(A2D2Uf)=0inDfUf=0onΓfUf=1on{xn+1=0}Uf=1on{xn+1=L}.\displaystyle\begin{cases}\Delta U_{f}=0\ &\text{in}\ D_{f}^{+}\\ \textnormal{tr}(A_{2}D^{2}U_{f})=0\ &\text{in}\ D_{f}^{-}\\ U_{f}=0\ &\text{on}\ \Gamma_{f}\\ U_{f}=1\ &\text{on}\ \{x_{n+1}=0\}\\ U_{f}=-1\ &\text{on}\ \{x_{n+1}=L\}.\end{cases} (1.6)

We define the respective normal derivatives to the positive and negative sets:

forX0Γf,andν(X0)the unit normal derivative to Γf, pointing into the setD+f,\displaystyle\text{for}\ X_{0}\in\Gamma_{f},\ \text{and}\ \nu(X_{0})\ \text{the unit normal derivative to $\Gamma_{f}$, pointing into the set}\ D^{+}f,
ν+U(X0):=limt0U(X0+tν(X0))U(X0)tandνU(X0)=limt0U(X0tν(X0))U(X0)t.\displaystyle\partial^{+}_{\nu}U(X_{0}):=\lim_{t\to 0}\frac{U(X_{0}+t\nu(X_{0}))-U(X_{0})}{t}\ \ \text{and}\ \ \partial^{-}_{\nu}U(X_{0})=-\lim_{t\to 0}\frac{U(X_{0}-t\nu(X_{0}))-U(X_{0})}{t}. (1.7)

With these, we can define the operator, HH, as

H(f,x):=G(I+(f,x),I(f,x))1+|f|2,\displaystyle H(f,x):=G(I^{+}(f,x),I^{-}(f,x))\cdot\sqrt{1+\left|\nabla f\right|^{2}}, (1.8)

where

I+(f,x):=ν+Uf(x,f(x)),andI(f,x):=νUf(x,f(x)).\displaystyle I^{+}(f,x):=\partial_{\nu}^{+}U_{f}(x,f(x)),\ \ \text{and}\ \ I^{-}(f,x):=\partial_{\nu}^{-}U_{f}(x,f(x)). (1.9)

The standard ellipticity assumption on GG is the following:

Gis Lipschitz andλaG(a,b)Λ,λbG(a,b)Λ.\displaystyle G\ \text{is Lipschitz and}\ \ \lambda\leq\frac{\partial}{\partial a}G(a,b)\leq\Lambda,\ \ \lambda\leq-\frac{\partial}{\partial b}G(a,b)\leq\Lambda. (1.10)

A canonical example of GG for the two-phase problem is G(a,b)=abG(a,b)=a-b, whereas a one-phase problem will simply be given by G(a,b)=G~(a)G(a,b)=\tilde{G}(a), and the problem often referred to as one-phase Hele-Shaw flow is G(a,b)=aG(a,b)=a (we note that the name “Hele-Shaw” has multiple meanings, depending upon the literature involved; both instances can be seen in Saffman-Taylor [61]).

In a previous work, [24], it was shown that under the graph assumption, the flow (1.1) is equivalent in the sense of viscosity solutions for free boundary problems to viscosity solutions of the nonlinear, nonlocal, parabolic equation for ff

{tf=G(I+(f),I(f))1+|f|2inn×[0,T],f(,0)=f0onn×{0}.\displaystyle\begin{cases}\partial_{t}f=G(I^{+}(f),I^{-}(f))\cdot\sqrt{1+\left|\nabla f\right|^{2}}\ &\text{in}\ \mathbb{R}^{n}\times[0,T],\\ f(\cdot,0)=f_{0}\ &\text{on}\ \mathbb{R}^{n}\times\{0\}.\end{cases} (1.11)

We remark that a viscosity solution for the respective equations (1.1) and (1.11) (they are different objects) will exist whenever the free boundary (or in this case, ff) is uniformly continuous, i.e. in very low regularity conditions.

In this paper, we explore a higher regularity regime, already assuming the existence of a classical solution of (1.1). Whenever ff remains in a particular convex set of C1,DiniC^{1,\textnormal{Dini}} (the set of C1C^{1} functions whose gradients enjoy a Dini modulus), we will show that the operator HH takes a precise form as an integro-differential operator. This convex set is denoted as, 𝒦(δ,L,m,ρ)\mathcal{K}(\delta,L,m,\rho), and is made up as

Cρ1,Dini(n)={f:n|fLand is Dini continuous with modulusρ},\displaystyle C^{1,\textnormal{Dini}}_{\rho}(\mathbb{R}^{n})=\{f:\mathbb{R}^{n}\to\mathbb{R}\ |\ \nabla f\in L^{\infty}\ \text{and is Dini continuous with modulus}\ \rho\},
𝒦(δ,L,m,ρ)={fCρ1,Dini:δ<f<Lδ,|f|m}.\displaystyle\mathcal{K}(\delta,L,m,\rho)=\{f\in C^{1,\textnormal{Dini}}_{\rho}\ :\ \delta<f<L-\delta,\ \left|\nabla f\right|\leq m\}. (1.12)

We note that the extra requirement δ<f<Lδ\delta<f<L-\delta is simply that the free boundary remains away from the fixed boundary where the pressure conditions are imposed.

The first theorem gives the integro-differential structure of HH, and the details of which ellipticity class it falls into.

Theorem 1.1.

Assume that GG satisfies (1.10) and HH is the operator defined by (1.8), using the equation, (1.6).

  1. (i)

    For each fixed δ\delta, LL, mm, ρ\rho, that define the set 𝒦\mathcal{K} in (1.12) there exists a collection {aij,cij,bij,Kij}××n×Borel(n{0})\{a^{ij},c^{ij},b^{ij},K^{ij}\}\subset{\mathbb{R}\times\mathbb{R}\times\mathbb{R}^{n}\times\textnormal{Borel}(\mathbb{R}^{n}\setminus\{0\})} (depending upon δ\delta, LL, mm, ρ\rho), so that

    f𝒦(δ,L,m,ρ),H(f,x)=minimaxj(aij+cijf(x)+bijf(x)+nδyf(x)Kij(y)𝑑y),\displaystyle\forall\ f\in\mathcal{K}(\delta,L,m,\rho),\ \ \ H(f,x)=\min_{i}\max_{j}\left(a^{ij}+c^{ij}f(x)+b^{ij}\cdot\nabla f(x)+\int_{\mathbb{R}^{n}}\delta_{y}f(x)K^{ij}(y)dy\right),

    where for an r0r_{0} depending upon δ\delta, LL, mm, we use the notation,

    δyf(x)=f(x+y)f(x)𝟙Br0(y)f(x)y.\displaystyle\delta_{y}f(x)=f(x+y)-f(x)-{\mathbbm{1}}_{B_{r_{0}}}(y)\nabla f(x)\cdot y. (1.13)
  2. (ii)

    Furthermore, there exists R0R_{0} and CC, depending on δ\delta, LL, mm, ρ\rho, so that for all i,ji,j,

    yn,C1|y|n1𝟙BR0(y)Kij(y)C|y|n1,\displaystyle\forall\ y\in\mathbb{R}^{n},\ \ \ C^{-1}\left|y\right|^{-n-1}{\mathbbm{1}}_{B_{R_{0}}}(y)\leq K^{ij}(y)\leq C\left|y\right|^{-n-1},

    and

    sup0<r<r0|bijBr0BryKij(y)𝑑y|C.\displaystyle\sup_{0<r<r_{0}}\left|b^{ij}-\int_{B_{r_{0}}\setminus B_{r}}yK^{ij}(y)dy\right|\leq C.

    The value of r0r_{0} in (1.13) depends on R0R_{0}.

The second result of this paper is to use the above result, plus recent results for parabolic integro-differential equations that include (1.11), thanks to part (ii) of Theorem 1.1, to deduce regularity for the resulting free boundary (in this case, the set Γf=graph(f(,t))\Gamma_{f}=\textnormal{graph}(f(\cdot,t))). This is the content of our second main result.

Theorem 1.2.

There exist universal constants, C>0C>0 and γ(0,1)\gamma\in(0,1), depending upon δ\delta, LL, mm, and ρ\rho, which define 𝒦\mathcal{K} in (1.12) so that if ff solves (1.11) and for all t[0,T]t\in[0,T], f(,t)𝒦(δ,L,m,ρ)f(\cdot,t)\in\mathcal{K}(\delta,L,m,\rho), then fC1,γ(n×[T2,T])f\in C^{1,\gamma}(\mathbb{R}^{n}\times[\frac{T}{2},T]), and

fC1,γ(n×[T2,T])C(δ,L,m,ρ)(1+T)Tγf(,0)C0,1.\displaystyle\lVert f\rVert_{C^{1,\gamma}(\mathbb{R}^{n}\times[\frac{T}{2},T])}\leq\frac{C(\delta,L,m,\rho)(1+T)}{T^{\gamma}}\lVert f(\cdot,0)\rVert_{C^{0,1}}.

In particular, under the assumption that for all t[0,T]t\in[0,T], {U(,t)>0}=graph(f(,t))\partial\{U(\cdot,t)>0\}=\textnormal{graph}(f(\cdot,t)), and for all t[0,T]t\in[0,T], f𝒦(δ,L,m,ρ)f\in\mathcal{K}(\delta,L,m,\rho), we conclude that {U>0}\partial\{U>0\} is a C1,γC^{1,\gamma} hypersurface in space and time.

Remark 1.3.

It is important to note the strange presentation of the C1,γC^{1,\gamma} estimate in Theorem 1.2 with only f(,0)C0,1\lVert f(\cdot,0)\rVert_{C^{0,1}} on the right hand side. We emphasize that we have not proved that Lipschitz free boundaries become C1,γC^{1,\gamma}, due to the constant, C(δ,L,m,ρ)C(\delta,L,m,\rho). As the reader will see in Section 8, the constant CC depends in a complicated way on the parameters, δ\delta, LL, mm, ρ\rho, as all of these impact the boundary behavior of the Green’s function for the elliptic equations in (1.1), which in turn changes the estimates in Theorem 1.1, and hence the resulting parabolic estimates in Section 8. Nevertheless, once one knows that f𝒦(δ,L,m,ρ)f\in\mathcal{K}(\delta,L,m,\rho) for some fixed choice of δ\delta, LL, mm, ρ\rho, subsequently decreasing the Lipschitz norm of f(,0)f(\cdot,0) would decrease the C1,γC^{1,\gamma} norm of the solution at later times. However, since the parameters δ\delta, LL, mm, ρ\rho give an upper bound on the quantity, for each tt, f(,t)Cρ1,Dini\lVert f(\cdot,t)\rVert_{C^{1,\textnormal{Dini}}_{\rho}}, a reasonable interpretation of the result is rather given as

fC1,γ(n×[T2,T])C(δ,L,m,ρ)(1+T)Tγsupt[0,T]f(,t)Cρ1,Dini.\displaystyle\lVert f\rVert_{C^{1,\gamma}(\mathbb{R}^{n}\times[\frac{T}{2},T])}\leq\frac{C(\delta,L,m,\rho)(1+T)}{T^{\gamma}}\sup_{t\in[0,T]}\lVert f(\cdot,t)\rVert_{C^{1,\textnormal{Dini}}_{\rho}}.

We note that the work [24] established the equivalence between free boundary viscosity solutions of some Hele-Shaw type evolutions, like (1.1), and viscosity solutions of fractional integro-differential parabolic equations in (1.11). However, the results in [24] focused on this equivalence at the level of viscosity solutions and low regularity properties, and they stopped short of addressing the question of a regularization phenomenon that may occur in a slightly higher regularity regime. As shown in the current paper, one needs to obtain much more precise information about the integro-differential operators appearing in, for instance, Theorem 1.1 in order to utilize recent tools from the realm of integro-differential equations to investigate how this equation regularizes. Furthermore, obtaining the estimate as in Theorem 1.1 required a slightly different approach than the one pursued in [24], instead invoking a finite dimensional approximation technique from [42]. This can be seen in Sections 7.1 and 7.2.

2. Some Historical Background and Related Results

Basically, (1.1) is a two-phase Hele-Shaw type problem without surface tension and neglecting the effects of gravity. For our purposes, we are interested in (1.1) for mathematical reasons to uncover some of its structural properties and to explore the possibility of regularizing effects. Thus, we do not comment much on the model’s physical origins. The fact that (1.1) governs a two-phase situation is important for us to demonstrate that these techniques work for both one and two phase problems of a certain type.

In the following discussion, we attempt to focus on results most closely related to (1.1), and we note that a more extended discussion can be found in the works [23] and [24].

2.1. Hele-Shaw type free boundary problems without gravity.

In most of the existing literature, (1.1) is studied in its one-phase form, where the set {U<0}\{U<0\} is ignored by simply dictating that the velocity condition is V=G(ν+U+)V=G(\partial^{+}_{\nu}U^{+}).

Some of the earliest works for short time existence and uniqueness are [39] and [40], where a type of variational problem is studied in [39] and a classical solution (for short time) is produced in [40]. For the one-phase problem, under a smoothness and convexity assumption, [36] gives global in time smooth solutions. Viscosity solutions for the one-phase version of (1.1) are defined and shown to exist and be unique in [50], which follows the approach first developed in [14] for the stationary two-phase problem and subsequently used in [5] for the two-phase Stefan problem. A follow-up modification of the definition of viscosity solutions for (1.1) was given in [24, Section 9]. Of course, for our results, we are assuming already the existence of a classical solution, and so none of the definitions of viscosity solutions for (1.1) are invoked here. (However, we do invoke viscosity solutions for the function ff, as they are useful even when studying smooth solutions, such as in investigating the equation for discrete spatial derivatives of solutions. But the notion of solution for ff is entirely different from that of UfU_{f}.)

Moving on to issues of regularity, beyond the smooth initial data case in [40], and the convex case in [36], there are a number of works. All of the following works apply to the one-phase problem. With some assumptions on the quantity |Ut|/|DU|\left|U_{t}\right|/\left|DU\right|, [51] showed a Lipschitz free boundary becomes C1C^{1} in space-time with a modulus, and long time regularity, involving propagation of a Lipschitz modulus, was obtained in [49]. Subsequently, the extra condition on the space-time non-degeneracy in [51] was removed in the work of [26], where under a dimensional small Lipschitz condition on the initial free boundary, Lipschitz free boundaries must be C1C^{1} in space-time and hence classical. This was then followed up by the work [27] where more precise results can be proved when the solution starts from a global Lipschitz graph. In this context, it is fair to say that our results are the extension of [27] to the two-phase case, but with paying the extra price of requiring C1,DiniC^{1,\textnormal{Dini}} regularity of the initial graph instead of being only Lipschitz. There is another regularity result for the one-phase Hele-Shaw problem in [23] that follows more the strategy of [37] and [62], instead of [13], [15], [26], [51]. In [23] the approach to regularity for the one-phase Hele-Shaw invoked parabolic regularity theory for fractional equations, but in that context the regularity theory applied to a blow-up limit of the solutions under a flatness condition in space-time, which resulted in a local C1,γC^{1,\gamma} space-time regularity for the solution. Thus already [23] foreshadowed the type of strategy that we have pursued in Theorem 1.2.

2.2. The nonlinear Dirichlet-to-Neumann mapping

In this paper, it is reasonable to call the operator, II, defined in (1.2) and (1.3) a nonlinear version of the classical Dirichlet-to-Neumann mapping. In this case it records the dependence on the shape of the domain of a particular harmonic function. This operator II, and the resulting mapping defined as HH in (1.8) are key components in our analysis, as well as were one of the main ingredients in the previous work [24]. Such operators are not new, and they have a relatively long study, particularly in some water wave equations (in fact, the authors in [24] were unaware of this long history). Although the map, II, appearing in (1.3), is not exactly the operator appearing in earlier works, it is very similar. Most of the earlier versions are a slight variant on the following: given two functions, h:nh:\mathbb{R}^{n}\to\mathbb{R} and ψ:n\psi:\mathbb{R}^{n}\to\mathbb{R}, Uh,ψU_{h,\psi} is the unique, bounded, harmonic function,

{ΔUh,ψ=0in{(x,xn+1)n+1:xn+1<h(x)}Uh,ψ(x,xn+1)=ψ(x)ongraph(h)\displaystyle\begin{cases}\Delta U_{h,\psi}=0\ &\text{in}\ \{(x,x_{n+1})\in\mathbb{R}^{n+1}\ :\ x_{n+1}<h(x)\}\\ U_{h,\psi}(x,x_{n+1})=\psi(x)\ &\text{on}\ \textnormal{graph}(h)\end{cases}

and the Dirichlet-to-Nuemann operator is

[G~(h)ψ](x):=νUh,ψ(x,h(x))1+|h(x)|2.\displaystyle[\tilde{G}(h)\psi](x):=\partial_{\nu}U_{h,\psi}(x,h(x))\sqrt{1+\left|\nabla h(x)\right|^{2}}.

We note that this operator is in fact in the literature usually denoted as G(h)ψG(h)\psi, but we use G~(h)ψ\tilde{G}(h)\psi due to the conflicting notation with our use of “GG” in (1.1), which is entirely different. The reader should note that in this context, it is very frequent that ψ\psi actually does not depend on xn+1x_{n+1}, which can be justified in that G~(h)\tilde{G}(h) often is used when acting on such vertically constant boundary data. Sometimes instead of taking UU to be defined in the subgraph of hh, there may be other boundary conditions, such as, for example when h>1h>1, a no flux condition νUh,ψ|{xn+1=0}=0\partial_{\nu}U_{h,\psi}|_{\{x_{n+1}=0\}}=0, or even there could be a fixed bottom boundary with a nontrivial shape. For the purposes of discussion, the equation in the subgraph of hh will suffice. The use of the map, G~(h)\tilde{G}(h), appears to go back to [71] and then [34]. The operator, G~(h)\tilde{G}(h) was revisited in [60] for the sake of improving computational tractability for various problems like (1.1) that may involve interfaces moving via a normal derivative. The work [57] investigates the mapping and boundedness properties of G~(h)\tilde{G}(h) on various Sobolev spaces for proving well-posedness of water wave equations, and also gives a very detailed description of the usage of G~\tilde{G} in earlier works on water waves; we refer to [57] for more discussion on the history of G~\tilde{G} in water wave results. The subsequent article [1] showed a more careful analysis of G~\tilde{G} could give improved conditions on well-posedness in gravity water waves. G~\tilde{G} recently played a central role in [59] for well-posedness of the Muskat problem and in [2], [3] for well-posedness of the one-phase Hele-Shaw equation with gravity as well as to deduce results related to Lyapunov functionals for the solution.

2.3. Hele-Shaw type free boundary problems with gravity– Muskat type problems

A pair of free boundary problems that is closely related to (1.1), but pose their own set of additional challenges are those that are also called Hele-Shaw and Muskat problems. They can be cast as both one and two phase problems, and they govern the free surface between two fluids of different density and possibly different viscosity. We note that in both, gravity is taken into account, and this changes the nature of the equation a bit away from (1.1); also the pressure is not required to be constant along the free boundary. There is a large amount of literature on this class of problems, and we focus on the ones most closely related to (1.1). A feature that links the Muskat problem to that considered in this paper is to rewrite the solution for the original problem in n+1n+1 space dimensions as a problem in nn dimensions that governs the free surface itself, directly, via a nonlinear equation that is inherently integro-differential in nature and which linearizes to the fractional heat equation of order 1/21/2. Already the reformulation of the problem in terms of integro-differential equations goes back to [4], [16], [65], with gloabl existence of solutions with small data in [65] and short time existence of solutions with large data in an appropriate Sobolev space in [4]. This method of writing the Muskat problem as an equation for the free surface directly continues in [32], and this is an integro-differential type equation for the gradient of the free surface function, where for a 2-dimensional interface is

tf=ρ2ρ14π2(f(x,t)f(xy,t))y(y2+[f(x,t)f(xy,t)]2)3/2𝑑y.\displaystyle\partial_{t}f=\frac{\rho_{2}-\rho_{1}}{4\pi}\int_{\mathbb{R}^{2}}\frac{(\nabla f(x,t)-\nabla f(x-y,t))\cdot y}{(y^{2}+[f(x,t)-f(x-y,t)]^{2})^{3/2}}dy. (2.1)

This formulation was then used to show that near a stable solution that is sufficiently regular, the equation linearizes to the 1/2-heat equation, and [32] further showed existence of solutions in this region (see a few more comments about linearization in Section 9). It was subsequently used to produce many well-posedness and regularity results, both short time and global time, a few of which are: [29], [30], [31].

There are (at least) two other variants on studying the Muskat problem as an equation for the free surface alone, and the ones that are very close in spirit to our work are, on the one hand, [17], [18], [33], and on the other hand, [3], [59]. In [33], equation (2.1) was rewritten as a fully nonlinear integro-differential equation on ff itself, instead of xf\partial_{x}f, which is given in 1-d as

tf=f(y,t)f(x,t)(yx)xf(x,t)(yx)2+(f(y,t)f(x,t))2𝑑y,\displaystyle\partial_{t}f=\int_{\mathbb{R}}\frac{f(y,t)-f(x,t)-(y-x)\partial_{x}f(x,t)}{(y-x)^{2}+(f(y,t)-f(x,t))^{2}}dy, (2.2)

which is an equation of the form,

tf=δyf(x,t)Kf(y,t)𝑑y,\displaystyle\partial_{t}f=\int_{\mathbb{R}}\delta_{y}f(x,t)K_{f}(y,t)dy,

where Kf0K_{f}\geq 0 is a kernel that depends on ff and has the same structure as what we provide in Theorem 1.1 above. The integro-differential equation for ff (as opposed to xf\partial_{x}f) played a role in [33] to show non-expansion of the Lipschitz norm of solutions with nice enough data. The integro-differential nature of the Muskat problem was subsequently utilized in [17], [18] to study well-posedness for Lipschitz data as well as establish regularizing effects from (2.2). Thus, in spirit, our work combined with [24] is very close to [17], [18]. The other variation closely related to our work is to utilize the equation for ff given by the operator, G~(f)\tilde{G}(f) shown above, and this is used in [3] for one-phase Hele-Shaw with gravity and [59] for both the one and two phase Muskat problem. The analogy is easiest to see for the one phase problem, and in both [3] and [59] it is established that if the graph of ff gives the free surface, then ff can be completely characterized by the flow

tf=G~(f)fonn×[0,T].\displaystyle\partial_{t}f=\tilde{G}(f)f\ \ \ \text{on}\ \mathbb{R}^{n}\times[0,T]. (2.3)

At least for the Hele-Shaw type flow we study in (1.1), it appears as though the first result to show that weak solutions (viscosity solutions) of (1.1) are equivalent to the flow governed by the Dirichlet-to-Neumann operator acting on ff, as above in (2.3) (in our context, this is HH in (1.8) and (1.11)), was proved in [24]. The reduction to the equation for the free surface is not surprising, as a similar (and more complicated) reduction to a system for the free surface in water waves was known since [34] (also appearing in [1], [57], among others)– the novelty in [24] was that the reduction holds for viscosity solutions, which may not be classical. In [24] it was shown that under the graph assumption, the notion of the viscosity free boundary solution for UU is equivalent to the viscosity solution of the equation for ff, which is (1.11). Furthermore, global in time existence and uniqueness for (1.11)– or well-posedness– holds, and it can be used to construct solutions to (1.1), as well as show that a modulus of continuity for the initial interface will be preserved for all time. Subsequently, both [3] and [59] showed that for respectively the one-phase Hele-Shaw with gravity and the Muskat problem, the equation (2.3) is equivalent to solving the original free boundary problem, and that this equation is globally in time well posed in Hs(n)H^{s}(\mathbb{R}^{n}) for s>1+n2s>1+\frac{n}{2}, regardless of the size of the initial data in HsH^{s}. Thus, the work in [24] and our work here is again, very closely related to [3], [59], by utilizing (2.3) directly. There is an important difference to note, however, where the results in [24] and our results here exploit the fact that HH enjoys the global comparison property (see Definition 5.23) and the structure provided by Theorem 1.1, contrasted with [3], [59] for which the analysis is derived from the properties of G~\tilde{G} as a mapping on HsH^{s}.

2.4. Parabolic integro-differential equations

For the sake of presentation, in the context of this paper, the parabolic integro-differential equations that we utilize are of the form

tf=b(x)f+nδhf(x,t)K(x,h)𝑑h,\displaystyle\partial_{t}f=b(x)\cdot\nabla f+\int_{\mathbb{R}^{n}}\delta_{h}f(x,t)K(x,h)dh, (2.4)

with δhf(x)=f(x+h)f(x)𝟙Br0(h)f(x)h\delta_{h}f(x)=f(x+h)-f(x)-{\mathbbm{1}}_{B_{r_{0}}}(h)\nabla f(x)\cdot h, and their nonlinear counterparts given as those in Theorem 1.1. Here, bb is a bounded vector field, and K0K\geq 0. The main issue for our work is the possibility that solutions of (2.4) enjoy some sort of extra regularity when KK has better behavior than simply being non-negative. Are solutions to (2.4) Hölder continuous in some way that still allows for rough coefficients? Are they C1,αC^{1,\alpha}? We note that as written, (2.4), is an equation in non-divergence form, and in the literature, the theory that addresses these questions commonly carries the name Krylov-Safonov results, which comes from the result for local, second order parabolic equations [56] (in the divergence case, they usually carry the name De Giorgi - Nash - Moser). These questions pertaining to (2.4) have gathered considerable attention in the past 20 or so years, and most of the works relevant to our study find their origins in either [8] or [9], followed by a combination of [12] and [66]. Examples of the works on parabolic equations that are close to our needs include [20], [21], [22], [63], [64], [67], [68]. We note there are many references for elliptic problems and problems involving existence and uniqueness of viscosity solutions which are not mentioned above.

A common feature of most of the parabolic works listed above is that they arose from the interest of studying the probabilistic implications and analytical properties of equations like (2.4) for their sake as fundamental mathematical objects in their own right. A typical and frequently mentioned application among the nonlinear works is their relationship to optimal control and differential games. There has also been interest in utilizing equations like (2.4) in situations which are not necessarily originally posed as an integro-differential equation, such as we do in this work as it pertains to (1.11). In our case, we find that (1.11) coincidentally landed within the scope of existing results, as the reader may see in Sections 4.2 and 8. This is not always the case, and sometimes the intended application of the integro-differential theory has led to new advances in the integro-differential field. One recent occurrence of this is the application of integro-differential techniques to the Boltzmann equation. For the homogeneous Boltzmann equation, new integro-differential results were first produced in [63] to be subsequently applied in [69] (which is mentioned in [63, Section 1B]). Even more advanced techniques were required for the inhomogenous Boltzmann equation, and one can see the evolution of the integro-differential theory in [47], which was followed by [44], [45], [46].

3. Notation and Assumptions

We will collect some notation here.

  • nn is the dimension of the free boundary hypersurface, with n2n\geq 2.

  • X=(x,xn+1)n+1X=(x,x_{n+1})\in\mathbb{R}^{n+1}.

  • Br(x)nB_{r}(x)\subset\mathbb{R}^{n} and Brn+1(X)n+1B^{n+1}_{r}(X)\subset\mathbb{R}^{n+1}. When the context is clear, the superscript may be dropped.

  • d(x,y)d(x,y) is the distant between xx and yy, d(x,E)d(x,E) is the distance between xx and a set EE, and may be abbreviated d(x)d(x) when d(x,E)d(x,E) is understood for a particular EE.

  • νf(X)\nu_{f}(X) is the unit normal vector to the boundary at XΓfX\in\Gamma_{f}, often abbreviated without the subscript.

  • I+I^{+} and II^{-} are the respective normal derivatives from the positive and negative phases of UfU_{f}, defined using (1.6), (1.7), and (1.9).

  • Cρ1,DiniC^{1,\textnormal{Dini}}_{\rho} is a Banach space, as in Stein [70, Chapter VI, Cor 2.2.3 and Exercise 4.6] (also see (1.12) as well as XρX_{\rho} in Definition 5.9).

  • XρX_{\rho}, see Definition 5.9 and Remark 5.10, cf. [70, Chapter VI, Cor 2.2.3 and Exercise 4.6].

  • C0(N)C^{0}(\mathbb{R}^{N}) is the space of continuous functions on N\mathbb{R}^{N}.

  • Cb0(N)C^{0}_{b}(\mathbb{R}^{N}) is the Banach space of continuous bounded functions with the norm L\lVert\cdot\rVert_{L^{\infty}}.

  • Cb1,α(N)C^{1,\alpha}_{b}(\mathbb{R}^{N}) is the space of functions that are bounded with bounded derivatives, with the derivatives α\alpha-Hölder continuous.

  • Df={(x,xn+1): 0<xn+1<f(x)}=Df+D_{f}=\{(x,x_{n+1})\ :\ 0<x_{n+1}<f(x)\}=D^{+}_{f}, Df={(x,xn+1):f(x)<xn+1<L}D^{-}_{f}=\{(x,x_{n+1})\ :\ f(x)<x_{n+1}<L\}

  • Γf=graph(f)={(x,xn+1):xn+1=f(x)}\Gamma_{f}=\textnormal{graph}(f)=\{(x,x_{n+1})\ :\ x_{n+1}=f(x)\}.

  • dσfd\sigma_{f} the surface measure on Γf\Gamma_{f}, often abbreviated without the subscript as dσd\sigma.

  • GfG_{f} the Green’s function in DfD_{f} for the operator in (1.6).

  • PfP_{f} the Poisson kernel for DfD_{f} on Γf\Gamma_{f}.

  • 𝒦(δ,L,m,ρ)\mathcal{K}(\delta,L,m,\rho), see (1.12).

  • δhf(x)=f(x+h)f(x)𝟙Br0(h)f(x)h\delta_{h}f(x)=f(x+h)-f(x)-{\mathbbm{1}}_{B_{r_{0}}}(h)\nabla f(x)\cdot h

4. Background results on Green’s Functions and Parabolic Equations

This section has two subsections, collecting respectively background results related to Green’s functions for equations in Dini domains and background results for fractional parabolic equations.

4.1. Boundary behavior of Green’s functions

We utilize results about the boundary behavior of Green’s functions for equations with Dini coefficients in domains that have C1,DiniC^{1,\textnormal{Dini}} boundaries, for a Dini modulus, ω\omega. We will use the shorthand d(x):=dist(x,Ω)d(x):=\text{dist}(x,\partial\Omega) for xΩx\in\Omega. The main way in which we use the boundary behavior of the Green’s function is to deduce the boundary behavior of the Poisson kernel as well as that of solutions that may vanish on a portion of the boundary. The study of the boundary behavior of Green’s functions is a well developed topic, and none of what we present here is new. The results in either Theorem 4.1 or Proposition 4.5 reside in the literature in various combinations of [10], [25], [72], among other references.

Theorem 4.1.

If GfG_{f} is the Green’s function for the domain, DfD_{f}, then there exist positive constants C1C_{1}, C2C_{2}, and R0R_{0}, that depend upon the Dini modulus of f\nabla f and other universal parameters so that for all x,yDfx,y\in D_{f} with |xy|R0\left|x-y\right|\leq R_{0}

C1min{d(x)d(y)|xy|n+1,14|xy|n1}Gf(x,y)C2min{d(x)d(y)|xy|n+1,14|xy|n1}.C_{1}\min\left\{\frac{d(x)d(y)}{\left|x-y\right|^{n+1}},\frac{1}{4\left|x-y\right|^{n-1}}\right\}\leq G_{f}(x,y)\leq C_{2}\min\left\{\frac{d(x)d(y)}{\left|x-y\right|^{n+1}},\frac{1}{4\left|x-y\right|^{n-1}}\right\}. (4.1)

The essential ingredient in the proof of Theorem 4.1 is the following Lemma 4.3 on the growth of solutions away from their zero set. Before stating this result, we need a few definitions. Denote by [x0,z0][x_{0},z_{0}] the closed line segment with endpoints x0,z0Ωx_{0},z_{0}\in\Omega, and denote by 𝒜2r(x0)\mathcal{A}_{2r}(x_{0}) the annulus B2r(x0)\Br(x0)B_{2r}(x_{0})\backslash B_{r}(x_{0}).

Definition 4.2.

A domain Ωn+1\Omega\subset\mathbb{R}^{n+1} satisfies the uniform interior ball condition with radius ρ0\rho_{0} if for every ξΩ\xi\in\partial\Omega, there exists an open ball BB of radius ρ0\rho_{0} such that BΩB\subset\Omega and B¯Ω={ξ}\overline{B}\cap\partial\Omega=\{\xi\}.

Observe that since δfLδ\delta\leq f\leq L-\delta and f\nabla f has a Dini modulus of continuity ρ\rho, there exists a C1,DiniC^{1,\text{Dini}} map Tf:Df¯n×[0,L]T_{f}:\overline{D_{f}}\rightarrow\mathbb{R}^{n}\times[0,L] satisfying

{Tf(Df)=n×[0,L],Tf(Γf)={xn+1=L},Tf({xn+1=0})={xn+1=0}.\begin{cases}T_{f}(D_{f})=\mathbb{R}^{n}\times[0,L],\\ T_{f}(\Gamma_{f})=\left\{x_{n+1}=L\right\},\\ T_{f}(\left\{x_{n+1}=0\right\})=\left\{x_{n+1}=0\right\}.\end{cases} (4.2)

Consequently, the function Vf:=UfTf1V_{f}:=U_{f}\circ T_{f}^{-1} satisfies an equation of the form LAVf=div(A(y)Vf(y))=0L_{A}V_{f}=-\text{div}(A(y)\nabla V_{f}(y))=0 on n×[0,L]\mathbb{R}^{n}\times[0,L], where the coefficients A()A(\cdot) satisfy 0<λ𝕀n+1AΛ𝕀n+10<\lambda\mathbb{I}_{n+1}\leq A\leq\Lambda\mathbb{I}_{n+1} with λ,Λ\lambda,\Lambda depending on δ,L,m\delta,L,m, and are Dini continuous on n×[0,L]\mathbb{R}^{n}\times[0,L] up to the boundary with a modulus of continuity ω\omega. Thus, for the purposes of the next lemma, we will only consider a domain Ωn+1\Omega\subset\mathbb{R}^{n+1} which satisfies the uniform interior ball condition with radius ρ0\rho_{0} and a solution to a uniformly elliptic equation in divergence form on Ω\Omega with coefficients having a Dini modulus of continuity ω\omega.

Lemma 4.3.

Suppose Ωn+1\Omega\subset\mathbb{R}^{n+1} satisfies the uniform interior ball condition with radius ρ0\rho_{0}. Let uC2(Ω)C(Ω¯)u\in C^{2}(\Omega)\cap C(\overline{\Omega}) be non-negative and satisfy

{LAu=0in Ω,u=0on ΓΩ,\begin{cases}L_{A}u=0\quad\text{in }\Omega,\\ u=0\quad\text{on }\Gamma\subset\partial\Omega,\end{cases}

Then there exist positive constants C=C(n,λ,Λ)C=C(n,\lambda,\Lambda) and r0=r0(n,ω,λ,Λ)ρ02r_{0}=r_{0}(n,\omega,\lambda,\Lambda)\leq\frac{\rho_{0}}{2} such that for all balls B2r(x0)ΩB_{2r}(x_{0})\subset\Omega with B2r(x0)¯Γ\overline{B_{2r}(x_{0})}\cap\Gamma\neq\emptyset and rr0r\leq r_{0}, we have the estimate

u(x)Cru(x0)d(x)+o(d(x)) for all x[x0,z0]𝒜2r(x0),z0B2r(x0)¯Γ.u(x)\geq\frac{C}{r}u(x_{0})d(x)+o(d(x))\qquad\text{ for all }x\in[x_{0},z_{0}]\cap\mathcal{A}_{2r}(x_{0}),\ z_{0}\in\overline{B_{2r}(x_{0})}\cap\Gamma. (4.3)

Let us state some useful consequences of Lemma 4.3 and Theorem 4.1. First, notice that Lemma 4.3 implies the following uniform linear growth of UfU_{f} away from Γf\Gamma_{f}.

Lemma 4.4.

There exist a constant C>0C>0 that depends on δ\delta, LL, mm, ρ\rho, so that for all f𝒦(δ,L,m,ρ)f\in\mathcal{K}(\delta,L,m,\rho), for UfU_{f} defined in (1.2), and for all YΓfY\in\Gamma_{f},

sCUf(Ysyn+1)Cs,andsCUf(Y+sνf(Y))Cs.\displaystyle\frac{s}{C}\leq U_{f}(Y-sy_{n+1})\leq Cs,\ \ \text{and}\ \ \frac{s}{C}\leq U_{f}(Y+s\nu_{f}(Y))\leq Cs.

(Recall νf\nu_{f} is the inward normal to DfD_{f}.)

Theorem 4.1 also induces the following behavior on the Poisson kernel.

Proposition 4.5.

If f𝒦(δ,L,m,ρ)f\in\mathcal{K}(\delta,L,m,\rho) and PfP_{f} is the Poisson kernel for the domain, DfD_{f}, then there exists constants C1C_{1}, C2C_{2}, C3C_{3} and R0R_{0}, that depend upon δ,L,m,ρ\delta,L,m,\rho and other universal parameters so that for all XDfX\in D_{f}, YΓfY\in\Gamma_{f}, with |XY|R0\left|X-Y\right|\leq R_{0},

C1d(X)|XY|n+1Pf(X,Y)C2d(X)|XY|n+1.\displaystyle C_{1}\frac{d(X)}{\left|X-Y\right|^{n+1}}\leq P_{f}(X,Y)\leq C_{2}\frac{d(X)}{\left|X-Y\right|^{n+1}}.

Furthermore, there exists an exponent, α(0,1]\alpha\in(0,1], depending on δ,L,m,ρ\delta,L,m,\rho and universal parameters, so that for XΓfX\in\Gamma_{f} and with R>R0R>R_{0},

ΓfBRPf(X+sν(X),Y)𝑑σf(Y)CsRα.\int_{\Gamma_{f}\setminus B_{R}}P_{f}(X+s\nu(X),Y)d\sigma_{f}(Y)\leq\frac{Cs}{R^{\alpha}}. (4.4)

For technical reasons, we also need a slight variation on Proposition 4.5, which is related to conditions necessary to invoke results from the earlier work [42] that we state here in Theorem 5.24.

Lemma 4.6.

There exists constants c0c_{0}, C>0C>0 and α(0,1]\alpha\in(0,1], depending on δ\delta, LL, mm, ρ\rho, so that if fCρ1,Dini(B2R(0))f\in C^{1,\textnormal{Dini}}_{\rho}(B_{2R}(0)), δfLδ\delta\leq f\leq L-\delta, and |f|m\left|\nabla f\right|\leq m, then for XBRΓfX\in B_{R}\cap\Gamma_{f} and 0<s<c00<s<c_{0},

ΓfB2RPf(X+sν(X),Y)𝑑σf(Y)CsRα.\displaystyle\int_{\Gamma_{f}\setminus B_{2R}}P_{f}(X+s\nu(X),Y)d\sigma_{f}(Y)\leq\frac{Cs}{R^{\alpha}}.
Remark 4.7.

It is worth noting that based on purely the Lipschitz constant of ff, one would obtain this same estimate in Lemma 4.6, but with the upper bound of CsαrαC\frac{s^{\alpha}}{r^{\alpha}}. The Dini condition in B2RB_{2R} is what allows to obtain ss, instead of sαs^{\alpha} in the estimate.

For the convenience of the reader, we have provided proofs of the above results in the Appendix. See [25] for a parabolic version of related results.

4.2. Background results on regularity for integro-differential equations

For our purposes, we will invoke results for parabolic integro-differential equations that originate mainly in Chang Lara - Davila [22] and Silvestre [67].

Following [19] and [22], we consider fully nonlinear parabolic equations whose linear versions are (tLK,b)u(\partial_{t}-L_{K,b})u, where for u:n×u:\mathbb{R}^{n}\times\mathbb{R}\rightarrow\mathbb{R},

LK,bu(x,t):=b(x,t)u(x,t)+nδhu(x,t)K(x,t,h)𝑑h,L_{K,b}u(x,t):=b(x,t)\cdot\nabla u(x,t)+\int_{\mathbb{R}^{n}}\delta_{h}u(x,t)K(x,t,h)\ dh, (4.5)

b(x,t)nb(x,t)\in\mathbb{R}^{n} is a bounded vector field and δhu(x,t):=u(x+h,t)u(x,t)𝟙Br0(h)u(x,t)h\delta_{h}u(x,t):=u(x+h,t)-u(x,t)-{\mathbbm{1}}_{B_{r_{0}}}(h)\nabla u(x,t)\cdot h. For any r(0,r0)r\in(0,r_{0}), consider the rescaled function

ur(x,t):=1ru(rx,rt).u_{r}(x,t):=\frac{1}{r}u(rx,rt).

A direct calculation shows that if uu satisfies the equation (tLK,b)u(x,t)=φ(x,t)(\partial_{t}-L_{K,b})u(x,t)=\varphi(x,t), then uru_{r} satisfies the equation (tLKr,br)ur(x,t)=φr(x,t)(\partial_{t}-L_{K_{r},b_{r}})u_{r}(x,t)=\varphi_{r}(x,t), where

Kr(x,t,h):=rn+1K(rx,rt,rh),br(x,t):=b(rx,rt)Br0\BrhK(rx,rt,h)𝑑h,φr(x,t)=φ(rx,rt).K_{r}(x,t,h):=r^{n+1}K(rx,rt,rh),\quad b_{r}(x,t):=b(rx,rt)-\int_{B_{r_{0}}\backslash B_{r}}hK(rx,rt,h)\ dh,\quad\varphi_{r}(x,t)=\varphi(rx,rt).

Based on this scaling behavior, we are led to consider the following class of linear operators.

Definition 4.8 (cf. Section 2 of [22]).

Given a positive number Λ\Lambda, the class Λ\mathcal{L}_{\Lambda} is the collection of linear operators of the form LK,bL_{K,b} as in (4.5) with KK and bb satisfying the properties

(i)Λ1|h|n1K(x,t,h)Λ|h|n1for all (x,t,h)n×[0,T]×n,\displaystyle\mathrm{(i)}\ \Lambda^{-1}\left|h\right|^{-n-1}\leq K(x,t,h)\leq\Lambda\left|h\right|^{-n-1}\qquad\text{for all }(x,t,h)\in\mathbb{R}^{n}\times[0,T]\times\mathbb{R}^{n},
(ii)sup0<ρ<1,(x,t)n+1|b(x,t)Br0BρhK(x,t,h)𝑑h|Λ.\displaystyle\mathrm{(ii)}\ \sup_{0<\rho<1,\ (x,t)\in\mathbb{R}^{n+1}}\left|b(x,t)-\int_{B_{r_{0}}\setminus B_{\rho}}hK(x,t,h)dh\right|\leq\Lambda.

Let us show that if b,KΛb,K\in\mathcal{L}_{\Lambda} then br,KrΛb_{r},K_{r}\in\mathcal{L}_{\Lambda} for all r(0,1)r\in(0,1). We suppress the dependence on tt. The bounds (i) on the kernels are immediate: for the upper bound, we have

Kr(x,h)=rn+1K(rx,rh)rn+1Λ|rh|n1=Λ|h|n1,K_{r}(x,h)=r^{n+1}K(rx,rh)\leq r^{n+1}\Lambda|rh|^{-n-1}=\Lambda|h|^{-n-1},

while for the lower bound, we have

Kr(x,h)=rn+1K(rx,rh)Λ1rn+1|rh|n1Λ1|h|n1.K_{r}(x,h)=r^{n+1}K(rx,rh)\geq\Lambda^{-1}r^{n+1}|rh|^{-n-1}\geq\Lambda^{-1}|h|^{-n-1}.

To show that br,Krb_{r},K_{r} satisfy (ii), let ρ(0,1)\rho\in(0,1) and xnx\in\mathbb{R}^{n} be arbitrary. Then

|br(x)B1BρhKr(x,h)𝑑h|\displaystyle\left|b_{r}(x)-\int_{B_{1}\setminus B_{\rho}}hK_{r}(x,h)dh\right| =|b(rx)B1BrhK(rx,h)𝑑hB1Bρhrn+1K(rx,rh)𝑑h|\displaystyle=\left|b(rx)-\int_{B_{1}\setminus B_{r}}hK(rx,h)dh-\int_{B_{1}\setminus B_{\rho}}hr^{n+1}K(rx,rh)dh\right|
=|b(rx)B1BrhK(rx,h)𝑑hBrBρrhK(rx,h)𝑑h|\displaystyle=\left|b(rx)-\int_{B_{1}\setminus B_{r}}hK(rx,h)dh-\int_{B_{r}\setminus B_{\rho r}}hK(rx,h)dh\right|
=|b(rx)B1BρrhK(rx,h)𝑑h|Λ.\displaystyle=\left|b(rx)-\int_{B_{1}\setminus B_{\rho r}}hK(rx,h)dh\right|\leq\Lambda.

Consequently, br,KrΛb_{r},K_{r}\in\mathcal{L}_{\Lambda}.

The class Λ\mathcal{L}_{\Lambda} gives rise to the extremal operators

Λ+(u)=supLΛL(u),Λ(u)=infLΛL(u).\displaystyle\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}(u)=\sup_{L\in\mathcal{L}_{\Lambda}}L(u),\quad\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}(u)=\inf_{L\in\mathcal{L}_{\Lambda}}L(u). (4.6)

These operators are typically used to characterize differences of a given nonlocal operator, say JJ, in which one would require

Λ(uv)J(u)J(v)MΛ+(uv),\displaystyle\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}(u-v)\leq J(u)-J(v)\leq M^{+}_{\mathcal{L}_{\Lambda}}(u-v), (4.7)

where one can change the operators by changing the set of functionals included in Λ\mathcal{L}_{\Lambda}. This is what is known as determining an “ellipticity” class for JJ. By the scale invariance of Λ\mathcal{L}_{\Lambda} we know that Λ±(ur)(x)=Λ±(u)(rx)\mathcal{M}^{\pm}_{\mathcal{L}_{\Lambda}}(u_{r})(x)=\mathcal{M}^{\pm}_{\mathcal{L}_{\Lambda}}(u)(rx).

The cylinders corresponding to the maximal operators Λ±\mathcal{M}^{\pm}_{\mathcal{L}_{\Lambda}} are

Qr=(r,0]×Br(0),Qr(t0,x0)=(t0r,t0]×Br(x0).Q_{r}=(-r,0]\times B_{r}(0),\qquad Q_{r}(t_{0},x_{0})=(t_{0}-r,t_{0}]\times B_{r}(x_{0}).
Definition 4.9.

The function uu is a viscosity supersolution of the equation

tuΛu=φ\partial_{t}u-\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}u=\varphi

if for all ε>0\varepsilon>0 and ψ:(t,x)×n\psi:(t,x)\in\mathbb{R}\times\mathbb{R}^{n}\rightarrow\mathbb{R} left-differentiable in tt, twice pointwise differentiable in xx, and satisfying ψ(t,x)u(t,x)\psi(t,x)\leq u(t,x) with equality at (t0,x0)(t_{0},x_{0}), the function vεv_{\varepsilon} defined as

vε(t,x)={ψ(t,x)if (t,x)Qε(t0,x0),u(t,x)otherwisev_{\varepsilon}(t,x)=\begin{cases}\psi(t,x)\quad\text{if }(t,x)\in Q_{\varepsilon}(t_{0},x_{0}),\\ u(t,x)\quad\text{otherwise}\end{cases}

satisfies the inequality

tvε(t0,x0)Λvε(t0,x0)φ(t0,x0).\partial_{t}v_{\varepsilon}(t_{0},x_{0})-\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}v_{\varepsilon}(t_{0},x_{0})\geq\varphi(t_{0},x_{0}).

The corresponding definition of a viscosity subsolution is obtained by considering a function ψ\psi satisfying ψ(t,x)u(t,x)\psi(t,x)\geq u(t,x) with equality at (t0,x0)(t_{0},x_{0}), and requiring

tvε(t0,x0)Λvε(t0,x0)φ(t0,x0).\displaystyle\partial_{t}v_{\varepsilon}(t_{0},x_{0})-\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}v_{\varepsilon}(t_{0},x_{0})\leq\varphi(t_{0},x_{0}).

The same definitions hold for tuΛ+=φ\partial_{t}u-\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}=\varphi.

The main regularity result that we need is stated below, and can be found in [22]; see also [63, 67, 68].

Proposition 4.10 (Hölder Estimate, Section 7 of [22]).

Suppose uu is bounded in n×[0,t0]\mathbb{R}^{n}\times[0,t_{0}] and satisfies in the viscosity sense

{tuΛ+uAtuΛuA\begin{cases}\partial_{t}u-\mathcal{M}_{\mathcal{L}_{\Lambda}}^{+}u\leq A\\ \partial_{t}u-\mathcal{M}_{\mathcal{L}_{\Lambda}}^{-}u\geq-A\end{cases} (4.8)

in Qt0(t0,x0)Q_{t_{0}}(t_{0},x_{0}) for some constant A>0A>0. Then there exist constants C>0C>0 and γ(0,1)\gamma\in(0,1), depending only on nn and Λ\Lambda, such that

uCγ(Qt02(t0,x0))Ct0γ(uL(n×[0,t0])+t0A).||u||_{C^{\gamma}(Q_{\frac{t_{0}}{2}}(t_{0},x_{0}))}\leq\frac{C}{t_{0}^{\gamma}}\left(||u||_{L^{\infty}(\mathbb{R}^{n}\times[0,t_{0}])}+t_{0}A\right).
Remark 4.11.

The equations in (4.8) simply say that uu is a subsolution of tuΛ+u=A\partial_{t}u-\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}u=A and a supersolution of tuΛu=A\partial_{t}u-\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}u=-A.

Since Proposition 4.10 differs slightly from [22] in that it accommodates the cylinder Qt0(t0,x0)Q_{t_{0}}(t_{0},x_{0}) other than the standard cylinder Q1Q_{1} and also from [67] in that it includes a non-zero right hand side, AA, we make a small comment here as to the appearance of the term t0At_{0}A in the conclusion of the estimate. Indeed, this is simply a result of rescaling the equation. As in [22], we already know that Proposition 4.10 holds for uu that are bounded in n×[1,0]\mathbb{R}^{n}\times[-1,0] and satisfy (4.8) in Q1Q_{1}; in this case, the CγC^{\gamma} estimate holds on Q12Q_{\frac{1}{2}}. Let us now show what happens for arbitrary t0>0t_{0}>0 and x0nx_{0}\in\mathbb{R}^{n}.

Let uu be as in the statement of Propositon 4.10 and define u~(t,x):=u((t0,x0)+t0(t,x))\tilde{u}(t,x):=u((t_{0},x_{0})+t_{0}(t,x)). Notice that if (t,x)Qr(t,x)\in Q_{r}, then (t0,x0)+t0(t,x)Qt0r(t0,x0)(t_{0},x_{0})+t_{0}(t,x)\in Q_{t_{0}r}(t_{0},x_{0}) for all r[0,1]r\in[0,1]. By the translation and scaling invariance properties of the operators tΛ±\partial_{t}-\mathcal{M}^{\pm}_{\mathcal{L}_{\Lambda}}, we thus have

tu~Λ+u~=t0(tuΛ+u)t0A and tu~Λu~=t0(tuΛu)t0A in Q1.\partial_{t}\tilde{u}-\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}\tilde{u}=t_{0}(\partial_{t}u-\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}u)\leq t_{0}A\quad\text{ and }\quad\partial_{t}\tilde{u}-\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}\tilde{u}=t_{0}(\partial_{t}u-\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}u)\geq-t_{0}A\quad\text{ in }Q_{1}.

On the other hand, we also have u~L(n×[1,0])=uL(n×[0,t0])||\tilde{u}||_{L^{\infty}(\mathbb{R}^{n}\times[-1,0])}=||u||_{L^{\infty}(\mathbb{R}^{n}\times[0,t_{0}])} and for all (t,x)Q12(t,x)\in Q_{\frac{1}{2}},

|u~(t,x)u~(0,0)||(t,x)|γ=|u((t0,x0)+t0(t,x))u(t0,x0)||(t,x)|γ=t0γ|u((t0,x0)+t0(t,x))u(t0,x0)||(t0,x0)+t0(t,x)(t0,x0)|γ.\frac{|\tilde{u}(t,x)-\tilde{u}(0,0)|}{|(t,x)|^{\gamma}}=\frac{|u((t_{0},x_{0})+t_{0}(t,x))-u(t_{0},x_{0})|}{|(t,x)|^{\gamma}}=\frac{t_{0}^{\gamma}|u((t_{0},x_{0})+t_{0}(t,x))-u(t_{0},x_{0})|}{|(t_{0},x_{0})+t_{0}(t,x)-(t_{0},x_{0})|^{\gamma}}.

Consequently, u~Cγ(Q12)=t0γuCγ(Qt02(t0,x0))||\tilde{u}||_{C^{\gamma}(Q_{\frac{1}{2}})}=t_{0}^{\gamma}||u||_{C^{\gamma}(Q_{\frac{t_{0}}{2}}(t_{0},x_{0}))}. The conclusion follows by applying to u~\tilde{u} the version of Proposition 4.10 for functions that are bounded in n×[1,0]\mathbb{R}^{n}\times[-1,0] and satisfy (4.8) in Q1Q_{1}, and then rewriting the resulting CγC^{\gamma} estimate in terms of uu.

5. A Finite Dimensional Approximation For II

An important note for this section is we will take NN to be an arbitrary dimension, and we are looking generically at operators on C1,Dini(N)C^{1,\textnormal{Dini}}(\mathbb{R}^{N}). The application to equation (1.1) will be for N=nN=n (as f:nf:\mathbb{R}^{n}\to\mathbb{R}).

Here we will record some tools that were developed in [42] and [43] to investigate the structure of operators that enjoy what we call a global comparison property (see Definition 5.23, below). The point of these tools is to build linear mappings that can be used to “linearize” the nonlinear operator, II, through the min-max procedure apparent in Theorem 1.1, or more precisely, to reconstruct II from a min-max of a special family of linear operators.

The linear mappings we build to achieve a min-max for II are limits of linear mappings that are differentials of maps with similar properties for a family of simpler operators that can be used to approximate II. The advantage of the approximations constructed in [42] and [43] is that they are operators with the same domain as II but enjoy the property of having finite rank (with the rank going to infinity as the approximates converge to the original). In this regard, even though the original operator and approximating operators are nonlinear, the approximates behave as Lipschitz operators on a high, but finite dimensional space, and are hence differentiable almost everywhere. This differentiability makes the min-max procedure straightforward, and it is then passed through the limit back to the original operator, II. The basis for our finite dimensional approximation to II is the Whitney extension for a family of discrete and finite subsets of n\mathbb{R}^{n}, whose union is dense in n\mathbb{R}^{n}. The reason for doing this is that we can restrict the functions to be identically zero outside of a finite set, and naturally, the collection of these functions is a finite dimensional vector space. Thus, Lipschitz operators on those functions will be differentiable almost everywhere, and as mentioned this is one of the main points of [42] to represent II as a min-max over linear operators.

5.1. The Whitney Extension

Here we just list some of the main properties of the Whitney extension constructed in [42]. It is a variant of the construction in Stein [70], where in [42] it is designed to preserve the grid structure of 2mN2^{-m}\mathbb{Z}^{N}. We refer the reader to [42, Section 4] for complete details.

Definition 5.1.

For each mm\in\mathbb{N}, the finite set, GmG_{m}, is defined as

Gm=2mN.\displaystyle G_{m}=2^{-m}\mathbb{Z}^{N}.

We will call hmh_{m} the grid size, defined as hm=2mh_{m}=2^{-m}.

We note that in [42, Section 4], the sets for the Whitney extension were constructed as a particular disjoint cube decomposition that covers nhmN\mathbb{R}^{n}\setminus h_{m}\mathbb{Z}^{N} and was shown to be invariant under translations of GmG_{m} by any vector in GmG_{m}. For each mm, we will index these sets by kk\in\mathbb{N}, and we will call them Qm,kQ_{m,k}. See [42, Section 4] for the precise details of Qm,kQ_{m,k} and ϕm,k\phi_{m,k}. Here we record these results.

Lemma 5.2 (Lemma 4.3 in [42]).

For every mm\in\mathbb{N}, there exists a collection of cubes {Qm,k}k\{Q_{m,k}\}_{k} such that

  1. (1)

    The cubes {Qm,k}k\{Q_{m,k}\}_{k} have pairwise disjoint interiors.

  2. (2)

    The cubes {Qm,k}k\{Q_{m,k}\}_{k} cover dGm\mathbb{R}^{d}\setminus G_{m}.

  3. (3)

    There exist a universal pair of constants, c1c_{1}, c2c_{2}, so that

    c1diam(Qm,k)dist(Qm,k,Gm)c2diam(Qm,k).\displaystyle c_{1}\textnormal{diam}(Q_{m,k})\leq\textnormal{dist}(Q_{m,k},G_{m})\leq c_{2}\textnormal{diam}(Q_{m,k}).
  4. (4)

    For every hGmh\in G_{m}, there is a bijection σh:\sigma_{h}:\mathbb{N}\to\mathbb{N} such that Qm,k+h=Qm,σhkQ_{m,k}+h=Q_{m,\sigma_{h}k} for every kk\in\mathbb{N}.

Remark 5.3.

Just for clarity, we make explicit for the reader: the parameter, mm\in\mathbb{N}, is used for the grid size, 2mN2^{-m}\mathbb{Z}^{N}, and the parameter, kk\in\mathbb{N}, in Qm,kQ_{m,k}, etc. is the index resulting from a cube decomposition of NGm\mathbb{R}^{N}\setminus G_{m}.

Remark 5.4.

In what follows, given a cube QQ, we shall denote by QQ^{*} the cube with the same center as QQ but whose sides are increased by a factor of 9/89/8. Observe that for every mm and kk, we have Qm,kn22mNQ_{m,k}^{*}\subset\mathbb{R}^{n}\setminus 2^{2-m}\mathbb{Z}^{N}, and that any given xx lies in at most some number C(N)C(N) of the cubes Qm,kQ_{m,k}^{*}.

Proposition 5.5 (Proposition 4.6 in [42]).

For every mm, there is a family of functions ϕm,k(x)\phi_{m,k}(x), with kk\in\mathbb{N}, such that

  1. (1)

    0ϕm,k(x)10\leq\phi_{m,k}(x)\leq 1 for every kk and ϕm,k0\phi_{m,k}\equiv 0 outside Qm,kQ_{m,k}^{*} (using the notation in Remark 5.4)

  2. (2)

    kϕm,k(x)=1\sum_{k}\phi_{m,k}(x)=1 for every xnGmx\in\mathbb{R}^{n}\setminus G_{m}.

  3. (3)

    There is a constant CC, independent of mm and kk, such that

    |ϕm,k(x)|Cdiam(Qm,k).\displaystyle|\nabla\phi_{m,k}(x)|\leq\frac{C}{\textnormal{diam}(Q_{m,k})}.
  4. (4)

    For every zGmz\in G_{m}, we have

    ϕm,k(xz)=ϕm,σzk(x),k,x,\displaystyle\phi_{m,k}(x-z)=\phi_{m,\sigma_{z}k}(x),\;\;\forall\;k,\;x,

    where σz\sigma_{z} are the bijections introduced in Lemma 5.2.

We will call {ϕm,k}\{\phi_{m,k}\} the corresponding partition of unity for {Qm,k}\{Q_{m,k}\} that is appropriate for the Whitney extension. As in [42, Section 4], we use the following finite difference operator to construct approximate Taylor polynomials for the Whitney extension. Denote by m1u(x)\nabla_{m}^{1}u(x) the unique vector that satisfies for xGmx\in G_{m} and j=1,,nj=1,\dots,n

m1u(x)ej=12hm(u(x+hmej)u(xhmej)).\displaystyle\nabla_{m}^{1}u(x)\cdot e_{j}=\frac{1}{2h_{m}}(u(x+h_{m}e_{j})-u(x-h_{m}e_{j})). (5.1)

Note that this exploits the fact that x±hmejGmx\pm h_{m}e_{j}\in G_{m} if xGmx\in G_{m}.

In order to define the polynomials that will be used to build the Whitney extension, we need some notation for the centers of cubes and closest points in GmG_{m}.

Definition 5.6.

For each mm and kk, we will call ym,ky_{m,k} the center of the cube Qm,kQ_{m,k}, and y^m,k\hat{y}_{m,k} will denote the unique element of GmG_{m} so that

d(ym,k,Gm)=|ym,ky^m,k|.\displaystyle d(y_{m,k},G_{m})=\left|y_{m,k}-\hat{y}_{m,k}\right|.

For f:Nf:\mathbb{R}^{N}\to\mathbb{R}, we can now define a polynomial used to approximate it:

Definition 5.7.

Using the discrete gradient, m1f\nabla_{m}^{1}f in (5.1), we define a first order polynomial depending on ff, mm, kk, as

forxQm,k,Pf,k1(x)=f(y^m,k)+m1f(y^mk)(xy^m,k).\displaystyle\text{for}\ x\in Q_{m,k},\ \ P^{1}_{f,k}(x)=f(\hat{y}_{m,k})+\nabla_{m}^{1}f(\hat{y}_{mk})\cdot(x-\hat{y}_{m,k}).

Given any ff, we denote the mm-level truncation, f~m\tilde{f}_{m} as

f~m=f𝟙B2m.\displaystyle\tilde{f}_{m}=f{\mathbbm{1}}_{B_{2^{m}}}.

With all of these ingredients in hand, we can define the Whitney extensions that we will use.

Definition 5.8.

Using the notation of Definition 5.7, and partition of unity, (ϕm,k)(\phi_{m,k}), in Proposition 5.5, the zero order Whitney extension is

Em0(f,x)={f~m(x)ifxGm,kf~m(y^m,k)ϕm,k(x)ifxGm,\displaystyle E^{0}_{m}(f,x)=\begin{cases}\tilde{f}_{m}(x)\ &\text{if}\ x\in G_{m},\\ \sum_{k\in\mathbb{N}}\tilde{f}_{m}(\hat{y}_{m,k})\phi_{m,k}(x)\ &\text{if}\ x\not\in G_{m},\end{cases}

and the first order Whitney extension is

Em1(f,x)={f~m(x)ifxGm,kPf~m,k1(x)ϕm,k(x)ifxGm.\displaystyle E^{1}_{m}(f,x)=\begin{cases}\tilde{f}_{m}(x)\ &\text{if}\ x\in G_{m},\\ \sum_{k\in\mathbb{N}}P^{1}_{\tilde{f}_{m,k}}(x)\phi_{m,k}(x)\ &\text{if}\ x\not\in G_{m}.\end{cases}

5.2. The finite dimensional approximation

As mentioned above, we give an approximation procedure and min-max formula for generic operators acting on convex subsets of C1,Dini(N)C^{1,\textnormal{Dini}}(\mathbb{R}^{N}). We will call these operators, J:XρC0(N)J:X_{\rho}\to C^{0}(\mathbb{R}^{N}), where the Banach space, XρX_{\rho} appears below, in Definition 5.9. Our particular interest is the eventual application of this material to the operator II defined in (1.2) and (1.3).

The spaces that are used for the domain of the operators, JJ, are given here.

Definition 5.9.
Xρ={fC1,Dini(N):Cf,s.t.|f(x)f(y)|Cfρ(|xy|) for all x,yN}.\displaystyle X_{\rho}=\left\{f\in C^{1,\textnormal{Dini}}(\mathbb{R}^{N})\ :\ \exists\ C_{f},\ \textnormal{s.t.}\ \left|\nabla f(x)-\nabla f(y)\right|\leq C_{f}\rho(\left|x-y\right|)\ \textnormal{ for all }x,y\in\mathbb{R}^{N}\right\}.
Xρ,x={fXρ:Cf,s.t.|f(y)f(x)|Cf|yx|ρ(|yx|) for all yN}.\displaystyle X_{\rho,x}=\left\{f\in X_{\rho}\ :\ \exists\ C_{f},\ \textnormal{s.t.}\ \left|f(y)-f(x)\right|\leq C_{f}\left|y-x\right|\rho(\left|y-x\right|)\ \textnormal{ for all }y\in\mathbb{R}^{N}\right\}.

We note that XρX_{\rho} is a Banach space with the usual norm on C1C^{1} combined with the additional Dini semi-norm

[f]ρ=infC{supx,y|f(x)f(y)|Cρ(|xy|},\displaystyle[\nabla f]_{\rho}=\inf_{C}\{\sup_{x,y}\left|\nabla f(x)-\nabla f(y)\right|\leq C\rho(\left|x-y\right|\},

see [70, Chapter VI, Cor 2.2.3 and Exercise 4.6]. Furthermore, Xρ,xX_{\rho,x} is a subspace of XρX_{\rho} consisting of those functions vanishing with a rate at xx.

Remark 5.10.

We note that fXρf\in X_{\rho} if and only if

x,yN,|f(x+y)f(x)f(x)y|Cf|y|ρ(|y|).\displaystyle\forall\ x,y\in\mathbb{R}^{N},\ \ \left|f(x+y)-f(x)-\nabla f(x)\cdot y\right|\leq C_{f}\left|y\right|\rho(\left|y\right|).

Without loss of generality, ρ\rho can be chosen so that ρ~(y)=fL|y|ρ(|y|)\tilde{\rho}(y)=\lVert f\rVert_{L^{\infty}}\left|y\right|\rho(\left|y\right|) satisfies ρ~Xρ\tilde{\rho}\in X_{\rho}. This means that whenever f𝒦(δ,L,m,ρ)f\in\mathcal{K}(\delta,L,m,\rho), we have that ψ(y)=δ+fL|y|ρ(|y|)\psi(y)=\delta+\lVert f\rVert_{L^{\infty}}\left|y\right|\rho(\left|y\right|) satisfies ψ𝒦(δ,L,m,ρ)\psi\in\mathcal{K}(\delta,L,m,\rho).

The first step in making operators with finite rank is to first restrict input functions to the finite set, GmG_{m}. So, we define the restriction operator,

Tm:C0(N)Gm,Tmf:=f|Gm.\displaystyle T_{m}:C^{0}(\mathbb{R}^{N})\to\mathbb{R}^{G_{m}},\ \ \ T_{m}f:=f|_{G_{m}}. (5.2)

Thus, we can use the restriction operator to create a projection of XρX_{\rho} onto a finite dimensional subspace of functions depending only on their values on GmG_{m}:

πm=Em1Tm:XρXρ.\displaystyle\pi_{m}=E^{1}_{m}\circ T_{m}:X_{\rho}\to X_{\rho}. (5.3)

One of the reasons for using the Whitney extension to define E1E^{1} is that operators such as πm\pi_{m} will be Lipschitz, and with a norm that is independent from GmG_{m}.

Theorem 5.11 (Stein Chapter VI result 4.6 [70]).

Em0E^{0}_{m} is linear, and if gg is Lipschitz on GmG_{m}, then Em0gE^{0}_{m}g is Lipschitz on N\mathbb{R}^{N} with the same Lipschitz constant. Furthermore, πm\pi_{m} is linear and, for a constant, C>0C>0 that depends only on dimension, for all fXρf\in X_{\rho},

πmfXρCfXρ\displaystyle\lVert\pi_{m}f\rVert_{X_{\rho}}\leq C\lVert f\rVert_{X_{\rho}}

On top of the boundedness of πm\pi_{m}, we have intentionally constructed the sets GmG_{m}, the cubes {Qm,k}k\{Q_{m,k}\}_{k}, and the partition functions ϕm,k\phi_{m,k}, to respect translations over GmG_{m}.

Definition 5.12.

For f:Nf:\mathbb{R}^{N}\to\mathbb{R}, and zNz\in\mathbb{R}^{N}, we define the translation operator τz\tau_{z} as

τzf(x)=f(x+z).\displaystyle\tau_{z}f(x)=f(x+z).

In particular, property (4) of Proposition 5.5 gives the following translation invariance of πm\pi_{m}.

Lemma 5.13 (Proposition 4.14 of [42]).

If f:Nf:\mathbb{R}^{N}\to\mathbb{R}, zGmz\in G_{m}, fixed, and τz\tau_{z} in Definition 5.12, then

πm(τzf)=τz(πmf),andEm0Tm(τzf)=τz(Em0Tmf).\displaystyle\pi_{m}(\tau_{z}f)=\tau_{z}\left(\pi_{m}f\right),\ \ \text{and}\ \ E^{0}_{m}\circ T_{m}(\tau_{z}f)=\tau_{z}\left(E^{0}_{m}\circ T_{m}f\right).

With these nice facts about the projection operator, πm\pi_{m}, we can thus define our approximating operators to JJ, in which the approximates have finite rank.

Definition 5.14.

Given JJ that is a Lipschitz mapping of XρCb0(N)X_{\rho}\to C^{0}_{b}(\mathbb{R}^{N}) the finite dimensional approximation, JmJ_{m}, is defined as

Jm:=Em0TmJEm1Tm=Em0TmJπm,\displaystyle J^{m}:=E^{0}_{m}\circ T_{m}\circ J\circ E^{1}_{m}\circ T_{m}=E^{0}_{m}\circ T_{m}\circ J\circ\pi_{m}, (5.4)

where E0E^{0} and E1E^{1} appear in Definition 5.8, TmT_{m} is defined in (5.2), and πm\pi_{m} is defined in (5.3).

Below, we will see JmJ^{m} are Lipschitz maps. It will also matter in which way JmJJ^{m}\to J; for our purposes, it is enough that these approximate operators converge pointwise to JJ over XρX_{\rho}, in the following sense.

Proposition 5.15 (Corollary 5.20 of [42]).

For all fXρf\in X_{\rho}, for each R>0R>0,

limmJm(f)J(f)L(BR)=0.\displaystyle\lim_{m\to\infty}\lVert J^{m}(f)-J(f)\rVert_{L^{\infty}(B_{R})}=0.

A property that was observed in [43] and also used in [42] is the “almost” preservation of ordering by the projections, πm\pi_{m}. Although ordering is, in general, not preserved, on functions that are regular enough, there is a quantifiable error term. We record this here because it plays a fundamental role later on, in Section 7, to preserve certain estimates. In particular, we eventually focus on the fact that our operators have an extra structure called the global comparison property (see Definition 5.23), and so whenever JJ enjoys the global comparison property, then JmJ_{m} almost enjoys the global comparison property, up to a quantifiable error term over a large enough subspace of XρX_{\rho}. The main ingredient to this end is the following lemma.

Lemma 5.16 (Lemma 4.17 of [42]).

If wC1,α(N)w\in C^{1,\alpha}(\mathbb{R}^{N}), x0Gmx_{0}\in G_{m}, w0w\geq 0, w(x0)=0w(x_{0})=0, then there exists a function, Rα,m,w,x0C1,α/2(N)R_{\alpha,m,w,x_{0}}\in C^{1,\alpha/2}(\mathbb{R}^{N}) with Rα,m,w,x0(x0)=0R_{\alpha,m,w,x_{0}}(x_{0})=0,

xN,πmw(x)+Rα,m,w,x0(x)0,andRα,m,w,x0C1,α/2ChnβwC1,α,\displaystyle\forall\ x\in\mathbb{R}^{N},\ \pi_{m}w(x)+R_{\alpha,m,w,x_{0}}(x)\geq 0,\ \ \ \textnormal{and}\ \ \ \lVert R_{\alpha,m,w,x_{0}}\rVert_{C^{1,\alpha/2}}\leq Ch_{n}^{\beta}\lVert w\rVert_{C^{1,\alpha}},

where β(0,1)\beta\in(0,1) and depends upon α\alpha.

Remark 5.17.

If J:XρCb0J:X_{\rho}\to C^{0}_{b} is Lipschitz, then for any modulus, ω\omega so that ωρ\omega\leq\rho, JJ is also a Lipschitz mapping on XωX_{\omega}. In particular, for all α(0,1)\alpha\in(0,1), such a JJ is a Lipschitz mapping on Cb1,α(N)C^{1,\alpha}_{b}(\mathbb{R}^{N}).

5.3. A subset of “supporting” linear operators, 𝒟J\mathcal{D}_{J}

The main reason for using the approximating operators, JmJ_{m}, is that as maps that have finite rank, they are effectively maps on a finite dimensional space and hence are differentiable at almost every fXρf\in X_{\rho}. Furthermore, this a.e. ff differentiability endows them with a natural min-max structure. It turns out that taking limits of “linearizations” of JmJ_{m} produces a rich enough family to construct a min-max representation for the original JJ. That is the purpose of this subsection.

First, we have some notation for the set of “supporting” differentials of maps on XρX_{\rho}. The first is simply the collection of limits of derivatives of a map that is differentiable almost everywhere.

Definition 5.18 (Differential Set Almost Everywhere).

If JJ is differentiable a.e. XρX_{\rho}, we call the differential set,

𝒟J=c.h.{L=limkDJ[fk;]:fkfandJis differentiable atf},\displaystyle\mathcal{D}J=\textnormal{c.h.}\{L=\lim_{k}DJ[f_{k};\cdot]\ :\ f_{k}\to f\ \text{and}\ J\ \text{is differentiable at}\ f\},

where we used the abbreviation “c.h.” to denote the convex hull. Here DJ[f;]DJ[f;\cdot] is the derivative of JJ at ff.

This is used to build a weaker notion of “differential” set that we will use later, which is the limits of all derivatives of approximating operators.

Definition 5.19 (Weak Differential Set).

For J:XρCb0(N)J:X_{\rho}\to C^{0}_{b}(\mathbb{R}^{N}), we can define a weak differential set as the following:

𝒟J=c.h.{L:mk,Lmk𝒟Jmks.t.fXρ,limkLmk(f,)=L(f,)},\displaystyle\mathcal{D}_{J}=\textnormal{c.h.}\{L\ :\ \exists m_{k},\ L_{m_{k}}\in\mathcal{D}J^{m_{k}}\ \text{s.t.}\ \forall\ f\in X_{\rho},\ \lim_{k\to\infty}L_{m_{k}}(f,\cdot)=L(f,\cdot)\}, (5.5)

where we used the abbreviation “c.h.” to denote the convex hull. Here, JmkJ^{m_{k}} are the approximating operators for JJ that are given in Definition 5.14.

Lemma 5.20.

If J:XρCb0(n)J:X_{\rho}\to C^{0}_{b}(\mathbb{R}^{n}) is Lipschitz and translation invariant, then so are JmJ^{m}, and all L𝒟JmL\in\mathcal{D}J^{m} enjoy a bound which is the Lipschitz norm of JmJ^{m}.

Main idea of proof of Lemma 5.20.

We do not give all the details here, but simply comments on a few points. First of all, the Lipschitz nature of JmJ^{m} is evident from that of JJ and Theorem 5.11 (translation invariance is not used here). Furthermore, as JmJ^{m} is a Lipschitz function on a finite dimensional space, we see that all L𝒟JmL\in\mathcal{D}J^{m} must be realized as limits of derivatives of JmJ^{m}. However, it is easily checked that the operator norm of any differential is bounded by the Lipschitz norm of the original operator, hence the claim about L𝒟JmL\in\mathcal{D}J^{m}. Finally, we need to address the translation invariance of JmJ^{m} and LL. This follows immediately from the translation invariance properties of the projection and extension operators listed in Lemma 5.13. Furthermore, again, this translation invariance will also be inherited by any derivative of JmJ^{m} and hence L𝒟JmL\in\mathcal{D}J^{m}. ∎

The reason that the set, 𝒟J\mathcal{D}_{J}, is useful for our purposes is that it gives a sort of “maximal” mean value inequality, which is just a variant on the usual mean value theorem (cf. Lebourg’s Theorem in [28]).

Lemma 5.21 (Lemma 5.2 and Remark 5.4 of [42]).

If 𝒦\mathcal{K} is a convex subset of XρX_{\rho} and J:𝒦Cb0(N)J:\mathcal{K}\to C^{0}_{b}(\mathbb{R}^{N}) is Lipschitz, then

f,g𝒦,J(f)J(g)maxL𝒟JL(fg),\displaystyle\forall\ f,g\in\mathcal{K},\ \ J(f)-J(g)\leq\max_{L\in\mathcal{D}_{J}}L(f-g),

where 𝒟J\mathcal{D}_{J} is from Definition 5.19.

Sketch of Lemma 5.21.

We note more careful details are given in [42, Section 5], and so we just give the main idea. Given f,g𝒦f,g\in\mathcal{K}, the usual Mean Value theorem of Lebourg [28] shows that there exists t[0,1]t\in[0,1] and z=tf+(1t)gz=tf+(1-t)g with the property that there is at least one L𝒟J(z)L\in\mathcal{D}_{J}(z) (the differential only at zz) with the property that

J(f)J(g)=L(fg).\displaystyle J(f)-J(g)=L(f-g).

Hence taking that maximum gives the result. The actual result requires a small amount more detail in the invocation of Lebourg’s mean value theorem, which is presented in [42, Section 5].

From this mean value inequality, a generic min-max formula for JJ becomes immediate.

Corollary 5.22.

Given a convex subset 𝒦Xρ\mathcal{K}\subset X_{\rho}, and J:𝒦Cb0(N)J:\mathcal{K}\to C^{0}_{b}(\mathbb{R}^{N}) that is Lipschitz, JJ can be realized in the following way:

f𝒦,J(f,x)=ming𝒦maxL𝒟JJ(g,x)+L(fg,x),\displaystyle\forall\ f\in\mathcal{K},\ \ J(f,x)=\min_{g\in\mathcal{K}}\max_{L\in\mathcal{D}_{J}}J(g,x)+L(f-g,x),

where 𝒟J\mathcal{D}_{J} is from Definition 5.19.

Proof of Corollary 5.22.

For generic f,gXρf,g\in X_{\rho}, we can utilize Lemma 5.21, and then taking the minimum over all gXρg\in X_{\rho} yields the claim.

The next result needs a feature we call the global comparison property.

Definition 5.23.

We say that J:XρC0(N)J:X_{\rho}\to C^{0}(\mathbb{R}^{N}) obeys the global comparison property (GCP) provided that for all f,gXρf,g\in X_{\rho} and x0x_{0} such that fgf\leq g and f(x0)=g(x0)f(x_{0})=g(x_{0}), JJ satisfies J(f,x0)J(g,x0)J(f,x_{0})\leq J(g,x_{0}).

In the case that JJ enjoys the GCP, more can be said. This is one of the main results from [42] and [43].

Theorem 5.24 (Theorem 1.11 in [42], Theorem 1.6 [43]).

If 𝒦\mathcal{K} is a convex subset of XρX_{\rho} and J:𝒦Cb0(N)J:\mathcal{K}\to C^{0}_{b}(\mathbb{R}^{N}) is such that

  1. (i)

    JJ is Lipschitz

  2. (ii)

    JJ is translation invariant

  3. (iii)

    JJ enjoys the GCP

  4. (iv)

    there exists a modulus, ω\omega, with limRω(R)=0\lim_{R\to\infty}\omega(R)=0 and

    f,g𝒦,withfginB2R,J(f)J(g)L(BR)ω(R)fgL(N),\displaystyle\forall\ f,g\in\mathcal{K},\ \text{with}\ f\equiv g\ \text{in}\ B_{2R},\ \lVert J(f)-J(g)\rVert_{L^{\infty}(B_{R})}\leq\omega(R)\lVert f-g\rVert_{L^{\infty}(\mathbb{R}^{N})}, (5.6)

then for each L𝒟JL\in\mathcal{D}_{J}, there exists the following parameters that are independent of xx:

cL,bLN,μLmeasures(n{0}),\displaystyle c_{L}\in\mathbb{R},\ b_{L}\in\mathbb{R}^{N},\ \mu_{L}\in\textnormal{measures}(\mathbb{R}^{n}\setminus\{0\}),

such that for all ff,

L(f,x)=cLf(x)+bLf(x)+nδhf(x)μL(dh),\displaystyle L(f,x)=c_{L}f(x)+b_{L}\cdot\nabla f(x)+\int_{\mathbb{R}^{n}}\delta_{h}f(x)\mu_{L}(dh),

and JJ can be represented as

f𝒦,J(f,x)=ming𝒦maxL𝒟JJ(g,x)+L(fg,x).\displaystyle\forall\ f\in\mathcal{K},\ \ J(f,x)=\min_{g\in\mathcal{K}}\max_{L\in\mathcal{D}_{J}}J(g,x)+L(f-g,x).

Here, for some appropriate, fixed, r0r_{0}, depending upon JJ, we use the notation

δhf(x)=f(x+h)f(x)𝟙Br0(h)f(x)h.\displaystyle\delta_{h}f(x)=f(x+h)-f(x)-{\mathbbm{1}}_{B_{r_{0}}}(h)\nabla f(x)\cdot h.

Furthermore, for a universal C>0C>0, we have

supL𝒟J{|cL|+|bL|+Nmin{|h|ρ(|h|),1}μL(dh)}CJLip,XρCb0.\displaystyle\sup_{L\in\mathcal{D}_{J}}\left\{\left|c_{L}\right|+\left|b_{L}\right|+\int_{\mathbb{R}^{N}}\min\{\left|h\right|\rho(\left|h\right|),1\}\mu_{L}(dh)\right\}\leq C\lVert J\rVert_{Lip,\ X_{\rho}\to C^{0}_{b}}.
Remark 5.25.

Generically, r0r_{0} can be taken as r0=1r_{0}=1, allowing for a change to each of the corresponding bLb_{L}, but in our context, it is more natural to choose r0r_{0} depending on JJ.

Comments on the proof of Theorem 5.24.

As the way Theorem 5.24 is stated does not match exactly the statements of those in [42, Theorem 1.11] or [43, Theorem 1.6], some comments are in order. The point is that we explicitly show that the min-max representation for JJ uses the set of linear mappings, 𝒟J\mathcal{D}_{J}, which is not made explicit in the theorems in [42], [43]. This is purely a matter of presentation.

By Lemma 5.20, we know that since JJ is translation invariant, then also all L𝒟JmL\in\mathcal{D}J^{m} are translation invariant. Taking this fact in hand, and combining it with the analysis that appears in [42, Section 3], in particular, [42, Lemma 3.9], we see that all L𝒟JmL\in\mathcal{D}J^{m} have the form claimed here in Theorem 5.24. The passage from operators in 𝒟Jm\mathcal{D}J^{m} to 𝒟J\mathcal{D}_{J} and the preservation of their structure follows in the same way as in [42, Section 5]. We note that the structured imparted on L𝒟JmL\in\mathcal{D}J^{m} by the fact that LL is an operator that is translation invariant and enjoys the GCP allows us to remove any requirement of [42, Assumption 1.4] as it pertains to the arguments in [42, Section 5].

Remark 5.26.

A curious reader may notice that in [42], all of Theorems 1.9, 1.10, and 1.11 apply to the JJ that we study herein. The most relevant two are Theorems 1.10 and 1.11 in [42], and in particular as here JJ is translation invariant, Theorem 1.10 in [42] is much simpler in that there is no requirement for (5.6) as we do above. The reason Theorem 1.10 in [42] does not suit us here is subtle, and is based on the fact that we will subsequently require a non-degeneracy property of all of the LL used to reconstruct JJ as a min-max. In our case this will result from using the approximations JmJ^{m} as above, and to describe the limits of Lm𝒟JmL_{m}\in\mathcal{D}J^{m}, we need an extra condition to get some compactness on the nonlocal terms, which is the use of (5.6). The type of non-degeneracy we will need for LL will be apparent in Section 7, and we will add some further discussion later.

6. Lipschitz Property of II and HH

First, we will show that for each fixed choice of parameters, δ\delta, LL, mm, ρ\rho, II is a Lipschitz mapping, from 𝒦(δ,L,m,ρ)\mathcal{K}(\delta,L,m,\rho) to Cb0(n)C^{0}_{b}(\mathbb{R}^{n}). The main properties of HH are deduced from the more basic operator, II, which we study first. Then, later in the section we will show how the same results follow for HH.

6.1. The analysis for the operator, II

Because HH is defined as a function of two operators that take the form, (1.3), the key result in proving HH is Lipschitz is to prove that II as in (1.3) is Lipschitz.

Proposition 6.1.

If II is the operator defined via (1.2) and (1.3), then for each δ\delta, LL, mm, ρ\rho fixed, II is a Lipschitz mapping,

I:𝒦(δ,L,m,ρ)Cb0(n),\displaystyle I:\mathcal{K}(\delta,L,m,\rho)\to C^{0}_{b}(\mathbb{R}^{n}),

and the Lipschitz norm of II depends upon all of δ\delta, LL, mm, ρ\rho.

Because of the definition of I+I^{+} and II^{-} using (1.6) and (1.9), we see that all of the argument in the domain Df+D^{+}_{f} for the operator, II (which is, by definition I+I^{+}), have direct analogs to the operator II^{-} and the domain DfD_{f}^{-}. Thus, we state the following as a corollary of the techniques that prove Proposition 6.1, but we do not provide a proof.

Corollary 6.2.

The operator, II^{-}, defined in (1.6) and (1.9) has the same Lipschitz property as II in Proposition 6.1.

Before we can establish Proposition 6.1, we give some more basic results.

Lemma 6.3.

For R0R_{0} as in Theorem 4.1, there exists a universal C>0C>0 and α(0,1]\alpha\in(0,1], so that if ψ0\psi\geq 0, ψ(0)=0\psi(0)=0, ψ(y)c|y|ρ(y)\psi(y)\leq c\left|y\right|\rho(y), f𝒦(δ,L,m,ρ)f\in\mathcal{K}(\delta,L,m,\rho), and f+ψ𝒦(δ,L,m,ρ)f+\psi\in\mathcal{K}(\delta,L,m,\rho), then for ν=νf=νf+ψ\nu=\nu_{f}=\nu_{f+\psi} and X0=(0,f(0))=(0,(f+ψ)(0))X_{0}=(0,f(0))=(0,(f+\psi)(0)), with UfU_{f}, Uf+ψU_{f+\psi} as in (1.2),

1C(ΓfBR0n+1(X0)ψ(y)|YX0|n1𝑑Y)\displaystyle\frac{1}{C}\left(\int_{\Gamma_{f}\cap B^{n+1}_{R_{0}}(X_{0})}\psi(y)\left|Y-X_{0}\right|^{-n-1}dY\right)
νUf+ψ(X0)νUf(X0)\displaystyle\ \ \ \ \leq\partial_{\nu}U_{f+\psi}(X_{0})-\partial_{\nu}U_{f}(X_{0})
C(R0αψL(nBR0)+ΓfBR0n+1(X0)ψ(y)|YX0|n1𝑑Y).\displaystyle\ \ \ \ \ \ \ \ \leq C\left(R_{0}^{-\alpha}\lVert\psi\rVert_{L^{\infty}(\mathbb{R}^{n}\setminus B_{R_{0}})}+\int_{\Gamma_{f}\cap B^{n+1}_{R_{0}}(X_{0})}\psi(y)\left|Y-X_{0}\right|^{-n-1}dY\right).

Recall, BRnB_{R}\subset\mathbb{R}^{n} and BRn+1(X0)n+1B^{n+1}_{R}(X_{0})\subset\mathbb{R}^{n+1}.

Corollary 6.4.

With ff and ψ\psi as in Lemma 6.3,

1C(BR0ψ(y)|y|n1𝑑y)\displaystyle\frac{1}{C}\left(\int_{B_{R_{0}}}\psi(y)\left|y\right|^{-n-1}dy\right)
νUf+ψ(X0)νUf(X0)\displaystyle\ \ \ \ \leq\partial_{\nu}U_{f+\psi}(X_{0})-\partial_{\nu}U_{f}(X_{0})
C(R0αψL(nBR0)+BR0ψ(y)|y|n1𝑑y),\displaystyle\ \ \ \ \ \ \ \ \leq C\left(R_{0}^{-\alpha}\lVert\psi\rVert_{L^{\infty}(\mathbb{R}^{n}\setminus B_{R_{0}})}+\int_{B_{R_{0}}}\psi(y)\left|y\right|^{-n-1}dy\right),

where the integration occurs over n\mathbb{R}^{n} instead of Γf\Gamma_{f}.

Remark 6.5.

The exponent, α\alpha, in Lemma 6.11 and Corollary 6.4 is the same exponent that appears in the second part of Proposition 4.5, from (4.4).

First we note how the corollary follows from Lemma 6.3.

Proof of Corollary 6.4.

Because Γf\Gamma_{f} is a C1,DiniC^{1,\textnormal{Dini}} graph, we know that up to a constant (depending on only the Lipschitz norm of ff),

1CΓf(BR0(X0))ψ(y)|X0Y|n+1𝑑Y\displaystyle\frac{1}{C}\int_{\Gamma_{f}\cap\left(B_{R_{0}}(X_{0})\right)}\psi(y)\left|X_{0}-Y\right|^{n+1}dY
BR0(0)ψ(h)|h|n1𝑑h\displaystyle\leq\int_{B_{R_{0}}(0)}\psi(h)\left|h\right|^{-n-1}dh
CΓf(BR0(X0))ψ(y)|X0Y|n+1𝑑Y,\displaystyle\leq C\int_{\Gamma_{f}\cap\left(B_{R_{0}}(X_{0})\right)}\psi(y)\left|X_{0}-Y\right|^{n+1}dY,

and we emphasize that the first and third integrals occur on the set Γf\Gamma_{f}, whereas the second integral is over a subset of n\mathbb{R}^{n}. ∎

Proof of Lemma 6.3.

This lemma uses, via the fact that ψ0\psi\geq 0, a sort of “semigroup” property of UfU_{f} (recall UfU_{f}, Uf+ψU_{f+\psi} are as in (1.2)). In particular, since DfDf+ψD_{f}\subset D_{f+\psi}, we can decompose Uf+ψU_{f+\psi} as the following

Uf+ψ=Uf+W,\displaystyle U_{f+\psi}=U_{f}+W,

where WW is the unique solution of

{ΔW=0inDfW=0on{xn+1=0}W=Uf+ψ|ΓfonΓf.\displaystyle\begin{cases}\Delta W=0\ &\text{in}\ D_{f}\\ W=0\ &\text{on}\ \{x_{n+1}=0\}\\ W=U_{f+\psi}|_{\Gamma_{f}}\ &\text{on}\ \Gamma_{f}.\end{cases}

We can invoke the linear growth of Uf+ψU_{f+\psi} away from Γf+ψ\Gamma_{f+\psi} given in Lemma 4.4 to see that

Y=(y,yn+1)=(y,f(y))Γf,ψ(y)CUf+ψ(Y)Cψ(y).\displaystyle\forall\ Y=(y,y_{n+1})=(y,f(y))\in\Gamma_{f},\ \ \frac{\psi(y)}{C}\leq U_{f+\psi}(Y)\leq C\psi(y). (6.1)

Now, we can fix 0<s<<10<s<<1 and use the Poisson kernel, PfP_{f}, to evaluate Uf+ψ(X0+sν(X0))U_{f+\psi}(X_{0}+s\nu(X_{0})) (and we recall that X0=(0,f(0))X_{0}=(0,f(0))). We first show the details of the next argument as they pertain to the lower bound. The argument for the upper bound follows by analogous arguments, invoking the upper bound on Uf+ψ(Y)U_{f+\psi}(Y), given previously. We will also use the boundary behavior of PfP_{f} given in Proposition 4.5 (the lower bound in BR0(X0)B_{R_{0}}(X_{0}) here, and the upper bound for the analogous upper bound argument on Uf+ψU_{f+\psi}). Thus, we can estimate:

Uf+ψ(X0+sνf(X0))\displaystyle U_{f+\psi}\left(X_{0}+s\nu_{f}(X_{0})\right)
=Uf(X0+sνf(X0))+W(X0+sνf(X0))\displaystyle=U_{f}(X_{0}+s\nu_{f}(X_{0}))+W(X_{0}+s\nu_{f}(X_{0}))
=Uf(X0+sνf(X0))+ΓfUf+ψ|Γf(Y)Pf(X0+sνf(X0),Y)dY\displaystyle=U_{f}(X_{0}+s\nu_{f}(X_{0}))+\int_{\Gamma_{f}}U_{f+\psi}|_{\Gamma_{f}}(Y)P_{f}\left(X_{0}+s\nu_{f}(X_{0}),Y\right)dY
Uf(X0+sνf(X0))+Γfψ(y)CPf(X0+sνf(X0),Y)𝑑Y\displaystyle\geq U_{f}(X_{0}+s\nu_{f}(X_{0}))+\int_{\Gamma_{f}}\frac{\psi(y)}{C}P_{f}\left(X_{0}+s\nu_{f}(X_{0}),Y\right)dY (6.2)
Uf(X0+sνf(X0))+ΓfBR0n+1(X0)C~sψ(y)|X0Y|n1𝑑Y,\displaystyle\geq U_{f}(X_{0}+s\nu_{f}(X_{0}))+\int_{\Gamma_{f}\cap B^{n+1}_{R_{0}}(X_{0})}\tilde{C}s\psi(y)\left|X_{0}-Y\right|^{-n-1}dY,

where in the second to last line, we invoke the estimate of Lemma 4.4 as in (6.1). (We have used νf\nu_{f} as the inward normal derivative to DfD_{f} and we recall the notation Y=(y,yn+1)Y=(y,y_{n+1}), as well as R0R_{0} originating in Proposition 4.5.) We note the use of the assumption that ψ(y)c|y|ρ(|y|)\psi(y)\leq c\left|y\right|\rho(\left|y\right|) in order that the following integral is well defined:

ΓfBR0n+1(X0)ψ(y)|X0Y|n1𝑑Y.\displaystyle\int_{\Gamma_{f}\cap B^{n+1}_{R_{0}}(X_{0})}\psi(y)\left|X_{0}-Y\right|^{-n-1}dY.

Thus, since Uf+ψ(0,f(0))=0=Uf(0,f(0))U_{f+\psi}(0,f(0))=0=U_{f}(0,f(0)), as well νf+ψ(X0)=νf(X0)\nu_{f+\psi}(X_{0})=\nu_{f}(X_{0}) (as (f+ψ)(0)=f(0)\nabla(f+\psi)(0)=\nabla f(0)), we see that by rearranging terms, dividing by ss, and taking s0s\to 0 (with an abuse of the use of the constant, CC)

νUf+tψ(X0)νUf(X0)C(ΓfBR0n+1(X0)ψ(y)|X0Y|n1𝑑Y).\displaystyle\partial_{\nu}U_{f+t\psi}(X_{0})-\partial_{\nu}U_{f}(X_{0})\geq C\left(\int_{\Gamma_{f}\cap B^{n+1}_{R_{0}}(X_{0})}\psi(y)\left|X_{0}-Y\right|^{-n-1}dY\right).

Now, we mention the minor modification to obtain the upper bound. Working just as above, we can start at the upper bound analog of line (6.2), and then we invoke Proposition 4.5, both the pointwise estimates in BR0B_{R_{0}} and the integral estimate in BR0CB_{R_{0}}^{C} in (4.4). This yields:

Uf+ψ(X0+sνf(X0))\displaystyle U_{f+\psi}\left(X_{0}+s\nu_{f}(X_{0})\right)
Uf(X0+sνf(X0))+Γfψ(y)CPf(X0+sνf(X0),Y)𝑑Y\displaystyle\leq U_{f}(X_{0}+s\nu_{f}(X_{0}))+\int_{\Gamma_{f}}\frac{\psi(y)}{C}P_{f}\left(X_{0}+s\nu_{f}(X_{0}),Y\right)dY
Uf(X0+sνf(X0))+ΓfBR0n+1(X0)C~sψ(y)|X0Y|n1𝑑Y\displaystyle\leq U_{f}(X_{0}+s\nu_{f}(X_{0}))+\int_{\Gamma_{f}\cap B^{n+1}_{R_{0}}(X_{0})}\tilde{C}s\psi(y)\left|X_{0}-Y\right|^{-n-1}dY
+ΓfBR0n+1(X0)ψL(nBR0)Pf(X0+sν,Y)𝑑Y\displaystyle\ \ \ \ \ \ \ \ \ \ +\int_{\Gamma_{f}\setminus B^{n+1}_{R_{0}}(X_{0})}\lVert\psi\rVert_{L^{\infty}(\mathbb{R}^{n}\setminus B_{R_{0}})}P_{f}(X_{0}+s\nu,Y)dY
Uf(X0+sνf(X0))+ΓfBR0n+1(X0)C~sψ(y)|X0Y|n1𝑑Y+CsψL(nBR0)R0α.\displaystyle\leq U_{f}(X_{0}+s\nu_{f}(X_{0}))+\int_{\Gamma_{f}\cap B^{n+1}_{R_{0}}(X_{0})}\tilde{C}s\psi(y)\left|X_{0}-Y\right|^{-n-1}dY+\frac{Cs\lVert\psi\rVert_{L^{\infty}(\mathbb{R}^{n}\setminus B_{R_{0}})}}{R_{0}^{\alpha}}.

The upper bound concludes as the lower bound, and this finishes the proof of the lemma.

Lemma 6.6.

There exists a universal C>0C>0 and ε2>0\varepsilon_{2}>0 so that if ψ(0)=0\psi(0)=0, |ψ|ε2\left|\nabla\psi\right|\leq\varepsilon_{2}, f𝒦(δ,L,m,ρ)f\in\mathcal{K}(\delta,L,m,\rho), and f+ψ𝒦(δ,L,m,ρ)f+\psi\in\mathcal{K}(\delta,L,m,\rho), then for X0=(0,f(0))X_{0}=(0,f(0)),

|νf+ψUf+ψ(X0)νfUf(X0)|C|ψ(0)|+Cε2ψL.\displaystyle\left|\partial_{\nu_{f+\psi}}U_{f+\psi}(X_{0})-\partial_{\nu_{f}}U_{f}(X_{0})\right|\leq C\left|\nabla\psi(0)\right|+C\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}.
Proof of Lemma 6.6.

The main part of this proof is to use a rotation to reduce to the case of Lemma 6.3. Let \mathcal{R} be the unique rotation that satisfies

(νf+ψ(X0))=νf(X0)\displaystyle\mathcal{R}(\nu_{f+\psi}(X_{0}))=\nu_{f}(X_{0})

and leaves

(span{νf+ψ(X0),νf(X0})\displaystyle\left(\textnormal{span}\{\nu_{f+\psi}(X_{0}),\nu_{f}(X_{0}\}\right)^{\perp}

unchanged. Then we can define for a yet to be chosen cutoff function, η\eta, the transformation TT

T:n+1n+1,T(X)=X0+η(|XX0|)(XX0)+(1η(|XX0|))(XX0)\displaystyle T:\mathbb{R}^{n+1}\to\mathbb{R}^{n+1},\ \ T(X)=X_{0}+\eta(\left|X-X_{0}\right|)\mathcal{R}(X-X_{0})+(1-\eta(\left|X-X_{0}\right|))(X-X_{0})

We compose this mapping with Uf+ψU_{f+\psi} to define an auxiliary function,

V(X)=(Uf+ψT1)(X).\displaystyle V(X)=(U_{f+\psi}\circ T^{-1})(X).

If the parameter, ε2\varepsilon_{2}, in the assumption of the lemma is not too large (depending upon the Lipschitz bound on ff, which is mm), the transformation induces a new domain, whose top boundary will still be a graph. Let gg be the unique function which defines the transformed domain, i.e.

TDf+ψ=Dg.\displaystyle TD_{f+\psi}=D_{g}.

By construction, we have νf(X0)=νg(X0)\nu_{f}(X_{0})=\nu_{g}(X_{0}). On top of the previous restriction on ε2\varepsilon_{2}, we can choose it smaller so that gL2m\lVert\nabla g\rVert_{L^{\infty}}\leq 2m. This means that we can also make a choice of η\eta so that

TΓf+ψ=Γg,andg𝒦(δ/2,L+δ/2,2m,ρ~),\displaystyle T\Gamma_{f+\psi}=\Gamma_{g},\ \ \text{and}\ \ g\in\mathcal{K}(\delta/2,L+\delta/2,2m,\tilde{\rho}),

where the new modulus, ρ~\tilde{\rho} is simply ρ~(s)=ρ(Cs)\tilde{\rho}(s)=\rho(Cs), for a universal CC. Finally, we will enforce that η\eta satisfies

η1in[0,r0],andr0=cc|ψ(0)|,\displaystyle\eta\equiv 1\ \text{in}\ [0,r_{0}],\ \ \text{and}\ \ r_{0}=c\lVert\mathcal{R}\rVert\leq c\left|\nabla\psi(0)\right|, (6.3)

which is possible if ε2\varepsilon_{2} is small enough, depending upon δ\delta, LL, mm, ρ\rho.

We remark that these restrictions on ε2\varepsilon_{2} and the choice of η\eta will be such that the function gg, satisfies

|f(x)g(x)|Cε2ψL|x|ρ~(|x|),\displaystyle\left|f(x)-g(x)\right|\leq C\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}\left|x\right|\tilde{\rho}(\left|x\right|), (6.4)

as by assumption, |ψ(0)|ε2\left|\nabla\psi(0)\right|\leq\varepsilon_{2}.

We will use three steps to estimate

|νf+ψUf+ψ(X0)νfUf(X0)|,\displaystyle\left|\partial_{\nu_{f+\psi}}U_{f+\psi}(X_{0})-\partial_{\nu_{f}}U_{f}(X_{0})\right|,

using the two additional auxiliary functions, VV and UgU_{g}. We emphasize that VV is not harmonic in all of DgD_{g}.

Step 1:

νf+ψUf+ψ(X0)=νgV(X0).\displaystyle\partial_{\nu_{f+\psi}}U_{f+\psi}(X_{0})=\partial_{\nu_{g}}V(X_{0}). (6.5)

Step 2:

|νgV(X0)νgUg(X0)|C|ψ(0)|.\displaystyle\left|\partial_{\nu_{g}}V(X_{0})-\partial_{\nu_{g}}U_{g}(X_{0})\right|\leq C\left|\nabla\psi(0)\right|. (6.6)

Step 3:

|νgUg(X0)νfUf(X0)|Cε2ψL.\displaystyle\left|\partial_{\nu_{g}}U_{g}(X_{0})-\partial_{\nu_{f}}U_{f}(X_{0})\right|\leq C\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}. (6.7)

Step 1 follows by a direct calculation, by the definition of RR and that νg=Rνf+ψ\nu_{g}=R\nu_{f+\psi}.

Next, to establish step 2, we will use that fact that once η\eta is chosen, depending only on ε2\varepsilon_{2} and the collection δ\delta, LL, mm, ρ\rho, if η1\eta\equiv 1 on the interval [0,r0][0,r_{0}], then VV is harmonic in Br0(X0)DgB_{r_{0}}(X_{0})\cap D_{g}. We can then compare the respective normal derivatives of VV and UgU_{g} using a global Lipschitz estimate combined with the comparison principle. Indeed, both VV and UgU_{g} enjoy global Lipschitz estimates, for some CC that depends only on δ\delta, LL, mm, ρ\rho,

VL(Dg),UgL(Dg)C.\displaystyle\lVert\nabla V\rVert_{L^{\infty}(D_{g})},\ \lVert\nabla U_{g}\rVert_{L^{\infty}(D_{g})}\leq C.

Since on the upper part of (Br0(X0)Dg)\partial(B_{r_{0}}(X_{0})\cap D_{g}), we have

V=Ug0onBr0(X0)Γg,\displaystyle V=U_{g}\equiv 0\ \ \text{on}\ \ B_{r_{0}}(X_{0})\cap\Gamma_{g},

it follows from the Lipschitz estimates and (6.3) that

VUgL((Br0(X0)Dg))Cr0Cc|ψ(0)|.\displaystyle\lVert V-U_{g}\rVert_{L^{\infty}(\partial(B_{r_{0}}(X_{0})\cap D_{g}))}\leq Cr_{0}\leq Cc\left|\nabla\psi(0)\right|.

Since the function VUgV-U_{g} is harmonic in Br0(X0)DgB_{r_{0}}(X_{0})\cap D_{g} we can use linearly growing barriers for C1,DiniC^{1,\textnormal{Dini}} domains to deduce that for s>0s>0 and small enough,

|V(X0+sν)Ug(X0+sν)|CsVUgL(Br0(X0)Dg)\displaystyle\left|V(X_{0}+s\nu)-U_{g}(X_{0}+s\nu)\right|\leq Cs\lVert V-U_{g}\rVert_{L^{\infty}(B_{r_{0}}(X_{0})\cap D_{g})}
CsVUgL((Br0(X0)Dg))sC~r0sC~|ψ(0)|.\displaystyle\leq Cs\lVert V-U_{g}\rVert_{L^{\infty}(\partial(B_{r_{0}}(X_{0})\cap D_{g}))}\leq s\tilde{C}r_{0}\leq s\tilde{C}\left|\nabla\psi(0)\right|.

This establishes Step 2 after dividing by ss and taking s0s\to 0. (Note, these are the same type of barriers from Lemma A.1, and they can be combined with a transformation that flattens DgD_{g}.)

Now we finish with Step 3.

We can break up the estimate into two separate parts, for which we define the functions g1g_{1} and g2g_{2} as

g1=min{g,f},andg2=max{g,f}.\displaystyle g_{1}=\min\{g,f\},\ \ \text{and}\ \ g_{2}=\max\{g,f\}.

Notice that f(0)=g(0)=g1(0)=g2(0)\nabla f(0)=\nabla g(0)=\nabla g_{1}(0)=\nabla g_{2}(0), and so we will denote ν=νf(X0)=νg(X0)\nu=\nu_{f}(X_{0})=\nu_{g}(X_{0}). By construction, it follows that

Ug1(X0+sν)Uf(X0+sν)Ug(X0+sν)Uf(X0+sν)Ug2(X0+sν)Uf(X0+sν).\displaystyle U_{g_{1}}(X_{0}+s\nu)-U_{f}(X_{0}+s\nu)\leq U_{g}(X_{0}+s\nu)-U_{f}(X_{0}+s\nu)\leq U_{g_{2}}(X_{0}+s\nu)-U_{f}(X_{0}+s\nu).

The key improvement from this construction is that by the C1,DiniC^{1,\textnormal{Dini}} property of ff, and gg, owing to (6.4),

0g2(y)f(y)Cε2ψL|y|ρ~(|y|),\displaystyle 0\leq g_{2}(y)-f(y)\leq C\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}\left|y\right|\tilde{\rho}(\left|y\right|),

and as noted in the assumptions, we know that the function |y|ρ~(y)\left|y\right|\tilde{\rho}(y) is actually in XρX_{\rho}. This is useful because g2fg_{2}-f will be Lipschitz, but may only enjoy a one sided modulus.

First, we will demonstrate the upper bound that comes from Ug2U_{g_{2}}. Defining the function, ψ~\tilde{\psi} as

ψ~=Cε2ψL|y|ρ~(|y|),\displaystyle\tilde{\psi}=C\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}\left|y\right|\tilde{\rho}(\left|y\right|),

we see that ψ~\tilde{\psi} satisfies the assumptions of Lemma 6.3 (recall that we have defined the modulus so that |y|ρ(y)\left|y\right|\rho(y) is an element of XρX_{\rho}). Thus we have that

0\displaystyle 0 νUg(X0)νUf(X0)\displaystyle\leq\partial_{\nu}U_{g}(X_{0})-\partial_{\nu}U_{f}(X_{0})
νUf+(g2f)(X0)νUf(X0)\displaystyle\leq\partial_{\nu}U_{f+(g_{2}-f)}(X_{0})-\partial_{\nu}U_{f}(X_{0})
νUf+ψ~(X0)νUf(X0)\displaystyle\leq\partial_{\nu}U_{f+\tilde{\psi}}(X_{0})-\partial_{\nu}U_{f}(X_{0})
C(R0αψ~L+ΓfBR0(X0)ψ~(y)|X0Y|n1𝑑Y)\displaystyle\leq C\left(R_{0}^{-\alpha}\lVert\tilde{\psi}\rVert_{L^{\infty}}+\int_{\Gamma_{f}\setminus B_{R_{0}}(X_{0})}\tilde{\psi}(y)\left|X_{0}-Y\right|^{-n-1}dY\right)
C(R0αCε2ψL+Cε2ψLΓfBR0(X0)ρ~(|y|)|X0Y|n𝑑Y)\displaystyle\leq C\left(R_{0}^{-\alpha}C\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}+C\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}\int_{\Gamma_{f}\setminus B_{R_{0}}(X_{0})}\tilde{\rho}(\left|y\right|)\left|X_{0}-Y\right|^{-n}dY\right)
C~ε2ψL.\displaystyle\leq\tilde{C}\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}.

The lower bound follows similarly, but we instead use the inequality

fψ~=fCε2ψL|y|ρ~(|y|)g1f,\displaystyle f-\tilde{\psi}=f-C\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}\left|y\right|\tilde{\rho}(\left|y\right|)\leq g_{1}\leq f,

so that

0νUfνUg1νUfνUfψ~.\displaystyle 0\leq\partial_{\nu}U_{f}-\partial_{\nu}U_{g_{1}}\leq\partial_{\nu}U_{f}-\partial_{\nu}U_{f-\tilde{\psi}}.

Thus, we can invoke Lemma 6.3 with ff replaced by fψ~f-\tilde{\psi}, ψ=ψ~\psi=\tilde{\psi}, and f+ψ~f+\tilde{\psi} replaced by ff. The rest of the calculation is the same. This completes Step 3 and the proof of the lemma.

Because the operator, II, is translation invariant, it is useful to define an auxiliary operator, fixed at x=0x=0.

Definition 6.7.

The functional, ii, is defined as

i:𝒦(δ,L,m,ρ),i(f):=I(f,0),\displaystyle i:\mathcal{K}(\delta,L,m,\rho)\to\mathbb{R},\ \ i(f):=I(f,0),

and analogously, using (1.6) and (1.9), we have

i+(f)=i(f)=I+(f,0)=I(f,0),andi(f)=I(f,0).\displaystyle i^{+}(f)=i(f)=I^{+}(f,0)=I(f,0),\ \ \text{and}\ \ i^{-}(f)=I^{-}(f,0).
Lemma 6.8.

There exists a constant, CC depending upon δ\delta, LL, mm, ρ\rho so that if 0<ε<δ/20<\varepsilon<\delta/2, is a constant and f𝒦(δ,L,m,ρ)f\in\mathcal{K}(\delta,L,m,\rho), then

i+(f)Cεi+(f+ε)i+(f)\displaystyle i^{+}(f)-C\varepsilon\leq i^{+}(f+\varepsilon)\leq i^{+}(f)

and

i(f)+Cεi(f+ε)i(f).\displaystyle i^{-}(f)+C\varepsilon\geq i^{-}(f+\varepsilon)\geq i^{-}(f).
Proof of Lemma 6.8.

We note that the restriction on ε\varepsilon is simply to keep both i+(f)i^{+}(f) and i(f)i^{-}(f) well-defined. If we were working with i+i^{+} only, no restriction on ε\varepsilon would be necessary. Furthermore, we will only establish the inequalities as they pertain to i+i^{+}. The corresponding pair of inequalities for ii^{-} are analogous.

We first translate the function, Uf+εU_{f+\varepsilon}, down so that it vanishes on Γf\Gamma_{f}. To this end, we define

V(X):=Uf+ε(x,xd+1+ε),\displaystyle V(X):=U_{f+\varepsilon}(x,x_{d+1}+\varepsilon),

so that VV is defined in Df+D^{+}_{f}, and V=0V=0 on Γf\Gamma_{f}. As Uf+ε1U_{f+\varepsilon}\leq 1, we see that V1V\leq 1 on {xd+1=0}\{x_{d+1}=0\}. This and the comparison principle implies that VUfV\leq U_{f} in DfD_{f}, and hence,

νV(X0)νUf(X0).\displaystyle\partial_{\nu}V(X_{0})\leq\partial_{\nu}U_{f}(X_{0}).

But νV(0,f(0))=νUf+ε(0,f(0)+ε)=i+(f+ε)\partial_{\nu}V(0,f(0))=\partial_{\nu}U_{f+\varepsilon}(0,f(0)+\varepsilon)=i^{+}(f+\varepsilon). This establishes the second inequality.

For the first inequality, we note that Uf+εU_{f+\varepsilon} enjoys a uniform Lipschitz estimate depending on δ\delta, LL, mm, ρ\rho. Thus, there is a universal CC so that, in particular,

on{xd+1=0}, 1CεV1.\displaystyle\text{on}\ \{x_{d+1}=0\},\ \ 1-C\varepsilon\leq V\leq 1.

Thus 0(UfV)Cε0\leq(U_{f}-V)\leq C\varepsilon everywhere in DfD_{f}. Again, by the universal Lipschitz estimate, we see that

0ν(UfV)(0,f(0))Cε.\displaystyle 0\leq\partial_{\nu}(U_{f}-V)(0,f(0))\leq C\varepsilon.

Hence, this shows that

i+(f)i+(f+ε)Cε,\displaystyle i^{+}(f)-i^{+}(f+\varepsilon)\leq C\varepsilon,

which gives the first inequality of the Lemma.

Although not used until the next subsection, it will be worthwhile to record a result about ii which is an immediate consequence of Corollary 6.4.

Lemma 6.9.

If ff and ψ\psi are functions as in Lemma 6.3, then for the same constants as in Corollary 6.4,

1C(BR0ψ(y)|y|n1𝑑y)i+(f+ψ)i+(f)C(R0αψL(nBR0)+BR0ψ(y)|y|n1𝑑y),\displaystyle\frac{1}{C}\left(\int_{B_{R_{0}}}\psi(y)\left|y\right|^{-n-1}dy\right)\leq i^{+}(f+\psi)-i^{+}(f)\leq C\left(R_{0}^{-\alpha}\lVert\psi\rVert_{L^{\infty}(\mathbb{R}^{n}\setminus B_{R_{0}})}+\int_{B_{R_{0}}}\psi(y)\left|y\right|^{-n-1}dy\right),

and

1C(BR0ψ(y)|y|n1𝑑y)i(f)i(f+ψ)C(R0αψL(nBR0)+BR0ψ(y)|y|n1𝑑y).\displaystyle\frac{1}{C}\left(\int_{B_{R_{0}}}\psi(y)\left|y\right|^{-n-1}dy\right)\leq i^{-}(f)-i^{-}(f+\psi)\leq C\left(R_{0}^{-\alpha}\lVert\psi\rVert_{L^{\infty}(\mathbb{R}^{n}\setminus B_{R_{0}})}+\int_{B_{R_{0}}}\psi(y)\left|y\right|^{-n-1}dy\right).

We are now in a position to prove Proposition 6.1.

Proof of Proposition 6.1.

We first note that we will choose parameters, ε1\varepsilon_{1} and ε2\varepsilon_{2}, depending upon δ\delta, LL, mm, and ρ\rho so that we establish the proposition whenever

fgLε1,andfgLε2.\displaystyle\lVert f-g\rVert_{L^{\infty}}\leq\varepsilon_{1},\ \ \text{and}\ \ \lVert\nabla f-\nabla g\rVert_{L^{\infty}}\leq\varepsilon_{2}. (6.8)

Assuming we have already proved the proposition under this restriction on fgf-g, we see that we can choose the Lipschitz constant to also depend upon ε1\varepsilon_{1} and ε2\varepsilon_{2}. Indeed if either fg>ε1\lVert f-g\rVert>\varepsilon_{1} or fg>ε2\lVert\nabla f-\nabla g\rVert>\varepsilon_{2}, since II is bounded on 𝒦(δ,L,m,ρ)\mathcal{K}(\delta,L,m,\rho), we see that

I(f)I(g)LI(f)L+I(g)L2C2C(ε11fgL+ε21fgL)\displaystyle\lVert I(f)-I(g)\rVert_{L^{\infty}}\leq\lVert I(f)\rVert_{L^{\infty}}+\lVert I(g)\rVert_{L^{\infty}}\leq 2C\leq 2C(\varepsilon_{1}^{-1}\lVert f-g\rVert_{L^{\infty}}+\varepsilon_{2}^{-1}\lVert\nabla f-\nabla g\rVert_{L^{\infty}})

(as under the assumption on fgf-g, 1<(ε11fg+ε21fg)1<(\varepsilon_{1}^{-1}\lVert f-g\rVert+\varepsilon_{2}^{-1}\lVert\nabla f-\nabla g\rVert)).

Now, we explain how to choose ε1\varepsilon_{1} and ε2\varepsilon_{2} and establish the proposition under (6.8). We note that with ii as in Definition 6.7, by translation invariance,

I(f,x)=i(τxf).\displaystyle I(f,x)=i(\tau_{x}f).

Thus, we will establish that ii is Lipschitz.

Let us assume, without loss of generality that f(0)>g(0)f(0)>g(0). First, we take

ε=f(0)g(0),\displaystyle\varepsilon=f(0)-g(0),

and we define the new function,

g~=g+ε.\displaystyle\tilde{g}=g+\varepsilon.

Since f,g𝒦(δ,L,m,ρ)f,g\in\mathcal{K}(\delta,L,m,\rho), we can choose the parameter, ε1<δ/2\varepsilon_{1}<\delta/2, so that

g~𝒦(δ/2,L,m,ρ).\displaystyle\tilde{g}\in\mathcal{K}(\delta/2,L,m,\rho).

Next, we take ε2\varepsilon_{2} to be the parameter from Lemma 6.6 that corresponds to the set 𝒦(δ/2,L,m,ρ)\mathcal{K}(\delta/2,L,m,\rho). Under this assumption, we see that ψ=g~f\psi=\tilde{g}-f satisfies the assumptions of Lemma 6.6. Hence, because by definition i(f)=νUf(X0)i(f)=\partial_{\nu}U_{f}(X_{0}), we see that

|i(g~)i(f)|C|(g~f)(0)|+ε2g~fLC(fgL+fgL).\displaystyle\left|i(\tilde{g})-i(f)\right|\leq C\left|\nabla(\tilde{g}-f)(0)\right|+\varepsilon_{2}\lVert\tilde{g}-f\rVert_{L^{\infty}}\leq C(\lVert f-g\rVert_{L^{\infty}}+\lVert\nabla f-\nabla g\rVert_{L^{\infty}}).

Furthermore, Lemma 6.8 shows that

|i(g~)i(g)|C|f(0)g(0)|CfgL.\displaystyle\left|i(\tilde{g})-i(g)\right|\leq C\left|f(0)-g(0)\right|\leq C\lVert f-g\rVert_{L^{\infty}}.

This shows that ii is Lipschitz, and hence also II.

6.2. Analysis For HH

Because of the assumptions on GG, the following corollary is immediate from Proposition 6.1 and Corollary 6.2, recalling that I+=II^{+}=I.

Corollary 6.10.

For each δ\delta, LL, mm, ρ\rho fixed, HH is a Lipschitz mapping,

H:𝒦(δ,L,m,ρ)Cb0(n),\displaystyle H:\mathcal{K}(\delta,L,m,\rho)\to C^{0}_{b}(\mathbb{R}^{n}),

and the Lipschitz norm of HH depends upon all of δ\delta, LL, mm, ρ\rho.

The results of Lemma 6.3 and Corollary 6.4 are also used in building appropriate finite dimensional approximations to II and HH. We note that HH also enjoys these properties.

Lemma 6.11.

The results in Lemma 6.3, Corollary 6.4, and Lemma 6.9 hold for the operator,

h(f)=H(f,0).\displaystyle h(f)=H(f,0). (6.9)
Proof of Lemma 6.11.

Since ψ(0)=0\nabla\psi(0)=0, we have (f+tψ)(0)=f(0)\nabla(f+t\psi)(0)=\nabla f(0) for all t0t\geq 0. Consequently,

h(f+ψ)h(f)=(G(i+(f+ψ),i(f+ψ))G(i+(f),i(f)))1+|f(0)|2.h(f+\psi)-h(f)=\left(G(i^{+}(f+\psi),i^{-}(f+\psi))-G(i^{+}(f),i^{-}(f))\right)\sqrt{1+\left|\nabla{f}(0)\right|^{2}}.

We proceed to estimate G(i+(f+ψ),i(f+ψ))G(i+(f),i(f))G(i^{+}(f+\psi),i^{-}(f+\psi))-G(i^{+}(f),i^{-}(f)). First, observe that i+(f+ψ)i+(f)i^{+}(f+\psi)\geq i^{+}(f) and i(f+ψ)i(f)i^{-}(f+\psi)\leq i^{-}(f). By the assumptions on GG, we thus have

G(i+(f+ψ),i(f+ψ))G(i+(f),i(f))\displaystyle G(i^{+}(f+\psi),i^{-}(f+\psi))-G(i^{+}(f),i^{-}(f))
=\displaystyle=\ G(i+(f+ψ),i(f+ψ))G(i+(f),i(f+ψ))+G(i+(f),i(f+ψ))G(i+(f),i(f))\displaystyle G(i^{+}(f+\psi),i^{-}(f+\psi))-G(i^{+}(f),i^{-}(f+\psi))+G(i^{+}(f),i^{-}(f+\psi))-G(i^{+}(f),i^{-}(f))
\displaystyle\geq\ λ(i+(f+ψ)i+(f))+λ(i(f)i(f+ψ)).\displaystyle\lambda\left(i^{+}(f+\psi)-i^{+}(f)\right)+\lambda\left(i^{-}(f)-i^{-}(f+\psi)\right).

Similarly,

G(i+(f+ψ),i(f+ψ))G(i+(f),i(f))Λ(i+(f+ψ)i+(f))+Λ(i(f)i(f+ψ)).G(i^{+}(f+\psi),i^{-}(f+\psi))-G(i^{+}(f),i^{-}(f))\leq\Lambda\left(i^{+}(f+\psi)-i^{+}(f)\right)+\Lambda\left(i^{-}(f)-i^{-}(f+\psi)\right).

The claim now follows from Lemma 6.3 and Corollary 6.4, where we note the factor 1+|f(0)|2\sqrt{1+\left|\nabla f(0)\right|^{2}} is controlled by (1+m)(1+m), and so can be absorbed into the constant in the resulting inequalities.

The extension of the results in Lemma 6.3, Corollary 6.4, and Lemma 6.9 to the operator HH also implies that the remaining key result above applies to HH as well. We omit the proof, as it follows the same adaptations as in the previous Lemma.

Lemma 6.12.

The results in Lemma 6.6 hold for the operator, h(f)=H(f,0)h(f)=H(f,0).

7. Proof of Theorem 1.1

Before we can prove Theorem 1.1, we must make a number of observations about how II behaves with respect to some positive perturbations in Xρ,0X_{\rho,0} and especially what this behavior implies for the linear operators in 𝒟I\mathcal{D}_{I} (recall 𝒟I\mathcal{D}_{I} is in Definition 5.19, which is applicable since II is Lipschitz). Then we will show how these properties carry over to HH, and finally we will collect the ideas to finish the proof of Theorem 1.1.

7.1. Estimates on the Lévy measures for II and HH

We will show that once a Lipschitz operator, JJ, with the GCP enjoys bounds similar to those in Lemma 6.3 and Corollary 6.4, then as a consequence, its resulting linear supporting operators are comparable to a modified 1/2-Laplacian, and subsequently the corresponding Lévy measures have a density that is comparable to the 1/2-Laplacian. The main result in this direction is Proposition 7.5. Basically, the analysis we use follows almost exactly as some arguments in [43, Section 4.6] regarding inequalities for extremal operators and linear functionals in the min-max representation.

For this subsection, we will assume that JJ is an operator as in Section 5, and assume further that JJ satisfies the assumptions of Theorem 5.24 and the conclusion of Lemma 6.3.

We can utilize the translation invariance of JJ to focus on linear functionals via evaluation at x=0x=0.

Definition 7.1.
𝒟J(0)={(Xρ):LDJ,s.t.f𝒦,(f)=L(f,0)}.\displaystyle\mathcal{D}_{J}(0)=\{\ell\in\left(X_{\rho}\right)^{*}\ :\ \exists\ L\in D_{J},\ \textnormal{s.t.}\ \forall\ f\in\mathcal{K},\ \ell(f)=L(f,0)\}. (7.1)

We will compare the linear support functionals of JJ to a modified version of the 1/2-Laplacian, which we define here.

Definition 7.2.

With the constant, R0R_{0} as in Theorem 4.1, the linear operator, LΔL_{\Delta}, is defined as

LΔ(f,x)=BR0δhf(x)|h|n1𝑑h,\displaystyle L_{\Delta}(f,x)=\int_{B_{R_{0}}}\delta_{h}f(x)\left|h\right|^{-n-1}dh,

which is well defined for all fXρf\in X_{\rho}. Note, this is simply the 1/2-Laplacian, but computed with a truncated kernel.

Lemma 7.3.

Let R0R_{0} be the constant in Theorem 4.1. There exists a constant, C>0C>0, so that if JJ is an operator that satisfies the assumptions of Theorem 5.24 as well as the outcome of Lemma 6.3, JmJ^{m} and (LΔ)m\left(L_{\Delta}\right)^{m} are the finite dimensional approximations to JJ and LΔL_{\Delta}, defined in (5.4), then for all xGmx\in G^{m} and ψXρ,xC1,α(n)\psi\in X_{\rho,x}\cap C^{1,\alpha}(\mathbb{R}^{n}), with f+ψ𝒦f+\psi\in\mathcal{K} and supp(ψ)BR0(x)\textnormal{supp}(\psi)\subset B_{R_{0}}(x),

ChmβψC1,α+1C(LΔ)m(ψ,x)Jm(f+ψ,x)Jm(f,x)C(LΔ)m(ψ,x)+ChmβψC1,α,\displaystyle-Ch_{m}^{\beta}\lVert\psi\rVert_{C^{1,\alpha}}+\frac{1}{C}\left(L_{\Delta}\right)^{m}(\psi,x)\leq J^{m}(f+\psi,x)-J^{m}(f,x)\leq C\left(L_{\Delta}\right)^{m}(\psi,x)+Ch_{m}^{\beta}\lVert\psi\rVert_{C^{1,\alpha}},

where hmh_{m} is the grid size parameter from Definition 5.1 and β\beta is the exponent from Lemma 5.16.

Proof of Lemma 7.3.

We note that by the translation invariance of both JJ and LΔL_{\Delta}, it suffices to prove this result for x=0x=0 (see Lemma 5.20). We need to utilize Lemma 5.16 because we will also use Lemma 6.3 and Corollary 6.4, which require ψ\psi to be non-negative. Given such a ψ\psi, it is not true in general that πmψ0\pi_{m}\psi\geq 0, but we can correct with a quantifiable remainder. This is what follows.

Let RR be the function, Rα,m,w,0R_{\alpha,m,w,0} which results from Lemma 5.16 when applied to ψ\psi. Let ψ~m\tilde{\psi}_{m} be the function,

ψ~m=πmψ+R,\displaystyle\tilde{\psi}_{m}=\pi_{m}\psi+R,

so that ψ~mXρ,0\tilde{\psi}_{m}\in X_{\rho,0}.

This means we can apply Corollary 6.4 to πmf+ψ~m\pi_{m}f+\tilde{\psi}_{m}, and this gives

1CBR0ψ~m(y)|y|n1𝑑yJ(πmf+ψ~m,0)J(πmf,0)CBR0ψ~m(y)|y|n1𝑑y,\displaystyle\frac{1}{C}\int_{B_{R_{0}}}\tilde{\psi}_{m}(y)\left|y\right|^{-n-1}dy\leq J(\pi_{m}f+\tilde{\psi}_{m},0)-J(\pi_{m}f,0)\leq C\int_{B_{R_{0}}}\tilde{\psi}_{m}(y)\left|y\right|^{-n-1}dy,

and hence since ψ~m(0)=0\tilde{\psi}_{m}(0)=0 and ψ~(0)=0\nabla\tilde{\psi}(0)=0,

1CLΔ(ψ~m,0)J(πmf+ψ~m,0)J(πmf,0)CLΔ(ψ~m,0).\displaystyle\frac{1}{C}L_{\Delta}(\tilde{\psi}_{m},0)\leq J(\pi_{m}f+\tilde{\psi}_{m},0)-J(\pi_{m}f,0)\leq CL_{\Delta}(\tilde{\psi}_{m},0).

Using the continuity of LΔL_{\Delta} over C1,α/2C^{1,\alpha/2} as well as the Lipschitz nature of JJ over XρX_{\rho} and C1,αC^{1,\alpha} (recall that πmψXρC1,α\pi_{m}\psi\in X_{\rho}\cap C^{1,\alpha} as well as Remark 5.17), we obtain

C~RC1,α/2+1CLΔ(πmψ,0)J(πmf+πmψ,0)J(πmf,0)CLΔ(πmψ,0)+C~RC1,α/2.\displaystyle-\tilde{C}\lVert R\rVert_{C^{1,\alpha/2}}+\frac{1}{C}L_{\Delta}(\pi_{m}\psi,0)\leq J(\pi_{m}f+\pi_{m}\psi,0)-J(\pi_{m}f,0)\leq CL_{\Delta}(\pi_{m}\psi,0)+\tilde{C}\lVert R\rVert_{C^{1,\alpha/2}}.

Invoking Lemma 5.16, for the parameter, β\beta in Lemma 5.16,

C~hmβψC1,α+1CLΔ(πmψ,0)J(πmf+πmψ,0)J(πmf,0)CLΔ(πmψ,0)+C~hmβψC1,α.\displaystyle-\tilde{C}h_{m}^{\beta}\lVert\psi\rVert_{C^{1,\alpha}}+\frac{1}{C}L_{\Delta}(\pi_{m}\psi,0)\leq J(\pi_{m}f+\pi_{m}\psi,0)-J(\pi_{m}f,0)\leq CL_{\Delta}(\pi_{m}\psi,0)+\tilde{C}h_{m}^{\beta}\lVert\psi\rVert_{C^{1,\alpha}}.

Finally, using the fact that the operator Em0TmE_{m}^{0}\circ T_{m} is linear, preserves ordering, and agrees with its input function over GmG^{m}, we see that applying Em0TmE^{0}_{m}\circ T_{m} to each of the operators in the last inequality, not evaluated at x=0x=0, and then relabeling constants and evaluating at x=0Gmx=0\in G^{m},

ChmβψC1,α+1C(LΔ)m(ψ,0)Jm(f+ψ,0)Jm(f,0)C(LΔ)m(ψ,0)+C~hmβψC1,α.\displaystyle-Ch_{m}^{\beta}\lVert\psi\rVert_{C^{1,\alpha}}+\frac{1}{C}\left(L_{\Delta}\right)^{m}(\psi,0)\leq J^{m}(f+\psi,0)-J^{m}(f,0)\leq C\left(L_{\Delta}\right)^{m}(\psi,0)+\tilde{C}h_{m}^{\beta}\lVert\psi\rVert_{C^{1,\alpha}}.

Corollary 7.4.

If JJ satisfies the assumptions of Theorem 5.24 as well as the outcome of Lemma 6.3, then for all L𝒟JL\in\mathcal{D}_{J}, the constant CC and functions ψ\psi as in Lemma 7.3,

1CLΔ(ψ,x)L(ψ,x)CLΔ(ψ,x).\displaystyle\frac{1}{C}L_{\Delta}(\psi,x)\leq L(\psi,x)\leq CL_{\Delta}(\psi,x).
Proof of Corollary 7.4.

We recall that 𝒟J\mathcal{D}_{J} is a convex hull of limits of linear operators that are derivatives of JmJ^{m}. Thus, it suffices to prove the result for fXρf\in X_{\rho} so that f=limfmf=\lim f_{m} and that JmJ^{m} is differentiable at fmf_{m}, i.e.

ψ,DJm(fm)[ψ]=lims0Jm(fm+sψ)Jm(fm)s,\displaystyle\forall\ \psi,\ DJ^{m}(f_{m})[\psi]=\lim_{s\to 0}\frac{J^{m}(f_{m}+s\psi)-J^{m}(f_{m})}{s},

and

L=limmDJm(fm).\displaystyle L=\lim_{m\to\infty}DJ^{m}(f_{m}).

Thus, for all ψ\psi satisfying the requirements of Lemma 7.3, we see that

ChmβψC1,α+1C(LΔ)m(ψ)DJm(fm)[ψ]C(LΔ)m(ψ)+ChmβψC1,α.\displaystyle-Ch_{m}^{\beta}\lVert\psi\rVert_{C^{1,\alpha}}+\frac{1}{C}\left(L_{\Delta}\right)^{m}(\psi)\leq DJ^{m}(f_{m})[\psi]\leq C\left(L_{\Delta}\right)^{m}(\psi)+Ch_{m}^{\beta}\lVert\psi\rVert_{C^{1,\alpha}}.

We can now take limits as m0m\to 0, using that hm0h_{m}\to 0 and Proposition 5.15 that shows (LΔ)mLΔ(L_{\Delta})^{m}\to L_{\Delta}, to conclude the result of the corollary for such LL, fmf_{m}, and ff. Since these inequalities are stable under convex combinations, we are finished.

Just as above, thanks to translation invariance, we have the luxury of focusing on all of the operators in 𝒟J\mathcal{D}_{J} evaluated at x=0x=0. Thus, as an immediate consequence of Corollary 7.4, we obtain the following result.

Proposition 7.5.

For all 𝒟J(0)\ell\in\mathcal{D}_{J}(0), for f𝒦f\in\mathcal{K}, for ψXρ,0\psi\in X_{\rho,0} with supp(ψ)BR0\textnormal{supp}(\psi)\subset B_{R_{0}},

1CBR0ψ(y)|y|n1𝑑y(ψ)CBR0ψ(y)|y|n1𝑑y,\displaystyle\frac{1}{C}\int_{B_{R_{0}}}\psi(y)\left|y\right|^{-n-1}dy\leq\ell(\psi)\leq C\int_{B_{R_{0}}}\psi(y)\left|y\right|^{-n-1}dy,
Corollary 7.6.

If 𝒟J(0)\ell\in\mathcal{D}_{J}(0), and μ\mu_{\ell} is the Lévy measure corresponding to \ell from Theorem 5.24, then there exists a function, KK_{\ell} so that

μ(E)=EK(h)𝑑h,\displaystyle\mu_{\ell}(E)=\int_{E}K_{\ell}(h)dh,

and

hBR0{0},1C|h|n1K(h)C|h|n1.\displaystyle\forall\ \ h\in B_{R_{0}}\setminus\{0\},\ \ \frac{1}{C}\left|h\right|^{-n-1}\leq K_{\ell}(h)\leq C\left|h\right|^{-n-1}.
Proof of Corollary 7.6.

We recall the structure of \ell from Theorem 5.24 and the fact that for any ψ\psi as in Lemma 7.3 we have ψ(0)=0\psi(0)=0 and ψ(0)=0\nabla\psi(0)=0, so that

(ψ)=nψ(h)μ(dh).\displaystyle\ell(\psi)=\int_{\mathbb{R}^{n}}\psi(h)\mu_{\ell}(dh).

Hence, for each fixed rr, from Proposition 7.5 we can already deduce that μ\mu_{\ell} has a density in BR0BrB_{R_{0}}\setminus B_{r}, and that this density must inherit the bounds given in Proposition 7.5. Hence the Corollary holds for the measure μ\mu_{\ell}, restricted to BR0(0)Br(0)B_{R_{0}}(0)\setminus B_{r}(0). Since r>0r>0 was arbitrary, we see that there will be a density on the set BR0(0){0}B_{R_{0}}(0)\setminus\{0\} and that the required bounds still follow from Proposition 7.5.

7.2. Estimates on the drift

Just as the estimates for the Lévy measures corresponding to a mapping, JJ, depended upon a variant of the inequality of Lemma 6.3 being inherited by the finite dimensional approximations, so too will the proof here for the estimate on the drift. This time, we need a finite dimensional version of Lemma 6.6.

Lemma 7.7.

With CC, ε2\varepsilon_{2}, ff, ψ\psi as in Lemma 6.6, we also have

|Jm(f+ψ,0)Jm(f,0)|C(|ψ(0)|+ε2ψL).\displaystyle\left|J^{m}(f+\psi,0)-J^{m}(f,0)\right|\leq C\left(\left|\nabla\psi(0)\right|+\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}\right).
Proof of Lemma 7.7.

Applying Lemma 6.6 or Lemma 6.12 to πmf\pi_{m}f and πmψ\pi_{m}\psi, we obtain

|J(πmf+πmψ,0)J(πmf,0)|C(|(πmψ)(0)|+ε2πmψL).|J(\pi_{m}f+\pi_{m}\psi,0)-J(\pi_{m}f,0)|\leq C\left(|\nabla(\pi_{m}\psi)(0)|+\varepsilon_{2}||\pi_{m}\psi||_{L^{\infty}}\right).

We next apply Theorem 5.11 to bound πmψL||\pi_{m}\psi||_{L^{\infty}} with respect to ψL||\psi||_{L^{\infty}}. Also, since πm\pi_{m} agrees up to first order with its input function on GmG_{m}, and because 0Gm0\in G_{m}, we see that (πmψ)(0)=ψ(0)\nabla(\pi_{m}\psi)(0)=\nabla\psi(0). Finally, using the fact that Em0TmE^{0}_{m}\circ T_{m} is order-preserving and agrees with its input function on GmG_{m}, we obtain the desired estimate. ∎

With this information in hand, we need to address how the drift and Lévy measures given in Theorem 5.24 relate to each other, particularly in the context of the assumptions in Section 4.2. To this end, fix ene\in\mathbb{R}^{n}, |e|=1|e|=1, and a smooth cutoff function ηCc(n)\eta\in C^{\infty}_{c}(\mathbb{R}^{n}) between B1/2B_{1/2} and B1B_{1}. We define the functions, for 0<τr0<\tau\leq r,

ϕ(y)=(ey)η(y)andϕτ,r(y):=τrϕ(yr).\displaystyle\phi(y)=(e\cdot y)\eta(y)\ \ \ \ \text{and}\ \ \ \ \phi_{\tau,r}(y):=\tau r\phi\left(\frac{y}{r}\right). (7.2)

A crucial property of ϕτ,r\phi_{\tau,r} is given in the next lemma.

Lemma 7.8.

There exists a constant, CC, depending on δ\delta, LL, mm, ρ\rho, such that if

𝒟J(0),with(f)=cf(0)+bf(0)+nδhf(0)K(h)𝑑h,\displaystyle\ell\in\mathcal{D}_{J}(0),\ \ \text{with}\ \ell(f)=c_{\ell}f(0)+b_{\ell}\cdot\nabla f(0)+\int_{\mathbb{R}^{n}}\delta_{h}f(0)K_{\ell}(h)dh,

then for ϕτ,r\phi_{\tau,r} defined in (7.2),

|(ϕτ,r)|Cτ for all τr.\displaystyle|\ell(\phi_{\tau,r})|\leq C\tau\quad\text{ for all }\tau\leq r.
Proof of Lemma 7.8.

First, we list a number of properties of ϕτ,r\phi_{\tau,r}.

  1. (i)

    ϕτ,r(0)=0\phi_{\tau,r}(0)=0 and ϕτ,r(0)=τe\nabla\phi_{\tau,r}(0)=\tau e.

  2. (ii)

    There exists a universal constant C>0C^{\prime}>0 such that |ϕτ,r(y)|Cτ|\nabla\phi_{\tau,r}(y)|\leq C^{\prime}\tau for all ysupp(ϕτ,r)y\in\textnormal{supp}(\phi_{\tau,r}).

  3. (iii)

    If η\eta is Cρ1,DiniC^{1,\text{Dini}}_{\rho}, then ϕτ,r\phi_{\tau,r} is Cρ1,DiniC^{1,\text{Dini}}_{\rho}. Indeed, by the concavity of ρ\rho, and since τr\tau\leq r, we see that for any xnx\in\mathbb{R}^{n} and yBr(x)y\in B_{r}(x), we have

    |ϕτ,r(x+y)ϕτ,r(x)ϕτ,r(x)y|\displaystyle|\phi_{\tau,r}(x+y)-\phi_{\tau,r}(x)-\nabla\phi_{\tau,r}(x)\cdot y|
    =|(τe(x+y))η(x+yr)(τex)η(xr)(τey)η(xr)(τexr)(η(xr)y)|\displaystyle=\bigg{|}(\tau e\cdot(x+y))\eta\left(\frac{x+y}{r}\right)-(\tau e\cdot x)\eta\left(\frac{x}{r}\right)-(\tau e\cdot y)\eta\left(\frac{x}{r}\right)-\left(\tau e\cdot\frac{x}{r}\right)\left(\nabla\eta\left(\frac{x}{r}\right)\cdot y\right)\bigg{|}
    =|(τe(x+y))(η(x+yr)η(xr))(τex)(η(xr)yr)|\displaystyle=\bigg{|}(\tau e\cdot(x+y))\left(\eta\left(\frac{x+y}{r}\right)-\eta\left(\frac{x}{r}\right)\right)-\left(\tau e\cdot x\right)\left(\nabla\eta\left(\frac{x}{r}\right)\cdot\frac{y}{r}\right)\bigg{|}
    =|(τe(x+y))(η(x+yr)η(xr)η(xr)yr)+(τey)(η(xr)yr)|\displaystyle=\bigg{|}(\tau e\cdot(x+y))\left(\eta\left(\frac{x+y}{r}\right)-\eta\left(\frac{x}{r}\right)-\nabla\eta\left(\frac{x}{r}\right)\cdot\frac{y}{r}\right)+\left(\tau e\cdot y\right)\left(\nabla\eta\left(\frac{x}{r}\right)\cdot\frac{y}{r}\right)\bigg{|}
    τ|x+y||yr|ρ(yr)+τrηL(n)|y|2\displaystyle\leq\tau|x+y|\bigg{|}\frac{y}{r}\bigg{|}\rho\left(\frac{y}{r}\right)+\frac{\tau}{r}||\nabla\eta||_{L^{\infty}(\mathbb{R}^{n})}|y|^{2}
    |y|τρ(yr)+ηL(n)|y|2\displaystyle\leq|y|\tau\rho\left(\frac{y}{r}\right)+||\nabla\eta||_{L^{\infty}(\mathbb{R}^{n})}|y|^{2}
    |y|τrρ(y)+ηL(n)|y|2\displaystyle\leq|y|\frac{\tau}{r}\rho(y)+||\nabla\eta||_{L^{\infty}(\mathbb{R}^{n})}|y|^{2}
    |y|(ρ(y)+C|y|).\displaystyle\leq|y|(\rho(y)+C|y|).

Without loss of generality, we can assume that for |y|1\left|y\right|\leq 1, |y|ρ(|y|)\left|y\right|\leq\rho(\left|y\right|).

In order to conclude the bound on (ϕτ,r)\ell(\phi_{\tau,r}), we look to Lemma 7.7. This shows that for all 𝒟J(0)\ell\in\mathcal{D}_{J}(0), for all ψXρ\psi\in X_{\rho}

|(ψ)|C(|ψ(0)|+ε2ψL),\displaystyle\left|\ell(\psi)\right|\leq C(\left|\nabla\psi(0)\right|+\varepsilon_{2}\lVert\psi\rVert_{L^{\infty}}),

where CC is a constant that depends only on δ\delta, LL, mm, ρ\rho. Thus as ϕτ,rXρ\phi_{\tau,r}\in X_{\rho}, applying this to ψ=ϕτ,r\psi=\phi_{\tau,r} shows |(ϕτ,r)|Cτ\left|\ell(\phi_{\tau,r})\right|\leq C\tau.

We are finally ready to prove the estimates on the drift.

Lemma 7.9.

There exists a constant, CC, depending on δ\delta, LL, mm, ρ\rho, such that if r0r_{0} and δhf\delta_{h}f are as in Theorem 5.24,

𝒟J(0),with(f)=cf(0)+bf(0)+nδhf(0)K(h)𝑑h,\displaystyle\ell\in\mathcal{D}_{J}(0),\ \ \text{with}\ \ell(f)=c_{\ell}f(0)+b_{\ell}\cdot\nabla f(0)+\int_{\mathbb{R}^{n}}\delta_{h}f(0)K_{\ell}(h)dh,

then for 0<r<r00<r<r_{0},

|bBr0BrhK(h)𝑑h|C.\displaystyle\left|b_{\ell}-\int_{B_{r_{0}}\setminus B_{r}}hK_{\ell}(h)dh\right|\leq C.
Proof of Lemma 7.9.

Fix ene\in\mathbb{R}^{n}, |e|=1|e|=1 and 0<τr<r00<\tau\leq r<r_{0}. Consider the function ϕτ,r\phi_{\tau,r} defined above. We have

(ϕτ,r)\displaystyle\ell(\phi_{\tau,r}) =τ(be)+n(τeh)[η(hr)𝟙Br0(h)]K(h)𝑑h\displaystyle=\tau(b_{\ell}\cdot e)+\int_{\mathbb{R}^{n}}(\tau e\cdot h)\left[\eta\left(\frac{h}{r}\right)-{\mathbbm{1}}_{B_{r_{0}}}(h)\right]\ K_{\ell}(h)dh
=τ(be+Br0\Br/2(eh)[η(hr)1]K(h)𝑑h)\displaystyle=\tau\left(b_{\ell}\cdot e+\int_{B_{r_{0}}\backslash B_{r/2}}(e\cdot h)\left[\eta\left(\frac{h}{r}\right)-1\right]\ K_{\ell}(h)dh\right)
=τ(be+Br0\Br(eh)[η(hr)1]K(h)𝑑h+Br\Br/2(eh)[η(hr)1]K(h)𝑑h)\displaystyle=\tau\left(b_{\ell}\cdot e+\int_{B_{r_{0}}\backslash B_{r}}(e\cdot h)\left[\eta\left(\frac{h}{r}\right)-1\right]\ K_{\ell}(h)dh+\int_{B_{r}\backslash B_{r/2}}(e\cdot h)\left[\eta\left(\frac{h}{r}\right)-1\right]\ K_{\ell}(h)dh\right)
=τ(beBr0\Br(eh)K(h)𝑑h+Br\Br/2(eh)[η(hr)1]K(h)𝑑h)\displaystyle=\tau\left(b_{\ell}\cdot e-\int_{B_{r_{0}}\backslash B_{r}}(e\cdot h)\ K_{\ell}(h)dh+\int_{B_{r}\backslash B_{r/2}}(e\cdot h)\left[\eta\left(\frac{h}{r}\right)-1\right]\ K_{\ell}(h)dh\right)

Consequently,

τ(beBr0\Br(eh)K(h)𝑑h)=(ϕτ,r)+τBr\Br/2(eh)[1η(hr)]K(h)𝑑h.\tau\left(b_{\ell}\cdot e-\int_{B_{r_{0}}\backslash B_{r}}(e\cdot h)\ K_{\ell}(h)dh\right)=\ell(\phi_{\tau,r})+\tau\int_{B_{r}\backslash B_{r/2}}(e\cdot h)\left[1-\eta\left(\frac{h}{r}\right)\right]\ K_{\ell}(h)dh.

Using Lemma 7.8, we have

τ|beBr0\Br(eh)K(h)𝑑h|Cτ+τ|Br\Br/2(eh)[1η(hr)]K(h)𝑑h|.\tau\bigg{|}b_{\ell}\cdot e-\int_{B_{r_{0}}\backslash B_{r}}(e\cdot h)\ K_{\ell}(h)dh\bigg{|}\leq C\tau+\tau\bigg{|}\int_{B_{r}\backslash B_{r/2}}(e\cdot h)\left[1-\eta\left(\frac{h}{r}\right)\right]\ K_{\ell}(h)dh\bigg{|}.

Dividing by τ\tau yields the estimate

|beBr0\Br(eh)K(h)𝑑h|C+|Br\Br/2(eh)[1η(hr)]K(h)𝑑h|.\bigg{|}b_{\ell}\cdot e-\int_{B_{r_{0}}\backslash B_{r}}(e\cdot h)\ K_{\ell}(h)dh\bigg{|}\leq C+\bigg{|}\int_{B_{r}\backslash B_{r/2}}(e\cdot h)\left[1-\eta\left(\frac{h}{r}\right)\right]\ K_{\ell}(h)dh\bigg{|}.

To estimate the integral on the right-hand side, we recall from Corollary 7.6 that K(h)|h|(n+1)K_{\ell}(h)\approx|h|^{-(n+1)}. This yields

|Br\Br/2(eh)[1η(hr)]K(h)𝑑h|\displaystyle\bigg{|}\int_{B_{r}\backslash B_{r/2}}(e\cdot h)\left[1-\eta\left(\frac{h}{r}\right)\right]\ K_{\ell}(h)dh\bigg{|} Br\Br/2C|h|n𝑑h\displaystyle\leq\int_{B_{r}\backslash B_{r/2}}C|h|^{-n}\ dh
=Cr/2rs1𝑑s=Clog(2).\displaystyle=C\int_{r/2}^{r}s^{-1}\ ds=C\log(2).

7.3. Collecting the arguments to finish Theorem 1.1

Here we just list all of the particular previous results that are used to culminate in the proof of Theorem 1.1.

Proof of Theorem 1.1.

First, we note that the function, HH, enjoys the GCP over 𝒦\mathcal{K} (see Definition 5.23). This was already established in [24], but we will briefly comment on it here. Indeed, if f,g𝒦f,g\in\mathcal{K} and fgf\leq g with f(x0)=g(x0)f(x_{0})=g(x_{0}), then we also know that Df+Dg+D_{f}^{+}\subset D_{g}^{+}. Thus, since Ug+0U_{g}^{+}\geq 0 on Γf\Gamma_{f}, we see that Ug+U_{g}^{+} is a supersolution of the same equation that governs Uf+U_{f}^{+}. Since Uf+(x0,f(x0))=0=Ug+(x0,g(x0))U_{f}^{+}(x_{0},f(x_{0}))=0=U_{g}^{+}(x_{0},g(x_{0})), we see that also ν+Uf(x0,f(x0))ν+Ug(x0,g(x0))\partial_{\nu}^{+}U_{f}(x_{0},f(x_{0}))\leq\partial_{\nu}^{+}U_{g}(x_{0},g(x_{0})). Hence, I+(f,x0)I+(g,x0)I^{+}(f,x_{0})\leq I^{+}(g,x_{0}). A similar argument can be applied to νUf\partial_{\nu}^{-}U_{f} and νUg\partial_{\nu}^{-}U_{g}, but this time the ordering is reversed (per the definition in (1.7)), as now we have DgDfD_{g}^{-}\subset D_{f}^{-}. Combining these inequalities with the definition in (1.8), and remembering that GG is increasing in its first variable and decreasing in its second variable (and by assumption on ff and gg, f(x0)=g(x0)\nabla f(x_{0})=\nabla g(x_{0})), we conclude the GCP for HH.

We know that since HH is Lipschitz on 𝒦\mathcal{K} and enjoys the GCP, we will want to invoke Theorem 5.24. However, we still need to establish that the extra decay requirement in (5.6) is satisfied. Indeed it is, which we will show after this current proof in Lemma 7.10, below. Now, assuming we have established (5.6), then Theorem 5.24 shows that all 𝒟H(0)\ell\in\mathcal{D}_{H}(0) enjoy the structure as claimed in part (i) of Theorem 1.1 (𝒟H(0)\mathcal{D}_{H}(0) is from Definition 7.1, following Definition 5.19). After a relabeling of aij=h(g)(g)a^{ij}=h(g)-\ell(g) and the triple cc_{\ell}, bb_{\ell}, and KK_{\ell}, from Theorem 5.24, we see that part (i) has been established.

To conclude part (ii) of the theorem, we can invoke Corollary 7.6 for the Lévy measure estimates and Lemma 7.9 for the bounds involving the drift terms.

Lemma 7.10.

There exists constants, C>0C>0, and α(0,1]\alpha\in(0,1], depending on δ\delta, LL, mm, ρ\rho, and nn, so that if f,g𝒦(δ,L,m,ρ)f,g\in\mathcal{K}(\delta,L,m,\rho) and fgf\equiv g in B2RB_{2R}, then

H(f)H(g)L(BR)CRα.\displaystyle\lVert H(f)-H(g)\rVert_{L^{\infty}(B_{R})}\leq\frac{C}{R^{\alpha}}.
Proof of Lemma 7.10.

First, we will establish that

I+(f)I+(g)L(BR)CRα.\displaystyle\lVert I^{+}(f)-I^{+}(g)\rVert_{L^{\infty}(B_{R})}\leq\frac{C}{R^{\alpha}}. (7.3)

Then, as following the proof of Lemma 6.11, we will see that this estimate carries over to HH as well.

The proof of (7.3) goes very similarly to the proofs of Lemmas 6.3 and 6.6 (specifically, Step 3), combined with Lemma 4.6. As in the proof of Lemma 6.12, we define the functions,

g1=min{f,g}andg2=max{f,g},\displaystyle g_{1}=\min\{f,g\}\ \ \ \text{and}\ \ \ g_{2}=\max\{f,g\},

and by construction, the respective domains are the ordered as follows:

Dg1DfDg2andDg1DgDg2.\displaystyle D_{g_{1}}\subset D_{f}\subset D_{g_{2}}\ \ \ \text{and}\ \ \ D_{g_{1}}\subset D_{g}\subset D_{g_{2}}.

Thus, we see that at least in BRB_{R}, since fgf\equiv g,

νUg1νUfνUg2andνUg1νUgνUg2,\displaystyle\partial_{\nu}U_{g_{1}}\leq\partial_{\nu}U_{f}\leq\partial_{\nu}U_{g_{2}}\ \ \ \text{and}\ \ \ \partial_{\nu}U_{g_{1}}\leq\partial_{\nu}U_{g}\leq\partial_{\nu}U_{g_{2}},

so we have

νUg2νUg1νUf(X)νUg(X)νUg1(X)νUg2(X).\displaystyle\partial_{\nu}U_{g_{2}}-\partial_{\nu}U_{g_{1}}\leq\partial_{\nu}U_{f}(X)-\partial_{\nu}U_{g}(X)\leq\partial_{\nu}U_{g_{1}}(X)-\partial_{\nu}U_{g_{2}}(X).

Furthermore, for the function WW, defined as

{ΔW=0inDg1W=0on{xn+1=0}W=Ug2|Γg1onΓg1,\displaystyle\begin{cases}\Delta W=0\ &\text{in}\ D_{g_{1}}\\ W=0\ &\text{on}\ \{x_{n+1}=0\}\\ W=U_{g_{2}}|_{\Gamma_{g_{1}}}\ &\text{on}\ \Gamma_{g_{1}},\end{cases}

we see that in the smaller domain, Dg1D_{g_{1}},

Ug2=Ug1+W.\displaystyle U_{g_{2}}=U_{g_{1}}+W.

Thus, we have reduced the estimate to

νUg1(X)νUg2(X)=νW(X),\displaystyle\partial_{\nu}U_{g_{1}}(X)-\partial_{\nu}U_{g_{2}}(X)=\partial_{\nu}W(X),

and so

|νUf(X)νUg(X)||νW(X)|.\displaystyle\left|\partial_{\nu}U_{f}(X)-\partial_{\nu}U_{g}(X)\right|\leq\left|\partial_{\nu}W(X)\right|.

As g1g_{1} and g2g_{2} are C1,Dini(B2R)C^{1,\textnormal{Dini}}(B_{2R}) and globally Lipschitz with Lipschitz constant, mm, we see that Lemma 4.6 gives for XΓfBRX\in\Gamma_{f}\cap B_{R}, and s>0s>0,

W(X+sν(X))CsRα,\displaystyle W(X+s\nu(X))\leq\frac{Cs}{R^{\alpha}},

and hence

νW(X)CRα.\displaystyle\partial_{\nu}W(X)\leq\frac{C}{R^{\alpha}}.

Thus, we have established

xBR,|I+(f,x)I+(g,x)|CRα.\displaystyle\forall\ x\in B_{R},\ \ \left|I^{+}(f,x)-I^{+}(g,x)\right|\leq\frac{C}{R^{\alpha}}.

Remark 7.11.

The curious reader may see that our approach to establish Theorem 1.1 deviates slightly from the one given in [24], and we believe the reasons for this deviation are noteworthy. We will discuss this in more detail in Section 9.

8. Proof of Theorem 1.2

Here we will prove Theorem 1.2. As a first step, we wish to exhibit which ellipticity class will apply to the equation solved by the finite differences of ff. Determining this class gives a result that depends on the structure provided in Theorem 1.1, and the class, as well as resulting regularity results will depend on the parameters δ\delta, LL, mm, ρ\rho, which is the source for the dependence in the outcome of Theorem 1.2. The key is to note what are some valid choices for extremal operators that govern our mapping, HH (extremal operators are those defined in (4.6) that satisfy (4.7)). We see from the min-max representation of hh in Theorem 1.1 (recall h(f)=H(f,0)h(f)=H(f,0), as in Lemma 6.11) that if f1,f2𝒦f_{1},f_{2}\in\mathcal{K}, then

h(f1)\displaystyle h(f_{1}) =ming𝒦(max𝒟H(0)h(g)(g)+(f))\displaystyle=\min_{g\in\mathcal{K}}\left(\max_{\ell\in\mathcal{D}_{H}(0)}h(g)-\ell(g)+\ell(f)\right)
max𝒟H(0)h(f2)(f2)+(f1)\displaystyle\leq\max_{\ell\in\mathcal{D}_{H}(0)}h(f_{2})-\ell(f_{2})+\ell(f_{1})
h(f2)+max𝒟H(0)(f1f2).\displaystyle\leq h(f_{2})+\max_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{1}-f_{2}).

Next, let g1𝒦g_{1}\in\mathcal{K} be such that h(f1)=max𝒟H(0)h(g1)(g1)+(f1)h(f_{1})=\max_{\ell\in\mathcal{D}_{H}(0)}h(g_{1})-\ell(g_{1})+\ell(f_{1}), and let 2𝒟H(0)\ell_{2}\in\mathcal{D}_{H}(0) be such that 2(f2g1)=max𝒟H(0)(f2g1)\ell_{2}(f_{2}-g_{1})=\max_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{2}-g_{1}). We then find that

h(f1)h(f2)\displaystyle h(f_{1})-h(f_{2}) =(max𝒟H(0)h(g1)(g1)+(f1))ming𝒦(max𝒟H(0)h(g)(g)+(f2))\displaystyle=\left(\max_{\ell\in\mathcal{D}_{H}(0)}h(g_{1})-\ell(g_{1})+\ell(f_{1})\right)-\min_{g\in\mathcal{K}}\left(\max_{\ell\in\mathcal{D}_{H}(0)}h(g)-\ell(g)+\ell(f_{2})\right)
h(g1)+(max𝒟H(0)(f1g1))(max𝒟H(0)h(g1)(g1)+(f2))\displaystyle\geq h(g_{1})+\left(\max_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{1}-g_{1})\right)-\left(\max_{\ell\in\mathcal{D}_{H}(0)}h(g_{1})-\ell(g_{1})+\ell(f_{2})\right)
=h(g1)+(max𝒟H(0)(f1g1))h(g1)(max𝒟H(0)(f2g1))\displaystyle=h(g_{1})+\left(\max_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{1}-g_{1})\right)-h(g_{1})-\left(\max_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{2}-g_{1})\right)
=max𝒟H(0)(f1g1)2(f2g1)\displaystyle=\max_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{1}-g_{1})-\ell_{2}(f_{2}-g_{1})
2(f1g1)2(f2g1)\displaystyle\geq\ell_{2}(f_{1}-g_{1})-\ell_{2}(f_{2}-g_{1})
=2(f1f2)min𝒟H(0)(f1f2)\displaystyle=\ell_{2}(f_{1}-f_{2})\geq\min_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{1}-f_{2})

In summary, we have

f1,f2𝒦,min𝒟H(0)(f1f2)h(f1)h(f2)max𝒟H(0)(f1f2).\forall\ f_{1},f_{2}\in\mathcal{K},\ \ \ \min_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{1}-f_{2})\leq h(f_{1})-h(f_{2})\leq\max_{\ell\in\mathcal{D}_{H}(0)}\ell(f_{1}-f_{2}). (8.1)

Now let CC be the constant in Theorem 1.1 (ii), and let Λ\mathcal{L}_{\Lambda} be the class of operators from Definition 4.8 with Λ=C\Lambda=C. We claim there exist constants C1,C2C_{1},C_{2} such that

max𝒟H(0)(f)Λ+(f)+C1fL(n),min𝒟H(0)(f)Λ(f)C2fL(n).\max_{\ell\in\mathcal{D}_{H}(0)}\ell(f)\leq\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}(f)+C_{1}||f||_{L^{\infty}(\mathbb{R}^{n})},\quad\min_{\ell\in\mathcal{D}_{H}(0)}\ell(f)\geq\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}(f)-C_{2}||f||_{L^{\infty}(\mathbb{R}^{n})}. (8.2)

First notice that the lower bound on the kernels in Theorem 1.1 (ii) is only valid in a small ball. To be able to apply the regularity results in Section 4.2, namely Proposition 4.10, the kernels must satisfy the lower bound stated in Definition 4.8. To do this, we employ a strategy similar to that in [12, Section 14] for truncated kernels. Indeed, if 𝒟H(0)\ell\in\mathcal{D}_{H}(0), then we may write

(f)(x)=cijf(x)+bijf(x)+nδyf(x)K~ij(y)𝑑yΛ1n\Br0(f(x+y)f(x))|y|n1𝑑y,\ell(f)(x)=c^{ij}f(x)+b^{ij}\cdot\nabla f(x)+\int_{\mathbb{R}^{n}}\delta_{y}f(x)\tilde{K}^{ij}(y)dy-\Lambda^{-1}\int_{\mathbb{R}^{n}\backslash B_{r_{0}}}(f(x+y)-f(x))|y|^{-n-1}\ dy,

where K~ij(y)=Kij(y)+Λ1𝟙n\Br0|y|n1\tilde{K}^{ij}(y)=K^{ij}(y)+\Lambda^{-1}{\mathbbm{1}}_{\mathbb{R}^{n}\backslash B_{r_{0}}}|y|^{-n-1}. Since bij,K~ijΛb^{ij},\tilde{K}^{ij}\in\mathcal{L}_{\Lambda} and Λ1𝟙n\Br0|y|n1L1(n)\Lambda^{-1}{\mathbbm{1}}_{\mathbb{R}^{n}\backslash B_{r_{0}}}|y|^{-n-1}\in L^{1}(\mathbb{R}^{n}), and taking into account the bound on cijc^{ij} given in Theorem 1.1, the inequalities (8.2) hold.

As an immediate consequence of (8.2) and (8.1), we find that

C1|f1f2|+Λ(f1f2)h(f1)h(f2)Λ+(f1f2)+C2|f1f2| for all f1,f2𝒦.-C_{1}|f_{1}-f_{2}|+\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}(f_{1}-f_{2})\leq h(f_{1})-h(f_{2})\leq\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}(f_{1}-f_{2})+C_{2}|f_{1}-f_{2}|\text{ for all }f_{1},f_{2}\in\mathcal{K}. (8.3)

With (8.3) at hand, Theorem 1.2 follows by combining the conclusion of (8.3) with the following C1,γC^{1,\gamma} estimate for translation invariant operators, Proposition 8.1, whose statement and proof are essentially that of [67, Theorem 6.2]. As Λ\mathcal{L}_{\Lambda} depends upon δ\delta, LL, mm, ρ\rho, then so does the constant obtained in Theorem 1.2. We note that by [24, Theorem 1.1 (iii)], the Lipschitz bound on f(,0)f(\cdot,0) is preserved for all time. Thus, in the following Proposition 8.1, when applied to ff in Theorem 1.2, we can replace fC0,1(n×[0,T])\lVert f\rVert_{C^{0,1}(\mathbb{R}^{n}\times[0,T])} by fC0,1(n×{0})\lVert f\rVert_{C^{0,1}(\mathbb{R}^{n}\times\{0\})}.

We provide the standard argument for the proof of Proposition 8.1 using difference quotients for the sake of completeness.

Proposition 8.1.

Suppose uC0,1(n×[0,t0])u\in C^{0,1}(\mathbb{R}^{n}\times[0,t_{0}]) is a viscosity solution of the translation invariant non-local equation tuJ(u)=0\partial_{t}u-J(u)=0 in n×(0,t0)\mathbb{R}^{n}\times(0,t_{0}), where JJ satisfies the ellipticity condition

C1|uv|+Λ(uv)J(u)J(v)Λ+(uv)+C2|uv|,for all u,vC0,1(n).-C_{1}|u-v|+\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}(u-v)\leq J(u)-J(v)\leq\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}(u-v)+C_{2}|u-v|,\quad\text{for all }u,v\in C^{0,1}(\mathbb{R}^{n}). (8.4)

Then we have the estimate

uC1,γ(Qt02(t0,x0))C(1+t0)t0γuC0,1(n×[0,t0]),||u||_{C^{1,\gamma}(Q_{\frac{t_{0}}{2}}(t_{0},x_{0}))}\leq\frac{C(1+t_{0})}{t_{0}^{\gamma}}||u||_{C^{0,1}(\mathbb{R}^{n}\times[0,t_{0}])},

where CC and γ\gamma are the constants from Proposition 4.10.

Remark 8.2.

The constants CC and γ\gamma arising from Propostion 4.10 depend upon the ellipticity class, Λ\mathcal{L}_{\Lambda}. Since, as above, our particular choice of class, Λ\mathcal{L}_{\Lambda}, depends on the estimates of Theorem 1.1, which depend upon δ\delta, LL, mm, ρ\rho, we see that an invocation of Proposition 8.1 for our situation retains such dependence on the constants in the C1,γC^{1,\gamma} estimate.

Proof of Proposition 8.1.

For (x,t)Q1(x,t)\in Q_{1}, consider the difference quotient in space

vh(x,t):=u(x+h,t)u(x,t)|h|.v_{h}(x,t):=\frac{u(x+h,t)-u(x,t)}{|h|}.

Using the ellipticity condition (8.4) and the translation invariance of JJ, we find that in the viscosity sense, vhv_{h} solves

C2|vh|+Λ+(vh)I(u(+h,t),x,t)I(u,x,t)|h|=tvh(x,t).C_{2}|v_{h}|+\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}(v_{h})\geq\frac{I(u(\cdot+h,t),x,t)-I(u,x,t)}{|h|}=\partial_{t}v_{h}(x,t).

Since vhL(n×[0,t0])uC0,1(n×[0,t0])||v_{h}||_{L^{\infty}(\mathbb{R}^{n}\times[0,t_{0}])}\leq||u||_{C^{0,1}(\mathbb{R}^{n}\times[0,t_{0}])} independently of hh, it follows that vhv_{h} satisfies the inequality, in the viscosity sense,

tvh(x,t)Λ+(vh)(x,t)uC0,1(n×[0,t0])for all (x,t)n×(0,t0).\partial_{t}v_{h}(x,t)-\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}}(v_{h})(x,t)\leq||u||_{C^{0,1}(\mathbb{R}^{n}\times[0,t_{0}])}\qquad\text{for all }(x,t)\in\mathbb{R}^{n}\times(0,t_{0}).

A similar argument shows vhv_{h} also satisfies in the viscosity sense,

tvh(x,t)Λ(vh)(x,t)uC0,1(n×[0,t0])for all (x,t)n×(0,t0).\partial_{t}v_{h}(x,t)-\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}}(v_{h})(x,t)\geq-||u||_{C^{0,1}(\mathbb{R}^{n}\times[0,t_{0}])}\qquad\text{for all }(x,t)\in\mathbb{R}^{n}\times(0,t_{0}).

Applying Proposition 4.10 to vhv_{h}, we conclude that

vhCγ(Qt02(t0,x0))C(1+t0)t0γuC0,1(n×[0,t0]).||v_{h}||_{C^{\gamma}(Q_{\frac{t_{0}}{2}}(t_{0},x_{0}))}\leq\frac{C(1+t_{0})}{t_{0}^{\gamma}}||u||_{C^{0,1}(\mathbb{R}^{n}\times[0,t_{0}])}.

Since the right-hand side is independent of hh, we may let |h|0|h|\rightarrow 0 to obtain the desired C1,γC^{1,\gamma} estimate in space. By considering (x,t)n×(0,t0)(x,t)\in\mathbb{R}^{n}\times(0,t_{0}) the difference quotient in time

wh(x,t):=u(x,t+h)u(x,t)|h|,w_{h}(x,t):=\frac{u(x,t+h)-u(x,t)}{|h|},

with hh sufficiently small and carrying out an argument as above, we also obtain a C1,γC^{1,\gamma} estimate in time. The one extra step is that once we obtain the regularity in space, we see that utu_{t} is bounded in the viscosity sense, and hence uu is Lipschitz in time. Thus, whw_{h} is a bounded viscosity solution of the extremal inequalities. Another invocation of Proposition 4.10 concludes the regularity in time.

9. Commentary on Many Issues

9.1. Where is the min-max structure utilized?

The first place the min-max in Theorem 1.1 is used is to identify the correct class of integro-differential operators for invoking the Krylov-Safonov theory. In most of the existing literature on regularity theory (as well as existence and uniqueness theory), a min-max structure is assumed for the given equations. However, the min-max structure is quickly replaced by simply requiring the existence of a class of linear nonlocal operators so that the relevant nonlinear operator, say JJ, satisfies inequalities such as (4.7). Then, as one sees by, e.g. Proposition 4.10, it is these extremal inequalities that govern the regularity theory. Thus, as outlined in Section 8, as soon as a min-max, plus some properties of the ingredients are obtained as in Theorem 1.1 one can deduce which ellipticity class and results will apply to solutions of tf=H(f)\partial_{t}f=H(f). It was rather striking to find in the case of (1.11), under the extra C1,DiniC^{1,\textnormal{Dini}} regularity assumption for ff, that the resulting ellipticity class had already been studied in the literature as in [22]. Furthermore, thanks to the translation invariance of HH, combined with the inequalities (4.7), it is not hard to show that the finite differences, w=1|h|(f(+h)f())w=\frac{1}{\left|h\right|}(f(\cdot+h)-f(\cdot)) satisfy, in the viscosity sense, the pair of inequalities (4.6). This is key to obtaining the C1,γC^{1,\gamma} regularity for ff.

In some sense, the min-max provided by Theorem 1.1 gives a way of “linearizing” the equation, but in a possibly slightly different manner than sometimes carried out. One way to linearize (1.11) would be to fix a very smooth solution, f0f_{0}, and then find an equation, say tψ=Lf0ψ\partial_{t}\psi=L_{f_{0}}\psi, where Lf0L_{f_{0}} is an operator with coefficients depending upon f0f_{0}, and the equation governs functions of the form f=f0+εψf=f_{0}+\varepsilon\psi for ε<<1\varepsilon<<1. The min-max gives a different linear equation in the sense that for any solution, say ff, of (1.11), one can think ff itself solves a linear equation with bounded measurable coefficients of the form,

tf=cf(x)f(x)+bf(x)f(x)+nδhf(x)Kf(x,h)𝑑h,\displaystyle\partial_{t}f=c^{*}_{f}(x)f(x)+b^{*}_{f}(x)\cdot\nabla f(x)+\int_{\mathbb{R}^{n}}\delta_{h}f(x)K^{*}_{f}(x,h)dh,

where cfc^{*}_{f}, bfb^{*}_{f}, KfK^{*}_{f} are all xx-depended coefficients that can be any of those that attain the min-max for ff in Theorem 1.1 at a given xx. Of course, one cannot expect these coefficients to be better than bounded and measurable in xx, and this is one reason why it is typically presented in the elliptic and parabolic literature that linear equations with bounded measurable coefficients are as easy or hard to treat (it depends upon your point of view) as fully nonlinear equations that are translation invariant. Of course, we “linearized” equation (1.11) in neither of the two approaches mentioned above, but as earlier, we found that linearizing for w=1|h|(f(+h)f())w=\frac{1}{\left|h\right|}(f(\cdot+h)-f(\cdot)) gives the inequalities pertinent to Proposition 4.10. If one used the mean value theorem, it would formally give a linear equation with bounded measurable coefficients, assuming that HH was a Fréchet differentiable map (but one can obtain the inequalities (4.7) without any assumption of differentiability of HH, thanks to the min-max).

In Section 8, the min-max representation of hh suggests that the natural maximal and minimal operators corresponding to hh should be the ones given by (8.1). However, one does not know if there is regularity theory available for these maximal and minimal operators. One annoyance in this direction is that the class of linear operators used to define them is, in general, not invariant under translations and dilations. Certainly the needed regularity is true, but the arguments to produce such results are better implemented for a larger class of equations, such as those described in Definition 4.8. The bounds obtained in Theorem 1.1 (ii) instead allow us to estimate hh by a different set of maximal and minimal operators as shown in (8.3), where the operators Λ+\mathcal{M}^{+}_{\mathcal{L}_{\Lambda}} and Λ\mathcal{M}^{-}_{\mathcal{L}_{\Lambda}} are defined using a class of linear operators which satisfies the translation and dilation invariance properties necessary to invoke existing regularity theory while also containing the linear functionals that support hh.

We also note an interesting departure from an easier min-max approach as utilized in [24] and [42, Theorem 1.10]. The curious reader may see that since HH is translation invariant, there is a quicker and more straightforward way to obtaining the first half of Theorem 1.1 stated in part (i). The translation invariance means that it suffices to look only at H(f,0)H(f,0), and as a Lipschitz functional from the Banach space, XρX_{\rho}, to \mathbb{R}, H(f,0)H(f,0) enjoys a larger collection of tools from the nonlinear analysis setting built in Clarke’s book [28]. The mean value theorem of Lebourg [28] that we give a variant on in Lemma 5.21 has a more straightforward presentation using a more natural subdifferential set than the one defined in Definition 5.19. This is the approach that is pursued in proving the corresponding result in [24, Theorem 1.4] and [42, Theorem 1.10]. The problem with using the more natural subdifferential set that circumvents the cumbersome details of the finite dimensional approximations is that it is very hard to capture in the linear operators for the min-max the non-degeneracy property that is proved in Lemma 6.3. For a lack of a better analogy, it is like saying that for the function A:NA:\mathbb{R}^{N}\to\mathbb{R} given by A(x)=|x|A(x)=\left|x\right|, one can think of the contrast in reconstructing AA, by considering the set of all possible supporting hyperplanes, versus considering the actual derivative DADA at any point where DADA may exist. In the former situation, one cannot avoid that degenerate linear functionals, such as the zero functional, appear in the collection that makes up a min-max (just a max, actually) representation of AA, whereas in the latter, one can see that the only differentials that would be used will be those with norm 11, and hence are “non-degenerate” in a sense. This is the reason for the finite dimensional approximations used in Section 7 because a non-degeneracy property like that in Lemma 6.3 can be preserved in the functionals used for the min-max in Corollary 5.22.

9.2. A counter example

There are interesting pathologies in Hele-Shaw free boundary problems related to the contrast between UU being regular in space-time and {U>0}\partial\{U>0\} being regular in space-time. Aside from the fact that there are geometries in which the free boundary may stagnate and then immediately jump in space-time (see [54]), there are solutions of (1.1) with space-planar free boundaries such as (see [51])

U(X,t)=a(t)(Xn+1+0ta(s)𝑑s),\displaystyle U(X,t)=a(t)\left(X_{n+1}+\int_{0}^{t}a(s)ds\right),

with e.g. aa is a bounded function of tt. The zero set is, of course, given by

{U>0}={Xn+1=0ta(s)𝑑s},hencef(x,t)=0ta(s)𝑑s.\displaystyle\partial\{U>0\}=\left\{X_{n+1}=-\int_{0}^{t}a(s)ds\right\},\ \ \text{hence}\ \ f(x,t)=-\int_{0}^{t}a(s)ds.

We note that this special solution does not necessarily satisfy the spatial boundary conditions prescribed by (1.1), and indeed, in the absence of further restrictions on aa, it is not true that fC1,γ(n×[τ,T])f\in C^{1,\gamma}(\mathbb{R}^{n}\times[\tau,T]). However, if one insists that this solution does satisfy (1.1) exactly, we then see the boundary condition that U(0,t)=1U(0,t)=1 means that

a(t)(0ta(s)𝑑s)=1,\displaystyle a(t)\left(\int_{0}^{t}a(s)ds\right)=1,

whereby a(t)=±(2t+c)1/2a(t)=\pm(2t+c)^{-1/2}, for some c0c\geq 0, and hence 0ta(s)𝑑s=±(2t+c)1/2\int_{0}^{t}a(s)ds=\pm(2t+c)^{1/2}. In order that U>0U>0, we see that in fact a(t)=(2t+c)1/2a(t)=-(2t+c)^{-1/2}, and hence

U(X,t)=(2t+c)1/2(Xn+1(2t+c)1/2),in{0<Xn+1<(2t+c)1/2},\displaystyle U(X,t)=-(2t+c)^{-1/2}\left(X_{n+1}-(2t+c)^{1/2}\right),\ \ \ \text{in}\ \ \ \{0<X_{n+1}<(2t+c)^{1/2}\},

and so

f(x,t)=(2t+c)1/2.\displaystyle f(x,t)=(2t+c)^{1/2}.

In particular, requiring that the free boundary resides in the region n×[δ,Lδ]\mathbb{R}^{n}\times[\delta,L-\delta], we see

c>δ2.\displaystyle c>\delta^{2}.

Thus, indeed, fC(n×[0,T])f\in C^{\infty}(\mathbb{R}^{n}\times[0,T]), with a norm that depends on δ\delta, which is compatible with the result in Theorem 1.2.

9.3. Some questions

Here, we list some questions related to (1.1) and Theorem 1.2.

  • Is the gain in regularity given in Theorem 1.2 enough to prove higher regularity, such as a CC^{\infty} free boundary? This would be related to higher regularity via Schauder or bootstrap methods for integro-differential equations, such as that pursued in e.g. [7], [38], [48], [58]; or like the analysis for free boundary problems that attains smooth solutions, such as in [6], [27], [52], [53].

  • Is it possible to include variable coefficients in equation (1.1) and obtain the same regularity of the solution? This could be for either a divergence form operator or a non-divergence form operator. It is conceivable that similar regularity should hold, and one may expect to use either directly, or modifications of works such as [38], [55], [64], when the order of the kernels is 11.

  • How does incorporating an inhomogeneous boundary law, V=G(X,ν+U+,νU)V=G(X,\partial^{+}_{\nu}U^{+},\partial^{-}_{\nu}U^{-}), in (1.1) change the outcome of the results? At least when G(X,ν+U+,νU)=g(X)G~(ν+U+,νU)G(X,\partial^{+}_{\nu}U^{+},\partial^{-}_{\nu}U^{-})=g(X)\tilde{G}(\partial^{+}_{\nu}U^{+},\partial^{-}_{\nu}U^{-}) it appears as though the steps would be very similar, but if the XX dependence is more general, the analysis in Section 8, may be complicated by the fact that the equation is not translation invariant, and the xx dependence is not as easily isolated.

  • The most important question to address could be to adapt the method to apply to situations in which {U>0}\partial\{U>0\} is only locally a space-time graph of a function. In many free boundary problems related to (1.1), it is not natural to assume that the free boundary is globally the graph of some function. Rather, without assuming the free boundary is a graph, some low regularity assumption like a Lipschitz condition or a flatness condition then forces the free boundary to in fact be locally a graph that is quite regular (at least for small time that avoids different regions of the free boundary colliding and causing topological changes). This could be attained by including as a parameter in the definition of II, some extra space-time boundary condition that allows II to act on functions that are merely defined in, say, B1B_{1}, instead of n\mathbb{R}^{n}, with this extra boundary condition providing the information of the free boundary outside of B1B_{1}.

  • Another interesting question is to address the possibility to modify the method to apply to Stefan type problems wherein (1.1) now requires UU to solve a parabolic problem in the sets {U>0}\{U>0\} and {U<0}\{U<0\}. Of course, the two-phase Stefan problem itself is already rather well understood, but there are many variations that could be considered. This would require adapting the results in Section 5 to accommodate operators acting on f:n×[0,T]f:\mathbb{R}^{n}\times[0,T] that satisfy the GCP in space-time, rather than simply looking at those operators that satisfy the GCP in space.

Appendix A Proofs related to Green’s function estimates

Before proving Lemma 4.3, we recall the following fact from [41].

Lemma A.1.

(cf. Lemma 3.2 in [41]) Suppose AA is λ,Λ\lambda,\Lambda uniformly elliptic and Dini continuous with modulus, ω\omega, and vv solves the Dirichlet problem

{LAv=0in 𝒜2r(x0),x0Ω,r1,v=1on Br(x0),v=0on B2r(x0).\displaystyle\begin{cases}L_{A}v=0\quad\text{in }\mathcal{A}_{2r}(x_{0}),\ x_{0}\in\Omega,\ r\leq 1,\\ v=1\quad\text{on }\partial B_{r}(x_{0}),\\ v=0\quad\text{on }\partial B_{2r}(x_{0}).\end{cases} (A.1)

There exists a constant K=K(n,λ,Λ,ω)>0K=K(n,\lambda,\Lambda,\omega)>0 such that

|v(x)|Krfor all x𝒜2r(x0).|\nabla v(x)|\leq\frac{K}{r}\qquad\text{for all }x\in\mathcal{A}_{2r}(x_{0}).
Proof of Lemma 4.3.

We first perform a reduction to a model problem. By Harnack’s inequality applied to the non-negative solution uu in the ball Br(x0)B_{r}(x_{0}), we know there exists a constant C~=C~(n,λ,Λ)\tilde{C}=\tilde{C}(n,\lambda,\Lambda) such that

infBr(x0)uC~u(x0).\inf_{\partial B_{r}(x_{0})}u\geq\tilde{C}u(x_{0}).

Rescaling uu, we may thus assume infBr(x0)u=1\inf_{\partial B_{r}(x_{0})}u=1. Let vv be the solution to the problem

{LAv=0in 𝒜2r(x0),v=1on Br(x0),v=0on B2r(x0).\begin{cases}L_{A}v=0\quad\text{in }\mathcal{A}_{2r}(x_{0}),\\ v=1\quad\text{on }\partial B_{r}(x_{0}),\\ v=0\quad\text{on }\partial B_{2r}(x_{0}).\end{cases} (A.2)

We recall that by assumption, 𝒜2r(x0)Ω\mathcal{A}_{2r}(x_{0})\subset\Omega, and hence as u0u\geq 0, by the maximum principle, uvu\geq v on 𝒜2r(x0)\mathcal{A}_{2r}(x_{0}) and so it suffices to prove the estimate (4.3) for the function vv.

Consider the constant coefficient operator L0:=div(A(z0))L_{0}:=-\text{div}(A(z_{0})\nabla\cdot), and let v^\hat{v} solve the problem

{L0v^=0in 𝒜2r(x0),v^=1on Br(x0),v^=0on B2r(x0).\begin{cases}L_{0}\hat{v}=0\quad\text{in }\mathcal{A}_{2r}(x_{0}),\\ \hat{v}=1\quad\text{on }\partial B_{r}(x_{0}),\\ \hat{v}=0\quad\text{on }\partial B_{2r}(x_{0}).\end{cases} (A.3)

The function w:=v^vw:=\hat{v}-v vanishes on the boundary of 𝒜2r(x0)\mathcal{A}_{2r}(x_{0}). If G0G_{0} is the Green’s function for the operator L0L_{0}, then by the representation formula for L0L_{0}, we have for all x𝒜2r(x0)x\in\mathcal{A}_{2r}(x_{0})

w(x)=𝒜2r(x0)G0(x,y)L0w(y)𝑑y=𝒜2r(x0)yG0(x,y),A(z0)w(y)𝑑y.w(x)=\int_{\mathcal{A}_{2r}(x_{0})}G_{0}(x,y)L_{0}w(y)\ dy=\int_{\mathcal{A}_{2r}(x_{0})}\left\langle\nabla_{y}G_{0}(x,y),A(z_{0})\nabla w(y)\right\rangle\ dy.

Now since v^\hat{v} solves (A.3), we know that

𝒜2r(x0)yG0(x,y),A(z0)v^(y)𝑑y=0.\int_{\mathcal{A}_{2r}(x_{0})}\left\langle\nabla_{y}G_{0}(x,y),A(z_{0})\nabla\hat{v}(y)\right\rangle\ dy=0.

Consequently,

w(x)=𝒜2r(x0)yG0(x,y),A(z0)v(y)𝑑y.w(x)=-\int_{\mathcal{A}_{2r}(x_{0})}\left\langle\nabla_{y}G_{0}(x,y),A(z_{0})\nabla v(y)\right\rangle\ dy.

Next, since vv solves (A.2), we know that

𝒜2r(x0)yG0(x,y),A(y)v(y)𝑑y=0.\int_{\mathcal{A}_{2r}(x_{0})}\left\langle\nabla_{y}G_{0}(x,y),A(y)\nabla v(y)\right\rangle\ dy=0.

It follows that

w(x)=𝒜2r(x0)yG0(x,y),(A(z0)A(y))v(y)𝑑y.w(x)=-\int_{\mathcal{A}_{2r}(x_{0})}\left\langle\nabla_{y}G_{0}(x,y),(A(z_{0})-A(y))\nabla v(y)\right\rangle\ dy.

Differentiating in xx yields

Dw(x)=𝒜2r(x0)Dx,y2G0(x,y),(A(z0)A(y))v(y)𝑑y.Dw(x)=-\int_{\mathcal{A}_{2r}(x_{0})}\left\langle D^{2}_{x,y}G_{0}(x,y),(A(z_{0})-A(y))\nabla v(y)\right\rangle\ dy.

Evaluating at x=z0x=z_{0}, we thus conclude

Dw(z0)=𝒜2r(x0)Dx,y2G0(z0,y),(A(z0)A(y))v(y)𝑑y.Dw(z_{0})=-\int_{\mathcal{A}_{2r}(x_{0})}\left\langle D^{2}_{x,y}G_{0}(z_{0},y),(A(z_{0})-A(y))\nabla v(y)\right\rangle\ dy.

Now by estimates for the Green’s function for constant coefficient operators, we know there exists a constant C1=C1(n,λ,Λ)>0C_{1}=C_{1}(n,\lambda,\Lambda)>0 such that

|Dx,y2G0(z0,y)|C1|z0y|n.|D^{2}_{x,y}G_{0}(z_{0},y)|\leq C_{1}|z_{0}-y|^{-n}.

It follows that

|Dw(z0)|C1𝒜2r(x0)|A(z0)A(y)||z0y|n|v(y)|𝑑y.|Dw(z_{0})|\leq C_{1}\int_{\mathcal{A}_{2r}(x_{0})}\frac{|A(z_{0})-A(y)|}{|z_{0}-y|^{n}}|\nabla v(y)|\ dy.

By Lemma A.1, there exists a constant K=K(n,λ,Λ,ω)>0K=K(n,\lambda,\Lambda,\omega)>0 such that

|v(y)|Krfor all y𝒜2r(x0).|\nabla v(y)|\leq\frac{K}{r}\qquad\text{for all }y\in\mathcal{A}_{2r}(x_{0}).

Therefore,

|Dw(z0)|C1Kr𝒜2r(x0)|A(z0)A(y)||z0y|n𝑑y.|Dw(z_{0})|\leq\frac{C_{1}K}{r}\int_{\mathcal{A}_{2r}(x_{0})}\frac{|A(z_{0})-A(y)|}{|z_{0}-y|^{n}}\ dy.

We now write the integral above as

𝒜2r(x0)|A(z0)A(y)||z0y|n𝑑y\displaystyle\int\limits_{\mathcal{A}_{2r}(x_{0})}\frac{|A(z_{0})-A(y)|}{|z_{0}-y|^{n}}\ dy =𝒜2r(x0)Br(z0)|A(z0)A(y)||z0y|n𝑑y+𝒜2r(x0)\Br(z0)|A(z0)A(y)||z0y|n𝑑y\displaystyle=\int\limits_{\mathcal{A}_{2r}(x_{0})\cap B_{r}(z_{0})}\frac{|A(z_{0})-A(y)|}{|z_{0}-y|^{n}}\ dy+\int\limits_{\mathcal{A}_{2r}(x_{0})\backslash B_{r}(z_{0})}\frac{|A(z_{0})-A(y)|}{|z_{0}-y|^{n}}\ dy
=I+II.\displaystyle=\text{I}+\text{II}.

Converting to polar coordinates centered at z0z_{0}, and using the Dini continuity of the coefficients A()A(\cdot) yields

IC20rω(t)t𝑑t,\text{I}\leq C_{2}\int_{0}^{r}\frac{\omega(t)}{t}\ dt,

for a dimensional constant C2>0C_{2}>0. To control II, we notice that |z0y|r|z_{0}-y|\geq r if y𝒜2r(x0)\Br(z0)y\in\mathcal{A}_{2r}(x_{0})\backslash B_{r}(z_{0}), and so

IIrn𝒜2r(x0)|A(z0)A(y)|dyrn|𝒜2r(x0)|supy𝒜2r(x0)ω(|z0y|)|C3supy𝒜2r(x0)ω(|z0y|),\text{II}\leq r^{-n}\int\limits_{\mathcal{A}_{2r}(x_{0})}|A(z_{0})-A(y)|\ dy\leq r^{-n}|\mathcal{A}_{2r}(x_{0})|\sup_{y\in\mathcal{A}_{2r}(x_{0})}\omega(|z_{0}-y|)|\leq C_{3}\sup_{y\in\mathcal{A}_{2r}(x_{0})}\omega(|z_{0}-y|),

where C3>0C_{3}>0 is a dimensional constant. It follows that given ε>0\varepsilon>0, there exists r0=r0(n,ω,λ,Λ,ε)r_{0}=r_{0}(n,\omega,\lambda,\Lambda,\varepsilon) such that if rr0r\leq r_{0}, then |Dw(z0)|εr|Dw(z_{0})|\leq\frac{\varepsilon}{r}.

By Taylor expansion around z0z_{0}, we have

v(x)=v(z0)+Dv(z0)(xz0)+o(|xz0|)for all x[x0,z0]𝒜2r(x0).v(x)=v(z_{0})+Dv(z_{0})\cdot(x-z_{0})+o(|x-z_{0}|)\qquad\text{for all }x\in[x_{0},z_{0}]\cap\mathcal{A}_{2r}(x_{0}).

Let Dνφ(z0):=Dφ(z0),ν(z0)D_{\nu}\varphi(z_{0}):=\left\langle D\varphi(z_{0}),\nu(z_{0})\right\rangle denote the derivative of a function φ\varphi in the direction of the inward pointing unit normal vector ν(z0)\nu(z_{0}) to B2r(x0)\partial B_{2r}(x_{0}) at z0z_{0}. Since v(z0)=0v(z_{0})=0 and d(x)ν(z0)=xz0d(x)\nu(z_{0})=x-z_{0}, we see that

v(x)=Dνv(z0)d(x)+o(d(x))for all x[x0,z0]𝒜2r(x0).v(x)=D_{\nu}v(z_{0})d(x)+o(d(x))\qquad\text{for all }x\in[x_{0},z_{0}]\cap\mathcal{A}_{2r}(x_{0}).

Writing v=v^wv=\hat{v}-w, we thus obtain

v(x)=(Dνv^(z0)Dνw(z0))d(x)+o(d(x))for all x[x0,z0]𝒜2r(x0).v(x)=\left(D_{\nu}\hat{v}(z_{0})-D_{\nu}w(z_{0})\right)d(x)+o(d(x))\qquad\text{for all }x\in[x_{0},z_{0}]\cap\mathcal{A}_{2r}(x_{0}).

Now, by explicit calculation of v^\hat{v}, it is possible to show that there exists a constant C4=C4(n,λ,Λ)>0C_{4}=C_{4}(n,\lambda,\Lambda)>0 such that

Dνv^(z0)C4r.D_{\nu}\hat{v}(z_{0})\geq\frac{C_{4}}{r}.

If we now choose ε:=C42\varepsilon:=\frac{C_{4}}{2} above, we obtain

Dνv^(z0)Dνw(z0)C4rεr=C42r.D_{\nu}\hat{v}(z_{0})-D_{\nu}w(z_{0})\geq\frac{C_{4}}{r}-\frac{\varepsilon}{r}=\frac{C_{4}}{2r}.

Therefore, there exist constants C=C(n,λ,Λ)>0C=C(n,\lambda,\Lambda)>0 and r0=r0(n,ω,λ,Λ)>0r_{0}=r_{0}(n,\omega,\lambda,\Lambda)>0 such that if rr0r\leq r_{0}, then

v(x)Crd(x)+o(d(x))for all x[x0,z0]𝒜2r(x0).v(x)\geq\frac{C}{r}\ d(x)+o(d(x))\qquad\text{for all }x\in[x_{0},z_{0}]\cap\mathcal{A}_{2r}(x_{0}).

From here on, we assume we are working with Ωn+1\Omega\subset\mathbb{R}^{n+1}. Before we prove Theorem 4.1, let us first recall a number of useful facts from [11, 41]. For any y0Ωy_{0}\in\partial\Omega and r>0r>0, let Δr(y0):=Br(y0)Ω\Delta_{r}(y_{0}):=B_{r}(y_{0})\cap\partial\Omega. We denote by Wr,y0W_{r,y_{0}} the solution to the Dirichlet problem

{LAWr,y0=0in Ω,Wr,y0=𝟙Δr(y0)on Ω,\begin{cases}L_{A}W_{r,y_{0}}=0\quad\text{in }\Omega,\\ W_{r,y_{0}}=\mathbbm{1}_{\Delta_{r}(y_{0})}\quad\text{on }\partial\Omega,\end{cases} (A.4)

i.e. Wr,y0(x)W_{r,y_{0}}(x) is the harmonic measure of Δr(y0)\Delta_{r}(y_{0}), based at xx.

Lemma A.2.

(cf. Lemma 2.1 in [11]) There exist positive numbers r0=r0(m)r_{0}=r_{0}(m) and C=C(λ,Λ,m)C=C(\lambda,\Lambda,m) such that for rr0r\leq r_{0}, we have

Wr,y0(y0+rν(y0))C.W_{r,y_{0}}(y_{0}+r\nu(y_{0}))\geq C.
Lemma A.3.

(cf. Lemma 2.2 in [11]) There exist positive numbers r0=r0(m)r_{0}=r_{0}(m) and c=c(λ,Λ,m)c=c(\lambda,\Lambda,m) such that for rr0r\leq r_{0} and for all xB3r(y0)Ωx\notin B_{3r}(y_{0})\cap\Omega, we have

c1rn1G(y0+rν(y0),x)Wr,y0(x)crn1G(y0+rν(y0),x),c^{-1}r^{n-1}G(y_{0}+r\nu(y_{0}),x)\leq W_{r,y_{0}}(x)\leq cr^{n-1}G(y_{0}+r\nu(y_{0}),x),

where GG is the Green’s function corresponding to LAL_{A} in Ωn+1\Omega\subset\mathbb{R}^{n+1}.

Lemma A.4.

(cf. Theorem 1.1 in [41]) There exists a positive constant K=K(n,λ,Λ)K=K(n,\lambda,\Lambda) such that if p,qΩnp,q\in\Omega\subset\mathbb{R}^{n} satisfy |pq|12d(q)|p-q|\leq\frac{1}{2}d(q), then

G(p,q)K|pq|1n,G(p,q)\geq K|p-q|^{1-n},

where GG is the Green’s function corresponding to LAL_{A} in Ωn+1\Omega\subset\mathbb{R}^{n+1}.

Proof of Theorem 4.1.

By flattening DfD_{f}, we may work on the domain Ω={0<xn+1<L}\Omega=\left\{0<x_{n+1}<L\right\}. We will only focus on proving the estimate (4.1) on the portion of the boundary, Γ0:={xn+1=0}\Gamma_{0}:=\left\{x_{n+1}=0\right\}. Let R0R_{0} be the minimum of LL and the smallest value of r0r_{0} for which the conclusions of Lemma 4.3, Lemma A.2, and Lemma A.3 hold. Evidently, R0R_{0} depends only on the Dini modulus of A()A(\cdot), the C1,DiniC^{1,\text{Dini}} modulus of ff, and other universal parameters. Since the upper bound in (4.1) is a consequence of [41, Theorem 3.3], we only show the proof of the lower bound.

Fix x,y{0<xn+1<L}x,y\in\left\{0<x_{n+1}<L\right\} and let r:=|xy|R0r:=|x-y|\leq R_{0}. Let x0x_{0} (resp. y0y_{0}) denote the point on Γ0\Gamma_{0} closest to xx (resp. yy), and define x:=x0+ren+1x^{*}:=x_{0}+re_{n+1} (resp. y:=y0+ren+1y^{*}:=y_{0}+re_{n+1}). Notice that d(x)=dist(x,Γ0)=xn+1d(x)=\text{dist}(x,\Gamma_{0})=x_{n+1} (resp. d(y)=dist(y,Γ0)=yn+1d(y)=\text{dist}(y,\Gamma_{0})=y_{n+1}). Consider the following scenarios:

Case 1: 0<d(x),d(y)r20<d(x),d(y)\leq\frac{r}{2}.
Since xBr(y)x\notin B_{r}(y^{*}), G(,x)G(\cdot,x) satisfies the hypotheses of Lemma 4.3 in Br(y)B_{r}(y^{*}) and vanishes at y0y_{0}. Hence, there exists C1=C1(λ,Λ,n)>0C_{1}=C_{1}(\lambda,\Lambda,n)>0 such that

G(y,x)C1rG(y,x)d(y).G(y,x)\geq\frac{C_{1}}{r}G(y^{*},x)d(y).

Let y^:=y0+r23en+1\hat{y}:=y_{0}+\frac{r}{2\sqrt{3}}e_{n+1}. By the Boundary Harnack Principle, there exists a constant C2=C2(λ,Λ,n)>0C_{2}=C_{2}(\lambda,\Lambda,n)>0 such that

G(y,x)C2G(y^,x).G(y^{*},x)\geq C_{2}G\left(\hat{y},x\right).

Notice that xB3r2(y0)x\notin B_{\frac{\sqrt{3}r}{2}}(y_{0}) since

|x0y0|2=|xy|2|xn+1yn+1|2r2r24=3r24.|x_{0}-y_{0}|^{2}=|x-y|^{2}-|x_{n+1}-y_{n+1}|^{2}\geq r^{2}-\frac{r^{2}}{4}=\frac{3r^{2}}{4}.

Therefore, by Lemma A.3, there exists a constant C3=C3(λ,Λ,m)>0C_{3}=C_{3}(\lambda,\Lambda,m)>0 such that

G(y^,x)C3r1nWr23,y0(x).G\left(\hat{y},x\right)\geq C_{3}r^{1-n}W_{\frac{r}{2\sqrt{3}},y_{0}}(x).

Applying Lemma 4.3 to Wr23,y0W_{\frac{r}{2\sqrt{3}},y_{0}} in Br(x)B_{r}(x^{*}), we find there exists a constant C4=C4(λ,Λ,n)>0C_{4}=C_{4}(\lambda,\Lambda,n)>0 such that

Wr23,y0(x)C4rWr23,y0(x)d(x).W_{\frac{r}{2\sqrt{3}},y_{0}}(x)\geq\frac{C_{4}}{r}W_{\frac{r}{2\sqrt{3}},y_{0}}(x^{*})d(x).

A crude estimate shows

|y^x||y^y0|+|y0x0|+|x0x|r23+r+r<5r2.|\hat{y}-x^{*}|\leq|\hat{y}-y_{0}|+|y_{0}-x_{0}|+|x_{0}-x^{*}|\leq\frac{r}{2\sqrt{3}}+r+r<\frac{5r}{2}.

It follows from a covering argument and Harnack’s inequality that there exists a constant C5=C5(λ,Λ,n)>0C_{5}=C_{5}(\lambda,\Lambda,n)>0 such that

Wr23,y0(x)C5Wr23,y0(y^).W_{\frac{r}{2\sqrt{3}},y_{0}}(x^{*})\geq C_{5}W_{\frac{r}{2\sqrt{3}},y_{0}}(\hat{y}).

Finally, by Lemma A.2, there exists a constant C6=C6(λ,Λ,m)>0C_{6}=C_{6}(\lambda,\Lambda,m)>0 such that

Wr23,y0(y^)C6.W_{\frac{r}{2\sqrt{3}},y_{0}}(\hat{y})\geq C_{6}.

Combining all the bounds above, and recalling that |xy|=r|x-y|=r we conclude that

G(x,y)Cr(n+1)d(x)d(y)=Cd(x)d(y)|xy|n+1.G(x,y)\geq Cr^{-(n+1)}d(x)d(y)=C\frac{d(x)d(y)}{|x-y|^{n+1}}.

Case 2: d(y)r2<d(x)d(y)\leq\frac{r}{2}<d(x).
Since |xy|=r|x-y|=r, it follows that d(x)|xy|+d(y)3r2d(x)\leq|x-y|+d(y)\leq\frac{3r}{2}. Let x^Br(y){xn+1=r2}\hat{x}\in\partial B_{r}(y)\cap\left\{x_{n+1}=\frac{r}{2}\right\} be the point closest to xx. Then d(x^)=r2d(x)3d(\hat{x})=\frac{r}{2}\geq\frac{d(x)}{3} and |x^y|=r=|xy||\hat{x}-y|=r=|x-y|. Consequently, by Case 1,

G(x^,y)Cd(x^)d(y)|x^y|n+1C3d(x)d(y)|xy|n+1.G(\hat{x},y)\geq C\frac{d(\hat{x})d(y)}{|\hat{x}-y|^{n+1}}\geq\frac{C}{3}\frac{d(x)d(y)}{|x-y|^{n+1}}.

On the other hand, by a covering argument and Harnack’s inequality, there exists a constant C1=C1(λ,Λ,n)>0C_{1}=C_{1}(\lambda,\Lambda,n)>0 such that

G(x,y)C1G(x^,y).G(x,y)\geq C_{1}G(\hat{x},y).

Case 3: r2<d(y),d(x)\frac{r}{2}<d(y),d(x).
In this case,

min{d(x)d(y)|xy|n+1,14|xy|n1}=14|xy|n1.\min\left\{\frac{d(x)d(y)}{|x-y|^{n+1}},\frac{1}{4|x-y|^{n-1}}\right\}=\frac{1}{4|x-y|^{n-1}}.

Let p=y+14(xy)p=y+\frac{1}{4}(x-y) and q=yq=y. Note that d(p)r2d(p)\geq\frac{r}{2} by convexity of the half-space {xn+1r2}\left\{x_{n+1}\geq\frac{r}{2}\right\}. Also, |pq|=r4<12d(q)|p-q|=\frac{r}{4}<\frac{1}{2}d(q). Consequently, by Lemma A.4, we have

G(p,y)=G(p,q)K|pq|1n=K4n1|xy|1n.G(p,y)=G(p,q)\geq K|p-q|^{1-n}=K4^{n-1}|x-y|^{1-n}.

On the other hand, by connecting the points pp and xx using a Harnack chain using balls of radius r8\frac{r}{8}, and applying Harnack’s inequality to the positive solution G(,y)G(\cdot,y), we conclude that there exists a positive constant C3=C3(n,λ,Λ)C_{3}=C_{3}(n,\lambda,\Lambda) such that

G(x,y)C3G(p,y).G(x,y)\geq C_{3}G(p,y).

The estimate (4.1) thus follows.

In order to address the behavior of PfP_{f} in nBR\mathbb{R}^{n}\setminus B_{R}, for large RR, we need a variation on the barrier function given in Lemma A.1. The difference between the two results is that Lemma A.1 applies to the situation for r(0,1]r\in(0,1], whereas in the Lemma A.5, r>1r>1. This is a modification of a well known result about the uniform Hölder continuity of solutions to equations with bounded measurable coefficients in domains with an exterior cone condition, e.g. [41, Lemma 7.1].

Lemma A.5.

There exists constants, C>0C>0, α(0,1]\alpha\in(0,1], and ε>0\varepsilon>0, depending on the Dini modulus and ellipticity of AA and nn, so that for all r>1r>1, and for vv as in Lemma A.1, for all |X|r+ε\left|X\right|\leq r+\varepsilon,

v(X)Cd(X)rα.\displaystyle v(X)\leq C\frac{d(X)}{r^{\alpha}}.
Proof of Lemma A.5.

First, we note that in Lemma A.1, the constant, CC, to depended upon only the Dini modulus of AA, ellipticity, and nn. The scaling argument used for r<1r<1 in Lemma A.1 will not work here because in order to have AA given in a ball of radius r>1r>1, the result at scale 11 must be applied to the coefficients A(rx)A(rx), whose Dini modulus blows up as rr is large.

Thus, instead, we can appeal to results at scale r=1r=1 that only depend on ellipticity, and then rescale the equation in 𝒜2r\mathcal{A}_{2r} to 𝒜2\mathcal{A}_{2}, which preserves ellipticity, but not the Dini modulus. This is the reason for the appearance of the factor rαr^{\alpha} for possibly α<1\alpha<1. To this end, we simply note that for v1v_{1} that solves equation A.1 with r=1r=1, v1v_{1} is Hölder continuous for some universal α(0,1]\alpha\in(0,1] in 𝒜¯2\overline{\mathcal{A}}_{2}. Thus, under rescaling, we see that as v10v_{1}\equiv 0 on B1\partial B_{1}, (e.g. [41, Lemma 1.7])

0v1(X)Cd(X)α.\displaystyle 0\leq v_{1}(X)\leq Cd(X)^{\alpha}.

Under rescaling, back to the case of vrv_{r} that solves (A.1) in 𝒜2r\mathcal{A}_{2r}, we have

0vr(X)Cd(X)αrα.\displaystyle 0\leq v_{r}(X)\leq C\frac{d(X)^{\alpha}}{r^{\alpha}}.

Now, as the domain 𝒜2r\mathcal{A}_{2r} enjoys a uniform exterior ball condition of radius r>1r>1, we can invoke the Dini property of AA to use a barrier for vv near the boundary Br\partial B_{r}. In particular, we can use a barrier in an outer annulus with inner radius 11, outer radius 22 (given in Lemma A.1), to conclude that

v(X)Cd(X)rα.\displaystyle v(X)\leq C\frac{d(X)}{r^{\alpha}}.

This, of course follows from the fact that the first estimate established in this proof that for all XX with r<|X|r+1r<\left|X\right|\leq r+1, v(X)C1rαv(X)\leq C\frac{1}{r^{\alpha}}.

Proof of Proposition 4.5.

First of all, we address the bounds for the case |XY|<R0\left|X-Y\right|<R_{0}. As

Pf(X,Y)=(νG(X,))(Y),\displaystyle P_{f}(X,Y)=(\partial_{\nu}G(X,\cdot))(Y),

we see that the bounds on PfP_{f} are immediate from Theorem 4.1.

Now, we focus on the second estimate. We may assume, without loss of generality, that X=X0=(0,f(0))X=X_{0}=(0,f(0)). Notice that

ΓfBR(X0)Pf(X0+sν(X0),Y)𝑑σ(Y)=W(X0+sν(X0)),\int_{\Gamma_{f}\setminus B_{R}(X_{0})}P_{f}(X_{0}+s\nu(X_{0}),Y)d\sigma(Y)=W(X_{0}+s\nu(X_{0})),

where WW solves the equation

{ΔW=0inDf,W=𝟙BRc(X0)onΓf,W=0on{xn+1=0}.\begin{cases}\Delta W=0\ \text{in}\ D_{f},\\ W={\mathbbm{1}}_{B_{R}^{c}(X_{0})}\ \text{on}\ \Gamma_{f},\\ W=0\ \text{on}\ \{x_{n+1}=0\}.\end{cases}

We next flatten the domain DfD_{f} by using the transformation TfT_{f} defined in (4.2). The function W~=WTf1\tilde{W}=W\circ T_{f}^{-1} then solves

{div(A(y)W~(y))=0inn×[0,L],W~=𝟙BRc(0,L)on{yn+1=L},W~=0on{yn+1=0},\begin{cases}\mathop{\textnormal{div}}\nolimits(A(y)\nabla\tilde{W}(y))=0\ \text{in}\ \mathbb{R}^{n}\times[0,L],\\ \tilde{W}={\mathbbm{1}}_{B_{R}^{c}(0,L)}\ \text{on}\ \left\{y_{n+1}=L\right\},\\ \tilde{W}=0\ \text{on}\ \{y_{n+1}=0\},\end{cases}

with A(y)(n+1)×(n+1)A(y)\in\mathbb{R}^{(n+1)\times(n+1)} uniformly elliptic and Dini continuous (depending on δ\delta, LL, mm, ω\omega). Note that 0W~10\leq\tilde{W}\leq 1 on n×[0,L]\mathbb{R}^{n}\times[0,L] by the comparison principle.

We now extend the coefficients AA to all of n+1\mathbb{R}^{n+1} in a Dini continuous fashion with the same modulus of continuity ω\omega, and denote them A^\hat{A}. The corresponding divergence form operator on n+1\mathbb{R}^{n+1} will be denoted L^:=div(A^(y))\hat{L}:=\mathop{\textnormal{div}}\nolimits(\hat{A}(y)\nabla\cdot). Note that A^\hat{A} can also be taken to satisfy the same ellipticity conditions as AA. Now suppose R>3LR>\sqrt{3}L, and let Y0=(0,L+R3)Y_{0}=(0,L+\frac{R}{\sqrt{3}}). On the annular domain 𝒜2R3(Y0)\mathcal{A}_{\frac{2R}{\sqrt{3}}}(Y_{0}), consider the function φ\varphi which solves the problem

{L^φ=0 in 𝒜2R3(Y0),φ=0 on BR3(Y0),φ=1 on B2R3(Y0).\begin{cases}\hat{L}\varphi=0\text{ in }\mathcal{A}_{\frac{2R}{\sqrt{3}}}(Y_{0}),\\ \varphi=0\text{ on }\partial B_{\frac{R}{\sqrt{3}}}(Y_{0}),\\ \varphi=1\text{ on }\partial B_{\frac{2R}{\sqrt{3}}}(Y_{0}).\\ \end{cases}

By Lemma A.5 (we can assume, without loss of generality that R>1R>1), there exists constant K=K(n,λ,Λ,ω)K=K(n,\lambda,\Lambda,\omega) such that when R>1R>1, |φ(X)|Cd(X)Rα\left|\varphi(X)\right|\leq C\frac{d(X)}{R^{\alpha}} for all X𝒜2R3(Y0)X\in\mathcal{A}_{\frac{2R}{\sqrt{3}}}(Y_{0}) with R<|X|<R+εR<\left|X\right|<R+\varepsilon. Consequently, since φ(0,L)=0\varphi(0,L)=0, we conclude that φ(0,Ls)KsRα\varphi(0,L-s)\leq\frac{Ks}{R^{\alpha}} for all s>0s>0 sufficiently small.

It remains to show that W~φ\tilde{W}\leq\varphi on ΩR:=𝒜2R3(Y0)n×[0,L]\Omega_{R}:=\mathcal{A}_{\frac{2R}{\sqrt{3}}}(Y_{0})\cap\mathbb{R}^{n}\times[0,L]. To show this, notice that ΩR\partial\Omega_{R} consists of three pieces; the first two are the flat portions consisting of the intersection of 𝒜2R3(Y0)\mathcal{A}_{\frac{2R}{\sqrt{3}}}(Y_{0}) with {yn+1=0}\left\{y_{n+1}=0\right\} and {yn+1=L}\left\{y_{n+1}=L\right\} respectively, while the third piece is the intersection of B2R3(Y0)\partial B_{\frac{2R}{\sqrt{3}}}(Y_{0}) with n×[0,L]\mathbb{R}^{n}\times[0,L]. On the flat portions, we know W~=0\tilde{W}=0 and since φ0\varphi\geq 0 by the maximum principle, we see that φW~\varphi\geq\tilde{W} on this portion of ΩR\partial\Omega_{R}. On the remaining portion of ΩR\partial\Omega_{R}, we know that φ=1\varphi=1 and since W~1\tilde{W}\leq 1 on n×[0,L]\mathbb{R}^{n}\times[0,L], we conclude that φW~\varphi\geq\tilde{W} on this piece of ΩR\partial\Omega_{R} as well. Consequently, by the maximum principle, φW~\varphi\geq\tilde{W} on ΩR\Omega_{R}. In particular, W~(0,Ls)φ(0,Ls)KsRα\tilde{W}(0,L-s)\leq\varphi(0,L-s)\leq\frac{Ks}{R^{\alpha}} for all s>0s>0 sufficiently small. Rewriting this in terms of WW, we obtain the desired estimate (4.4).

With only a few modifications, we can adapt the proof of Proposition 4.5 to also give the proof of Lemma 4.6.

Proof of Lemma 4.6.

We note that in this setting, as DfD_{f} is a Lipschitz domain, then PfP_{f} exists and is an AA^{\infty} weight as in [35], and by the above results, PfP_{f} will be more regular when restricted to BRB_{R}, as in that region, Γ(f)\Gamma(f) is C1,DiniC^{1,\textnormal{Dini}}.

We see that this time, we have

ΓfB2R(X)Pf(X+sν(X),Y)𝑑σf(Y)=W(X+sν(X)),\displaystyle\int_{\Gamma_{f}\setminus B_{2R}(X)}P_{f}(X+s\nu(X),Y)d\sigma_{f}(Y)=W(X+s\nu(X)),

where WW is the unique solution of

{ΔW=0inDfW=𝟙B2Rc(X)onΓfW=0on{xn+1=0}.\displaystyle\begin{cases}\Delta W=0\ &\text{in}\ D_{f}\\ W={\mathbbm{1}}_{B_{2R}^{c}(X)}\ &\text{on}\ \Gamma_{f}\\ W=0\ &\text{on}\ \{x_{n+1}=0\}.\end{cases}

Owing to the fact that ff is globally Lipschitz and Cρ1,Dini(B2R)C^{1,\textnormal{Dini}}_{\rho}(B_{2R}), we see that after the straightening procedure, W~\tilde{W} solves an equation on n×[0,L]\mathbb{R}^{n}\times[0,L], with coefficients, A^\hat{A}, that have been extended to all of n+1\mathbb{R}^{n+1} and that are Dini continuous in B2R×B_{2R}\times\mathbb{R}, while they are globally bounded and uniformly elliptic. We note that we are now concerned with the behavior of W~\tilde{W} at X~sen+1\tilde{X}-se_{n+1}, where for X=(x,f(x))X=(x,f(x)), X~=(x,L)\tilde{X}=(x,L). Thus, for the barrier, φ\varphi, we can now center the annular region at Y0=(x,L+R3)Y_{0}=(x,L+\frac{R}{\sqrt{3}}). As X~BR×[0,L]\tilde{X}\in B_{R}\times[0,L], it also holds that 𝒜R(Y0)\mathcal{A}_{R}(Y_{0}) is contained in B2R×B_{2R}\times\mathbb{R}, in which A^\hat{A} is Dini continuous. Thus, Lemma A.5 is applicable. The rest of the proof is the same.

References

  • [1] T. Alazard, N. Burq, and C. Zuily. On the Cauchy problem for gravity water waves. Invent. Math., 198(1):71–163, 2014.
  • [2] Thomas Alazard. Convexity and the hele-shaw equation. arXiv preprint arXiv:2003.02618, 2020.
  • [3] Thomas Alazard, Nicolas Meunier, and Didier Smets. Lyapounov functions, identities and the cauchy problem for the hele-shaw equation. arXiv preprint arXiv:1907.03691, 2019.
  • [4] David M. Ambrose. Well-posedness of two-phase Hele-Shaw flow without surface tension. European J. Appl. Math., 15(5):597–607, 2004.
  • [5] I. Athanasopoulos, L. Caffarelli, and S. Salsa. Regularity of the free boundary in parabolic phase-transition problems. Acta Math., 176(2):245–282, 1996.
  • [6] Begoña Barrios, Alessio Figalli, and Enrico Valdinoci. Bootstrap regularity for integro-differential operators and its application to nonlocal minimal surfaces. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5), 13(3):609–639, 2014.
  • [7] Richard F. Bass. Regularity results for stable-like operators. J. Funct. Anal., 257(8):2693–2722, 2009.
  • [8] Richard F. Bass and David A. Levin. Harnack inequalities for jump processes. Potential Anal., 17(4):375–388, 2002.
  • [9] Richard F. Bass and David A. Levin. Transition probabilities for symmetric jump processes. Trans. Amer. Math. Soc., 354(7):2933–2953 (electronic), 2002.
  • [10] Krzysztof Bogdan. Sharp estimates for the Green function in Lipschitz domains. J. Math. Anal. Appl., 243(2):326–337, 2000.
  • [11] L. Caffarelli, E. Fabes, S. Mortola, and S. Salsa. Boundary behavior of nonnegative solutions of elliptic operators in divergence form. Indiana Univ. Math. J., 30(4):621–640, 1981.
  • [12] Luis Caffarelli and Luis Silvestre. Regularity theory for fully nonlinear integro-differential equations. Comm. Pure Appl. Math., 62(5):597–638, 2009.
  • [13] Luis A. Caffarelli. A Harnack inequality approach to the regularity of free boundaries. I. Lipschitz free boundaries are C1,αC^{1,\alpha}. Rev. Mat. Iberoamericana, 3(2):139–162, 1987.
  • [14] Luis A. Caffarelli. A Harnack inequality approach to the regularity of free boundaries. III. Existence theory, compactness, and dependence on XX. Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4), 15(4):583–602 (1989), 1988.
  • [15] Luis A. Caffarelli. A Harnack inequality approach to the regularity of free boundaries. II. Flat free boundaries are Lipschitz. Comm. Pure Appl. Math., 42(1):55–78, 1989.
  • [16] R. E. Caflisch, O. F. Orellana, and M. Siegel. A localized approximation method for vortical flows. SIAM J. Appl. Math., 50(6):1517–1532, 1990.
  • [17] Stephen Cameron. Global well-posedness for the two-dimensional Muskat problem with slope less than 1. Anal. PDE, 12(4):997–1022, 2019.
  • [18] Stephen Cameron. Global wellposedness for the 3d muskat problem with medium size slope. arXiv preprint arXiv:2002.00508, 2020.
  • [19] Héctor Chang Lara. Regularity for fully non linear equations with non local drift. arXiv:1210.4242 [math.AP], 2012.
  • [20] Héctor Chang Lara and Gonzalo Dávila. Regularity for solutions of non local parabolic equations. Calc. Var. Partial Differential Equations, 49(1-2):139–172, 2014.
  • [21] Héctor Chang-Lara and Gonzalo Dávila. Regularity for solutions of nonlocal parabolic equations II. J. Differential Equations, 256(1):130–156, 2014.
  • [22] Héctor A. Chang-Lara and Gonzalo Dávila. Hölder estimates for non-local parabolic equations with critical drift. J. Differential Equations, 260(5):4237–4284, 2016.
  • [23] Héctor A Chang-Lara and Nestor Guillen. From the free boundary condition for hele-shaw to a fractional parabolic equation. arXiv preprint arXiv:1605.07591, 2016.
  • [24] Héctor A. Chang-Lara, Nestor Guillen, and Russell W. Schwab. Some free boundary problems recast as nonlocal parabolic equations. Nonlinear Analysis, 189:111538, 2019.
  • [25] Sungwon Cho. Two-sided global estimates of the Green’s function of parabolic equations. Potential Anal., 25(4):387–398, 2006.
  • [26] Sunhi Choi, David Jerison, and Inwon Kim. Regularity for the one-phase Hele-Shaw problem from a Lipschitz initial surface. Amer. J. Math., 129(2):527–582, 2007.
  • [27] Sunhi Choi, David Jerison, and Inwon Kim. Local regularization of the one-phase Hele-Shaw flow. Indiana Univ. Math. J., 58(6):2765–2804, 2009.
  • [28] F. H. Clarke. Optimization and nonsmooth analysis, volume 5 of Classics in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, second edition, 1990.
  • [29] Peter Constantin, Diego Córdoba, Francisco Gancedo, and Robert M. Strain. On the global existence for the Muskat problem. J. Eur. Math. Soc. (JEMS), 15(1):201–227, 2013.
  • [30] Peter Constantin, Francisco Gancedo, Roman Shvydkoy, and Vlad Vicol. Global regularity for 2D Muskat equations with finite slope. Ann. Inst. H. Poincaré Anal. Non Linéaire, 34(4):1041–1074, 2017.
  • [31] Antonio Córdoba, Diego Córdoba, and Francisco Gancedo. Interface evolution: the Hele-Shaw and Muskat problems. Ann. of Math. (2), 173(1):477–542, 2011.
  • [32] Diego Córdoba and Francisco Gancedo. Contour dynamics of incompressible 3-D fluids in a porous medium with different densities. Comm. Math. Phys., 273(2):445–471, 2007.
  • [33] Diego Córdoba and Francisco Gancedo. A maximum principle for the Muskat problem for fluids with different densities. Comm. Math. Phys., 286(2):681–696, 2009.
  • [34] W. Craig and C. Sulem. Numerical simulation of gravity waves. J. Comput. Phys., 108(1):73–83, 1993.
  • [35] Björn E. J. Dahlberg. Estimates of harmonic measure. Arch. Rational Mech. Anal., 65(3):275–288, 1977.
  • [36] P. Daskalopoulos and Ki-Ahm Lee. All time smooth solutions of the one-phase Stefan problem and the Hele-Shaw flow. Comm. Partial Differential Equations, 29(1-2):71–89, 2004.
  • [37] D. De Silva. Free boundary regularity for a problem with right hand side. Interfaces Free Bound., 13(2):223–238, 2011.
  • [38] Hongjie Dong, Tianling Jin, and Hong Zhang. Dini and Schauder estimates for nonlocal fully nonlinear parabolic equations with drifts. Anal. PDE, 11(6):1487–1534, 2018.
  • [39] C. M. Elliott and V. Janovský. A variational inequality approach to Hele-Shaw flow with a moving boundary. Proc. Roy. Soc. Edinburgh Sect. A, 88(1-2):93–107, 1981.
  • [40] Joachim Escher and Gieri Simonett. Classical solutions of multidimensional Hele-Shaw models. SIAM J. Math. Anal., 28(5):1028–1047, 1997.
  • [41] Michael Grüter and Kjell-Ove Widman. The green function for uniformly elliptic equations. Manuscripta Mathematica, 37(3):303–342, 1982.
  • [42] Nestor Guillen and Russell W. Schwab. Min–max formulas for nonlocal elliptic operators on euclidean space. Nonlinear Analysis, 2019.
  • [43] Nestor Guillen and Russell W Schwab. Min-max formulas for nonlocal elliptic operators. Calc. Var. Partial Differential Equations, To Appear, 2019.
  • [44] Cyril Imbert and Luis Silvestre. The schauder estimate for kinetic integral equations. arXiv preprint arXiv:1812.11870, 2018.
  • [45] Cyril Imbert and Luis Silvestre. Global regularity estimates for the boltzmann equation without cut-off. arXiv preprint arXiv:1909.12729, 2019.
  • [46] Cyril Imbert and Luis Silvestre. Regularity for the boltzmann equation conditional to macroscopic bounds. arXiv preprint arXiv:2005.02997, 2020.
  • [47] Cyril Imbert and Luis Silvestre. The weak Harnack inequality for the Boltzmann equation without cut-off. J. Eur. Math. Soc. (JEMS), 22(2):507–592, 2020.
  • [48] Tianling Jin and Jingang Xiong. Schauder estimates for solutions of linear parabolic integro-differential equations. Discrete Contin. Dyn. Syst., 35(12):5977–5998, 2015.
  • [49] Inwon Kim. Long time regularity of solutions of the Hele-Shaw problem. Nonlinear Anal., 64(12):2817–2831, 2006.
  • [50] Inwon C. Kim. Uniqueness and existence results on the Hele-Shaw and the Stefan problems. Arch. Ration. Mech. Anal., 168(4):299–328, 2003.
  • [51] Inwon C. Kim. Regularity of the free boundary for the one phase Hele-Shaw problem. J. Differential Equations, 223(1):161–184, 2006.
  • [52] D. Kinderlehrer and L. Nirenberg. Regularity in free boundary problems. Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4), 4(2):373–391, 1977.
  • [53] David Kinderlehrer and Louis Nirenberg. Analyticity at the boundary of solutions of nonlinear second-order parabolic equations. Comm. Pure Appl. Math., 31(3):283–338, 1978.
  • [54] J. R. King, A. A. Lacey, and J. L. Vázquez. Persistence of corners in free boundaries in Hele-Shaw flow. volume 6, pages 455–490. 1995. Complex analysis and free boundary problems (St. Petersburg, 1994).
  • [55] Dennis Kriventsov. C 1, α\alpha interior regularity for nonlinear nonlocal elliptic equations with rough kernels. Communications in Partial Differential Equations, 38(12):2081–2106, 2013.
  • [56] N. V. Krylov and M. V. Safonov. A property of the solutions of parabolic equations with measurable coefficients. Izv. Akad. Nauk SSSR Ser. Mat., 44(1):161–175, 239, 1980.
  • [57] David Lannes. Well-posedness of the water-waves equations. J. Amer. Math. Soc., 18(3):605–654, 2005.
  • [58] R. Mikulevicius and H. Pragarauskas. On the Cauchy problem for integro-differential operators in Hölder classes and the uniqueness of the martingale problem. Potential Anal., 40(4):539–563, 2014.
  • [59] Huy Q. Nguyen and Benoît Pausader. A paradifferential approach for well-posedness of the Muskat problem. Arch. Ration. Mech. Anal., 237(1):35–100, 2020.
  • [60] David P. Nicholls and Fernando Reitich. A new approach to analyticity of Dirichlet-Neumann operators. Proc. Roy. Soc. Edinburgh Sect. A, 131(6):1411–1433, 2001.
  • [61] P. G. Saffman and Geoffrey Taylor. The penetration of a fluid into a porous medium or Hele-Shaw cell containing a more viscous liquid. Proc. Roy. Soc. London. Ser. A, 245:312–329. (2 plates), 1958.
  • [62] Ovidiu Savin. Small perturbation solutions for elliptic equations. Comm. Partial Differential Equations, 32(4-6):557–578, 2007.
  • [63] Russell W. Schwab and Luis Silvestre. Regularity for parabolic integro-differential equations with very irregular kernels. Anal. PDE, 9(3):727–772, 2016.
  • [64] Joaquim Serra. Regularity for fully nonlinear nonlocal parabolic equations with rough kernels. Calc. Var. Partial Differential Equations, 54(1):615–629, 2015.
  • [65] Michael Siegel, Russel E. Caflisch, and Sam Howison. Global existence, singular solutions, and ill-posedness for the Muskat problem. Comm. Pure Appl. Math., 57(10):1374–1411, 2004.
  • [66] Luis Silvestre. Hölder estimates for solutions of integro-differential equations like the fractional Laplace. Indiana Univ. Math. J., 55(3):1155–1174, 2006.
  • [67] Luis Silvestre. On the differentiability of the solution to the Hamilton-Jacobi equation with critical fractional diffusion. Adv. Math., 226(2):2020–2039, 2011.
  • [68] Luis Silvestre. Regularity estimates for parabolic integro-differential equations and applications. In Proceedings of the International Congress of Mathematicians—Seoul 2014. Vol. III, pages 873–894. Kyung Moon Sa, Seoul, 2014.
  • [69] Luis Silvestre. A new regularization mechanism for the Boltzmann equation without cut-off. Comm. Math. Phys., 348(1):69–100, 2016.
  • [70] E. M. Stein. Singular Integrals and Differentiability Properties of Functions. Princeton University Press, Princeton, 1971.
  • [71] Vladimir E Zakharov. Stability of periodic waves of finite amplitude on the surface of a deep fluid. Journal of Applied Mechanics and Technical Physics, 9(2):190–194, 1968.
  • [72] Zhong Xin Zhao. Uniform boundedness of conditional gauge and Schrödinger equations. Comm. Math. Phys., 93(1):19–31, 1984.