Uniform-in-time propagation of chaos for second order interacting particle systems
Abstract
We study the long time behavior of second order particle systems interacting through global Lipschitz kernels. Combining hypocoercivity method in [37] and relative entropy method in [26], we are able to overcome the degeneracy of diffusion in position direction by controlling the relative entropy and relative Fisher information together. This implies the uniform-in-time propagation of chaos through the strong convergence of all marginals. Our method works at the level of Liouville equation and relies on the log Sobolev inequality of equilibrium of Vlasov-Fokker-Planck equation.
1 Introduction
1.1 Framework
In this article, we consider the stochastic second order particle systems for indistinguishable point-particles, subject to a confining external force and an interacting kernel ,
(1.1) |
where . We take the position in , which may be the whole space or the periodic torus , while each velocity lies in . Note that denotes independent copies of Wiener processes on . We take the diffusion coefficient before the Brownian motions as a constant for simplicity. Usually one can take , where the constant denotes the friction parameter, and is the inverse temperature. In more general models, those may depend on the number of particles , the position of particles , etc.
In many important models, the kernel is given by , where is an interacting potential. A well-known example for is the Coulomb potential. We recall that the Hamiltonian energy associated to the particle system (1.1) reads as
(1.2) |
which is the sum of the kinetic energy and the potential energy defined by
(1.3) |
We recall the Liouville equation as in [26], which describes the joint distribution of the particle system (1.1) on ,
(1.4) |
We define the associated Liouville operator as
(1.5) |
As the mean field theory indicates, when , the limiting behavior of any particle in (1.1) is described by the McKean-Vlasov SDE
(1.6) |
where , denotes a standard Brownian motion on . We then denote its phase space density by and the spatial density by
The law further satisfies the mean-field equation or the nonlinear Fokker-Planck equation
(1.7) |
If , one can check that its nonlinear equilibrium satisfies the following equation
(1.8) |
where is the constant to make a probability density. In the literature, one may use the “formal equilibrium” of Eq.(1.7) at time to refer
(1.9) |
again where is the constant to make a probability density.
It is well-known that the large limit of particle system (1.1) is mathematically formalized by the notion of originating in [29]. See also the surveys for instance [36, 18, 25, 27]. Recently, many important works have been done in quantifying propagation of chaos for different kinds of first order interacting particle systems with singualr kernels. See for instance [27, 11] for detailed descriptions of recent development. However, for 2nd order systems or Newton’s dynamcis for interacting particles, the natural question is to consider the mean field limit of a purely deterministic problem, that is as in Eq. (1.1) or our setting with the diffusion only acts on the velocity variables. Thus the limiting equation (1.7) has the Laplacian term only in its velocity variable. Consequently, we cannot treat more general kernels and long time propagation of chaos for 2nd order systems by exploiting entropy dissipation as easy as those in the first-order setting for instance as in [28].
The main purposes of this article are two-fold. Firstly, we study the long time convergence of the solution to the N-particle Liouville equation (1.4) toward its unique equilibrium (i.e. the Gibbs measure) uniformly in . Secondly, we establish propagation of chaos of (1.1) (or (1.4) )toward (1.6) (or (1.7) ) uniformly in time. We combine the entropy method developed in [26] with the hypocoercivity method refined in [37] to overcome the degeneracy of diffusion in position direction. Here we deal with the cases where the interaction kernel is globally Lipschitz and the strength of diffusion is large enough. Uniform-in-time propagation of chaos cannot hold in full generality, one of the most critical issues is that the non-linear equilibrium of (1.7) might not be unique. Furthermore, for systems (1.4) with singular interactions , for instance the Coulomb interactions, even the mean-field limit/propagation of chaos for general initial data on a fixed finite time horizon, for instance , is still widely open. The recent progress can be found for instance in [23, 24, 26, 31, 7, 6, 5]. Uniform-in- convergence of (1.4) toward its equilibrium is seldom investigated for systems even with very mild singularities. Those will be the objects of our further study.
Let us briefly describe how we combine the relative entropy with hypocoercivity to establish propagation of chaos. To illustrate the ideas, we pretend all solutions are classical and there is no regularity issue when we take derivatives. We first recall the evolution of relative entropy in [26] for systems with bounded kernel ,
(1.10) | ||||
where is the normalized relative entropy defined as
and is the error term defined as
Hereinafter we write that , and . Then by integration by parts, one can rewrite (1.10) as
(1.11) | ||||
where is a -dimensional vector defined as
Inspired by the strategy used in [19] to deal with the first order dynamics on torus, we expect to find the term
in some way to use log-Sobolev inequality for the second order system (1.1). Fortunately, hypocoercivity in entropy sense in [37] motivates us to take the derivative of normalized relative Fisher information , which is defined as
where is a positive defined matrix. If we choose the matrix as the form
where are positive definite matrices to be specified later, then we obtain
(1.12) | ||||
where are two positive constants that depend on , and . The concrete form of error terms will be given in Lemma 4 and Lemma 6. Roughly speaking, time derivative of relative entropy only provides us dissipation in the direction of velocity and time derivative of relative Fisher information provides us further dissipation in the direction of position. The reference measure of (1.10) and (1.12) could be more general. For insatnce, if we replace by , we can show the convergence from towards as .
Now we combine these two quantities, relative entropy and Fisher information, to form a new “modulated energy” as
where may take or . The remaining argument is to appropriately control error terms in (1.12). When , linear structure of the Liouville equation (1.4) makes error terms vanish. When , error terms in (1.12) can be written as
(1.13) |
for suitable selection of (See Corollary 2), here . However, the terms and are too difficult to control to obtain some uniform-in-time estimates for (1.13) (See Remark 15). The key observation is that and are trivial terms if we replace by its non-linear equilibrium, then we can show
(1.14) | ||||
where are some constants that only depend on and (See Theorem 1.2). To control the error between and , we use the convergence result from to its equilibrium . There exist some results studying this type of convergence . In this article, we adapt the arguments as in [21] for with small and [13] for the case when and the interaction energy is convex to establish the following estimate
(1.15) |
where depends on and depends on initial data . We give the sketch of proof of (1.15) in Theorem 1.4 and Theorem 1.5 in the appendix. In the end, we combine (1.14) with (1.15) through 2-Wasserstein distance to conclude.
Finally, we give some comments about terms and . Unfortunately we cannot directly obtain uniform-in-time estimates about these terms, otherwise we would have
for some and then finish the proof by Gronwall’s inequality. Uniform-in-time propagation of chaos can only be expected when the limiting PDE (1.7) has a unique equilibrium. Indeed, we obtain Theorem 1.3 under the assumptions in Theorem 1.2 and some assumption on which prevents the existence of multiple equilibria of Eq.(1.7). Similar arguments/estimates have also appeared in the first-order setting. For instance, the authors of [17] obtained Li-Yau type growth estimates of with respect to the position for any fixed time horizon in particular in the case of 2d Navier-Stokes equation, with a universal constant growing with time . Those type estimates are very useful when one treats the mean field limit problem set on the whole space [35, 8].
1.2 Main results and examples
Let us fix some notations first. We denote as a phase space configuration of one particle, and , and for configurations in position, velocity and phase space of particles, respectively. We also use that and . As the same spirit, we write operator , and . Those are operators on .
For two probability measures on , we denote by theset of all couplings between and . We define Wasserstein distance as
For a measure and a positive integrable function such that , we define the entropy of with respect to as
For two probability measures and , we define the relative entropy between and as
and the normalized version of relative entropy for probability measures and on as
(1.16) |
We also define relative Fisher information between and as
where is a positive definite matrix-valued function such that for some independent on . Similarly, the normalized version of relative Fisher information between two probability measures and on reads as
(1.17) |
We abuse the notation of probability measures and as probability densities if they have densities. Finally, we give definitions of some functional inequalities that we will use in the following.
Definition 1.1.
(log-Sobolev inequality) We call that the probability measure on satisfies the log-Sobolev inequality if there exists some constant such that for all smooth function with , it holds that
(1.18) |
Definition 1.2.
(Weighted log-Sobolev inequality) Let be a smooth function on . We call that the probability measure on satisfies the weighted log-Sobolev inequality with weight if there exists some constant such that for all smooth function with , it holds that
(1.19) |
In the following, we use () to denote the (weighted) log-Sobolev inequality constant of the equilibrium measure of the limiting PDE (1.7) and to denote the uniform-in- log-Sobolev inequality constant of the stationary measure of the Liouville equation (1.4).
Now we detail assumptions about interaction potential and confining potential .
Assumption 1.
Suppose that and there exist such that .
The first condition means that goes to infinity at infinity and is bounded below. It can be implied by
for some . This expression implies that the force drags particles back to some compact set. Detailed proof can be found in [20].
The second assumption implies that the potential grows at most quadratically on .
Assumption 2.
Suppose that and there exists such that .
We also treat more general confining potentials when .
Assumption 3.
Suppose that and there exist and such that . Moreover, outside a compact domain on , we assume that satisfies
for some ;
for some positive constant .
Those conditions in Assumption 3 have been explored in [10] for kinetic Langevin process with confining potentials greater than quadratic growth at infinity. The boundedness of extend the quadratic growth condition and the other two conditions guarantee the weighted log-Sobolev inequality of (See Section 2.3). We also use multiplier method developed in [10] to deal with this type of confining potentials.
Example 1.
Some important examples have been provided in [10]. The first kind of examples is . Then we have , and . Finally, we take . Then all conditions above can be satisfied.
Example 2.
Another kind of examples is provided in [10], which shows that the limit growth of must be below the exponential growth. Observing that , and , the conditions above imply and .
Assumption 4.
Suppose that and there exists such that .
Assumption 5.
The mean field functional or the interaction energy defined as
is functional convex, i.e. for every and every ,
(1.20) |
Example 3.
Remark 1.
The harmonic interaction potential does not satisfy Assumption 5. Let us take
then for ,
which does not hold in general.
With those specific statement of assumptions as above, we can now state our main results.
Theorem 1.1.
Remark 2.
We can choose matrix as the following
where
and two constants and satisfy
Then the constant can be taken as . By the selection of , we observe that , i.e. the larger diffusion strength we have, the faster convergence from to .
Our second contribution is the uniform-in- exponential convergence from to .
Theorem 1.2.
Suppose that and satisfy one of the following two cases,
For the first case, we take as a constant matrix, then for initial data of Eq.(1.4) such that and , we have
(1.22) |
where and are explicit and independent of .
For the second case, we take as a matrix function on , then for initial data of Eq.(1.4) such that and , we have
(1.23) |
where are explicit and independent of .
Remark 3.
For the first case, we also choose as in Remark 2, but two constants and now should satisfy
then the constant can be taken as which implies , and the lower bound of diffusion constant satisfies
For the second case, we choose as
where are diagonal matrices. We choose and as
where
and two constants and satisfy
where , then the constant can be taken as , which implies , and the lower bound of diffusion constant satisfies
where .
Remark 4.
Theorem 1.2 implies that second order particle system (1.1) not only exponentially converges to its equilibrium, but also converges to the unique mean field equilibrium as . If we take , the results (1.22) and (1.23) imply that,
which offers us a kind of dynamical approach to prove the concentration of the Gibbs or stationary measure of the second order particle system around the nonlinear equilibrium of the limiting equation (1.7).
Combining the exponential convergence from to , we could replace by in last theorem so that avoid the estimates about and . Based on this observation, we establish the uniform-in-time propagation of chaos both in the sense of the Wasserstein distance and relative entropy.
Theorem 1.3.
Suppose that and satisfy one of the following two cases,
(i) satisfies Assumption 1 and 2, satisfies Assumption 4. Moreover, we assume either is small or interaction functional satisfies Assumption 5.
(ii) satisfies Assumption 1 and 3, satisfies Assumption 4 and . Moreover, we assume either is small or interaction functional satisfies Assumption 5.
Remark 5.
Remark 6.
The main difference of assumptions in Theorem 1.3 compared with those in Theorem 1.2 is that is small or the interaction functional is convex. Those two conditions make sure that exponentially converges to . The small condition of comes from Theorem 10 in [21], which obtains the exponential convergence from to by uniform log Sobolev inequality of . The convexity condition of is inspired by Theorem 2.1 in [13], this kind of condition avoid the smallness assumption on . We extend their result to more general confining potentials in the Appendix.
Remark 7.
The uniform-in-time propagation of chaos of second order particle system (1.1) has been investigated in [20] and [13]. Compared with [20] that exploits the coupling method, our result applies to more general confining potentials. In terms of Theorem 2.3 in [13], we do not need the uniform-in- log-Sobolev inequality of measure proportional to , which is not very easy to verify.
1.3 Related literature
Hypocoercivity. Hypocoercivity is an important analytical tool to study the long time behavior of Langevin dynamics and the corresponding kinetic Fokker-Planck equation. It was initiated by Villani [37] and and then later advanced by Dolbeault, Mouhot and Schmeiser in [15] and [16]. However, those now well-known results are only restricted to the one particle dynamics without any interactions. For the particle system given by the Liouville equation (1.4), the natural stationary measure is simply the Gibbs measure given by the following form
A natural problem is that whether or not the convergence rate from toward depends on the number of particles . Many researchers have contributed to this problem. Guillin, etc study the uniform in functional inequalities in [21]. Guillin and Monmarché show uniform-in- exponential decay rate in [33] and [22] by “Generalized calculus” developed in [34] and uniform log-Sobolev inequality in [21]. Guillin, etc also use type norm to show the uniform-in- exponential decay rate by hypocoercivity and uniform Poincaré inequality in [21]. These result are all restricted to potentials with smallness of . There are also some results that treat systems with singular potentials. Baudoin, Gordina and Herzog showed convergence to equilibrium by Gamma calculus in [2] with singular potentials. Lu and Mattingly constructed new Lyapunov function to show egodicity for systems (1.1) with Coulomb potential in the sense of weighted total variation distance in [32]. However, the convergence rates, if they provides one, all depend on .
Propagation of chaos for kinetic Vlasov equation. The main result presented in this article is a further development of the relative entropy method introduced in [26], where Jabin and and the 2nd author proved a quantitative propagation of chaos for Newton’s system with bounded interaction kernel in terms of relative entropy. Lacker [30] then developed an approach based on the BBGKY hierarchy and the entropy dissipation to optimize the local convergence rate of -marginals towards the limiting law. Bresch, Jabin and Soler [6] exploited the BBGKY hierarchy approach to firstly include the 2d Vlasov-Possion-Fokker-Planck case. More recently, Bresch, Jabin and Duerinckx [5] introduced a duality approach to cover the arbitrary square-integrable interaction forces at possibly vanishing temperature. Up to now, the mean field limit or the propagation of chaos results are still very limited for second order particle system with singular interaction forces. See also for the results in [23, 24, 31, 7] and the review [25] for more detailed discussions.
For long time propagation of chaos, Monmarché showed uniform-in-time propagation of chaos in the sense of Wasserstein distance of one marginal for systems with convex potentials, i.e.
for some contant independent on and . The sharp rate with for the case has also been established there. Guillin and Monmarché [22] later improved the convergence result to all marginals but without optimality in terms of of , i.e.
for some contant independent on and . Thanks to the reflection coupling method, Guillin, Bris and Monmarché [20] proved the optimal convergence rate of for all marginals with convex or non-convex interaction potentials, i.e.
for some constant independent on and , with the smallness assumption of the Lipschitz constant of interaction force . Recently, Chen, Lin, Ren and Wang [13] showed uniform-in-time propagation of chaos with functional convexity condition. Even though they do not need smallness of , they require some uniform-in-time Poincaré inequality for the solution of the limiting PDE (1.7) . To the best of our knowledge, there is no result of uniform-in-time propagation of chaos for second order systems with singular interaction forces yet. We leave this topic for our further study.
Uniform-in-time propagation of chaos cannot hold in general. One critical obstacle is that the non-linear Vlasov-Fokker-Planck equation (1.7) may have multiple equilibria and hence exhibit phrase transition. The convergence from towards prevents the phrase transition or the presence of multiple equilibria of the limiting system. There are some results about this kind of convergence but with very limited conditions about potentials. See for instance [14] and the reference therein. Villani [37] proved that converge to the Maxwellian
on with any polynomial order in the sense of norm, which requires that and is small enough. Guillin and Monmarché showed that converges to in the sense of “mean-field entropy” in [22], which defines as
(1.26) |
for probability measure , where and . Baudoin, Feng and Li [3] established that converges to with exponential decay rate in the sense of free energy combining with “relative Fisher Information” (by our notation)
(1.27) |
where is a constant matrix and . The free energy they used is defined as
(1.28) |
They used calculus to overcome the dissipation degeneracy in direction with convexity and smallness of and . Chen, Lin, Ren and Wang also exploited the quantity (1.27) to prove exponentially converges to 0 in [13] under conditions and the functional convexity of (Assumption 5). These two groups both used the so called free energy to quantify the convergence from to , i.e.
Finally, let us recall the convergence result in [21] and extend Theorem 2.1 in [13] to more general confining potentials. By [21], we have
Theorem 1.4.
We extend Theorem 2.1 in [13] to more general confining potentials,
Theorem 1.5.
We will give the sketch of proofs of these two theorems in the Appendix.
1.4 Outline of the article
The paper is then organized as follow: In Section 2, we develop the basic tools we will use throughout this article. In Section 2.1, we introduce the normalized relative Fisher information and compute its time evolution under the kinetic dynamics (1.4) and (1.7). In Section 2.2, we select the nontrivial matrix for relative Fisher information to deal with the confining potentials greater than a quadratic function at infinity, where the crucial idea “entropy multipliers” is inspired by the one particle case as in [10]. In Section 2.3, we introduce the weighted log-Sobolev inequality, which is essentially obtained with the entropy multiplier method. In Section 2.4, we prove a new Law of Large Number estimates for systems with Lipschitz interaction force . In Section 3, we give the complete proof of our main results, Theorem 1.1, 1.2 and 1.3. In the Appendix, we prove the convergence from to under some conditions on and .
2 Preliminary
Let us define some linear operators in we will use in this section. We denote on and on . The operator collects all of one order part of the Liouville operator in (1.5), i.e.
where is defined in (1.3). We write the infinitesimal generator of -particle system (1.1) as
(2.1) |
Lemma 1.
Proof.
In the next, we turn to the argument about Fisher information.
2.1 Hypocoercivity in entropy sense
In this subsection, we extend hypocoercivity in entropy sense in [37] to particle system with nontrivial interaction force. We also use more general reference measure in —– invariant measure or times tensor product of limiting measure , corresponding to uniform egodicity problem and uniform-in-time propagation of chaos problem.
In the following, we use notation and for convenience, may take or . Before tedious manipulations, we firstly derive the equation of .
Lemma 2.
Proof.
The proof is direct computation. In terms of Eq.(1.4), we have
(2.4) |
and for Eq.(1.7), we have
(2.5) |
we could understand as the difference of drift part between particle system (1.1) and McKean-Vlasov system (1.6). Combine Eq.(2.4) and Eq.(2.5), we have
(2.6) |
using identity
therefore, satisfies the equation
(2.7) |
recall , now we regard as reference measure and use Proposition 3 of [37], for vector function , we have
(2.8) |
now we rewrite Eq.(2.7) as following
we complete the proof. ∎
Remark 8.
Now let us compute the time derivation of relative Fisher Information. We omit the integration domain for convenience.
Lemma 3.
Assume that is a solution of Eq.(1.4). Assume that solves Eq.(1.7) with and . Let be linear differential operators on , where
and are to be confirmed, then
(2.10) |
where
(2.11) |
(2.12) |
Remark 9.
is a -tuple differential operator, but are -tuple differential operators. We denote and as and in the sense of coordinate . We omit the index for convenience in the following, i.e. and . Each of them can be identified with a vector field , in such a way that , so can be seen as a map valued in matrix. The inner products above should be understood as , .
Remark 10.
Let us explain the commutators we use. is a -tuple operator, understood as . But is a operator with components, understood as , and others follow. If are commutative with , the only nontrivial operators are and , we will compute them later.
Proof.
In this step, we claim
(2.13) |
here collects all terms with coefficient and reads as
(2.14) |
There terms comes from diffusion part of Eq.(1.4) and Eq.(1.7), we will deal with them in next step.
We directly take derivative and split into three terms:
(2.15) |
For the first term, we use Eq.(1.4) and integral by parts,
(2.16) |
For second term,
(2.17) |
Similarly, for third term,
(2.18) |
Combine (2.16), (2.17), (2.18), we complete the claim in this step.
In this step, we deal with the diffusion part, we claim
(2.19) |
where
(2.20) |
(2.21) |
The terms and collect all terms including commutators, we will take suitable operators and to simplify these commutators.
For the first term of (2.14), we use integral by parts with respect to Lebesgue measure,
(2.22) |
Recall we denote , we continue the first term of (2.22)
(2.23) |
Let us explain the notation we use: should be understood as
and
Moreover, we take the conjugate operator of w.r.t measure , we have
(2.24) |
For the second term of (2.14), we rewrite it as
and
then we have
(2.25) |
Similarly, for the fourth term of (2.14), up to the exchange of and , we have
(2.26) |
Finally, all of left terms are third and fifth term in (2.14) and underlined term in (2.22), we collect them as below,
(2.27) |
We deal with the first term of (2.27) as following,
(2.28) |
combine with the last two term of (2.27), we have
(2.29) |
After gathering (2.23), (2.25), (2.26) and (2.29), we find
(2.30) |
Based on the conjugate rule (2.24) for all similar terms, we complete the proof. ∎
Corollary 1.
Let , if is commutative with and , we have
(2.31) |
and
(2.32) |
Proof.
We recall that
(2.33) |
For the second term of (2.33), we direct compute the commutator , i.e.
by if . For the first term of (2.33), we understand the commutator as the row of operators
(2.34) |
then we have
(2.35) |
here we understand as a -tuple vector reads as , which can be operated by . Recall
then we have
(2.36) |
Up to change the position of and , we complete the proof. ∎
2.2 Entropy multipliers
In order to deal with more general potentials and , we develop the method of entropy multipliers for -particle relative Fisher information. We recommend Part 1, section 8 in [37] and [10] for “one particle version” without interaction potentials, and they only consider the invariant measure of single particle Fokker Planck equation as reference measure. Now let us consider a weight matrix to distort the relative Fisher Information, i.e.
we evolve this quantity along time in the following lemma, many ideas of manipulation are similar with Lemma 3.
Lemma 4.
Assume that is a solution of Eq.(1.4). Assume that solves Eq.(1.7) with and . Let be differential operators on , where
and are to be confirmed. Let be a matrix valued function smooth enough for all variables, then
(2.37) |
where
(2.38) |
(2.39) |
Remark 11.
The notations of inner product and commutator operators appeared above are the same as in Remark 9 and 10. Let us explain the notations associated with we used above. We denote are -tuple vectors, then it is reasonable to multiply with them. should be understood as
in the sense of coordinate . The derivative of (i.e. ) is taken componentwise.
Proof.
We use the similar argument with Lemma 3. We directly take derivative and obtain,
(2.40) |
Observe that the last term
(2.41) |
is new. Next we use the similar step in Lemma 3 to go on.
In this step, we claim the following
(2.42) |
where collects all terms with coefficient , which reads as
(2.43) |
Remark 12.
Let us explain some notations we use. Since , and are all -tuple vectors, it is reasonable to multiple them with . After that, , and are -tuple vectors.
For the first term of (2.40), recall Liouville operator (1.5) we have
(2.44) |
For the second term and third term of (2.40), using Eq.(2.3), we have
(2.45) |
In this step, we focus on the term as before, whose terms all come from diffusion. Now we claim,
(2.47) |
where
(2.48) |
(2.49) |
The terms and collect all terms including commutators, we will take suitable operators and to simplify these commutators.
For the first term of (2.43), using integral by parts with respect to Lebesgue measure,
(2.50) |
Recall we denote , we continue the first term of (2.50)
(2.51) |
For the second terms of (2.50), we rewrite it as
(2.52) |
and we continue the first term of (2.52),
now we have
Similarly, for the fourth term of (2.43), since is symmetric, using its conjugation in ,
up to the exchange of and , we have
(2.53) |
Finally, all of left terms are third and fifth term in (2.43), and underlined term in (2.50), we collect them as below,
(2.54) |
We deal with the first term of (2.54) as following,
(2.55) |
by similar argument with (2.28) and (2.29) in Lemma 3, we have
(2.56) |
Until now, let us collect all terms don’t appear in the Lemma 3:
(2.57) |
We compute the first term more precisely,
(2.58) |
using the conjugate relationship w.r.t. measure , we have
(2.59) |
recall , then we have
(2.60) |
Combine with last three terms of (2.57), all of remaining terms compared with (2.30) in the proof of Lemma 3 are
(2.61) |
Together with (2.42) in , we finish the proof. ∎
Corollary 2.
If we take as a block positive defined matrix, i.e.
(2.62) |
where are matrix-valued functions smooth enough, and take , then we have
(2.63) |
where is a matrix reads as
(2.64) |
and
Remark 13.
A easy computation shows that
(2.65) |
which is a matrix.
Proof.
The main idea is to analyse every term in (2.37). Let us compute what commutator is firstly. For each component of operator , we have
(2.66) |
and
(2.67) |
In other word, we have
(2.68) |
Then we regard as a quadratic form with matrix
(2.69) |
Since is a positive defined matrix, must be negative, so we just keep it. Moreover, are commutative with , only nontrivial terms in (2.38) and (2.39) are first line two terms. We recall and in Corollary 1, and we regard as a quadratic form with following matrix,
(2.70) |
For the last line in (2.37), we recall duality of Liouville operator (2.1), then we have
(2.71) |
2.3 Weighted log Sobolev inequality
In this section, we establish the weighted Log-Sobolev inequality for nonlinear equilibrium defined by Eq.(1.8), then we extend to the weighted -particle version by verifying tensorized invariance of weighted Log-Sobolev inequality. Before that, let us talk about the situation of first order particle system. Guillin et al. consider the uniform-in-time propagation of chaos with the following limiting equation on in [19],
(2.72) |
where is a probability density. An very useful observation in [19] is that, if there exists some constant such that
(2.73) |
for initial data of Eq.(2.72) on , then they propagate this property to all time uniformly, i.e.
(2.74) |
After controlling the upper bound of by standard energy estimates, they obtain the upper bound of and , which is essential for the proof of Theorem 1 in [19]. Another important observation in [19] is that satisfies Log-Sobolev inequality uniformly in as a result of perturbation of uniform distribution by (2.74) (See Proposition 5.1.6, [1]). These two facts help them obtain uniform-in-time propagation of chaos even for Biot-Savart kernel. But in the case of Vlasov-Fokker-Planck equation (1.7), the situation becomes totally different. The best we can expect to initial data of Eq.(1.7) is
(2.75) |
for some constant . The lack of positive lower bound of (2.75) makes the uniform-in-time upper bound of fail by the same strategy in [19], that is the reason why we replace the reference measure by .
Inspired by the argument of Gibbs measure for one particle in [10],
(2.76) |
where is the Hamiltonian defined on and is partition function. We omit the temperature constant in the following and recall some related results in [10].
Definition 2.1.
satisfies the following weighted Log-Sobolev inequality in if there exists some constant s.t. for all smooth enough with :
(2.77) |
The weighted Log-Sobolev inequality (2.77) associates with a new second order operator on ,
(2.78) |
which is symmetric in and satisfies
The following theorem tells us how to verifying the weighted Log-Sobolev inequality for suitable condition of function .
Theorem 2.1.
Assume that goes to infinity at infinity and that there exists such that .
(1) If satisfies the weighted Log-Sobolev inequality (2.77), then, there exists a Lyapunov function, i.e. a smooth function such that for all , two positive constant and such that
(2.79) |
A natural choice of Lyapunov function is with , we refer [9] and [10] to readers for more details. A simple perturbation argument in [1] could extend the weighted Log-Sobolev inequality to .
Proposition 1.
Assume that satisfies the weighted Log-Sobolev inequality (2.77), let be a probability measure with density with respect to such that for some constant , then satisfies
(2.81) |
Proof.
We use the following lemma in [1] (Page240, Lemma 5.1.7),
Lemma 5.
Let on some open interval be convex of class . For every bounded or suitably integrable measurable function with values in ,
(2.82) |
Now we verify the invariance of weighted Log-Sobolev constant, by which we extend weighted Log-Sobolev inequality (2.77) to the measure of tensorized form .
Proposition 2.
Assume that satisfy the weighted Log-Sobolev inequality (2.77) with constant and , then satisfies
(2.83) |
with .
Proof.
We denote
and of course , then
Observe that
Next we estimate ,
for terms on the right hand side, observe the weight only concerns the first variable,
and use Cauchy-Schwarz inequality, we have
Similarly,
then we finish the proof. ∎
Corollary 3.
2.4 Large deviation estimates
In this subsection, we deal with the error terms we mentioned before. For relative entropy, we recall Lemma 1 and the error term reads as,
(2.85) |
For relative fisher information, we recall Corollary 2 and we estimate the following term,
(2.86) |
Our main tool comes from Jabin-Wang’s large derivation type estimates for propagation of chaos for singular kernels in series of paper [26], [28]. We refer Theorem 3 in [26] or Theorem 3 and Theorem 4 in [28] to readers for more details to this kind of technique, now we apply it to our cases — bounded or lipschtiz kernels.
The following lemma gives more precise analysis to error term (2.86). Here we take .
Lemma 6.
Let be a positive defined matrix function which takes as (2.62). Assume that are block diagonal positive defined matrices, i.e. , and , where and are positive defined matrices. Then we have
(2.87) |
where is diffusion constant, and
(2.88) |
(2.89) |
(2.90) |
Remark 14.
Here we understand or as a matrix, and are dimensional vectors, we omit the summation about these components for convenience. The meaning of this lemma is to divide the error term of relative Fisher Information into norm of and small terms we can absorb by negative part in (2.63).
Proof.
We directly compute and use the Young inequality. For each component in position direction, we have
(2.91) |
we denote these three lines above by
where
is a matrix,
is a dimensional vector and
is a dimensional vector. Recall that we take as a block matrix
and are diagonal matrices. Hence, we use integral by parts for each component of the first term of (2.91), we have
(2.92) | ||||