Large Deviations Principles for Coulomb gases at intermediate temperature regime
Abstract
This paper deals with Coulomb gases at an intermediate temperature regime. We define a local empirical measure and identify a critical temperature scaling. We show that if the scaling of the temperature is supercritical, the local empirical measure satisfies an LDP with an entropy-based rate function. We also show that if the scaling of the temperature is subcritical, the local empirical measure satisfies an LDP with an energy-based rate function. In the critical temperature scaling regime, we derive an LDP-type result in which the "rate function" features the competition of entropy and energy terms.
1 Introduction
Coulomb gases are a system of particles of the same charge that interact via a repulsive kernel, and are confined by an external potential. Let with and let
(1) |
where
(2) |
is the Coulomb kernel for , i.e. satisfies for ,
(3) |
where is a constant that depends only on . Often, Coulomb gases at non-zero temperature are considered, these are modeled by a point process whose density is given by the Gibbs measure associated to the Hamiltonian:
(4) |
where
(5) |
In this notation is the inverse temperature (which may depend on ).
As long as we have that the empirical measure
(6) |
converges (weakly in the sense of probability measures) almost surely under the Gibbs measure to where is the minimizer of the mean-field limit
(7) |
among probability measures.
Coulomb gases have a wide range of applications in Statistical Mechanics and Random Matrix Theory, among other areas, see [9, 29] for a more in-depth discussion.
The most fundamental LDP in log gases is found in [3]. Adapting their results to our setting, it was proved that in the regime and the push-forward of by satisfies an LDP at speed with rate function given by
(8) |
This result was originally motivated by Hermitian Random Matrix Theory.
An analogous statement in dimension was proved in [7]. In [15] the authors deal with a general repulsive interaction in dimension In [26], the authors derive an LDP for the eigenvalues of some non-symmetric random matrices.
As mentioned before, the regime is substantially different since the empirical measure does not converge to the equilibrium measure. We call this the high temperature regime. Nevertheless, it is possible to identify the limit of the empirical measures as the thermal equilibrium measure:
(9) |
where is given by Definition 2.2, and the minimum is taken over probablity measures. Moreover, the push-forward of the Gibbs measure by the empirical measure satisfies an LDP at speed with rate function
(10) |
This result can be found in [17], which also treats Coulomb gases on compact manifolds.
In our setting, the intensity and sign of the charge of the particles are fixed; in reference, [8], however, the authors also consider the case of the intensity of the charges being a random variable which takes values in . Having positive and negative charges implies that there are attractive interactions, which are harder to deal with.
A widely studied question in Coulomb gases is that of the fluctuations of the difference between and In order to understand these fluctuations, it is convenient to multiply this difference by a test function the resulting object is called the first order statistic:
(11) |
In [23] it was proved that in two dimensions (under mild technical additional conditions) converges in law to a Gaussian random variable with mean
(12) |
and variance
(13) |
In this notation, is the support of the equilibrium measure, and is the harmonic extension of outside i.e. the only continuous function which agrees with in up to the boundary and is harmonic and bounded in A related, very similar result was obtained simultaneously in [5]. Analogous results were obtained in one dimension in [6] and [21], generalizing the work of [20], [32], and [10]. In [4], the authors derive local laws and moderate deviation bounds. In [19], the authors derive a CLT for linear statistics of ensembles at high temperature, in this reference, the authors also derive concentration inequalities. In [31], the author deals with linear statistics replacing with the thermal equilibrium measure. All of the references just mentioned, except [31] and [19] deal with proportional to for (in our notation), or constant in . This paper wanders into the mainly unexplored territory of Coulomb gases at general temperature regimes and high dimensions.
Coulomb gases are also widely studied due to their connection with Random Matrix Theory, see [13, 14, 12, 11]for recent developments. Most of the problems studied in connection with Random Matrix Theory are in dimensions and , therefore the result in this paper (which holds only in dimensions and higher) is not applicable to that setting. However, the main result in this paper is applicable to quaternionic Gaussian ensembles.
Since the equilibrium measure typically has compact support, there are particles in a bounded domain in and so, typically the particles are at distance of each other. After applying a dilation of magnitude to Euclidean space, one observes individual particles. An LDP at speed for Coulomb gases at this scale was obtained in [22], and the rate function combines two terms: one comes from the Hamiltonian and the other one is related to entropy. Similar results were obtained in [18] for hyper-singular Riesz gases, and in [24] for two-component plasmas.
Details of the convergence of to were obtained in [16]. In this reference, the authors also study the relation between the electric energy and norms on probability measures. One of their results concerning the convergence of to is the following: If , then under mild additional assumptions, there exist constants depending on and only such that, for any and
(14) |
we have
(15) |
where is the Wasserstein distance (see [16]).
The main contribution of this paper is to clarify the relationship between temperature scales and length scales for the mesoscopic behavior of particle systems with Coulomb interactions. The way to do this is to look at rare events at a mesoscale and understand them by means of an LDP. An important idea in this work is to exploit the different scaling relations satisfied by the Coulomb energy and the entropy. This work also exploits the smearing technique, used for example in [16, 22, 27, 28]. A large part of this work is devoted to simplifying expressions for partition functions, derived via a variational characterization.
At a macroscopic scale, it has been well-known that the temperature and the energy compete if the energy is of order , in the sense that the empirical field (the macroscopic observable) satisfies an LDP that involves energy and entropy terms. At a microscopic scale, it was recently proved in [22] that the energy and the entropy compete if the temperature is of order (for ), in the sense that the tagged empirical field (the microscopic observable) satisfies an LDP that involves energy and entropy terms. This raises the natural question: given a length scale between the macroscopic and microscopic, is there a temperature regime in which the temperature and entropy compete (in the sense that there is an LDP containing energy and entropy terms)? This paper answers this question.
Given a length scale, we will identify a critical temperature regime. Of course, this problem is equivalent to identifying a critical length scale given a temperature regime. This last approach was taken in [1]. However, despite analyzing the interplay between temperature and length scale, this work and [1] are pretty much independent. [1] deals with the tagged empirical field: an observable obtained by averaging the empirical field over a certain region. This observable is fundamentally different from the local empirical measure. Furthermore, the identified in [1] does not coincide with the "critical length scale" identified in our work. The main results in this work and in [1] are independent: our results do not follow from [1], and the results in [1] do not follow from our results. The techniques used are also fundamentally different. This work is not an attempt to prove any conjecture in [1].
A significant part of this work is devoted to computing, with high precision, certain partition functions. This is similar to obtaining a Laplace principle, as in [17]. However, the techniques in [17] would only allow us to obtain the leading order term in the partition function. This would be greatly insufficient to conclude, and so it is necessary to take a different approach in order to identify the next-order terms.
2 Main definitions and statement of main results
This section defines the most important objects for the rest of the paper and states the main results.
We begin with definitions related to the empirical measure.
Definition 2.1.
Given with
(16) |
we denote
(17) |
In order to make the notation more clear, we will often write
(18) |
instead of
(19) |
Given and we denote by
(20) |
We will also use the notation
(21) |
Let
(22) |
We now define the main observable of this paper: the local empirical measure
(23) |
Even though depends on and , we will sometimes omit this dependence for ease of notation and simply write . Note that is a measure with support contained in and with mass which we expect remains bounded if is distributed according to .
This paper deals with the empirical measure at a mesoscopic scale, i.e. at a scale where
(24) |
We choose the name mesoscopic because the scale is macroscopic, while the scale is microscopic, i.e. a scale which is of the same order of magnitude as the distance between particles. Without loss of generality, we assume that we blow up around the origin. For the general case, we may simply consider a modified potential.
The idea is to define a mesoscopic observable. This definition is inspired by interpolating between the empirical measure: , and the empirical field: . The factor of in the dilation corresponds to a mesoscale, while the normalizing factor of is necessary to obtain a bounded nonzero quantity. Note that the local empirical measure is, in general, not a probability measure but only a positive measure. The local empirical measure is more similar to the empirical measure in the sense that it converges to a continuous measure.
We now define the most basic functionals used in the paper: energy and entropy. We also define a few modifications of the functionals which will be used throughout the paper.
Definition 2.2.
Given two measures and , the relative entropy is defined as
(25) |
The entropy of a measure is defined as , where denotes the Lebesgue measure on
In the remainder of the paper, we commit the abuse of notation of not distinguishing between a measure and its density.
Definition 2.3.
The electric energy of a measure is defined as
(26) |
and
(27) |
where
(28) |
Given a measurable set , we will also use the notation
(29) |
where
(30) |
Definition 2.4.
We define the free energy of a measure as
(31) |
We define the thermal equilibrium measure as
(32) |
where denotes the set of probability measures on . More generally, we will use the notation for the set of probability measures on .
For existence, uniqueness and basic properties of see [2].
For existence, uniqueness and basic properties of see for example [30] and references therein.
We proceed with a few definitions regarding measures.
Definition 2.5.
Given a measurable set , we denote the space of measures on which are either of bounded variation or have a definite sign. We also define, for any
(34) |
Definition 2.6.
Given a measurable set , and a measure on , we define the bounded Lipschitz norm of , denoted as
(35) |
where denotes the set of Lipschitz functions on whose absolute value is bounded by , and Lipschitz constant is also smaller than .
Unless otherwise specified, any distance between measures will refer to the bounded Lipschitz norm. In particular, given we define
(36) |
We recall that the bounded Lipschitz norm metricizes the topology of weak convergence.
Definition 2.7.
Let we define
(37) |
where denotes the set of positive measures on a set .
We will now introduce the rate functions for the different LDP’s. These rate functions are based on the entropy functional, the energy functional, or both.
Definition 2.8.
Given a domain and a scalar we define the function defined for an absolutely continuous measure on as
(38) |
Definition 2.9.
Given a measure , we define the measure as
(39) |
For a measure defined on we denote by
(40) |
where the infimum is taken over such that
(41) |
We also define
(42) |
In this paper, we deal with general a general potential . We only impose some regularity and growth conditions, which we make precise in the next definition.
Definition 2.10.
We call a potential , with admissible if:
-
1.
.
-
2.
(43) -
3.
(44) for all .
-
4.
(45) for some , in a neighborhood of , defined as the support of .
-
5.
.
-
6.
.
Remark 2.1.
If is admissible, the equilibrium measure is bounded and has compact support, see [30].
Finally, before stating the main result, we recall the definition of rate function and Large Deviations Principle (LDP).
Definition 2.11.
(Rate function) Let be a metric space (or a topological space). A rate function is a l.s.c. function it is called a good rate function if its sublevel sets are compact.
Definition 2.12 (LDP).
Let be a sequence of Borel probability measures on and a sequence of positive reals such that Let be a good rate function on The sequence is said to satisfy a Large Deviations Principle (LDP) at speed with (good) rate function if for every Borel set the following inequalities hold:
(46) |
where and denote respectively the interior and the closure of a set Formally, this means that
The main result of this paper is the following theorem:
Theorem 2.1.
Assume that and the potential are admissible. Let with Assume that is bounded away from inside its support. Let
(47) |
and assume that
(48) |
Then:
-
If (subcritical regime) then the push-forward of by satisfies an LDP in the topology of weak convergence at speed and rate function
(49) -
If (supercritical regime) then the push-forward of by satisfies an LDP in the topology of weak convergence at speed and rate function
(50) -
If (critical regime) and then
(51) Similarly,
(52)
Remark 2.2.
The rate functions have the same minimizer in all cases: . This is clear, since in this temperature regime, the empirical measure concentrates on the thermal equilibrium measure for all scales larger than , as was proved in [25]. The typical event, therefore, is trivial; and it is a rare event that deserves to be looked at.
Remark 2.3.
Nearly all hypotheses in Theorem 2.1 are essential. The hypothesis that , however is not. It is an artifact of the proof, and it is needed to bound a specific error term. Bounding this error term is necessary if one uses the regularization procedure, i.e. it is needed to bound the difference between the energy of a discrete probability measure and a continuous one. This technique is very common in the field. We expect that a similar result will be true for , but proving this would require essentially different techniques.
Remark 2.4.
It is natural to ask if there is an analog of Theorem 2.1 in the extreme cases
(53) |
In the case
(54) |
Theorem 2.1 has a very natural generalization, as mentioned in the introduction. It was proved in [17] that satisfies an LDP at speed with rate function
(55) |
The case
(56) |
is substantially different, because at a microscopic scale, we do not observe a continuous distribution but rather individual particles. A similar problem was treated in [22]. Even though the result is substantially different, it has a similar flavor, since the authors prove an LDP at speed in which the rate function contains the sum of an entropy term and an electric energy term.
3 Additional definitions
We proceed with a few additional definitions related to the empirical measure.
Definition 3.1.
Let be fixed, and We define such that
(57) |
where
(58) |
and
(59) |
Let
(60) |
and
(61) |
Similarly, let
(62) |
Definition 3.2.
Given an integer and we denote by the set of measures which are purely atomic with weight i.e.
(63) |
Given a measure an integer a region and we define
(64) |
where
We also define
(65) |
where the infimum is taken over such that
(66) |
The definition of is almost the same as but omitting the diagonal inside the square in the computation of the Coulomb energy. This modification allows for the quantity to be finite for atomic measures inside the cube.
Given we also define
(67) |
where the inf is taken over all such that
(68) |
We generalize the definition of to a more general setting in which the background measure is not necessarily constant out of . Given a set and a background measure
(69) |
We now define an analog of for measures that are not absolutely continuous. Given a measurable set , a positive measure on , is defined for a measure on as
(70) |
We also introduce the notation.
(71) |
Remark 3.1.
Note that
(72) |
Note also that for any , has the scaling relation
(73) |
and therefore has the scaling relation
(74) |
Lastly, we introduce notation that will be used throughout the work.
Remark 3.2 (Notation).
Given , we denote by the uniform probability measure on .
4 Preliminary results
In this section, we will prove some preliminary results needed for the main Theorem.
We begin with a splitting formula around the thermal equilibrium measure, which is an analog of the usual splitting formula (see for example [28]).
Proposition 4.1.
The Hamiltonian can be split into:
(75) |
where
(76) |
Proof.
See [1]. ∎
Definition 4.1.
We also need the following piece of information about , which can be deduced from [2], Theorem 1.
Remark 4.1.
Let and assume that , and the potential is admissible, then
(79) |
We proceed to prove some elementary properties about the rate functions in Theorem 2.1.
Claim 4.1.
For any , the function is a convex (in ) rate function.
Proof.
Since convexity and l.s.c. are immediate from the convexity and l.s.c. of ent, we need only show that is positive for any . Throughout the proof, we will use the notation
(80) |
Using Jensen’s inequality, the convexity of and doing a first-order Taylor expansion of we have
(81) |
∎
The following claim is standard and can be found, for example, in [25].
Lemma 4.1.
The energy is l.s.c. w.r.t. to weak convergence.
With the help of Lemma 4.1, we can prove some elementary properties about .
Lemma 4.2.
For any and any measure in such that , the infimum in the definition of is achieved.
Proof.
Let be a minimizing sequence for
(82) |
Note that
(83) |
Hence, modulo a subsequence,
(84) |
weakly in for some . By l.s.c. of we have
(85) |
∎
We now prove that the function is a convex rate function for any .
Claim 4.2.
For any , the function is a convex rate function.
Proof.
We first prove convexity. Let be measures on such that . Let
(86) |
and
(87) |
Then, using the convexity of we have
(88) |
This proves the convexity of . We now turn to prove that is l.s.c. Since it is clearly positive, this will conclude the proof. Let be a measure in such that and let be a sequence of measures in such that
(89) |
weakly in the sense of measures. Let
(90) |
Note that
(91) |
Then by precompactness, we have that the sequence is precompact in the weak topology (note that we are not claiming precompactness for convergence in the BL metric, which is clearly not true in general). Let be such that
(92) |
It is easy to see that and agree in the interior of Note also that
(93) |
is a positive measure, and therefore it can be used as a test function in the definition of . Then, using l.s.c. of we have
(94) |
∎
We will now prove that the rate functions are good.
Claim 4.3.
For any the function is a good rate function, i.e. sublevel sets are precompact in the topology of weak convergence of measures.
Proof.
Consider the sublevel sets
(95) |
We will prove that there exists such that if
(96) |
then
(97) |
which will imply the desired compactness. Let
(98) |
Using Jensen’s inequality, we have
(99) |
Since as we have that there exists such that if Hence, is precompact in the topology of weak convergence. ∎
We now prove that is a good rate function.
Claim 4.4.
For any the function is a good rate function, i.e. sublevel sets are precompact in the topology of weak convergence of measures..
Proof.
Let be such that
(100) |
Let
(101) |
Since we are assuming equation (100), we have that converges, modulo a subsequence (not relabelled) weakly in the topology. Hence the restriction to , converges weakly in the topology. In particular,
(102) |
Since is a positive measure, equation (102) implies that
(103) |
which implies that modulo a subsequence (not relabelled) converges in the topology of weak convergence of probability measures.
∎
5 Proof of upper bound
In this section, we prove the upper bound of Theorem 2.1. Recall that we use the notation
(104) |
with
(105) |
Proof of Theorem 2.1, upper bound.
We begin by using the splitting formula for the thermal equilibrium measure (Proposition (4.1)). Let and , then
(106) |
In order to pass from the third to the fourth line, we have used that
(107) |
since for any
(108) |
We now treat each of the terms in the last line of equation (106) individually. The second term is the easier, and we will will deal with it at the end of this section. More specifically, we will prove the following lemma:
Lemma 5.1.
Let and . Then
(110) |
The analysis of the first term is more delicate, and we deal with it in section . The result we prove is the following:
Lemma 5.2.
Let , let and be an integer smaller than or equal to . Then
(111) |
Furthermore, for any such that we have
(112) |
We will now finish the proof of the upper bound in Theorem 2.1 using Lemmas 5.1 and 5.2. We start with the last line of equation (106):
(113) |
Using results from [27], or from [1], we know that
(114) |
using the hypothesis that we have that
(115) |
Bounding this error term (and bounding a similar error term in the upper bound) is the only step in which we use the hypothesis that .
Note that, if then
(116) |
and so
(117) |
And finally, if then
(118) |
and so
(119) |
∎
This concludes the proof of the upper bound of Theorem 2.1. We now turn to the proof of the auxiliary lemmas (Lemmas 5.1 and 5.2). We start with Lemma 5.1, which we restate here for convenience:
Lemma 5.3.
Let and . Then
(120) |
Proof.
Using Sanov’s theorem and the scaling relation of , we have that
(121) |
where the infimum is taken over such that and Note that we may rewrite equation (121) as
(122) |
where the infimum is taken over all such that
We first determine the optimal in the minimization problem of equation (122) for a given . This can be done by adding a Lagrange multiplier for the constraint of mass and then computing the Euler Lagrange equations. The solution is that the minimizer is given by
(123) |
where is given by
(124) |
Hence we have that, for each
(125) |
6 Proof of Lemma 5.2
In this section, we prove Lemma 5.2, which we restate here for convenience:
Lemma 6.1.
Let , let and be an integer smaller than or equal to . Then
(129) |
Furthermore, for any such that we have
(130) |
The idea is that, on the one hand, given our choice of dilation, will converge to a continuous measure on every compact set. This implies that we can replace the infimum over purely atomic measures with the infimum over absolutely continuous measures in On the other hand, will converge to on compact sets, so we can replace the background measure with We will now make this intuition more rigorous.
Proof.
Step 1
We claim that
(131) |
To see this, let
(132) |
and
(133) |
with , and .
Note that depends only on and , since is uniformly bounded in for large enough, with a bound that depends only on and .
Using the hypothesis that
(138) |
we have that
(139) |
which implies, using the scaling relations of and , that
(140) |
Step 2 We now prove the second part of the claim: that for any such that we have
(141) |
We will first prove that
(142) |
To this end, let
(143) |
For any let be such that
(144) |
Taking as a test function in the definition of and using Remark 4.1 we have
(145) |
Since is arbitrary, we can conclude that
(146) |
We now turn to prove
(147) |
To this end, let
(148) |
where is minimized over measures satisfying and which are supported in
Note that
(149) |
Then, since
(150) |
we have that
(151) |
weakly in for some . It is easy to check that a.e. Using l.s.c. of we then have that
(152) |
Step 3
We now prove the first part of the statement of Lemma 5.2. In view of equation (140), we will prove that for any and any measure on such that ,
(153) |
To this end, let be such that
(154) |
we assume that the infimum is achieved for clarity of exposition, otherwise, we could prove the claim up to an arbitrary error by taking a minimizing sequence.
Since we have that as tends to
(155) |
weakly in the sense of measures, for some Let
(156) |
where is minimized over supported in
Note that
(157) |
therefore, for a subsequence
(158) |
weakly in for some . Therefore we can use as a test function in the definition of and get
(159) |
∎
7 Proof of lower bound
This section is devoted to proving the lower bound of the LDP’s of Theorem 2.1. The approach will be to construct a family of point configurations that has correct energy and sufficient volume.
We start with a lemma, which builds upon a construction found in unpublished class notes by Sylvia Serfaty.
Lemma 7.1.
Let be probability measures on a compact set such that
(160) |
, and
(161) |
Assume that is uniformly equi-continuous and bounded away from uniformly in and . Then for every there exists a family of configurations
(162) |
such that
-
(163) for any
-
(164) -
(165) -
•
There exists such that
(166) for .
We will also require the following lemma, which deals with approximating certain partition functions.
Lemma 7.2.
Let with for each . Let such that
(167) |
Assume that
(168) |
also that
(169) |
and that there exists such that
(170) |
and
(171) |
Let
(172) |
Then for we have
(173) |
where and are independent of
For we have
(174) |
where is independent of
Proof.
We will divide the proof in 3 steps. The idea of the proof is that using the variational formulation of the partition function, as well as the splitting formula for the equilibrium measure (Proposition (4.1)), we can reduce the integral in equation (172) to
(175) |
This is done in step 1. Steps 2 and 3 simplify this expression, and show that either the electric energy or the entropy dominates, depending on whether or
Step 1
We start with the characterization
(176) |
in this equation,
(177) |
where
(178) |
and
(179) |
This is a particular case of a characterization of the partition function which is valid in general, see for example [27].
We now define
(180) |
where
(181) |
The reason for introducing the extra factors of is that we need to normalize the total charge outside the cube to be . We then have that probability measures satisfy a splitting formula around , analogous to equation (75).
We also have that satisfies the EL equation
(182) |
for some constant In order to find we can multiply equation (182) by integrate, and use that has integral . The result is
(183) |
Plugging in as a test function in equation (176) we get
(184) |
The negative term of order is due to the fact that there are pair of particles, and not pairs.
Finally, note that the last term in equation (184) is (up to a negligible error) equivalent to
(185) |
where
(186) |
and the minimum is taken over satisfying .
Using the splitting formula for the thermal equilibrium measure (Proposition 4.1) we have that
(187) |
where the infimum is taken over all measures on which are positive and such that
(188) |
Step 2
This step is divided into two cases. The case and the case . First we deal with the case .
Substep 2.1: Regime .
In this case, we claim that
(189) |
where is given by (67). The proof will be divided into further subsubsteps.
Subsubstep 2.1.1
We now begin the proof of the claim. Using the scaling relations
(190) |
and
(191) |
we can rewrite equation (187) as
(192) |
where the infimum is taken over all such that
(193) |
Next we will argue that we can deal with instead of since we make a small error when approximating by . This will be the subject of the next subsubstep.
Substep 2.1.2
First of all, note that the constraint can be replaced by the constraint while making a negligible error because of equation (169). We now introduce the functionals and , defined for measures and on as
(194) |
and
(195) |
where the infimum in equations (194) and (195) is taken over all such that
(196) |
Given and , let
(197) |
and similarly, let be defined as
(198) |
where the infimum is taken over all such that
(199) |
We assume that the infimum is achieved for clarity of exposition. Otherwise, we could repeat the argument up to an arbitrarily small error. Then we can use as a test function in equation (194) and get
(200) |
Similarly, we can use as test function in equation (194) and get
(201) |
Then, since the points are at distance at least from , we have by Lemma 9.1, that, for
(203) |
(see Remark 3.2 for notation).
Using Cauchy-Schwartz we get
(204) |
Using now the hypothesis that has regularity, along with Lemmas 9.1, 9.2, 9.3, 9.4 we have that
(205) |
where depends on and .
On the other hand, it is easy to see that
(206) |
where depends on . Therefore
(207) |
where depends on and . Similarly,
(208) |
where depends on and .
This implies that
(209) |
where depends on .
We have proved that we can deal with instead of since we make a small error when approximating by . The last subsubstep will consist in proving that
(210) |
Substep 2.1.3
The proof of equation (210) will consist in taking a minimizing sequence of the problem in the RHS, and modifying it so that it is a valid test function to the problem in the LHS.
Let be such that
(211) |
and
(212) |
Then for every there exists such that
(213) |
and
(214) |
Now take a truncated such that (LABEL:errorinenergy) and (213) hold with an error in the right hand side, and in addition
(215) |
Note that exists because the sequence
(216) |
is bounded, and by Dominated Convergence Theorem, its integral converges to the integral of as .
Now define as
(217) |
Note that
(218) |
where depends on and but does not depend on Since we are in the regime we have that
(219) |
and therefore
(220) |
Using as a test function in the definition of , and appealing once again to Remark 4.1 we have that
(221) |
Since are arbitrary, we conclude
(222) |
The proof of substep 2.1 is now complete.
Substep 2.2 Now we deal with the case In this case we go back to working in unreescaled coordinates.
We start with formula (187). Since in the regime we expect the term
(223) |
to be negligible, we focus on the remaining part of the functional, i.e.
(224) |
where the infimum is taken over all measures on which are positive and such that
(225) |
The minimizer in equation (224) can be easily found by adding a Lagrange multiplier for the mass constraint. It can be easily checked that the unique minimizer of (224) in the corresponding space is given by , where
(226) |
where
(227) |
Using the identity
(228) |
valid for any , we have that
(229) |
It can be checked that, as a consequence of equation (169), , and therefore we may use the approximation . Proceeding as in the proof of Lemma 5.1 and using equation (169), we have that
(230) |
Note that
(231) |
since we are in the regime . Using again formula (187) and switching to rescaled coordinates, we have
(232) |
where is independent of
Lemma 7.2 is proved for .
Step 3
This step only deals with the case Once again we work with rescaled coordinates.
We now claim that for any measure on such that we have
(233) |
In other words, we claim that we can drop the mass constraint. We now prove the claim. Since clearly
(234) |
we will prove that
(235) |
In order to prove this claim, we reformulate the definition of as as
(236) |
where the infimum is taken over all such that is supported in and
Let and let be such that is supported in , and
(237) |
Let
(238) |
and let
(239) |
Let be a sequence such that tends to monotonically, and
(240) |
Define
(241) |
where denotes the Lebesgue measure. Then it’s easy to see that
(242) |
and
(243) |
Therefore
(244) |
Since is arbitrary, we conclude that
(245) |
∎
We can now complete the proof of Theorem 2.1 by giving the lower bound.
Proof of Theorem 2.1, lower bound.
We start with the case Let be a measure on Using a density argument, we may assume that and Let and let be as in Lemma 7.1 with and , (rounded to an integer) and . Note that equation (168) is satisfied with this choice of . We claim that equation (165) implies that there exists which depends on such that for any ,
(246) |
This is because for
(247) |
we have
(248) |
(249) |
Since we are in the regime we have
(250) |
Therefore
(251) |
Since is arbitrary, we can conclude that
(252) |
Now we proceed with the case Let be a positive measure in , let and let be as in Lemma 7.1 with and , (rounded to an integer) and . Then, starting as in the previous case, we have
(253) |
We then have that
(254) |
Recalling that
(255) |
and (rounded to an integer), we have that
(256) |
By construction we have that
(257) |
Combining the last equation with Remark 4.1 we have
(258) |
and therefore
(259) |
Since is arbitrary, we can conclude.
∎
8 Proof of statement about regime
In this section, we prove the third part of Theorem 2.1, which we repeat here for convenience: If (critical regime) and then
(260) |
Similarly,
(261) |
Before starting the proof, we note that since we are in the critical regime, we have .
Proof of inequality.
Let be a positive measure in , let and let be as in Lemma 7.1 with and , (rounded to an integer) and . Then we have that
(262) |
Using equation (192), and the hypothesis that we have
(263) |
where the infimum is taken over such that
(264) |
and are given by equations (40) and (65) respectively. We have used that, if then
(265) |
where depends on The proof of this statement is the same as the proof that
(266) |
where depends on , see the proof of Lemma 7.2, step 2.1 (in fact, in the critical regime, we have that and ).
Since and are arbitrary, we have
(268) |
In particular, this implies
(269) |
∎
We now turn to the proof of the inequality:
Proof of inequality.
We start with equation (192), which in the critical regime reads
(270) |
where the infimum in line of the last equation is taken over all such that
(271) |
We proceed by writing
(272) |
Letting tend to and using Remark 4.1 we have that
(273) |
It’s well known that ent is l.s.c. in for fixed Therefore
(274) |
We will also use a property of which we prove at the end of this section: we will show that
(275) |
We now prove equation (275), used in the proof and restated here for convenience.
Lemma 8.1.
Let be a measure on such that and , and be such that
(277) |
Then
(278) |
Proof.
Let , and let
(279) |
(see Remark 3.2 for notation). We claim that
(280) |
where depends on . To see this, let
(281) |
where the minimum is taken over such that
(282) |
Then we can use as a test function in the definition of and get
(283) |
Using Lemmas 9.1, 9.2, 9.3, 9.4, we have that
(284) |
where depends on . We, therefore, get that
(285) |
where depends on . Note that
(286) |
therefore we are left with proving that
(287) |
To see this, let
(288) |
We assume that the infimum is achieved for clarity of exposition. Otherwise, we would repeat the argument up to an arbitrarily small error. Let
(289) |
where the minimum is taken over such that
(290) |
Then we can use as a test function in the definition of and get
(291) |
Note that as tends to and tends to , converges weakly to , therefore
(292) |
and
(293) |
This implies that
(294) |
and since clearly
(295) |
we conclude that
(296) |
∎
9 Appendix A
In this appendix, we prove some fundamental properties of the smearing technique and energy minimizers. Loosely speaking, the smearing technique consists in studying properties about by analyzing instead the more regular measure where is a measure that approximates a Dirac delta on a scale .
We start by recalling a few facts about smearing and electric energy. These are standard and can be found, for example, in [16], [22], or [27]. The proof uses that is superharmonic in its domain, and harmonic away from
Lemma 9.1.
The next lemma can also be found in [16] (or verified by direct computation).
Lemma 9.2.
Let , then for ,
(299) |
For ,
(300) |
Lemma 9.3.
Lemma 9.4.
Let for Let for Let be a measure with an density. Then there exists which depends only on such that
(302) |
10 Appendix B
We will now prove Lemma 7.1, which we restate here for convenience.
Lemma 10.1.
Let be probability measures on a compact set such that
(303) |
, and
(304) |
Assume that is uniformly equi-continuous and bounded away from uniformly in and . Then for every there exists a family of configurations
(305) |
such that
-
(306) for any
-
(307) -
(308) -
•
There exists such that
(309) for .
Proof.
Step 1: Definition
First, we subdivide into cubes of size and center for to be determined later.
Let either
(310) |
or
(311) |
chosen so that
(312) |
The procedure for determining the point configuration of points is: is chosen at random from where is the cube minus a boundary layer of width , is chosen at random from
(313) |
Then, for the point is chosen at random from
(314) |
In other words,
(315) |
We set for some to be determined later. For small enough, the procedure is well defined, in the sense that it is possible to choose points in this way.
For small enough, any satisfies
(316) |
We immediately get that for some . We now prove that these configurations have the right volume and energy.
Step 2: Volume Estimate
To give intuition, we first treat the case as the uniform measure on . In this case, we have
(317) |
where are constants which depend only on . On the other hand, the volume of all configurations with exactly points in cube is given by
(318) |
By Sanov’s theorem, we have that
(319) |
For a general we have that the volume of all configurations with exactly points in cube is given by
(320) |
and that by Sanov’s theorem
(321) |
On the other hand, we can estimate
(322) |
where depends on . Using the hypothesis that is uniformly equi-continuous, we have that for any any there exists such that if we have
(323) |
for any .
Hence, we have
(324) |
Step 3: Energy Estimate
The idea for the energy estimate will be to prove that
(325) |
is typically pointwise small. Then the smallness of the energy will be a consequence of the finite mass of the measures and
Let Then we can write
(326) |
For any note that the minimum distance from to is given by and the maximum distance from to is given by , for some which depends on and . For the rest of the proof, we assume w.l.o.g. that
(327) |
then
(328) |
for some absolute constant .
Using the hypothesis that we get
(329) |
where depends on . Since is integrable at the origin and is compact, we have
(330) |
where depends on and .
Using again the hypothesis that we have
(331) |
where depends on and .
For the second term in equation (326) term, we will instead work with
(332) |
where for some to be determined later (see Remark 3.2 for notation). Note that by Lemma 9.1, and because we have
(333) |
Note also that
(334) |
where is a constant that depends on and . Hence
(335) |
where is a (new) constant that depends on and .
Putting everything together, we get
(336) |
where depends on and and depends, in addition, on .
Hence
(337) |
Making small enough after having chosen while keeping we have that for any we can find parameters such that
(338) |
which implies that
(339) |
∎
11 Acknowledgements
I thank Sylvia Serfaty for her guidance during this project. I thank Ofer Zeitouni and Thomas Leblé for useful conversations.
References
- [1] S. Armstrong and S. Serfaty, Local laws and rigidity for Coulomb gases at any temperature, Annals of Probability, in press-arXiv preprint arXiv:1906.09848, (2019).
- [2] , Thermal approximation of the equilibrium measure and obstacle problem, arXiv preprint arXiv:1912.13018, (2019).
- [3] G. B. Arous and A. Guionnet, Large deviations for wigner’s law and voiculescu’s non-commutative entropy, Probability theory and related fields, 108 (1997), pp. 517–542.
- [4] R. Bauerschmidt, P. Bourgade, M. Nikula, and H.-T. Yau, Local density for two-dimensional one-component plasma, Communications in Mathematical Physics, 356 (2017), pp. 189–230.
- [5] , The two-dimensional coulomb plasma: quasi-free approximation and central limit theorem, Advances in Theoretical and Mathematical Physics, 23 (2019), pp. 841–1002.
- [6] F. Bekerman, T. Leblé, S. Serfaty, et al., Clt for fluctuations of -ensembles with general potential, Electronic Journal of Probability, 23 (2018).
- [7] G. Ben Arous and O. Zeitouni, Large deviations from the circular law, ESAIM: Probability and Statistics, 2 (1998), pp. 123–134.
- [8] T. Bodineau and A. Guionnet, About the stationary states of vortex systems, Annales de l’Institut Henri Poincare (B) Probability and Statistics, 35 (1999), pp. 205–237.
- [9] A. Borodin, I. Corwin, and A. Guionnet, Random matrices, in IAS/Park City Mathematics Series, vol. 26, American Mathematical Soc., 2019.
- [10] G. Borot and A. Guionnet, Asymptotic expansion of matrix models in the one-cut regime, Communications in Mathematical Physics, 317 (2013), pp. 447–483.
- [11] P. Bourgade, L. Erdős, and H.-T. Yau, Bulk universality of general -ensembles with non-convex potential, Journal of mathematical physics, 53 (2012), p. 095221.
- [12] P. Bourgade, L. Erdös, and H.-T. Yau, Edge universality of beta ensembles, Communications in Mathematical Physics, 332 (2014), pp. 261–353.
- [13] P. Bourgade, L. Erdős, H.-T. Yau, et al., Universality of general -ensembles, Duke Mathematical Journal, 163 (2014), pp. 1127–1190.
- [14] P. Bourgade, H.-T. Yau, and J. Yin, Local circular law for random matrices, Probability Theory and Related Fields, 159 (2014), pp. 545–595.
- [15] D. Chafaï, N. Gozlan, P.-A. Zitt, et al., First-order global asymptotics for confined particles with singular pair repulsion, The Annals of Applied Probability, 24 (2014), pp. 2371–2413.
- [16] D. Chafai, A. Hardy, and M. Maïda, Concentration for coulomb gases and coulomb transport inequalities, Journal of Functional Analysis, 275 (2018), pp. 1447–1483.
- [17] D. García-Zelada, A large deviation principle for empirical measures on polish spaces: Application to singular gibbs measures on manifolds, in Annales de l’Institut Henri Poincaré, Probabilités et Statistiques, vol. 55, Institut Henri Poincaré, 2019, pp. 1377–1401.
- [18] D. P. Hardin, T. Leblé, E. B. Saff, and S. Serfaty, Large deviation principles for hypersingular riesz gases, Constructive Approximation, 48 (2018), pp. 61–100.
- [19] A. Hardy and G. Lambert, Clt for circular beta-ensembles at high temperature, Journal of Functional Analysis, 280 (2021), p. 108869.
- [20] K. Johansson et al., On fluctuations of eigenvalues of random hermitian matrices, Duke mathematical journal, 91 (1998), pp. 151–204.
- [21] G. Lambert, M. Ledoux, C. Webb, et al., Quantitative normal approximation of linear statistics of -ensembles, The Annals of Probability, 47 (2019), pp. 2619–2685.
- [22] T. Leblé and S. Serfaty, Large deviation principle for empirical fields of log and riesz gases, Inventiones mathematicae, 210 (2017), pp. 645–757.
- [23] , Fluctuations of two dimensional coulomb gases, Geometric and Functional Analysis, 28 (2018), pp. 443–508.
- [24] T. Leblé, S. Serfaty, and O. Zeitouni, Large deviations for the two-dimensional two-component plasma, Communications in Mathematical Physics, 350 (2017), pp. 301–360.
- [25] D. Padilla-Garza, Concentration inequality around the thermal equilibrium measure of coulomb gases, Journal of Functional Analysis, 284 (2023), p. 109733.
- [26] D. Petz and F. Hiai, Logarithmic energy as an entropy functional, Contemporary Mathematics, 217 (1998), pp. 205–221.
- [27] N. Rougerie and S. Serfaty, Higher-dimensional coulomb gases and renormalized energy functionals, Communications on Pure and Applied Mathematics, 69 (2016), pp. 519–605.
- [28] E. Sandier, S. Serfaty, et al., 2d coulomb gases and the renormalized energy, The Annals of Probability, 43 (2015), pp. 2026–2083.
- [29] S. Serfaty, Microscopic description of log and coulomb gases, arXiv preprint arXiv:1709.04089, (2017).
- [30] , Systems of points with coulomb interactions, in Proceedings of the International Congress of Mathematicians: Rio de Janeiro 2018, World Scientific, 2018, pp. 935–977.
- [31] , Gaussian fluctuations and free energy expansion for 2d and 3d coulomb gases at any temperature, arXiv preprint arXiv:2003.11704, (2020).
- [32] M. Shcherbina, Fluctuations of linear eigenvalue statistics of matrix models in the multi-cut regime, Journal of Statistical Physics, 151 (2013), pp. 1004–1034.