Non-Asymptotic Concentration of Magnetization in the Curie-Weiss Model at Subcritical Temperatures
Abstract.
In this short paper, we obtain non-asymptotic concentration results for magnetization of the Curie-Weiss model at subcritical temperatures, which leads to a diffusion limit theorem of the scaled and centered magnetization driven by a Metropolis-Hasting algorithm. These results are complementary to results at supercritical and critical temperatures in [Bierkens and Roberts(2017)].
Key words and phrases:
Curie-Weiss Model, Concentration, diffusion limit1. Introduction
Curie-Weiss model is among the simplest ferromagnetic models in statistical mechanics for quantifying phase transition, see detailed explanations in Chapter IV of [Ellis(2006)]. Mathematically, Curie-Weiss model is defined by the following probability measure on ,
(1) |
with
where is the normalizing constant (also known as the partition function), represents the reciprocal of the absolute temperature, and is the strength of external applied magnetic field. For each state , a key quantity magnetization, denoted as , is defined as . Then, the Hamiltonian function can be expressed as,
A common practice in computing the partition function, thus the entire probability distribution, is to use Markov chain Monte Carlo (MCMC) methods, and Metropolis-Hastings algorithm is one of such methods that is found to be successful in many cases. In [Bierkens and Roberts(2017)], it is shown that magnetization, under proper a centering and scaling transformation, the Markov chain generated from the Metropolis-Hastings algorithm has a diffusion limit for the critical ( and ) and supercritical () temperatures. The derivation depends on results non-asymptotic concentration derived in these two cases. In the paper, we are able to obtain a similar non-asymptotic concentration at subcritical temperatures (( and ), thus obtain a similar diffusion limit.
2. Diffusion approximations to Magnetization
2.1. Concentration of meanetization
It is known, see e.g. section IV.4 in [Ellis(2006)], that the megnetization concentrates around , the unique minimum point of the following quantity,
(2) |
In [Ellis(2006)], the analysis of concentration and identification of are obtained through a large deviations principle analysis, see also [Bovier(2006)] for a different and simplified derivation. Furthermore, satisfies the following Curie-Weiss equation
(3) |
which can also be written as . Quantitatively, the concentration can be elaborated by the following key result from [Chatterjee(2007)],
Proposition 1 ( [Chatterjee(2007)]).
For all , and ,
(4) |
2.2. Metropolis-Hastings Algorithm
A typical implementation of the Metropolis-Hasting algorithm for sampling from a distribution known up to the partition function uses a random walk to generate a proposal, then accept or reject the proposal according the ratio of the density function of the target distribution, and this does not require the calculation of the partition function.
The random walk on under consideration randomly picks (with probability each) one coordinate and flip its sign. Therefore, the transition probability takes the form of
where the set . To ensure reversibility hence convergence, the Metropolis-Hastings step accepts a proposed move generated by the random walk with probability
The equality is due the fact that in the case that is a proposed move to .
2.3. Centered and Scaled Magnetization
The key quantity that will be studied is the following centered and scaled magnetization,
It was termed shifted and renormalized magnetization in [Bierkens and Roberts(2017)]. Following Metropolis-Hastings algorithm, the scaled Markov chain, denoted by , has the transition probability,
with
Let represent the stationary continuous time Markov chain that jumps at rate with transition probability , and its stationary distribution .
2.4. Diffusion Approximation Theorem
In [Bierkens and Roberts(2017)], analysis has been provided for the diffusion limit of the centered and scaled magnetization, as approaches infinity in the critical ( and ) and supercritical () temperature. In this note, we extend their result to the subcritical phase and , and establish the following result.
Theorem 2.
Suppose and , jumps at rate , then converge weakly in to , where is the stationary Ornstein-Uhlenbeck process defined by
(5) |
with
The key to proving Theorem 2 is to establish non-asymptotic concentration results, which will be carried out in the next section.
3. Non-Asymptotic Concentration Inequality for Sub-Critical Phase
When , (The case of , can be dealt similar by symmetry), it is known, see e.g. Chapter IV in [Ellis(2006)], there are potentially three roots to the Curie-Weiss equation (3), with being the largest, and two other roots being negative. It was pointed in both [Ellis(2006), Chapter VI] and [Bierkens and Roberts(2017)], in the case of , the other two roots are not global minimum. The following lemma decides the order of derivatives of the two sides of (3) at .
Lemma 3.
when and .
Proof.
Define , hence and . Note that is the slope of the left hand side of the Curie-Weiss equation (3), which is in an affine form. is the derivative of the right hand side, and it is increasing in . is the point where these two derivatives are the same. When , the derivative of the left hand side will remain to be , on the other hand, the derivative of the right hand side will increase in . In the following, we will show that . Therefore, the equality of the two sides must happen after , i.e., , and the result follows.
At , write both sides of the Curie-Weiss equation (3) as functions of . We have, the left hand side takes the form of , and the right hand side . Hence, , and . So, for and . Furthermore, . Thus we conclude , i.e. . ∎
Lemma 4.
There exist and satisfying and , such that
(6) |
holds for all , with . Meanwhile,
(7) |
holds for all .
Proof.
Denote . Then the above lemmas allow us to obtain the following result, which extends Lemma 5 in [Bierkens and Roberts(2017)] to subcritical temperature.
Lemma 5.
Let . For , , and any , we have,
Proof.
The following lemma is known in [Bierkens and Roberts(2017)].
Lemma 6.
Let . For . we have, for ,
for any .
Proof of Theorem 2.
The proof follows the basic arguments of that of Theorem 1 in [Bierkens and Roberts(2017)], along with the non-asymptotic concentration result in Lemma 5, and we reproduce the key steps for completeness. The basic approach is to compare the action on function in the domain of the generators of and the diffusion process (5). Suppose that is the generator of and is the one for the diffusion. Then, we know that,
and
for . We have,
where the first inequality follows from the definitions and second inequality follows from Taylor expansion and the convergence is the consequence of estimations of the first, second and higher order terms in Lemma 6. And
follows from Lemma 5. Then the desired result follows from Corollary 4.8.7 in [Ethier and Kurtz(2009)]. ∎
References
- [Bierkens and Roberts(2017)] J. Bierkens and G. Roberts. A piecewise deterministic scaling limit of lifted metropolis–hastings in the curie–weiss model. Ann. Appl. Probab., 27(2):846–882, 04 2017. doi: 10.1214/16-AAP1217. URL https://doi.org/10.1214/16-AAP1217.
- [Bovier(2006)] A. Bovier. Statistical Mechanics of Disordered Systems: A Mathematical Perspective. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 2006. doi: 10.1017/CBO9780511616808.
- [Chatterjee(2007)] S. Chatterjee. Stein’s method for concentration inequalities. Probability Theory and Related Fields, 138(1):305–321, 2007. doi: 10.1007/s00440-006-0029-y. URL https://doi.org/10.1007/s00440-006-0029-y.
- [Ellis(2006)] R. Ellis. Entropy, Large Deviations, and Statistical Mechanics. Classics in Mathematics. Springer, 2006. ISBN 9783540290599. URL https://books.google.com/books?id=jNxKUjihQRYC.
- [Ethier and Kurtz(2009)] S. Ethier and T. Kurtz. Markov Processes: Characterization and Convergence. Wiley Series in Probability and Statistics. Wiley, 2009. ISBN 9780470317327. URL https://books.google.com/books?id=zvE9RFouKoMC.