A stochastic analysis approach to lattice Yang–Mills at strong coupling
Abstract.
We develop a new stochastic analysis approach to the lattice Yang–Mills model at strong coupling in any dimension , with t’ Hooft scaling for the inverse coupling strength. We study their Langevin dynamics, ergodicity, functional inequalities, large limits, and mass gap.
Assuming for the structure group , or for , we prove the following results. The invariant measure for the corresponding Langevin dynamic is unique on the entire lattice, and the dynamic is exponentially ergodic under a Wasserstein distance. The finite volume Yang–Mills measures converge to this unique invariant measure in the infinite volume limit, for which Log-Sobolev and Poincaré inequalities hold. These functional inequalities imply that the suitably rescaled Wilson loops for the infinite volume measure has factorized correlations and converges in probability to deterministic limits in the large limit, and correlations of a large class of observables decay exponentially, namely the infinite volume measure has a strictly positive mass gap. Our method improves earlier results or simplifies the proofs, and provides some new perspectives to the study of lattice Yang–Mills model.
2010 Mathematics Subject Classification:
37A25; 39B62; 60H101. Introduction
The purpose of this paper is to apply stochastic analysis and ergodic theory for Markov processes to study the lattice Yang–Mills model with structure group . In particular, we will consider the Langevin dynamics of these models, and under explicit strong coupling assumptions, we will prove uniqueness of invariant measures in infinite volume, log-Sobolev and Poincaré inequalities, with some application in large limits of Wilson loops and exponential decay of correlations.
Lattice discretizations of the Yang–Mills theories were first proposed in the physics literature by Wilson [Wilson1974] which lead to well-defined Gibbs measures on collections of matrices. We refer to [Chatterjee18] for a nice review on the Yang–Mills model and its gauge invariant discretization as well as the fundamental questions for the model. Among the literature we only mention that approximate computations of the Wilson loop expectations as the size of the structure group becomes large was first suggested by ’t Hooft [tHooft1974], where the Yang–Mills Hamiltonian is multiplied by (known as the ’t Hooft scaling), which is closely related to our present article.
The problems we discuss in this paper have been of interest and studied for decades in mathematical physics. A closely related earlier paper is by Osterwalder–Seiler [OS1978], which showed that for the lattice Yang–Mills theory, when the coupling is sufficiently strong, the cluster expansion (or high-temperature expansion in statistical mechanics language) for the expectation values of local observables (i.e. bounded functions of finitely many edge variables) is convergent, uniformly in volume. The proof of this convergent cluster expansion was sketched in [OS1978] since it follows similarly as [GJS1973] for model (and also [MR389080]); in fact it is simpler than the model in [GJS1973] since the fields are bounded in lattice Yang–Mills theory. Moreover, as explained in [OS1978], the existence of a mass gap (exponential clustering) follows from convergence of the cluster expansion, so do existence of the infinite volume limit and analyticity of Schwinger functions in the inverse coupling. Uniqueness of infinite volume limit should also follow from cluster expansion, see e.g. [MR990999] for the case of the model. We also refer to the book [MR785937] for these expansion techniques and results. As for the large limits, in the recent papers, factorization property of the Wilson loop expectations was proved in [Cha, Corollary 3.2] and [Jafar] under the assumption that is sufficiently small.
Given the earlier work, we revisit these problems in this article for a number of reasons. First of all, the earlier work [OS1978] didn’t consider ’t Hooft scaling, but if we translate their results into ’t Hooft scaling where the Hamiltonian is multiplied by then their condition amounts to requiring to be small. However, to our best knowledge, under the ’t Hooft scaling uniqueness was not known for in a fixed small neighborhood of the origin when is arbitrarily large (see for instance the discussion after [Cha, Theorem 3.1]); this is the reason that [Cha] and [Jafar] formulated their large results on a sequence of -dependent finite volumes. One aim of this paper is to establish uniqueness of infinite volume measures for in a fixed and explicit small neighborhood of the origin which is uniform in , which allows us to prove the existence of a mass gap and large limits of Wilson loops directly in infinite volume for this range of .
Secondly, as another motivation of this paper, we develop new methods based on stochastic analysis and give new proofs to these results. In these methods, the curvature properties of the Lie groups are better exploited via the verification of the Bakry–Émery condition. In particular, this allows us to perform more delicate calculations and obtain more explicit smallness condition on inverse coupling. As another novelty we study the Langevin dynamics (or stochastic quantization) and we prove uniqueness of the infinite volume measures by showing that the dynamic on the entire has a unique invariant measure. To this end we employed coupling methods for our stochastic dynamics, which is a variant of Kendall–Cranston’s coupling. Such stochastic coupling arguments were used earlier in the stochastic analysis on manifolds, but to our best knowledge this appears to be the first time that such coupling arguments are used in the setting of statistical physics or lattice quantum field theory models with manifold target spaces. For our coupling arguments we will also need to introduce suitable weighted distances on the product manifolds, and in our calculations a subtle comparison between the weight parameter and the curvature plays a key role in order to obtain ergodicity.
As the third motivation, it appears to us that some of the proofs in this paper are simpler. For instance, the large results on Wilson loops follow quickly from the Poincaré inequality, which simply comes from the Bakry–Émery condition. Our proof of exponential decay relies on some earlier ideas of Guionnet–Zegarlinski [GZ] together with our explicit bounds on commutators between derivatives and Markov generators on Lie groups. This seems to be simpler than cluster expansion, or at least provides some new perspectives.
1.1. Lattice Yang–Mills
We first recall the basic setup and definitions of the model.
Let be a finite dimensional lattice with side length and unit lattice spacing, and we will consider various functions on it with periodic boundary conditions. We will sometimes write for short. We say that a lattice edge of is positively oriented if the beginning point is smaller in lexographic order than the ending point. Let (resp. ) be the set of positively (resp. negatively) oriented edges, and denote by , the corresponding subsets of edges with both beginning and ending points in . Define and let and denote the starting point and ending point of an edge , respectively.
We write for the Lie group or and for the associated Lie algebra or . Note that we always view as a real manifold (even for ), and as a real vector space, and we will write .
To define the lattice Yang–Mills theory we need more notation, for which we closely follow [Cha] and [SSZloop].
A path is defined to be a sequence of edges with and for . The path is called closed if . A plaquette is a closed path of length four which traces out the boundary of a square. Also, let be the set of plaquettes whose vertices are all in , and be the subset of plaquettes such that the beginning point of is lexicographically the smallest among all the vertices in and the ending point of is the second smallest.
The lattice Yang-Mills theory (or lattice gauge theory) on for the structure group , with the inverse coupling constant, is the probability measure on the set of all collections of -matrices, defined as
(1.1) |
with {equ}[e:defS] S(Q) =defNβRe∑_p∈P^+_Λ_L Tr(Q_p), where is the normalizing constant, for a plaquette , and is the Haar measure on . Note that for the edges and are negatively oriented, so throughout the paper we define for , where denotes the edge with orientation reversed. Also, is the real part, which can be omitted when .
1.2. Main Results
We will assume the following in our main results on lattice Yang–Mills.
Assumption 1.1.
Suppose that
Assumption 1.1 is equivalent to the following strong coupling assumption:
(1.2) |
Define the (product) topological space , which will serve as our infinite volume configuration space. By Tychonoff’s theorem is compact. For each we define the distance on by {equ}[e:rho-inf] ρ_∞,a^2(Q,Q’)=def∑_e∈E^+1a—e—ρ^2(Q_e,Q_e’), with being the distance from to in . Here the distances for different choices of give equivalent topologies, and we just write when there’s no confusion. is then a Polish space w.r.t. . By standard results in topology, the topology induced by is equivalent with the product topology on .
We can easily extend the measure to the infinite volume configuration space by periodic extension, which is still denoted as . Namely, we can construct a random variable with law given by and extend the random variable periodically, and the law of the periodic extension gives the desired extension of measure. Since and are compact, form a tight set.
We will consider the Langevin dynamic on , formally given by
{equ}[e:YM-formal]
dQ = ∇S (Q) dt + 2dB ,
with being independent Brownian motions on .
This is formal since we will need to “extend” to infinite volume
in a suitable sense. More precisely, the Langevin dynamic we consider is the following SDE system parametrized by :
{equs}[eq:YM in]
dQ_e
&= -12Nβ∑_p∈P,p≻e(Q_p-Q_p^*)Q_edt
-12(N-1)Q_edt+2dB_eQ_e , if G=SO(N) ,
dQ_e
= -12Nβ∑_p∈P,p≻e( (Q_p-Q_p^*) - 1NTr(Q_p-Q_p^*) I_N) Q_edt
-N2-1NQ_edt+2dB_eQ_e, if G=SU(N).
Here is a collection of independent Brownian motions on the Lie algebra of ,
and the terms linear in arise from Casimir elements of the Lie algebras; we will review
these in Section 2.
Remark 1.1.
We note that the above SDE (in finite volume) was used earlier in [SSZloop] to derive the loop equations (i.e. Dyson–Schwinger or Makeenko–Migdal equations) for Wilson loops of the model (1.1). These loop equations also hold for any infinite volume tight limit of the measures, and in particular for the unique invariant measure for satisfying (1.2) as given in Theorem 1.2.
The study of a quantum field theory of the form (1.1) via a dynamic (LABEL:e:YM-formal) is also called stochastic quantization as first proposed by [Nelson66, ParisiWu].
We will prove that there exists a unique probabilistically strong solutions to SDE (LABEL:eq:YM_in) starting from any initial data in in Proposition LABEL:lem:4.7. Hence the solutions form a Markov process in and the related semigroup is denoted by .
Our first main result is as follows.
Theorem 1.2 (Uniqueness and ergodicity).
Under Assumption 1.1, the following statements hold.
(1) The invariant measure of the Markov semigroup for the Langevin dynamic (LABEL:eq:YM_in) is unique. We denote this invariant measure by .
(2) Furthermore, every tight limit of is the same, and the whole sequence converges to as .
(3) Finally, the Markov semigroup is exponentially ergodic in the following sense: there exists a constant such that for any {equ}[e:WnuP] W_2^ρ_∞,a(νP_t,μ^ym_N,β) ⩽C(a) e^-~K_S t, t⩾0, for some which only depends on the constant , , and (in particular ).
Here is the Wasserstein distance w.r.t. given for any
with being the set of couplings between and . Remark that can be explicitly given by (LABEL:e:tildeK) below and gives a lower bound of spectral gap for in Wasserstein distance. In Theorem 1.4 we will see that gives a lower bound of spectral gap in .
Remark 1.3.
The periodic boundary condition in the definition of is not essential. By the same argument as in Theorem LABEL:th:in1 the tight limit of when changing the periodic boundary condition to Dirichlet or other boundary conditions is also the invariant measure of the SDE (LABEL:eq:YM_in), hence, is the same as .
We remark that uniqueness for small could possibly also be proven using the method of Dobrushin, see e.g. [Dobrushin1970]. To this end one would also need to consider the related Wasserstein metric with respect to the Riemannian distance similarly as we do in this paper. However as we understand such an argument has not been carried out in detail for lattice Yang Mills in the literature. Here we give a proof based on a new idea which is a variant of Kendall–Cranston’s coupling used earlier in the stochastic analysis on manifold.
The idea for the proof of Theorem 1.2 is to use finite dimensional approximation, for which we construct a suitable coupling and find a suitable distance such that the associated Wasserstein distance between the two finite dimensional approximations starting from different initial distributions decays exponentially fast in time with uniform speed.
We define the cylinder functions by {equ}[e:cylQ] C^∞_cyl(Q) ={ F: F=f(Q_e_1,…, Q_e_n),n∈N, e_i∈E^+, f∈C^∞(G^n)}. We then obtain the following log-Sobolev inequality for based on Bakry–Émery’s criterion.
Theorem 1.4 (Log-Sobolev inequality).
Under Assumption 1.1, the log-Sobolev inequality holds for the measure in Theorem 1.2. Namely, for all cylinder functions with ,
(1.3) |
This implies the Poincaré inequality, i.e. for all cylinder functions , {equ}[e:PoinYM] μ^ym_N,β(F^2)⩽1KS∑_e∈E^+μ^ym_N,β(—∇_e F—^2)+μ^ym_N,β(F)^2, with the gradient w.r.t. the variable .
Theorem 1.4 follows from Theorem 1.2 and Corollary LABEL:co:lo, which states the log-Sobolev inequality for every tight limit of . The RHS of (1.3) is the Dirichlet form associated with the Langevin dynamic (see Proposition LABEL:th:D). Hence, (1.3) holds for the functions in the domain of Dirichlet form by lower-semicontinuity.
As some simple applications of the Poincaré inequality, we show certain “susceptibility” bounds on the field and , see Corollaries LABEL:co:c and LABEL:co:cc. These examples demonstrate how to choose suitable functions in these functional inequalities to yield interesting bounds for the model.
Log-Sobolev and Poincaré inequalities in Theorem 1.4 follow by checking the Bakry–Émery criteria [MR889476, BakryLedoux] directly for the finite dimensional approximation on the product manifolds. As the Ricci curvatures of the target manifolds are given by positive constants, so are the Ricci curvatures of the configuration space (i.e. the product manifolds). The Hessian of the Hamiltonian could also be bounded by the Ricci curvatures in the strong coupling regimes.
As a corollary of Theorem 1.4 we obtain the following large properties of the Wilson loops. For the rest of this paper, by a loop we mean an equivalent class of closed paths (as defined in Section 1.1), where the equivalence relation is given by cyclic permutations for any , and it has no two successive edges of the form . We will always assume that a loop is non-empty, i.e. has positive number of edges. Given a loop , recall that the Wilson loop variable is defined as {equ}[e:loop] W_ℓ=defTr(Q_e_1Q_e_2⋯Q_e_n) .
Corollary 1.5 (Large limit of Wilson loops).
Under Assumption 1.1, for every Wilson loop (LABEL:e:loop), writing and for the variance and expectation under the measure in Theorem 1.2, one has {equ}[e:varW] Var(1NW_ℓ)⩽n(n-3)KSN, G=SO(N); Var(1NW_ℓ)⩽4n(n-3)KSN, G=SU(N) . In particular, we obtain the convergence {equ}[e:EW] —WℓN-EWℓN—→0 as N→∞ in probability, and the factorization property of Wilson loops, i.e. for any loops
Corollary 1.5 is proven in Section LABEL:sec:3.2. Our proof is novel which is based on the Poincaré inequality (LABEL:e:PoinYM). Note that our formulation of the result is different from [Cha] and [Jafar] in which the factorization property of Wilson loops was obtained by taking a sequence of increasing finite lattices , considering the correlations of Wilson loops over , and taking infinite volume limit simultaneously as the large limit when sending . In our approach, we work directly in infinite volume, which seems more natural. The subtlety here, as mentioned above and also explained in [Cha], is that the ’t Hooft coupling places instead of in front of the Hamiltonian so one would require to be sufficiently small to obtain the infinite volume limit, which would appear to be problematic when taking the large limit afterwards. However, thanks to our precise smallness condition on in (1.2), we can take small uniformly in . This also allows us to derive bounds on the variances of Wilson loops which are explicit in terms of . Our proof based on the Poincaré inequality which follows from the Bakry–Émery condition also appears to be simpler than the arguments in aforementioned previous work.
Furthermore, we obtain the following exponential decay property of the covariance. Consider and we write for the set of the edges depends on. Let denote the cardinality of . We define
where is introduced in (LABEL:e:cylQ) and is introduced in Section LABEL:sec:3.1. We also write for the distance between , which is given by the nearest distance between the vertices in and .
Corollary 1.6 (Mass gap).
Note that and in the above corollary can be chosen to be Wilson loops, or functions of an arbitrary number of Wilson loops, which are of particular interest in physics. We also remark that exponential decay of correlations (together with certain center symmetry conditions) is also related to Wilson’s area law for Wilson loops (see [MR4278289, Theorem 2.4]). The proof of Corollary 1.6 is given in Section LABEL:sec:3.2.
We conclude this subsection by some brief comments on the challenges or subtleties in the proofs of the above results. One of the important ingredients in the proofs is to estimate the Hessian or the general second order derivatives for the interaction defined in (LABEL:e:defS). Note that a term of the form in the interaction would “appear” to be of order , which would be too large for us to obtain the desired results. In our proofs we will properly arrange terms and apply certain properties of the Lie groups and we will show that the relevant second order derivatives are actually at most of order . See the explanations before Lemma LABEL:lem:4.1 and the proof of Theorem 1.2 in Section LABEL:sec:uniqueYM for more details. This is one of the crucial reasons which allow us to compare the Ricci curvatures of the Lie groups and Hessians to verify the Bakry–Émery condition by choosing small, and also prove ergodicity using a suitable weighted distance.
1.3. Relevant literature and possible directions
The study of properties of lattice gauge theories recently attracts much interest. Besides the aforementioned work by [Cha] and [Jafar], [Chatterjee16] computed the leading terms of free energies, [MR3861073] provided an elaborate description of loop expectations in the planar setting, and [ChatterjeeJafar] derived expansions in the case at strong coupling. Wilson loops (and also Wilson lines when coupled with Higgs) for gauge theories with finite structure groups were studied in [MR4107931, FLV20, FLV21, Cao20, forsstromAbelian, Adhikari2021]; see also [GS21] for the case. Moreover, exponential correlation decay for lattice gauge theories with finite abelian structure groups was obtained by [Forsstrom2021] using coupling argument, and for finite non-abelian structure groups this was proved by [DdhikariCao].
Our present article provides a new approach to study these models via stochastic analysis and dynamical perspective; see also [SSZloop] for a new derivation of loop i.e. Makeenko–Migdal equations for Wilson loops by such methods.
Remark 1.7.
Here by “stochastic analysis approach”, we do not mean the stochastic analysis approach for 2D Yang–Mills in continuum developed earlier by [GKS89] and [Driver89] (See Def. 3.3 therein) in which parallel translations (which are related with Wilson loops) are formulated as stochastic differential equations. See [MR3982691] and references therein for more recent literature in this direction. Our Yang–Mills SDE on the other hand is the stochastic dynamic for the connection fields on a lattice with fixed spacing, which is along the line of stochastic quantization.
We remark that the choice of constant positive curvature Lie groups and in this article is a technical simplification for demonstrating our method, and it should apply as well for other compact target spaces with constant or non-constant positive curvatures. For instance it should apply to a lattice Yang–Mills model coupled with a Higgs field which takes values in a sphere in (i.e. rotator model) via a gauge-covariant derivative term, whose action takes the form .
It would certainly be interesting to show if log-Sobolev inequalities still hold when the lattice spacing vanishes, in the situations where the continuum limits of these models are shown or expected to exist. In this direction, on the two dimensional torus, the continuum limit of lattice approximations of the Yang–Mills measures on 1-forms was recently obtained by Chevyrev [Chevyrev19YM], who also showed that a certain class of Wilson loop observables of this random 1-form coincide in law with the corresponding observables under the Yang–Mills measure in the sense of [Levy03]. Note that the Langevin dynamics for Yang–Mills models on the two and three dimensional continuous torus were recently constructed in [CCHS2d, CCHS3d] (see [ChevyrevReview] for a review of these results), and as mentioned in [CCHS2d] it would be interesting to show that the lattice dynamics of the type (LABEL:e:YM-formal) converge to the processes constructed in the above papers in two and three dimensions. For some of the recent progress along this direction, see the proof of log-Sobolev inequalities for the and sine-Gordon models [RolandPhi4, RolandSG], and the 1D nonlinear -model (see [AnderssonDriver, Hairer16, StringManifold]) for which the log-Sobolev inequality, ergodicity and non-ergodicity (depending on the curvature of the target manifolds) were obtained in [RWZZ17, CWZZ18].
Notation. Given a Polish space , we write for the space of continuous functions from to . We use to denote all the probability measures on with Borel -algebra.
Acknowledgments. We would like to thank Scott Smith, Feng-Yu Wang, Xicheng Zhang and Xin Chen for very helpful discussions. H.S. gratefully acknowledges financial support from NSF grants DMS-1954091 and CAREER DMS-2044415. R.Z. gratefully acknowledges financial support from the NSFC (No. 11922103). X.Z. is grateful to the financial supports by National Key R&D Program of China (No. 2020YFA0712700) and the NSFC (No. 12090014, 12288201) and the support by key Lab of Random Complex Structures and Data Science, Youth Innovation Promotion Association (2020003), Chinese Academy of Science. R.Z. and X.Z. are grateful to the financial support by the DFG through the CRC 1283 “Taming uncertainty and profiting from randomness and low regularity in analysis, stochastics and their applications”
2. Notation and preliminaries
In this section we collect some notation and standard facts about Riemannian geometry, Lie groups and Brownian motions.
Riemannian manifolds. Let be a Riemannian manifold of dimension . We denote by the space of real-valued smooth functions on . For we denote by the tangent space at with inner product . For , we write or for the differentiation of along at . For a smooth curve the tangent vector along is defined by
Let be the Levi-Civita connection, which is a bilinear operation associating to vector fields and a vector field . Recall that depends on only via for (e.g. [MR1138207, Remark 2.3]).
For , we denote by the gradient vector field of . We also write for the Hessian. It can be calculated in the following ways {equ}[e:hess] Hess_f(X,Y) =defHess(f) (X,Y) = ⟨ ∇_X ∇f, Y⟩ = X(Y f ) - (∇_X Y) f . It is a two-tensor: for any so depends only on and . Since Levi-Civita connection is torsion-free, is symmetric in .
The Riemann curvature tensor associated to vector fields is an operator defined by
Let be an orthonormal basis of . The Ricci curvature tensor is defined by {equ}[e:ricci] Ric(X,Y)=∑_i=1^d⟨R(X,W_i)W_i,Y⟩_T_xM and is independent of the choice of . Note that depends on only via for .
Let be a geodesic. A smooth vector field is called a Jacobi field along if . For any and , there exists a Jacobi field along satisfying and (c.f. [CE75, Section 1.5], [Wang, Section 0.4] ).
Lie groups and algebras. For any matrix we write for the conjugate transpose of . Let and be the space of real and complex matrices.
For Lie groups , , we write the corresponding Lie algebras as , respectively. Every matrix in one of these Lie groups satisfies , and every matrix in one of these Lie algebras satisfies . Here denotes the identity matrix.
We endow with the Hilbert-Schmidt inner product {equ}[e:HS] ⟨ X,Y⟩ = ReTr(X Y^*) ∀X,Y∈M_N(C). We restrict this inner product to our Lie algebra , which is then invariant under the adjoint action. In particular for or we have . Note that since we have , and for any .
Below is always understood as or . Every induces a right-invariant vector field on , and for each , is just given by since is a matrix Lie group. Indeed, given any , the curve is well approximated near by up to an error of order .
The inner product on induces an inner product on the tangent space at every via the right multiplication on . Hence, for , we have , and their inner product is given by . This yields a bi-invariant Riemannian metric on .
For any function and , the right-invariant vector field induced by acts on at by the right-invariant derivative {equ}[e:Xf] ~X f (Q)= ddt—_t=0 f(e^tX Q). We have {equ} ~[X,Y] = [~X, ~Y] , namely, ([X,Y] Q ) f (Q) = [XQ, YQ] f(Q), where the is the Lie bracket on on the LHS and the vector fields commutator on the RHS. Also, for the Levi-Civita connection we have {equ}[e:F27] ∇_~X ( ~Y) =12 ~[X,Y] .
We refer the above facts to [AGZ, Appendix F], e.g. Lemma F.27 therein.
Brownian motions. Denote by and the Brownian motions on a Lie group and its Lie algebra respectively. The Brownian motion is characterized by
(2.1) |
By [Levy11, Sec. 1.4], the Brownian motions and are related through the following SDE: {equ}[e:dB] dB = dB ∘B = dB B + cg2 B dt, where is the Stratonovich product, and is in the Itô sense. Here the constant is determined by where is an orthonormal basis of . Moreover, by [Levy11, Lem. 1.2], 111Note that in [Levy11, Lem. 1.2], the scalar product differs from (LABEL:e:HS) by a scalar multiple depending on and , so we accounted for this in the expression for above. {equ}[e:c_g] c_so(N) = -12(N-1), c_su(N) = -N2-1N .
Denote by the Kronecker function, i.e. if and otherwise. For any matrix , we write for its th entry. The following holds by straightforward calculations (see e.g. [SSZloop, (2.5)]): \minilabe:BB {equs}[2] d⟨ B^ij, B^kℓ⟩ &=12(δ_ikδ_jℓ-δ_iℓδ