Combinatorial Approach for Factorization of Variance and Entropy in Spin Systems
Abstract
We present a simple combinatorial framework for establishing approximate tensorization of variance and entropy in the setting of spin systems (a.k.a. undirected graphical models) based on balanced separators of the underlying graph. Such approximate tensorization results immediately imply as corollaries many important structural properties of the associated Gibbs distribution, in particular rapid mixing of the Glauber dynamics for sampling. We prove approximate tensorization by recursively establishing block factorization of variance and entropy with a small balanced separator of the graph. Our approach goes beyond the classical canonical path method for variance and the recent spectral independence approach, and allows us to obtain new rapid mixing results. As applications of our approach, we show that:
-
1.
On graphs of treewidth , the mixing time of the Glauber dynamics is , which recovers the recent results of Eppstein and Frishberg [EF21] with improved exponents and simpler proofs;
-
2.
On bounded-degree planar graphs, strong spatial mixing implies mixing time of the Glauber dynamics, which gives a faster algorithm than the previous deterministic counting algorithm by Yin and Zhang [YZ13].
1 Introduction
Spin systems, also known as undirected graphical models, are important models for describing the joint distribution of interacting random variables. Spin systems were first studied in statistical physics but have been widely used in many other areas including computer science, social network, and biology.
Consider a general spin system defined on a graph . The model describes a distribution, called Gibbs distribution, over all spin configurations where each vertex is assigned a spin from a finite set . Given functions characterizing pairwise interactions and measuring external bias, the Gibbs distribution associated with a spin system is defined as
(1) |
where is a normalizing constant known as the partition function.
We mention two classical examples of spin systems which have been extensively studied. The first is the hardcore model of independent sets. The Gibbs distribution is supported over the collection of all independent sets of where each has density , where . Observe that this corresponds to , , and in Eq. 1 with being the indicator vector. Another example is random vertex colorings where the Gibbs distribution is the uniform distribution over all proper -colorings of . This corresponds to , , and in Eq. 1.
We study the problem of sampling from the Gibbs distribution of a spin system. In particular, we consider the single-site Glauber dynamics (also called Gibbs sampling) which is perhaps the simplest and most popular Markov chain Monte Carlo (MCMC) algorithm for sampling from a high-dimensional distribution. The Glauber dynamics is an ergodic Markov chain where in each iteration we choose a vertex uniformly at random, and update the spin conditional on the spin values of all other vertices. The mixing time of Glauber dynamics is the smallest such that, starting from any initial configuration , the distribution of after steps is -close to the target distribution in total variation distance.
Bounding the mixing time of Glauber dynamics is challenging even for simplest spin systems like the hardcore model or random colorings. One common method for establishing mixing time bounds of Markov chains is by proving associated functional inequalities such as the Poincaré inequality or the standard/modified log-Sobolev inequality. For a function , the expectation of with respect to a Gibbs distribution is defined as . The variance and entropy functionals are defined as , and for non-negative . For Glauber dynamics specifically, the Poincaré inequality can be expressed equivalently in the form of
(2) |
where on the right-hand side is the variance of under the conditional distribution of where all other vertices are fixed to be some , and takes expectation over . The inequality Eq. 2 is called Approximate Tensorization (AT) of variance. The Poincaré inequality, or equivalently AT of variance, implies that Glauber dynamics mixes in steps.
One can also consider the entropy analog of Eq. 2, known as Approximate Tensorization (AT) of entropy:
(3) |
AT of entropy is a much stronger property. It implies the modified log-Sobolev inequality for the Glauber dynamics, and gives a sharper mixing time bound (see Lemma 2.7) which is optimal if is constant.
Both Eqs. 2 and 3 were explicitly mentioned and carefully studied in [CMT15], though they were implicitly used in even earlier works, see e.g. [Mar99, GZ03, Ces01]. On a high level, both Eqs. 2 and 3 say that the global fluctuation of a function, quantified as variance or entropy, is always controlled by the sum of local fluctuations at each single variable, which is intuitively true when all variables are sufficiently independent from each other. Indeed, one has in Eqs. 2 and 3 with the equality holds iff is a product distribution. See also [BCŠV22, KHR22] for recent applications of AT in learning and testing.
Establishing AT of variance and entropy is a challenging task even for simple distributions. For variance, the canonical path approach is a common way of proving the Poincaré inequality (i.e., bounding the spectral gap) in the setting of spin systems. The high-level idea is to construct a family of canonical paths or more generally a multi-commodity flow between each pair of configurations and then use the congestion of the flow to establish Eq. 2. Canonical paths have found many successful applications such as matchings [JS89], ferromagnetic Ising model [JS93], and bipartite perfect matchings [JSV04]. However, constructing canonical paths is not easy at all and usually involves some specific technical complications for each problem.
Meanwhile, for entropy it is much more difficult to establish AT or other related functional inequalities like the standard/modified log-Sobolev inequality. In many cases they are proved analytically, relying on the topology being the lattice [Mar99, GZ03, Ces01]. It was also known that for high-temperature models such as under the Dobrushin uniqueness, AT of entropy holds with [CMT15, Mar19].
Recently, the spectral/entropic independence approach was introduced [ALO20, AJK+22] and becomes a powerful tool for establishing AT of both variance and entropy. For many families of spin systems it achieves in Eqs. 2 and 3 and thus shows optimal mixing of Glauber dynamics. For example, for the hardcore model one obtains AT of variance and entropy with when by a sequence of recent works [ALO20, CLV20, CLV21, CFYZ21, AJK+22, CFYZ22, CE22], where denotes the maximum degree and is the tree uniqueness threshold [Wei06]. It was known that Glauber dynamics can be exponentially slow when [MWW09] and hence for some graphs. The critical value in fact pinpoints a computational phase transition, see [Wei06, Sly10, SS14, GŠV16] for more discussions. Though the spectral independence approach works well on general graphs for proving optimal mixing time, it does not apply when the mixing time is a larger polynomial instead of nearly linear. For example, for the hardcore model on trees the Glauber dynamics always mixes in polynomial time for all [JSTV04] but (constant) spectral independence fails for large .
In this paper, we ask if there is a natural and direct way of proving AT beyond canonical paths or spectral independence, especially when we have extra knowledge of the underlying graph structure. We present a simple combinatorial approach for proving AT of variance and entropy based on the existence of balanced separators of the graph, see Propositions 3.1 and 4.6. Using this approach we are able to obtain new rapid mixing results as immediate corollaries for certain classes of graphs such as bounded-treewidth graphs or planar graphs. We are also able to derive many previous results in a simple and straightforward way in contrast to the detailed technical proofs that were known previously. For example, we can easily deduce within one page that Glauber dynamics for -colorings on complete -ary trees is rapidly mixing for all , see Proposition 3.7.
Our proof approach is nicely explained for graphs of bounded treewidth. The treewidth of a graph characterizes the closeness of a graph to a tree, and is an important parameter for getting fixed parameter tractable algorithms for many graph problems. For the hardcore model and random colorings on bounded-treewidth graphs, we obtain rapid mixing of Glauber dynamics as immediate consequences of Propositions 3.1 and 4.6, which improves the results in [EF21] with better exponents. We remark that we can also obtain similar results for more general spin systems but for simplicity we only state them for the hardcore model and vertex colorings which are the most commonly studied examples .
Theorem 1.1.
Let be an -vertex graph of treewidth . The mixing time of the Glauber dynamics for sampling from the hardcore model on with fugacity is .
Theorem 1.2.
Let be an -vertex graph of maximum degree and treewidth . For any , the mixing time of the Glauber dynamics for sampling uniformly random -colorings of is .
Previously, [EF21] presented mixing time bounds for hardcore model where and for random -colorings. Our mixing time bounds are better in the exponents and our proof is much simpler avoiding the technical construction of multi-commodity flows as done in [EF21]. We note that for bounded-treewidth graphs one can exactly compute the partition function and thus sample in time , see e.g. [YZ13, WTZL18] and the references therein. However, the performance of the Glauber dynamics is still unclear for such graphs which is the focus of this paper.
We establish Theorems 1.1 and 1.2 by factorizing variance recursively using a balanced separator of the graph; this is inspired by previous works on trees [MSW04] and lattice [Ces01, CP21]. Since has treewidth , it has a balanced separator of size at most . More specifically, there exists a partition such that , , and there is no edge between and . Given such we are able to establish a block factorization of variance
(4) |
where is a constant independent of . Since the size of is bounded, we can factorize the variance on into single vertices. Then by recursively applying Eq. 4 to and respectively, we are able to prove AT of variance with constant . We present our universal proof approach for AT in Propositions 3.1 and 4.6, and summarize basic tools for establishing such block factorization Eq. 4 in Sections 3.2 and 4.2.2.
We note that [EF21] also used balanced separators to construct multi-commodity flows and obtained similar results. In comparison, our approach establishes AT in a more direct way and is arguably simpler in nature. We remark that we could also establish AT of entropy, but since we are not aiming to get an optimal exponent in the mixing time we choose to work with variance which is much easier for calculation.
As another application of our approach, we consider planar graphs and show that the Strong Spatial Mixing (SSM) property (Definition 5.2) implies nearly optimal mixing time of Glauber dynamics. SSM is an important structural property characterizing the exponential decay of correlations between two subsets of variables as their graph distance grows. There is a rich literature in establishing rapid mixing of Glauber dynamics from SSM but mostly only for special classes of graphs such as lattice, see e.g. [Mar99, GZ03, Ces01, DSVW04, GMP05]. Here we consider general bounded-degree planar graphs with no restriction on the underlying topology. Previously, [YZ13] presented a determinstic counting algorithm under the same assumptions with large polynomial running time. Our result can be viewed as a faster sampling algorithm with nearly linear running time. We state our result for the hardcore model but it extends easily to general spin systems.
Theorem 1.3.
Let be an -vertex planar graph of maximum degree . Consider the hardcore model on with fugacity . If SSM holds, then the mixing time of the Glauber dynamics is .
In fact, we prove Theorem 1.3 more generally for all graphs of polynomially bounded local treewidth, see Theorem 5.3. This is a class of graphs such that all local balls of radius have treewidth . This includes for examples bounded-treewidth graphs, planar graphs, or graphs with polynomial neighborhood growth such as regions in (see Theorem 5.7). We remark that the result in [YZ13] holds only for graphs of linearly bounded local treewidth and thus our result holds for a larger class of graphs. We establish AT of entropy by combining SSM and local structure of the underlying graph. In particular, we show a new low-diameter decomposition of graphs (see Lemma 5.4) based on a classical result of Linial and Saks [LS93], which allows us to focus on subgraphs of diameter assuming SSM. Note that here we have to work with entropy in order to get an mixing time.
Paper organization.
After giving preliminaries in Section 2, we present our new framework for establishing approximate tensorization in Section 3. We prove Theorems 1.1 and 1.2 in Section 4 for Glauber dynamics on bounded-treewidth graphs. We prove Theorem 1.3 more generally for graphs of bounded local treewidth in Section 5. We present missing proofs of basic tools for approximate tensorization and block factorization in Section 6.
Acknowledgments.
The author thanks Kuikui Liu, Eric Vigoda, and Thuy-Duong Vuong for stimulating discussions. The author thanks Eric Vigoda for helpful comments on the manuscript.
2 Preliminaries
2.1 Spin systems
We consider general families of spin systems, also known as undirected graphical models. Let be a graph. Every vertex is associated with a finite set of spins (colors, labels). Let denote the product space of all spin assignments called configurations. Let be a collection of edge interactions such that for every edge , is a function mapping spins of the two endpoints to a non-negative weight, characterizing the interaction between neighboring vertices. Let be a collection of external fields such that for every vertex , assigns a weight to each color representing the bias towards each color. The induced Gibbs distribution is given by
where denotes the color of a vertex and denotes the (partial) spin assignment of vertices in , and is the partition function defined as
We assume so that the Gibbs distribution is well-defined.
For a subset let denotes the marginal distribution projected on . We say a spin configuration on some subset is a (feasible) pinning if . For any pinning on , the conditional Gibbs distribution , where the configuration on is fixed to be , can be viewed as another spin system on the graph . For any we further define to be the marginal on under the conditional distribution .
The Glauber dynamics is one of the simplest and most popular Markov chain Monte Carlo algorithms for sampling from the Gibbs distribution of a spin system. In each step of Glauber dynamics, we pick a vertex uniformly at random and update the spin value conditional on the configuration of all other vertices, i.e., from the conditional marginal . Two distinct configurations are said to be adjacent iff they differ in the spin value at a single vertex. Hence, for Glauber dynamics configurations only move to adjacent ones. We assume that under any pinning , the Glauber dynamics for the conditional distribution is irreducible; namely, one feasible configuration can move to another through a chain of feasible configurations where consecutive pairs are adjacent. This is necessary for the Glauber dynamics to be ergodic and is naturally true for many spin systems of interest, including the hardcore model, random -colorings with , etc.
2.2 Block factorization of variance and entropy
Consider the Gibbs distribution of a spin system defined on a graph with state space .
Definition 2.1.
Let be a function.
-
•
The expectation of with respect to is defined as
-
•
The variance of with respect to is defined as
-
•
For non-negative , the entropy of with respect to is defined as with the convention that .
We often omit the subscript and write when the underlying distribution is clear from context.
Let be a subset of vertices. For any pinning on , the expectation, variance, and entropy of a function with respect to the conditional Gibbs distribution is denoted as , , and ; in particular, we view all of them as functions mapping a full configuration to a real, which depends only on the pinning .
The following lemma summarizes several basic and important properties of variance and entropy in spin systems. The proof can be found in [MSW04] and the references therein.
Lemma 2.2 ([MSW04, Eq. (3), (4), (5)]).
Let be a function.
-
(1)
For any subsets , it holds that ;
-
(2)
For where are pairwise disjoint subsets such that every distinct pair of are disconnected in , it holds that ;
-
(3)
For any subsets such that there are no edges between and , it holds that .
All three properties (1), (2), (3) hold with variance as well.
Definition 2.3.
Let be a collection (possibly multiset) of subsets of . We say that satisfies -factorization of variance (resp., entropy) with multiplier if for every function (resp., ) it holds
(5) |
Remark 2.4.
Following [BGGŠ22], we call the “multiplier” instead of “constant” since it may depend on .
Remark 2.5.
The block factorization of variance/entropy can be defined more generally for weighted blocks; we refer to [CP21] for more details.
The -factorization corresponds to the heat-bath block dynamics for sampling from : In each iteration, we pick uniformly at random and update from the conditional distribution . More specifically, the -factorization of variance is equivalent to the Poincaré inequality of such block dynamics and the -factorization of entropy implies the modified log-Sobolev inequality (but the converse is not true). See [CMT15, CP21].
Definition 2.6.
We say that satisfies approximate tensorization (AT) of variance (resp., entropy) with multiplier if for every function (resp., ) it holds
(6) |
Observe that AT is exactly -factorization with . Establishing AT with a small allows us to conclude rapid mixing of Glauber dynamics, see [CLV21] and the references therein.
Lemma 2.7.
Suppose it holds that . If satisfies AT of variance with multiplier , then the mixing time of Glauber dynamics is . If satisfies AT of entropy with multiplier , then the mixing time of Glauber dynamics is .
2.3 Separator decomposition
We prove AT via a divide-and-conquer argument. To accomplish this we need the following separator decomposition, slightly modified from [YZ13]. Such ideas also appeared in many previous works to obtain fixed-parameter tractable algorithms in graphs of bounded treewidth.
Definition 2.8 (Separator Decomposition, [YZ13]).
For a graph , a separator decomposition tree of is a rooted tree satisfying the following conditions:
-
•
Every node of is a pair where is a subset of vertices and is a separator of ;
-
•
The root node of is a pair ;
-
•
For every non-leaf node , the children of are connected components of and their separators;
-
•
Every leaf of is a pair .
A separator decomposition tree is said to be balanced if for every internal node and every child of , it holds that .
Remark 2.9.
Observe that all separators appearing in form a partition of .
It is easy to see that the height of a balanced separator decomposition tree is .
Lemma 2.10.
Suppose is an -vertex graph with . If is a balanced separator decomposition tree of , then the height of is less than .
Proof.
Assume . Let be a leaf of at distance from the root node . Since all separators are balanced, we have , contradiction. ∎
3 Combinatorial Approach for Approximate Tensorization
3.1 Approximate tensorization via separator decomposition
Our main step for establishing approximate tensorization of variance and entropy is given by the following proposition. Roughly speaking, if we can find a (small) separator of the underlying graph whose removal disconnects , then we can factorize variance/entropy into the block and all connected components of , whose sizes are significantly smaller if the separator is balanced. Given a (balanced) separator decomposition tree, we can continue this process for each smaller block, and in the end tensorize into single vertices.
Proposition 3.1.
Let be a spin system defined on a graph with associated Gibbs distribution . Suppose that is a separator decomposition tree of satisfying:
-
1.
(Block Factorization for Decomposition) For every node , there exists , such that for any function we have
(7) For all leaves we take .
-
2.
(Approximate Tensorization for Separators) For every node , there exists , such that for any function we have
(8)
Then the Gibbs distribution satisfies approximate tensorization of variance with multiplier given by
where the maximum is taken over all nodes of , and the product is over all nodes in the unique path from the root to . Namely, for every function we have
Remark 3.2.
The entropy version of Proposition 3.1 is also true and the proof is exactly the same with replacing .
The proof of Proposition 3.1 is straightforwardly applying properties of variance and entropy given in Lemma 2.2. We use it as our basic strategy for obtaining meaningful AT bounds in many applications.
Proof of Proposition 3.1.
The lemma follows by decomposing the variance level by level on the separator decomposition tree by Lemmas 2.2, 7 and 8. More specifically, for the root we have that
where every is a connected component of and we can factorize without loss since it is a product distribution (Lemma 2.2). In particular, (together with its separator) is a child of in . Continue the process for each child , we obtain
where every is a connected component of . So in the end we obtain
where for each ,
where is the unique node such that (see Remark 2.9) and the product runs through all on the unique path from to . This shows the lemma. ∎
3.2 Tools for factorization of variance and entropy
To apply Proposition 3.1, one needs to establish block factorization of variance/entropy for decomposition Eq. 7 and approximate tensorization for separators Eq. 8. In this subsection, we summarizes known and gives new results for factorization of variance and entropy in a very general setting, which are useful for showing Eqs. 7 and 8. The lemmas in this subsection are suitable for establishing AT or block factorization for disjoint blocks; for overlapping blocks we also need Lemma 4.10 from Section 4.2.2.
Two-variable factorization with weak correlation
We first consider AT for two variables. Let and be two random variables with joint distribution , fully supported on finite state spaces and respectively. For applications such as proving Eq. 7, would represent a block of vertices, namely and , and we consider the joint distribution of under an arbitrary pinning outside .
Denote the marginal distribution of by , and for let be the distribution of conditioned on . We define the marginal distribution and for the conditional distribution in the same way.
We use , , and to denote the expectation, variance, and entropy of some function or under the distribution , and use , , and to denote the variance and entropy under the conditional distribution . As before, and represent the expectation of and where is chosen from .
We first show AT for when the correlation between and is bounded pointwisely. The following lemma is implicitly given in [Ces01, DPPP02]. Here we give a self-contained, simplified proof with an improved constant. The lemma below can also be applied to the main results from [Ces01, DPPP02].
Lemma 3.3 ([Ces01, Proposition 2.1] and [DPPP02, Lemma 5.1 & 5.2]).
Suppose there exists a real such that for all ,
(9) |
Then we have
(10) | ||||
(11) |
The proof of Lemma 3.3 can be found in Section 6.1.
Two-variable factorization with strong correlation
Our next result allows stronger correlations between and , at a cost of larger constants for AT.
Lemma 3.4.
Suppose there exist reals with such that
(12) | ||||
(13) |
Then we have
(14) | ||||
(15) |
where .
Remark 3.5.
We remark that the constant for entropy factorization Eq. 15 is not optimal since depends logarithmically on the size of state space, . Getting rid of this is an interesting open question.
Multi-variable factorization
Finally, we consider approximate tensorization for multiple variables, which is helpful for establishing Eq. 8 when the size of the separator is bounded.
Let be random variables where is fully supported on finite for each . The joint distribution of , over the product space is denoted by . For two disjoint subsets and a partial assignment with , let denote the conditional marginal distribution on conditioned on that the variables in are assigned values from . In particular, denotes the marginal on . We write and when the underlying set is for simplicity.
As before, we write as the conditional entropy of on given and let be its expectation where is drawn from ; the variance versions are defined in the same way.
Lemma 3.6.
Suppose there exists a real such that for every with , every with , every with , and every with and , it holds
Then we have
(16) | ||||
(17) |
where .
Observe that for , Lemma 3.6 recovers Lemma 3.4 for the special case . The proofs of Lemmas 3.4 and 3.6 can be found in Section 6.2, which are simple applications of the spectral independence approach based on [AL20, ALO20, FGYZ21].
3.3 Example
Here we give a simple example as an application of Proposition 3.1 and tools from Section 3.2. We show polynomial mixing time of Glauber dynamics for sampling -colorings on a complete -ary tree for all and . This was known previously, see [GJK10, LM11, LMP09, TVVY12, SZ17] for even sharper results. However, Proposition 3.1 allows us to establish this fact in a more straightforward manner, avoiding technical complications such as constructing canonical paths or Markov chain decomposition.
Proposition 3.7 ([GJK10], see [LM11, LMP09, TVVY12, SZ17] for sharper results).
Let and be integers. Suppose is a complete -ary tree of height , and denote the number of vertices by . The mixing time of the Glauber dynamics for sampling uniformly random -colorings of is .
Proof.
We apply Proposition 3.1. We may assume that since otherwise rapid mixing follows by standard path coupling arguments [Jer03]. The separator decomposition tree can be obtain from the original tree : every node of is of the form where is the subtree rooted at and the single-vertex set is a separator of . In particular, we have in Eq. 8 for each separator . We claim that in Eq. 7 for each node . To see this, consider two pinnings where receives colors and , and we can couple all children of with probability when neither nor is used. Hence, we can couple the whole subtree with the same probability, implying
Thus, we deduce from Lemma 3.4 that . By Proposition 3.1 we obtain that AT of variance holds with multiplier
The mixing time then follows from Lemma 2.7. ∎
4 Rapid Mixing for Graphs of Bounded Treewidth
In this section we consider graphs of bounded treewidth. It is well-known that all bounded-treewidth graphs have a balanced separator decomposition tree with separators of bounded size.
Lemma 4.1 ([RS86, Gru12]).
If is a graph of treewidth , then there exists a balanced separator decomposition tree for such that for every node in it holds .
4.1 Hardcore model
In this subsection we prove Theorem 1.1 for the hardcore model via Proposition 3.1. As a byproduct, we also show mixing time of the Glauber dynamics on arbitrary planar graphs with no restriction on the maximum degree, see Theorem 4.4.
To apply Proposition 3.1, we need to establish block factorization for every decomposition Eq. 7 and approximate tensorization for all separators Eq. 8, which are given by the following two lemmas.
Lemma 4.2.
Consider the hardcore model on a graph with fugacity . Let be subsets with . For any pinning on , the conditional hardcore Gibbs distribution satisfies -factorization of variance with constant . In particular, for every function it holds
(18) |
Proof.
Lemma 4.3.
Consider the hardcore model on a graph with fugacity . Let be a subset with . For any pinning on , the conditional hardcore Gibbs distribution satisfies approximate tensorization of variance with constant . In particular, for every function it holds
(19) |
Proof.
We are now ready to prove Theorem 1.1.
Proof of Theorem 1.1.
We deduce the theorem from Proposition 3.1. Since the graph has treewidth , there exists a balanced separator decomposition tree by Lemma 4.1 where all separators have size at most . The height of is by Lemma 2.10. Block factorization for decomposition Eq. 7 is shown by Lemma 4.2 with for each node . Approximate tensorization for separators Eq. 8 is shown by Lemma 4.3 with for each separator . Thus, we conclude from Proposition 3.1 that the hardcore Gibbs distribution satisfies approximate tensorization of variance with multiplier
The mixing time then follows from Lemma 2.7. ∎
As a byproduct, we also show mixing of Glauber dynamics for the hardcore model on any planar graph. See [BKMP05, Hei20, EF21] which can obtain similar results.
Theorem 4.4.
Suppose is an -vertex planar graph. The mixing time of the Glauber dynamics for sampling from the hardcore model on with fugacity is .
Proof.
We apply Proposition 3.1. It is well-known that every planar graph has a balanced separator of size , such that each connected component of has size at most ; see [LT79]. We can further find balanced separators for each component, and construct a balanced separator decomposition tree recursively. By Lemma 4.2, for every node we have block factorization for decomposition Eq. 7 with constant
By Lemma 4.3, for every separator we have approximate tensorization for separators Eq. 8 with constant
Therefore, we obtain from Proposition 3.1 that AT of variance holds with multiplier
The mixing time then follows from Lemma 2.7. ∎
4.2 List colorings
In this subsection we prove Theorem 1.2 from the introduction for colorings. We consider the more general setting of list colorings, where each vertex is associated with a list of available colors, and every list coloring assigns to each vertex a color from its list such that adjacent vertices receive distinct colors. The Glauber dynamics is ergodic for list colorings if for all . One handy feature of list colorings is that any pinning on a subset of vertices induces a new list coloring instance on the induced subgraph , for which still holds for all unpinned where is the new list of available colors conditioned on ; see, e.g., [GKM15].
Theorem 4.5.
Let be a graph of maximum degree and treewidth . Suppose each vertex is associated with a color list such that . The mixing time of the Glauber dynamics for sampling uniformly random list colorings of is .
4.2.1 A generalized version of Proposition 3.1
For list colorings we are not able to directly apply Proposition 3.1 to prove Theorem 4.5. Unlike the hardcore model where a vertex can always be unoccupied under any pinning of its neighbors, the lack of such “universal” color makes it hard to establish block factorization for decomposition Eq. 7 using tools from Section 3.2. In this subsection we present a stronger version of Proposition 3.1 which allows us to establish block factorization more easily for list colorings. Our applications in Section 5 also requires this more general version.
For subsets and an integer , define
to be the ball around of radius in , and define
to be the portion of the ball contained in . In Proposition 4.6 below, we replace by in all suitable places in Proposition 3.1, which makes it easier for us to establish block factorization for decomposition Eq. 20 for hard-constraint models like list colorings.
Proposition 4.6.
Let be a spin system defined on a graph with associated Gibbs distribution . Let be an integer. Suppose that is a separator decomposition tree of satisfying
-
1.
(Block Factorization for Decomposition) For every node , there exists , such that for any function we have
(20) For all leaves we take .
-
2.
(Approximate Tensorization for Separators) For every node , there exists , such that for any function we have
(21) -
3.
(Bounded Coverage) There exists such that for any vertex ,
(22)
Then the Gibbs distribution satisfies approximate tensorization of entropy with multiplier given by
where the maximum is taken over all nodes of , and the product is over all nodes in the unique path from the root to . Namely, for every function we have
Remark 4.7.
The variance version of Proposition 4.6 is also true and the proof is exactly the same.
Remark 4.8.
Observe that if then and we have in Eq. 22, so we recover exactly Proposition 3.1.
Proof of Proposition 4.6.
The proof is the same as for Proposition 3.1 by decomposing the entropy level by level on the separator decomposition tree . We similarly obtain
where for each ,
where the product runs through all on the unique path from to . This establishes the proposition. ∎
The following lemma is helpful for establishing bounded coverage Eq. 22 when applying Proposition 4.6.
Lemma 4.9.
Under the assumptions of Proposition 4.6:
-
•
If has maximum degree , then ;
-
•
If and is a balanced separator decomposition tree, then .
Proof.
If has maximum degree , we have for any that
where the second inequality is because all separators form a partition of (see Remark 2.9).
4.2.2 Block factorization via marginal distributions
Note that to apply Proposition 4.6, we need to establish the block factorization for decomposition Eq. 20 where the two blocks and overlap with each other. In particular, we can no longer apply Lemmas 3.4 and 3.3 directly since fixing the spin assignment in one block greatly changes the conditional distribution on the other block because of the overlap. Instead, we show block factorization for the marginal distribution for the two blocks and , where we essentially exclude the overlap part. It turns out these two notions of block factorization are equivalent to each other, and for the latter we are able to apply tools from Section 3.2 since the blocks are now disjoint.
We show this equivalence in a general setting. Let be three random variables taking values from finite state spaces respectively. Their joint distribution is denoted by . Denote the marginal distribution for by and similarly for other choices of subsets of variables. We establish block factorization for into two blocks and from approximate tensorization for the marginal distribution . More precisely, we say satisfies -factorization of entropy with constant if for every function , it holds
(23) |
We say satisfies -factorization (i.e., approximate tensorization) of entropy with constant if for every function , it holds
(24) |
where the reference distribution is the marginal distribution over the state space and in particular will be viewed as a function of (instead of ). The variance versions of Eqs. 23 and 24 are defined in the same way with replacing .
We show that these two notions of block factorization of entropy are equivalent to each other.
Lemma 4.10.
The joint distribution satisfies -factorization of entropy (resp., variance) with constant if and only if the marginal distribution satisfies -factorization (i.e., approximate tensorization) of entropy (resp., variance) with constant .
The proof of Lemma 4.10 can be found in Section 6.3.
4.2.3 Proof of Theorem 4.5
We apply Proposition 4.6 with , where we take in Eq. 22 by Lemma 4.9. We establish block factorization for decomposition Eq. 20 and approximate tensorization for separators Eq. 21 in the following two lemmas.
Lemma 4.11.
Consider list colorings on a graph of maximum degree where each vertex is associated with a color list such that . Let be subsets with . For any pinning on , the uniform distribution over list colorings conditioned on satisfies -factorization of variance with constant . In particular, for every function it holds
(25) |
Proof.
Let . We first prove -factorization for the marginal distribution on via Lemma 3.4. Let is an arbitrary pinning on which is feasible under . For any partial list coloring on that is feasible under , we claim that
(26) |
Note that must also be feasible under since and are not adjacent, and we can extend to a full list coloring by the assumption for all . By the chain rule, it suffices to show that for any and we have
(27) |
Consider a resampling procedure: starting from a random full list coloring generated from , we first resample all (free) neighbors of and then . When resampling neighbors, the probability of avoiding color is at least since there are at least two available colors for each neighbor by assumption, and when resampling the probability of getting is at least , thus establishing Eq. 27 and consequently Eq. 26.
We deduce from Eq. 26 that for any two pinnings on , it holds
Therefore, by Lemma 3.4 we have that satisfies -factorization of variance with constant . Finally, by Lemma 4.10 we conclude that satisfies -factorization of variance with the same constant, and Eq. 25 follows by taking expectation over . ∎
Lemma 4.12.
Consider list colorings on a graph of maximum degree where each vertex is associated with a color list such that . Let be a subset with . For any pinning on , the uniform distribution over list colorings conditioned on satisfies approximate tensorization of variance with constant . In particular, for every function it holds
(28) |
Proof.
Fix a pinning on some subset and consider two distinct vertices . For any two colors that are available to , there exists a color that is always available to no matter or . This is because: either all neighbors of are pinned and we can choose any color available to ; or has at least one free neighbor and thus at least three available colors, and we choose one of them which is not or . For this color , we have for , which was already shown in the proof of Lemma 4.11 (see Eq. 27). This implies that
The lemma then follows from an application of Lemma 3.6, and taking expectation over we obtain Eq. 28 as claimed. ∎
We are now ready to prove Theorem 4.5 and Theorem 1.2 from the introduction.
Proof of Theorem 4.5.
We deduce the theorem from Proposition 4.6. Since the graph has bounded treewidth, there exists a balanced separator decomposition tree by Lemma 4.1 such that each separator has size at most . The height of is at most by Lemma 2.10. We pick and hence we can take in Eq. 22 by Lemma 4.9. Block factorization for decomposition Eq. 20 is shown by Lemma 4.11 with for each node . Approximate tensorization for separators Eq. 21 follows from Lemma 4.12 with for each separator , noting that . Thus, we conclude from Proposition 4.6 that the uniform distribution of list colorings satisfies approximate tensorization of variance with multiplier
The mixing time then follows from Lemma 2.7. ∎
Proof of Theorem 1.2.
If then optimal mixing of Glauber dynamics follows from standard path coupling arguments [Jer03]. Otherwise, the theorem follows from Theorem 4.5. ∎
5 Rapid Mixing via SSM for Graphs of Bounded Local Treewidth
In this section we prove Theorem 1.3 from the introduction. We consider families of graphs of bounded local treewidth, which include planar graphs as a special case. We establish rapid mixing of Glauber dynamics under SSM for all such families.
Graphs of bounded local treewidth are defined as follows.
Definition 5.1 (Bounded Local Treewidth).
Let be a graph and be reals. We say has polynomially bounded local treewidth if it satisfies the following diameter-treewidth property: for any subgraph of , it holds
Examples of families of graphs that have bounded local treewidth include:
-
•
Graphs of bounded treewidth;
- •
-
•
Graphs of bounded growth, see Section 5.2.
The strong spatial mixing property characterizes the decay of correlations in a quantitative way. Roughly speaking, it says that in a spin system the correlation between the spin on a vertex and the configuration on a subset decays exponentially fast with their graph distance, and such decay holds uniformly under any pinning on any subset of vertices.
Definition 5.2 (Strong Spatial Mixing (SSM)).
Consider a spin system on a graph with Gibbs distribution . Let and be reals. We say the spin system satisfies the strong spatial mixing property with exponential decay rate if for all pinning on , for any vertex and feasible spin , and for any subset and two feasible configurations on , it holds
Our main result in this section is stated as follows. We state it only for the hardcore model but it extends naturally to other spin systems as long as Glauber dynamics is ergodic under any pinning.
Theorem 5.3.
Let be an -vertex graph of maximum degree . Suppose that has bounded local treewidth with constant parameters , and suppose that the hardcore model on with fugacity satisfies SSM with constant parameters . Then the mixing time of the Glauber dynamics for the hardcore Gibbs distribution on is .
5.1 Proof of Theorem 5.3
For graphs of bounded local treewidth, if the diameter is mildly bounded (say poly-logarithmically), then the treewidth is also bounded. However, in general the diameter of can be arbitrarily large. We need the following low-diameter decomposition of graphs which allows us to focus on subgraphs of small diameters and thus of small treewidth. We remark that Lemma 5.4 holds for arbitrary graphs with no restriction on the local treewidth or maximum degree.
Lemma 5.4.
Let be an -vertex graph where . For any integer , there exists a partition of the vertex set satisfying the following conditions:
-
1.
For each , we have ;
-
2.
For each vertex , we have .
The proof of Lemma 5.4 is postponed to Section 5.3, which is based on a classical low-diameter decomposition result of Linial and Saks [LS93].
To obtain block factorization of entropy from Lemma 5.4, we need the strong spatial mixing property, which is given by the following lemma.
Lemma 5.5.
Suppose SSM holds with constant parameters and . There exists a constant such that for any subsets and any , if then it holds for every function that,
(29) |
Proof.
First observe that it suffices to establish the inequality under an arbitrary pinning on and then Eq. 29 follows by taking expectation over . By Lemma 4.10, it then suffices to prove block factorization of entropy for the marginal distribution for the two blocks and . Suppose where and let be any feasible configuration on . For each define and let be the configuration restricted on . By SSM we have that for any two feasible configurations on ,
where we pick for large enough constant and we assume so that the last two inequalities hold. Similarly, we also have
Then, we deduce from Lemma 3.3 that the marginal distribution satisfies -factorization of entropy with constant (again we assume so that the first inequality holds). Eq. 29 then follows from Lemma 4.10 and averaging over . ∎
We now present the proof of Theorem 5.3.
Proof of Theorem 5.3.
By Lemma 5.5, there exists a constant such that Eq. 29 holds for all subsets and for whenever . We apply Lemma 5.4 to for , and suppose the resulting partition is . For each let , and so , and . Then we deduce from Eq. 29 that for every function ,
(30) |
where the last inequality is due to and that for any (Lemma 2.2). The good news are that each ball has diameter and thus treewidth, and every vertex is contained in many balls; both properties are guaranteed by Lemma 5.4. Thus, to prove approximate tensorization of entropy for , it suffices to prove it for each ball .
We apply Proposition 4.6 to show AT for each (to be more precise, by Proposition 4.6 we get AT uniformly under any pinning outside and then we take expectation over ). The balanced separator decomposition tree is given by Lemma 4.1. Observe that the size of each separator is since the treewidth of is , and the height of the decomposition tree satisfies by Lemma 2.10 since all separators are balanced. We take in Proposition 4.6 for some sufficiently large constant , such that block factorization for decomposition Eq. 20 holds with for each node ; this again follows from Lemma 5.5 where we have and , and Eq. 29 holds whenever . Also in Eq. 22 by Lemma 4.9. Therefore, we conclude from Proposition 4.6 that every satisfies approximate tensorization of entropy with multiplier
(31) |
where is the AT multiplier in Eq. 21 for the ball and we take maximum over all separators.
Observe that, for each node in the decomposition tree we have
for some constant when is sufficiently large. Define to be the maximum of optimal AT multipliers over all subsets of vertices of size at most ; namely, for all with , it holds for every function that
Note that is monotone increasing. Thus, we obtain from Eq. 31 that
(32) |
Combining Eqs. 30 and 32 and Lemma 5.4, we obtain
More generally, for any subset of size where is some fixed constant, we have for every function that
This is shown by the same arguments applied to under an arbitrary pinning outside and then taking expectation. Hence, we have established the following recursive bound: for all ,
(33) |
We now solve Eq. 33. By choosing large enough, we may assume that for all ,
We prove by induction that
(34) |
Eq. 34 is trivial for . Suppose Eq. 34 is true for all . Then we have
Thus, we deduce from the recursive bound Eq. 33 that
establishing Eq. 34.
Therefore, we have . In particular, the hardcore Gibbs distribution on satisfies AT with multiplier , and the mixing time follows from Lemma 2.7. ∎
5.2 Rapid mixing via SSM for graphs of bounded growth
In this subsection we consider graphs with polynomially bounded neighborhood growth.
Definition 5.6 (Bounded Growth).
Let be a graph and be reals. We say has polynomially bounded growth if for any vertex and any integer it holds
Observe that any family of graphs of bounded growth also have bounded maximum degree and bounded local treewidth. To see the latter, for any nontrivial subgraph of a graph of bounded growth, let be any vertex of and , and we have
(35) |
showing that has bounded local tree width. Thus, we deduce from Theorem 5.3 that for graphs of bounded growth, SSM implies mixing of Glauber dynamics. We can actually improve this mixing time bound slightly.
Theorem 5.7.
Let be an -vertex graph. Suppose that has bounded growth with constant parameters , and suppose that the hardcore model on with fugacity satisfies SSM with constant parameters . Then the mixing time of the Glauber dynamics for the hardcore Gibbs distribution on is .
Proof.
Following the proof of Theorem 5.3, we have that for every function ,
which is Eq. 30. For bounded-growth graphs, we have similarly as Eq. 35 that
where is some fixed constant, and the last inequality is due to Lemma 5.4. With the same definition of , we deduce that
where the last inequality follows from Lemma 5.4. This allows us to obtain the following recursive bound: for all where is some fixed constant,
(36) |
Solving Eq. 36, we can get . Thus, AT of entropy holds with multiplier , and we get the mixing time bound from Lemma 2.7. ∎
5.3 Low-diameter decomposition
In this subsection we present the proof of Lemma 5.4.
Given a graph and a partition of the vertex set into clusters, define the quotient graph to be the graph with vertex set where two clusters are adjacent iff there exists such that . Namely, is the graph obtained from by contracting every cluster into a single vertex and connect two vertices iff the two clusters are adjacent. We need the following classical result of low-diameter decomposition due to Linial and Saks [LS93].
Lemma 5.8 ([LS93, Theorem 2.1]).
Let be an -vertex graph where . There exists a partition of the vertex set into clusters such that the following conditions hold:
-
1.
For each , we have ;
-
2.
The quotient graph has chromatic number at most .
Remark 5.9.
Proof of Lemma 5.4.
Let be the graph with vertex set where two vertices are adjacent iff their graph distance is at most . By Lemma 5.8, there exists a partition for the graph such that
-
1.
For each , we have ;
-
2.
The quotient graph has chromatic number at most .
We claim that such a partition satisfies our requirements.
Fix . For , since the diameter of is at most , there exists a path in connecting and of length at most . By replacing every edge in in with a path in of length at most , we obtain a path in of length at most . In particular, this new path is contained in the induced subgraph . Hence, the distance between all pairs of in is at most . Also, note that every vertex in is at distance at most from in . Thus, the diameter of is at most , verifying the first condition.
For the second condition, take an arbitrary vertex and consider all clusters at distance at most from . Then all these clusters have pairwise distance at most , meaning that they form a clique in the quotient graph . Since the quotient graph is -colorable, the size of this clique is at most , implying the second condition. ∎
6 Proofs for Variance and Entropy Factorization
In this section, we give missing proofs from Sections 3.2 and 4.2.2.
6.1 Proof of Lemma 3.3
In this subsection we present the proof of Lemma 3.3. Our proof here avoids some technical difficulties appeared in [Ces01, Proposition 2.1] or [DPPP02, Lemma 5.2], and allows us to get a slightly better constant for AT.
Proof of Lemma 3.3.
We notice that factorization of entropy Eq. 11 implies factorization of variance Eq. 10 with the same constant by a standard linearization argument (plugging into Eq. 11 and then taking ), see [CMT15]. Thus, it suffices to consider only entropy and prove Eq. 11.
By the law of total entropy (Lemma 6.5) we have
Hence, Eq. 11 is equivalent to that
(37) |
It suffices to show Eq. 37.
Without loss of generality we may assume that . Thus, . Let us define
(38) |
Then by definition we have
Let such that . If , then by the inequality for all and , we deduce that at the point
(39) |
Meanwhile, if then it must hold since is non-negative, and hence Eq. 39 still holds at with both sides equal to 0 (recall ). Thus, we obtain that
(40) |
where we use . Note that is the covariance of the two functions and .
Let us now define a probability distribution over by , i.e., for all . Note that is indeed a distribution since . We also define the marginal distributions and the conditional distributions similarly as for . Observe that for each , we have
Then, we deduce that
(41) |
Let and , so is a partition of . Define and in the same way with respect to . Recall that the condition in Eq. 9 is equivalent to that for all ,
Hence, we obtain that
The same upper bound holds for as well. Meanwhile,
and the same bound also holds for . Therefore, plugging into Eq. 41, we obtain that
(42) |
By Pinsker’s inequality, we have
(43) |
where the last equality follows from the observation that and similarly .
6.2 Proofs of Lemmas 3.4 and 3.6
Before presenting the proofs of these two lemmas, we first give some definitions and lemmas that are needed.
Our proof of approximate tensorization is based on the spectral independence approach. The following definitions and theorem are taken from [FGYZ21] which builds upon [AL20, ALO20] (see also [CGŠV21]).
Definition 6.1 (Influence Matrix, [FGYZ21]).
Let be a distribution over a finite product space . Fix a subset and a feasible partial assignment with . For any with , we define the (pairwise) influence of on conditioned on by
where are chosen from such that and .
The (pairwise) influence matrix is defined with entries given as above and also with for for diagonal entries.
Definition 6.2 (Spectral Independence, [FGYZ21]).
We say a distribution over a finite product space is -spectrally independent, if for every , every subset of size , and every feasible partial assignment with , the spectral radius of the influence matrix satisfies
Theorem 6.3 ([FGYZ21, Theorem 3.2]).
Let be a distribution over a finite product space . Let be a sequence of reals such that for each . If is -spectrally independent, then the spectral gap of the Glauber dynamics for sampling from satisfies
We also need the following well-known comparison between the spectral gap and the standard log-Sobolev constant (which holds more generally for any Markov chain).
Lemma 6.4 ([DSC96, Corollay A.4]).
Let be a distribution over a finite product space . Suppose is the spectral gap of the Glauber dynamics for sampling from , and is the standard log-Sobolev constant of it. Then we have
where . In particular, if has positive density on at least two assignments in then and we have
Finally, we need the following lemma for deriving AT from the spectral gap and the standard log-Sobolev constant.
Lemma 6.5 ([CMT15, Proposition 1.1]).
Let be a distribution over a finite product space . Suppose is the spectral gap of the Glauber dynamics for sampling from , and is the standard log-Sobolev constant of it. Then we have
We are now ready to prove Lemmas 3.4 and 3.6.
Proof of Lemma 3.4.
The spectral radius of the influence matrix of , as defined in Definition 6.1, is upper bounded by
Therefore, by Theorem 6.3 the spectral gap of the Glauber dynamics is at least , and by Lemma 6.4 the standard log-Sobolev constant is at least . We then deduce the lemma from Lemma 6.5. ∎
Remark 6.6.
Another way of proving Lemma 3.4 is by viewing the (pairwise) influence matrix as the Dobrushin dependency/influence matrix, and showing the Glauber dynamics is contractive with respective to some weighted Hamming distance with the weight vector given by the principle eigenvector of ; the lower bound on the spectral gap then follows from [LP17, Theorem 13.1] or [Che98].
Proof of Lemma 3.6.
By definition we see that is -spectrally independent, where for each we have
by considering the norm (absolute row sum) of the influence matrices. Hence, we deduce from Theorem 6.3 that the spectral gap of the Glauber dynamics is lower bounded by
Furthermore, by Lemma 6.4 the standard log-Sobolev constant is lower bounded by
Finally, the lemma follows from an application of Lemma 6.5. ∎
6.3 Proof of Lemma 4.10
Proof of Lemma 4.10.
Recall, the marginal distribution satisfies -factorization (i.e., approximate tensorization) of entropy with constant if
(44) |
We claim that Eq. 44 is equivalent to
(45) |
where the underlying distribution is . To see this, observe that every function is in one-to-one correspondence to a function depending only on by the relationship . By definitions, we have , for all , and for all , implying Eqs. 44 and 45 are equivalent.
Therefore, it suffices to show that Eq. 45 is equivalent to that satisfies -factorization of entropy with constant , i.e.,
(46) |
It is trivial that Eq. 46 implies Eq. 45. For the other direction, suppose Eq. 45 is true. Since is a function depending only on and , we have from Eq. 45 that
Then, we deduce from Lemma 2.2 that
where the last inequality is due to (this can be seen by considering functions depending only on in Eq. 45). ∎
References
- [AJK+22] Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, and Thuy-Duong Vuong. Entropic independence: optimal mixing of down-up random walks. In Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing (STOC), pages 1418–1430, 2022.
- [AL20] Vedat Levi Alev and Lap Chi Lau. Improved analysis of higher order random walks and applications. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing (STOC), pages 1198–1211, 2020.
- [ALO20] Nima Anari, Kuikui Liu, and Shayan Oveis Gharan. Spectral independence in high-dimensional expanders and applications to the hardcore model. In Proceedings of the 61st Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 1319–1330, 2020.
- [BCŠV22] Antonio Blanca, Zongchen Chen, Daniel Štefankovič, and Eric Vigoda. Complexity of high-dimensional identity testing with coordinate conditional sampling. arXiv preprint arXiv:2207.09102, 2022.
- [BGGŠ22] Ivona Bezáková, Andreas Galanis, Leslie Ann Goldberg, and Daniel Štefankovič. Fast sampling via spectral independence beyond bounded-degree graphs. In Proceedings of the 49th International Colloquium on Automata, Languages, and Programming (ICALP), volume 229, 2022.
- [BKMP05] Noam Berger, Claire Kenyon, Elchanan Mossel, and Yuval Peres. Glauber dynamics on trees and hyperbolic graphs. Probability Theory and Related Fields, 131(3):311–340, 2005.
- [Bod98] Hans L Bodlaender. A partial -arboretum of graphs with bounded treewidth. Theoretical computer science, 209(1-2):1–45, 1998.
- [CE22] Yuansi Chen and Ronen Eldan. Localization schemes: A framework for proving mixing bounds for Markov chains. In Proceedings of the 63rd Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 110–122. IEEE, 2022.
- [Ces01] Filippo Cesi. Quasi-factorization of the entropy and logarithmic Sobolev inequalities for Gibbs random fields. Probability Theory and Related Fields, 120(4):569–584, 2001.
- [CFYZ21] Xiaoyu Chen, Weiming Feng, Yitong Yin, and Xinyuan Zhang. Rapid mixing of Glauber dynamics via spectral independence for all degrees. In Proceedings of the 62nd Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 137–148. IEEE, 2021.
- [CFYZ22] Xiaoyu Chen, Weiming Feng, Yitong Yin, and Xinyuan Zhang. Optimal mixing for two-state anti-ferromagnetic spin systems. In Proceedings of the 63rd Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 588–599. IEEE, 2022.
- [CGŠV21] Zongchen Chen, Andreas Galanis, Daniel Štefankovič, and Eric Vigoda. Rapid mixing for colorings via spectral independence. In Proceedings of the 32nd Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1548–1557, 2021.
- [Che98] Mu-Fa Chen. Trilogy of couplings and general formulas for lower bound of spectral gap. In Probability towards 2000, pages 123–136. Springer, 1998.
- [CLV20] Zongchen Chen, Kuikui Liu, and Eric Vigoda. Rapid mixing of Glauber dynamics up to uniqueness via contraction. In Proceedings of the 61st Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 1307–1318, 2020.
- [CLV21] Zongchen Chen, Kuikui Liu, and Eric Vigoda. Optimal mixing of Glauber dynamics: Entropy factorization via high-dimensional expansion. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing (STOC), pages 1537–1550, 2021.
- [CMT15] Pietro Caputo, Georg Menz, and Prasad Tetali. Approximate tensorization of entropy at high temperature. Annales de la Faculté des sciences de Toulouse: Mathématiques, 24(4):691–716, 2015.
- [CP21] Pietro Caputo and Daniel Parisi. Block factorization of the relative entropy via spatial mixing. Communications in Mathematical Physics, 388(2):793–818, 2021.
- [DH04] Erik D Demaine and MohammadTaghi Hajiaghayi. Equivalence of local treewidth and linear local treewidth and its algorithmic applications. In Proceedings of the 15th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 840–849, 2004.
- [DPPP02] Paolo Dai Pra, Anna Maria Paganoni, and Gustavo Posta. Entropy inequalities for unbounded spin systems. The Annals of Probability, 30(4):1959–1976, 2002.
- [DSC96] Persi Diaconis and Laurent Saloff-Coste. Logarithmic Sobolev inequalities for finite Markov chains. The Annals of Applied Probability, 6(3):695–750, 1996.
- [DSVW04] Martin Dyer, Alistair Sinclair, Eric Vigoda, and Dror Weitz. Mixing in time and space for lattice spin systems: A combinatorial view. Random Structures & Algorithms, 24(4):461–479, 2004.
- [EF21] David Eppstein and Daniel Frishberg. Rapid mixing of the hardcore Glauber dynamics and other Markov chains in bounded-treewidth graphs. arXiv preprint arXiv:2111.03898, 2021.
- [Epp99] David Eppstein. Subgraph isomorphism in planar graphs and related problems. Journal of Graph Algorithms and Applications, 3(3):1–27, 1999.
- [FGYZ21] Weiming Feng, Heng Guo, Yitong Yin, and Chihao Zhang. Rapid mixing from spectral independence beyond the Boolean domain. In Proceedings of the 32nd Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1558–1577. SIAM, 2021.
- [GJK10] Leslie Ann Goldberg, Mark Jerrum, and Marek Karpinski. The mixing time of Glauber dynamics for coloring regular trees. Random Structures & Algorithms, 36(4):464–476, 2010.
- [GKM15] David Gamarnik, Dmitriy Katz, and Sidhant Misra. Strong spatial mixing of list coloring of graphs. Random Structures & Algorithms, 46(4):599–613, 2015.
- [GMP05] Leslie A. Goldberg, Russell Martin, and Mike Paterson. Strong spatial mixing with fewer colors for lattice graphs. SIAM Journal on Computing, 35(2):486–517, 2005.
- [Gru12] Hermann Gruber. On balanced separators, treewidth, and cycle rank. Journal of Combinatorics, 3(4):669–681, 2012.
- [GŠV16] Andreas Galanis, Daniel Štefankovič, and Eric Vigoda. Inapproximability of the partition function for the antiferromagnetic Ising and hard-core models. Combinatorics, Probability and Computing, 25(4):500–559, 2016.
- [GZ03] Alice Guionnet and Bogusław Zegarliński. Lectures on Logarithmic Sobolev Inequalities, pages 1–134. Springer, Berlin, 2003.
- [Hei20] Marc Heinrich. Glauber dynamics for colourings of chordal graphs and graphs of bounded treewidth. arXiv preprint arXiv:2010.16158, 2020.
- [HW17] Daniel J Harvey and David R Wood. Parameters tied to treewidth. Journal of Graph Theory, 84(4):364–385, 2017.
- [Jer03] Mark Jerrum. Counting, Sampling and Integrating: Algorithms and Complexity. Lectures in Mathematics, ETH Zürich. Birkhäuser Basel, 2003.
- [JS89] Mark Jerrum and Alistair Sinclair. Approximating the permanent. SIAM Journal on Computing, 18(6):1149–1178, 1989.
- [JS93] Mark Jerrum and Alistair Sinclair. Polynomial-time approximation algorithms for the Ising model. SIAM Journal on Computing, 22(5):1087–1116, 1993.
- [JSTV04] Mark Jerrum, Jung-Bae Son, Prasad Tetali, and Eric Vigoda. Elementary bounds on Poincaré and log-Sobolev constants for decomposable Markov chains. The Annals of Applied Probability, 14(4):1741–1765, 2004.
- [JSV04] Mark Jerrum, Alistair Sinclair, and Eric Vigoda. A polynomial-time approximation algorithm for the permanent of a matrix with nonnegative entries. Journal of the ACM (JACM), 51(4):671–697, 2004.
- [KHR22] Frederic Koehler, Alexander Heckett, and Andrej Risteski. Statistical efficiency of score matching: The view from isoperimetry. In Proceedings of the 11th International Conference on Learning Representations (ICLR), 2022.
- [LM11] Brendan Lucier and Michael Molloy. The Glauber dynamics for colorings of bounded degree trees. SIAM Journal on Discrete Mathematics, 25(2):827–853, 2011.
- [LMP09] Brendan Lucier, Michael Molloy, and Yuval Peres. The Glauber dynamics for colourings of bounded degree trees. In International Workshop on Approximation Algorithms for Combinatorial Optimization, pages 631–645. Springer, 2009.
- [LP17] David A Levin and Yuval Peres. Markov chains and mixing times, volume 107. American Mathematical Soc., 2017.
- [LS93] Nathan Linial and Michael Saks. Low diameter graph decompositions. Combinatorica, 13(4):441–454, 1993.
- [LT79] Richard J Lipton and Robert Endre Tarjan. A separator theorem for planar graphs. SIAM Journal on Applied Mathematics, 36(2):177–189, 1979.
- [Mar99] Fabio Martinelli. Lectures on Glauber dynamics for discrete spin models. In Lectures on probability theory and statistics, pages 93–191. Springer, 1999.
- [Mar19] Katalin Marton. Logarithmic sobolev inequalities in discrete product spaces. Combinatorics, Probability and Computing, 28(6):919–935, 2019.
- [MSW04] Fabio Martinelli, Alistair Sinclair, and Dror Weitz. Glauber dynamics on trees: boundary conditions and mixing time. Communications in Mathematical Physics, 250(2):301–334, 2004.
- [MWW09] Elchanan Mossel, Dror Weitz, and Nicholas Wormald. On the hardness of sampling independent sets beyond the tree threshold. Probability Theory and Related Fields, 143(3):401–439, 2009.
- [Ree03] Bruce A Reed. Algorithmic aspects of tree width. Recent advances in algorithms and combinatorics, pages 85–107, 2003.
- [RS86] Neil Robertson and Paul D. Seymour. Graph minors. II. Algorithmic aspects of tree-width. Journal of algorithms, 7(3):309–322, 1986.
- [Sly10] Allan Sly. Computational transition at the uniqueness threshold. In Proceedings of the 51st Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 287–296, 2010.
- [SS14] Allan Sly and Nike Sun. The computational hardness of counting in two-spin models on -regular graphs. The Annals of Probability, 42(6):2383–2416, 2014.
- [SZ17] Allan Sly and Yumeng Zhang. The Glauber dynamics of colorings on trees is rapidly mixing throughout the nonreconstruction regime. The Annals of Applied Probability, pages 2646–2674, 2017.
- [TVVY12] Prasad Tetali, Juan C Vera, Eric Vigoda, and Linji Yang. Phase transition for the mixing time of the Glauber dynamics for coloring regular trees. The Annals of Applied Probability, pages 2210–2239, 2012.
- [Wei06] Dror Weitz. Counting independent sets up to the tree threshold. In Proceedings of the 38th Annual ACM Symposium on Theory of Computing (STOC), pages 140–149, 2006.
- [WTZL18] Pengfei Wan, Jianhua Tu, Shenggui Zhang, and Binlong Li. Computing the numbers of independent sets and matchings of all sizes for graphs with bounded treewidth. Applied Mathematics and Computation, 332:42–47, 2018.
- [YZ13] Yitong Yin and Chihao Zhang. Approximate counting via correlation decay on planar graphs. In Proceedings of the 24th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 47–66. SIAM, 2013.