Filling random cycles
Abstract.
We compute the asymptotic behavior of the average-case filling volume for certain models of random Lipschitz cycles in the unit cube and sphere. For example, we estimate the minimal area of a Seifert surface for a model of random knots first studied by Millett. This is a generalization of the classical Ajtai–Komlós–Tusnády optimal matching theorem from combinatorial probability. The author hopes for applications to the topology of random links, random maps between spheres, and other models of random geometric objects.
1. Introduction
1.1. Main results
This paper introduces a new kind of average-case isoperimetric inequality. Given a -cycle on , in any of a number of geometric measure theory senses, its filling volume is the minimal mass of a chain whose boundary is . The well-known Federer–Fleming isoperimetric inequality [FF] states that for all -cycles ,
However, one might expect that most cycles of given mass are much easier to fill.
Unfortunately, as explained to the author by Robert Young, a geometrically meaningful probability measure on the space of all cycles of mass may be too much to ask for. The issue is one of picking a scale: say we are trying to build a random 1-Lipschitz curve in a finite-dimensional space. If the curve is to fluctuate randomly at scale , then over time it will only travel a distance on the order of . Thus there is no way of ensuring random behavior in a scale-free way. This idea of decomposing a finite-mass cycle into pieces at different scales can be made precise using the notion of a corona decomposition, as in [Jones] (in dimension 1) and [Young] (in higher dimensions).
On the other hand, there are a number of ways of, and a number of motivations for, building random “space-filling” cycles of mass which look essentially trivial on balls of radius . Our main theorems characterize three models of this form which exhibit similar isoperimetric behavior, and we hypothesize that this behavior should be generic for models which are random at many scales, including the largest—an idea which may have a precise Fourier-analytic formulation.
This isoperimetric behavior is described in codimension by the function
Theorem A.
Let be a -cycle on obtained by sampling oriented great -spheres independently from the uniform distribution on the oriented Grassmannian . Then there are constants depending on and such that
(1.1) |
Moreover, is concentrated around its mean: there are constants depending on and such that
(1.2) |
From (1.2) we see that the spread of the distribution around the mean is at most on the order of ; in codimensions , this is small compared to the mean:
If a model of random codimension- cycles of mass satisfies (1.1) and (1.2), we say it exhibits AKT statistics, in honor of Ajtai, Komlós, and Tusnády, who discovered this phenomenon in the case of zero-cycles.
By rescaling the picture, we can make this result more interpretable. Let . Then the corresponding process in the -sphere of radius generates a cycle of mass which is evenly spread throughout the sphere, so that a 1-ball intersects one of the great -spheres in on average. With that rescaling, the mass of an optimal filling becomes
Informally speaking, to meet its match, the average point in has to travel a distance (in codimension 1), (in codimension 2), or (otherwise) times the distance to its closest neighbor.
We also prove similar results for the cube:
Theorem B.
Let be a relative -cycle on obtained by sampling planes independently from the uniform distribution on the space of oriented -planes which intersect nontrivially. Then exhibits AKT statistics.
There may be reasonable disagreement as to which distribution is the uniform one in this context; to prove (1.1) it suffices to require that it be uniform on each subset of (isometric to two copies of a polytope) consisting of parallel planes, but to prove (1.2) we also need to assume that it behaves reasonably with respect to the manifold structure on (for example, is a positive density or has finite support).
In fact, the only thing used here about is that almost all of its “slices” along coordinate -planes consist of independent uniformly distributed points. This means that there are a number of other possible models that can be fit into this framework. However, the following requires a separate proof:
Theorem C.
Let be a sequence of -dimensional oriented pseudomanifolds with vertices and at most simplices incident to any given simplex. Let be a -cycle on obtained by sending each vertex of to a uniformly random point in and extending linearly. Then exhibits AKT statistics.
In the context of this theorem, the constants in (1.1) depend on , , and , but not on the shapes of the pseudomanifolds (which can therefore also be randomized). The case , describes the “random jump” model of random knots and links introduced by Millett [Mil]. Moreover, by a theorem of Hardt and Simon [HS], the optimal filling of such a knot or link (after a slight rounding of corners) is a embedded surface. In particular:
Corollary 1.3.
For some , the minimal Seifert surface of a knot produced using random jumps has area between and with high probability.
1.2. Motivation
The methods we use to prove Theorems A, B, and C can be easily extended to other i.i.d. samples of simple shapes on various spaces. However, the investigation is mainly motivated by the desire to analyze topological invariants of random geometric objects such as links and maps. Models of such objects tend to produce random cycles which are similarly trivial at small scales, but are more difficult to sample because they cannot be easily written in terms of i.i.d. parameters.
Random knots and links
There have been a number of proposed models of random knots and links; see [E-Z] for a detailed survey. Several of these models are “spatial” in the sense that they produce random knotted curves in space, and one supposes that these may exhibit AKT statistics for filling area. As mentioned above, we show this for Millett’s random jump model, but it may also be true for random polygonal walks with shorter segments as well as random grid walks, perhaps with some restrictions on segment length.
Given two random curves in a certain model, one may want to understand the distribution of their linking number. Since this will usually be zero on average, the first interesting question is about the second moment. Linking number can be computed as the intersection number of one curve with a filling of the other, thus one may expect that two random curves of length which exhibit AKT statistics have expected squared linking number or .
However, this is not the case for the Millett model: the second moment of the linking number between two random jump curves of length is [ABD+, FK]. Similarly, one may take the setup of Theorem A for and as a model of a random link and try to understand the total linking number, that is, the sum of the signed linking numbers of all pairs of circles. This is then the intersection number of the chain with its own filling. Here it is easy to see (as pointed out by Matthew Kahle) that the second moment of the distribution is once again .
In both cases, this seeming incongruity perhaps boils down once again to the issue of multiple scales: random jump curves and great circles only “see” the largest scales, but the lower bound on filling volume in codimension 2 comes from looking on many different scales at once. One may perhaps get a different answer most easily by analyzing the linking number of an asymmetric model: a random jump curve and a random walk of total length made of smaller segments.
Random maps
Another way of producing a random (framed) link is as the preimage of a generic point under a random map . In fact, the self-linking number of this link is the Hopf invariant of the map, which is itself a natural subject for investigation since it is a complete topological invariant of such maps.
One natural model of -Lipschitz random maps is a uniformly random simplicial map from a triangulation of at scale to a tetrahedron. The maximal self-linking number of such a map is , cf. [Gro]; on the other hand, the heuristics above would suggest that the second moment of the linking number of the random model is between and .
These ideas may have applications in topological and geometric data analysis, see [FKW].
1.3. Methods
The cases of Theorems A and B are, up to minor adjustments, a classical theorem in combinatorial probability:
Theorem 1.4 (Ajtai, Komlós, and Tusnády [AKT]).
Let and be two sets of independent, uniformly distributed random points in , and let be the transportation cost between and , that is, the total length of an optimal matching. Then there are constants such that with high probability,
Since the original geometric proof in [AKT] of the most subtle case , this and related results have been proved many other times, often by applying Fourier analysis; see [BL2] for further references and [Tal3] for a detailed treatment of certain analytic approaches. Another beautiful geometric proof of the upper bound on the sphere is due to Holden, Peres, and Zhai [HPZ].
The proofs of Theorems A and B in general are obtained by applying the results to -dimensional slices of the cube and sphere. This is the reason that the results depend only on the codimension, and for the critical nature of codimension 2. The lower bound in (1.1) is obtained directly by integrating the lower bounds on these slices. The upper bound is obtained via a dual result on differential forms; this kind of technique was already used in [AKT] for the proof of the lower bound for the square. Finally, (1.2) is proved using the notion of concentration of measure due originally to Gromov and Milman [GM]; see [Led] for an extensive modern treatment.
Theorem C is proved similarly, except that slices no longer consist of i.i.d. points. Even this small amount of dependence complicates the argument considerably. We use ad hoc combinatorial arguments to overcome this, but one might hope to generalize, for example by applying a variant of Stein’s method, to a version of Theorem 1.4 in the presence of dependence (one approach, which only gives upper bounds, is discussed in [BL2, §5]).
Structure of the paper
Section 2 introduces necessary ideas and results from geometric measure theory, and Section 3 discusses the classical AKT theorem. In Sections 4 and 5, the upper and lower bounds in Theorems A and B are proved using tools that may generalize to other models of random cycles. In Section 6, we discuss the extra ideas needed to prove Theorem C. Finally, Section 7 discusses the concentration of the distributions in these theorems around their mean.
Acknowledgements
I would like to thank Matthew Kahle and Robert Young for a large number of helpful discussions over a span of three years. Yevgeny Liokumovich provided a crucial reference; Shmuel Weinberger asked a question which inspired Theorem C and gave other helpful comments. Finally, I would like to thank both Robert Young and the anonymous referee for calling out a great deal of sloppy and lazy writing in the initial draft. I was partially supported by NSF individual grant DMS-2001042.
2. Definitions and preliminaries
2.1. Cycles and currents
There are a number of useful ways to define chains and cycles from the point of view of topology and geometric measure theory. Algebraic topology typically uses singular -chains: formal linear combinations of continuous maps from the -simplex to a topological space (“singular simplices”). We will usually restrict our attention to Lipschitz simplices (that is, requiring the maps to be Lipschitz) on a Riemannian manifold . By Rademacher’s theorem, a Lipschitz simplex is differentiable almost everywhere and so has a well-defined volume or mass,
We can then extend by linearity to define the mass of a Lipschitz chain.
A more general notion of chain is that of a normal current. A -dimensional current on a manifold is simply a functional on (smooth) differential forms, which we think of as integration over the current. For example:
-
•
Every Lipschitz chain defines a current via .
-
•
Every compactly supported -form defines a current via .
We will write the value of on either as or as , since currents should be thought of as generalized domains of integration. The boundary operator is defined via Stokes’ theorem: for a current ,
The mass of a -current on , which agrees with the same notion on Lipschitz chains, is defined to be
Here is the supremum of the value of over all frames of unit vectors. For a general current, the mass of course need not be finite. A current is normal if and both have finite mass; in particular any cycle (current with empty boundary) of finite mass is normal.
2.2. Fillings and duality
Now, if is a normal current such that , we call it a filling of . The filling volume of is
which is always finite by the work of Federer and Fleming. The following is an instance of the Hahn–Banach theorem:
Proposition 2.1.
Let be a manifold. Then a normal -current in with has a filling of mass if and only if for every with , .
More generally, for any closed set , write for the vector space of forms whose restriction to is zero. Let be a normal -current with supported on , that is, such that for any -form . Then has a filling relative to (that is, a -current such that is supported on ) of mass if and only if for every with , .
In other words, the filling volume of a cycle can be redefined as
in both the absolute and the relative case. Our proofs of the upper bounds in Theorems A and B will be based on this proposition rather than constructing fillings directly.
Of course, knowing that a nice Lipschitz cycle has a filling which is a normal current is not very satisfying—after all, normal currents can still be very strange. Luckily, given a normal current filling, we can upgrade it to a Lipschitz chain (at the cost of multiplying the mass by a constant) using the following classical theorem:
Theorem 2.2 (Federer–Fleming deformation theorem [FF, Thm. 5.5]).
There is a constant such that the following holds. Let be a normal current in . Then for every we can write , where
-
(1)
.
-
(2)
.
-
(3)
.
-
(4)
is a polyhedral cycle which can be expressed as an -linear combination of -cells in the cubical unit lattice in .
-
(5)
If is a Lipschitz chain, then so are and .
-
(6)
If is a Lipschitz chain, then so is .
If is a normal current filling a Lipschitz chain , then is a Lipschitz chain filling whose mass is only greater by a multiplicative constant .
It is not hard to upgrade the deformation theorem to manifolds, although the resulting constants will depend on the manifold and its metric; see for example [EPC+, Theorem 10.3.3].
2.3. Slicing
An important property of normal currents, introduced in [FF, §3], is the ability to take “slices” by hyperplanes to produce currents in lower dimensions. We follow the exposition of F. Morgan [Mor, 4.11], who follows Federer [Fed, §4.2.1].
Let be a Lipschitz function on a manifold . Given a -current on and a differential -form , define the -current by
In particular, this makes sense when is a measurable function, for example the characteristic function of a set . In that case we can write for the restriction of to .
Given a Lipschitz function , the slice of at is defined by
(2.3) |
If is a normal current, then is a normal current for almost all . In addition, we have the following standard properties:
-
(1)
If is defined by integration over a (rectifiable) set , then is defined by integration over .
-
(2)
.
-
(3)
A quick calculation from the definitions yields an additional property:
Proposition 2.4.
If is a -form, then
Proof.
Start with the equalities
Subtract one from the other; then applying Stokes’ theorem and the Leibniz rule on the left and (2.3) on the right, we get the desired identity. ∎
In particular, by inductively slicing in different directions, we get the following:
Proposition 2.5.
Let , and let be an -dimensional current on . Given , let be the plane . Then there are -dimensional currents , normal for almost all , such that
In addition, given an -index and a function ,
3. A variation on the Ajtai–Komlós–Tusnády theorem
The results of this paper are a generalization of Theorem 1.4. Properly, the theorem of Ajtai, Komlós, and Tusnády [AKT] is in the case ; their paper also asserts the case , which is easy and later proved and extended in several directions by Talagrand [Tal1, Tal2]. The case is elementary, and the proof along with a vast array of strengthenings and generalizations can be found in [BL1] by Bobkov and Ledoux. Here we need a slight variation.
Theorem 3.1.
Generate a cycle of mass in by selecting independent, uniformly distributed points in . Then there are constants such that
Remark 3.2.
Suppose that is a Riemannian ball diffeomorphic to and has a volume form. Then by the main theorem of [BMPR] (extending results of Moser [Moser] and Banyaga [Bany]), there is a diffeomorphism between the two which multiplies the volume form by a constant. Therefore Theorem 3.1 also holds with respect to Lebesgue measure on , with constants depending on the ratio of the volumes and the bilipschitz constant of this diffeomorphism.
Moreover, given a smooth family of Riemannian balls, [BMPR] indicates that there is a smooth family of such diffeomorphisms. Therefore, if the family is compact, one can find uniform constants for the whole family.
Proof.
There are two differences here from the results as they are typically presented in the probability literature, where the problem consists of matching two sets of random points of the same cardinality: first, the number of positive and negative points may not match; second, we are allowed to match points to the boundary as well as to points of the opposite orientation.222In fact, a somewhat similar, but more complicated modification was studied by Shor [Shor]. We briefly explain how to modify the original proofs to deal with this.
Clearly, the possibility of matching to the boundary cannot make the upper bounds worse. Let’s say without loss of generality there are more positive points. To obtain the upper bound for , we may simply ignore some arbitrary set of “extra” positive points, matching all the others first. By the central limit theorem, the expected number of extra points is , so the extra mass generated by matching them all to the boundary of the cube does not change the asymptotic answer.
For the lower bound in the case , we use the same stratagem of ignoring the “extra” points to create a new cycle with an equal number of positive and negative points. From the original proof in [AKT], we know that there is a 1-Lipschitz function which is zero on and such that with high probability. Since with high probability the number of extra points is , and the values of lie between and , we also know that with high probability.
The lower bound in the case is easy to see: conditional on any distribution of the positive points, most negative points will be at distance from every positive point and the boundary, where is a constant depending on .
In the case , the filling is unique up to a constant: the unique filling supported away from zero has density at . We use arguments found in [BL1, §3] to give estimates on .
The upper bound is a simple calculation:
The lower bound comes from the following classical fact, found in [BL1] as Lemma 3.4:
Lemma 3.3.
Given independent mean zero random variables ,
Let be the th chosen point. Then applying the lemma to , we get
and therefore . ∎
4. Proof of the upper bound
To prove the upper bound in Theorems A and B, we will use Stokes’ theorem; that is, we use the fact that for a cycle ,
(4.1) |
In fact, since is a cycle, only depends on . To bound this quantity, we first note that any can be decomposed into a sum of “basic” forms of the form
where is a function , for each subset .
Lemma 4.2.
For any exact form (resp., ), there is a form (resp., ) given by
such that , and for each , .
Proof.
We prove this by induction on and , keeping constant. In the base case , we can take the function to be the fiberwise integral along one of the coordinates.
To do the inductive step in the relative case, we follow the usual proof of the Poincaré lemma with compact support, following [BottTu, §1.4]. Fix a smooth bump function which is 0 near 0 and 1 near 1. By applying the lemma one dimension lower, we get a form with and . Then for
where is the projection to the -cube along . Notice that
This gives us a bound on each in terms of the and as well as the derivatives of .
For the non-relative version, we follow the same proof, mutatis mutandis, taking
Here and in the next section, by a random -cycle we mean a random variable taking values in -cycles, that is, a measure on the space of -cycles. We follow the convention, common in probability theory, of blurring the distinction between a measure on a set of objects and an object randomly drawn from that measure.
Theorem 4.3.
Let be a random -cycle in or in which satisfies the condition that for some ,
(4.4) |
for almost all -planes parallel to one of the coordinate -planes. Then
(4.5) |
where is the constant from Lemma 4.2.
Proof.
For a -form , depends only on . Therefore to estimate (4.1) it is enough to show that for any -form with , there is a -form such that and .
By Lemma 4.2, we can choose
such that for every . Then for ranging over all these choices of antidifferentials,
By linearity of expectation,
Proof of Theorem B, upper bound..
Let be a cycle in obtained by sampling i.i.d. planes from a distribution on the set of oriented -planes which intersect nontrivially with , such that the distribution is uniform (with respect to Lebesgue measure on the corresponding polytope in ) on each set of parallel planes.
Proof of Theorem A, upper bound..
We use the fact that the transverse intersection of an oriented great -sphere with an oriented great -sphere is a pair of antipodal points with opposite orientations. Therefore, if is a cycle obtained by sampling oriented great -spheres independently from the uniform distribution, then for any great -sphere , with probability 1
where the are i.i.d. uniform points on .
Consider as a subset of , with standard unit basis vectors . Let be the preimage of the cube under central projection (that is, projection along lines through the origin) to the plane . If is large enough, the interiors of the cover . Each is disjoint from its antipodal set ; therefore for any great -sphere , with probability 1 consists of i.i.d. uniform points. By Remark 3.2,
where is considered as a cycle in .
Note that central projection sends great spheres to hyperplanes. Therefore, by Theorem 4.3, for each and sign,
(4.6) |
Set a partition of unity subordinate to which is invariant with respect to the involution, that is, such that . To prove the theorem, it is enough, given a -form with , to show that for some (and therefore every) with ,
But note that
Therefore it suffices to find an antidifferential and a bound separately for each . Therefore (4.6) suffices to prove the theorem. ∎
5. Proof of the lower bound
Theorem 5.1.
Let be a random Lipschitz -cycle in such that for almost every -plane
the slice satisfies
where is an function. Then
Proof.
Let be a normal current filling such that , for any . Then for almost all , there is a slice which fills , and
By Proposition 2.5,
Since this is true for every , and by linearity of expectation,
Proof of Theorem B, lower bound..
Let be a cycle in obtained by sampling i.i.d. planes from a distribution on the set of oriented -planes which intersect nontrivially with , such that the distribution is uniform (with respect to Lebesgue measure on the corresponding polytope in ) on each set of parallel planes. Assume, perhaps by permuting coordinates, that this distribution is not concentrated on planes of the form
As in the proof of the upper bound, it follows that for every , consists of i.i.d. positive and negative points with probability 1. Moreover, the probability of a random plane intersecting inside depends only on the direction of and not on . Thus
where depends on the distribution but not on .Thus by Theorems 3.1 and 5.1,
Proof of Theorem A, lower bound..
Let be a cycle in obtained by sampling independent uniformly distributed great -spheres. It suffices to show that for some compact submanifold ,
where is considered as a cycle in .
Recall that for any great -sphere , with probability 1
where the are i.i.d. uniform points on . Let be a great -sphere and the -sphere consisting of points farthest from . We use to indicate the -neighborhood of the set ; then deformation retracts to along the orthogonal retraction . We let where is some closed ball in which does not include any point and its antipode.
Notice that is foliated by equal-volume patches of great -spheres which retract to points , and also does not include any point and its antipode. By Remark 3.2, there is a bilipschitz diffeomorphism from to which sends each to a plane of the form
in a volume-preserving way (up to a constant). Therefore, for each ,
and applying Theorem 5.1, we obtain the result. ∎
6. Proof of Theorem C
Before we prove Theorem C, we must give a more precise statement.
By an oriented -pseudomanifold we mean a -dimensional simplicial complex with the following properties:
-
•
It is pure, i.e. every simplex is contained in an -dimensional simplex.
-
•
Every -simplex comes with an orientation such that the sum of all the oriented -simplices is a cycle in .
Note that this is considerably wider than the usual definition: it is just enough so that if is equipped with the standard simplexwise metric, any Lipschitz map from to a metric space defines a Lipschitz -cycle in .
We say has geometry bounded by if every -simplex intersects at most others.
With these definitions, we restate Theorem C:
Theorem.
Let be an oriented -pseudomanifold with vertices and geometry bounded by . Let be a -cycle on obtained by sending each vertex of to a uniformly random point in and extending linearly. Then there are constants depending on and such that
(6.1) |
The concentration result will be proved in the next section.
As with Theorem B, the proof is a direct application of Theorems 4.3 and 5.1. To apply these theorems, we need to understand the filling volumes of slices of , which is more complicated in this case because while the points are identically distributed, they are not entirely independent. We establish the upper bound in Lemma 6.3; this depends only on the fact that every point is independent of all but a constant number of others. For the lower bound in Theorem 6.13, the argument is more subtle: even if every point is correlated with only one other, such pairs could be very close and have opposite signs; then the least filling would be much smaller than for independent points. Accordingly, we have to show that most correlated points are still far apart. Together, these two bounds complete the proof.
In this section as before, fix the notation
We start by analyzing the slice .
Lemma 6.2.
Let . Then the slice is the sum of random -chains which are identically distributed on according to a distribution depending on and . Moreover:
-
(i)
is invariant with respect to the involution sending a chain to ;
-
(ii)
on .
-
(iii)
Each is independent of all but at most other .
Since the distribution of is invariant under permuting coordinates, this holds for any -dimensional slice in a coordinate direction.
Proof.
The distribution in question is the intersection of a random linear -simplex in with . Property (i) is obvious from this, and (iii) follows since a pair of are independent whenever the two corresponding simplices do not intersect. To see (ii), consider the function
sending each linear -simplex to its intersection with . This function is measure-preserving by definition, and its restriction to
is 1-Lipschitz. Therefore, by the coarea formula, the density function of is given by the -dimensional Hausdorff measure of point preimages. Thus it is enough to bound for each .
Let be the set of linear -simplices with vertices in which pass through , and let
(Here each vertex is translated by .) Notice that
All these translates are disjoint and their union is a subset of . Therefore, again by the coarea formula,
This completes the proof of (ii). ∎
Condition (iii) gives a dependency graph of degree between the . This graph has an -coloring, giving a partition of into disjoint subsets such that for , the are i.i.d.
6.1. Upper bound
This lemma gives the upper bound to plug into Theorem 4.3:
Lemma 6.3.
For every ,
Proof.
We estimate by separately considering each summand
These summands consist of i.i.d. negative and positive points.
Write for the probability measure on given by
If is a random 0-cycle in with positive and negative points distributed according to , then the AKT upper bound holds for : by [BL2, equation (12)], for some constant independent of the measure,
(6.4) |
To reduce to this situation, we note that while is a cycle (and therefore has an equal number of negative and positive points), may not be. We produce cycles for by adding up to additional i.i.d. points distributed according to . We add each point to for two different , with opposite signs, so that
Each is a 0-cycle consisting of at most positive and negative i.i.d. points. Therefore, by (6.4),
6.2. Lower bound
For the lower bound, we begin by showing that correlated points in are usually not very close; this relationship is summarized in Lemma 6.5. We use this as an ingredient in the proof of Theorem 6.13, which retraces some of the steps in the original proof of the AKT theorem.
In many places, we refer to the function and set defined in the proof of Lemma 6.2. Also, given a subset , we write
for the respective sets of -dimensional chains.
Lemma 6.5.
Assume that . Let and let and be random variables whose values are the intersections with of two -simplices and of whose vertices, some of which are shared, are chosen uniformly at random from . Let be a cube of side length . Then
(6.6) |
Remark 6.7.
A more careful analysis based on the same principle shows that
Proof.
The idea is this: suppose that and share a -face , and let and be the non-shared vertices of and . If the intersections of and with are close to each other, then either the angle between and is small (forcing to be close to the -plane containing ), or is close to . We would like to show that neither of these happens too often.
We will do this case in detail; the analysis when and share a lower-dimensional face is similar.
First, we show that is not very often too close to . This is easy to establish globally; the tricky bit is showing that it’s true after conditioning on being supported in . Let be the distance from to the hyperplane . The following lemma will be proved later.
Lemma 6.8.
Let and let be a set of positive measure. Let be a simplex with vertices chosen uniformly at random from , and let be its th face. Then
In particular,
(6.9) |
Now we must show that when , doesn’t very often land near . The point is that the difference depends on the angle between and , and the distribution of this angle is not too concentrated anywhere; this is true for any given . Fix with , and let be the set of points such that . Note that is contained in the intersection of a -plane with and hence its -dimensional Hausdorff norm is at most some . Now we would like to use the coarea formula to integrate with respect to . For this we need the following estimate, to be proved later:
Lemma 6.10.
Given a linear -simplex such that at least one of its -faces is at distance at least from ,
Now we prove the lemmas.
Proof of Lemma 6.10.
From Figure 1, we see that for every , there is a unit vector such that , for . Therefore
Proof of Lemma 6.8.
Let be the event that , and let be the event that . Bayes’ rule states that
We will bound by giving upper bounds for and and a lower bound for .
We first show that
(6.12) |
In fact, in this inequality, may be any constraint on the first coordinates of each vertex of (note that depends only on those coordinates).
We start by fixing the first coordinates of each vertex. Given a simplex with vertices in , let be the projection onto the first coordinates, and fix a simplex with vertices in . Let contain the last coordinates of simplices in whose intersection with is . Write to mean the set of translates of simplices in by points in . Notice that
-
(1)
,
-
(2)
the volume of is proportional to , and
-
(3)
contains .
Therefore, the probability that a given point of is contained in is at most , and this in turn bounds
Integrating over possible values of gives (6.12).
Now we show that . Choosing points in uniformly at random induces a probability measure on the set of -planes, whose density at a plane is proportional to . This density is bounded above by some , and so the set of planes whose distance from is at most has measure .
Finally we must give a lower bound for , that is, the volume of
For this, we note that when and have all coordinates in ,
This is a translate of a set which is independent of and . Taking all the translates with respect to vectors in , we see that contains a set whose volume depends linearly on and is easily seen to be positive.
Thus overall we get . ∎
Finally we have the tools we need to prove the lower bound for Theorem C.
Theorem 6.13.
For every ,
Proof.
Write . We are trying to find a lower bound for the expected filling length of a certain distribution on -cycles in which is defined as a sum of a large number of identically distributed points. This is very similar to the original AKT theorem, in which the points are, in addition, independent and uniformly distributed. In [AKT], the lower bound is proved using the dual definition of filling volume given in (4.1), which we restate here for a -cycle in :
For the case , Ajtai, Komlós and Tusnády construct a -Lipschitz function whose integral over is usually at least . The filling volume is then bounded below by the ratio of the integral to the Lipschitz constant.
When , we construct the same way as in [AKT], but on a modified point set in order to overcome two issues:
-
(1)
The points of are not uniformly distributed. We use a bilipschitz rescaling of the domain to make sure that they are.
-
(2)
The points of are not independent. We instead construct the function to have a large integral over a large independent subset, and then use Lemma 6.5 to show that the integral over the remaining points is small.
For and , we use the same reduction steps, but the function is somewhat simpler. We start by more precisely describing the common argument and then give the detailed construction of for each case.
Let be the chain-valued random variables corresponding to intersections of -simplices of with . Recall that or , with identically distributed according to a density which is bounded below on . Moreover, this bound is uniform with respect to . Thus, following Remark 3.2, there is a uniformly bilipschitz family of diffeomorphisms
which send this density to a constant times the standard volume form. In particular, Lemma 6.5 still holds for the after applying the diffeomorphism. We now write for .
In each case, consider an -coloring of the dependency graph between the , with the colored subsets ordered from largest to smallest. Note that each of the is correlated with for at most values of .
We write . In each case, we show that is hard to fill by constructing a Lipschitz function such that
Then we show that for each is small enough that it does not affect the overall asymptotics.
In each case, the function is constructed as a sum of simpler functions. Given a cube , let be the function supported on whose graph is a symmetric pyramid with base and height 1. We also write for the cube in the lattice of side length which contains . The function will consist of a sum of scaled copies of , each reflecting the “imbalance” of positive and negative points on the cube . The main difference between different dimensions is the scale of these cubes: for , the cubes are at the smallest scale (comparable to the average distance between neighboring points), for they are at the largest scale (comparable to 1), and for we use cubes at many scales, as in the original proof of [AKT].
In each case, we construct by means of an auxiliary function
(in the case , each is associated to many summands at different scales, which we consider separately). This may not satisfy the desired upper bound on the Lipschitz constant, so we remove some of the summands where they are too concentrated to produce . Whenever is independent from , , and so for every we can write
By Lemma 6.5, this correlation is not too high, and therefore we can bound each of the summands. By the construction of , this also bounds .
If we tune everything correctly, we get that for each ,
giving a lower bound on .
Case
We split the cube into subcubes of side length , and let
where is the cube with side length whose vertex closest to the origin is . Note that is -Lipschitz.
We can think of the number of points landing in each subcube as independent Poisson processes which we stop once their sum reaches roughly
By the law of large numbers, the stopping time will be very close to
and very nearly of the subcubes will contain exactly one point. Of these, with high probability, at least will be contained in the middle third of their subcube. Therefore,
On the other hand, given , , and such that is correlated with , Lemma 6.5 tells us that
and therefore
Since this is small compared to , this shows that
Case
This case is broadly similar, but we build the function in a more complicated way, following the original proof of [AKT]. For an integer , let be the square of side length whose lower left corner is at , and write . We write
Notice that is always nonnegative: roughly speaking, it measures the square of the “imbalance” of positive and negative points in . In particular, it’s not hard to see that , and therefore .
On the other hand, the derivative of is on average, but can be much larger in some places. To remedy this, Ajtai, Komlós, and Tusnády introduced a “stopping time” rule, building as the sum of some, but not all of the . We do not need to give the exact definition, remarking only that the function satisfies
(6.14) | ||||
(6.15) |
Case
We split the interval into equal regions, with to be determined later. Write , and let
We obtain the desired function by replacing with
for some sufficiently large . Then and .
On the other hand, for and any and such that and are correlated, by Lemma 6.5, , and therefore
For some large enough , depending on and but not on ,
Thus , completing the proof. ∎
7. Concentration of measure
In this section, we show that when , the size of the filling tends to concentrate around its mean. That is, we show that (1.2) holds in the case of Theorems A, B, and C. We first prove this in the case of Theorem 3.1. The main tool is the concentration of measure in high-dimensional balls, an idea due to Gromov and Milman [GM] and of wide importance in probability theory [Led]. We follow the exposition due to Bobkov and Ledoux [BL1, §7.1] which covers the 1-dimensional case; the higher-dimensional cases are essentially the same although they do not seem to appear explicitly in the literature.
Theorem 7.1.
In particular, the standard deviation of is at most . In other words, for , converges to 1 as .
Proof.
Equip with the metric
and let . Define by
and by
This map is -Lipschitz since when every point moves by a tiny amount , the distance is in the domain and in the range.
Define the concentration function of a metric measure space of total measure 1 to be
where is the -neighborhood of the set . The key observation of Gromov and Milman [GM, Thm. 4.1] is that
where is the first nonzero eigenvalue of the Laplacian on . Since the spectrum of a product of manifolds is the sum of its spectra, is constant on powers of . The map is measure-preserving, so it follows that
Therefore, for any -Lipschitz function ,
By Chebyshev’s inequality, the same, modulo constants, holds for the mean (see also [Led, Prop. 1.10]). ∎
To adapt this proof for the case of Theorems A, B, and C, we just have to change the space : take
for Theorem A | ||||
for Theorem B | ||||
In the first two cases, the map is constructed as before. In the last case, for a -pseudomanifold with vertex set , sends to the image of the linear immersion of with vertices .
In each case, it is easy to see that if the space of -cycles is given the filling volume metric, then is -Lipschitz. Therefore, the rest of the proof is identical to that of Theorem 7.1.
References
- [ABD+] J. Arsuaga, T. Blackstone, Y. Diao, E. Karadayi, and M. Saito, Linking of uniform random polygons in confined spaces, J. Phys. A 40 (2007), no. 9, 1925–1936.
- [AKT] M. Ajtai, J. Komlós, and G. Tusnády, On optimal matchings, Combinatorica 4 (1984), no. 4, 259–264.
- [Bany] A. Banyaga, Formes-volume sur les variétés à bord, Enseign. Math. (2) 20 (1974), 127–131.
- [BL1] S. Bobkov and M. Ledoux, One-dimensional empirical measures, order statistics, and Kantorovich transport distances, Mem. Amer. Math. Soc. 261 (2019), no. 1259, v+126.
- [BL2] by same author, A simple Fourier analytic proof of the AKT optimal matching theorem, arXiv:1909.06193 [math.PR], 2019.
- [BMPR] M. Bruveris, P. W. Michor, A. Parusiński, and A. Rainer, Moser’s theorem on manifolds with corners, Proc. Amer. Math. Soc. 146 (2018), no. 11, 4889–4897.
- [BottTu] R. Bott and L. W. Tu, Differential forms in algebraic topology, Graduate Texts in Mathematics, no. 82, Springer, 1982.
- [EPC+] D. Epstein, M. Paterson, J. Cannon, D. Holt, S. Levy, and W. P. Thurston, Word processing in groups, Jones and Bartlett, 1992.
- [E-Z] Ch. Even-Zohar, Models of random knots, J. Appl. Comput. Topol. 1 (2017), no. 2, 263–296.
- [Fed] H. Federer, Geometric measure theory, Die Grundlehren der mathematischen Wissenschaften, Band 153, Springer-Verlag New York Inc., New York, 1969.
- [FF] H. Federer and W. H. Fleming, Normal and integral currents, Ann. of Math. (2) 72 (1960), 458–520.
- [FK] E. Flapan and K. Kozai, Linking number and writhe in random linear embeddings of graphs, J. Math. Chem. 54 (2016), no. 5, 1117–1133.
- [FKW] P. Franek, M. Krčál, and H. Wagner, Solving equations and optimization problems with uncertainty, J. Appl. Comput. Topol. 1 (2018), no. 3-4, 297–330.
- [GM] M. Gromov and V. D. Milman, A topological application of the isoperimetric inequality, American Journal of Mathematics 105 (1983), no. 4, 843–854.
- [Gro] M. Gromov, Homotopical effects of dilatation, J. Differential Geom. 13 (1978), no. 3, 303–310.
- [HPZ] N. Holden, Y. Peres, and A. Zhai, Gravitational allocation on the sphere, Proc. Natl. Acad. Sci. USA 115 (2018), no. 39, 9666–9671.
- [HS] R. Hardt and L. Simon, Boundary regularity and embedded solutions for the oriented Plateau problem, Ann. of Math. (2) 110 (1979), no. 3, 439–486.
- [Jones] P. W. Jones, Rectifiable sets and the traveling salesman problem, Invent. Math. 102 (1990), no. 1, 1–15.
- [Led] M. Ledoux, The concentration of measure phenomenon, Mathematical Surveys and Monographs, vol. 89, American Mathematical Society, Providence, RI, 2001.
- [Marko] J. F. Marko, Linking topology of tethered polymer rings with applications to chromosome segregation and estimation of the knotting length, Phys. Rev. E 79 (2009), 051905.
- [Mil] K. C. Millett, Monte Carlo explorations of polygonal knot spaces, Knots in Hellas ’98 (Delphi), Ser. Knots Everything, vol. 24, World Sci. Publ., River Edge, NJ, 2000, pp. 306–334.
- [Mor] F. Morgan, Geometric measure theory: A beginner’s guide, fourth ed., Elsevier/Academic Press, Amsterdam, 2009.
- [Moser] J. Moser, On the volume elements on a manifold, Trans. Amer. Math. Soc. 120 (1965), 286–294.
- [Shor] P. W. Shor, The average-case analysis of some on-line algorithms for bin packing, Combinatorica 6 (1986), no. 2, 179–200.
- [Tal1] M. Talagrand, Matching random samples in many dimensions, Ann. Appl. Probab. 2 (1992), no. 4, 846–856.
- [Tal2] by same author, The transportation cost from the uniform measure to the empirical measure in dimension , Ann. Probab. 22 (1994), no. 2, 919–959.
- [Tal3] by same author, Upper and lower bounds for stochastic processes: Modern methods and classical problems, Ergebnisse der Mathematik und ihrer Grenzgebiete. 3. Folge. A Series of Modern Surveys in Mathematics, vol. 60, Springer, Heidelberg, 2014.
- [Tanaka] F. Tanaka, Gauge Theory of Topological Entanglements. I: — General Theory —, Progress of Theoretical Physics 68 (1982), no. 1, 148–163.
- [Young] R. Young, Quantitative nonorientability of embedded cycles, Duke Math. J. 167 (2018), no. 1, 41–108.