This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

institutetext: School of Natural Sciences, Institute for Advanced Study,
1 Einstein Drive, Princeton, NJ 08540 USA

Algebras, Regions, and Observers

Edward Witten
Abstract

In ordinary quantum field theory, one can define the algebra of observables in a given region in spacetime, but in the presence of gravity, it is expected that this notion ceases to be well-defined. A substitute that appears to make sense in the presence of gravity and that also is more operationally meaningful is to consider the algebra of observables along the timelike worldline of an observer. It is known that such an algebra can be defined in quantum field theory, and the timelike tube theorem of quantum field theory suggests that such an algebra is a good substitute for what in the absence of gravity is the algebra of a region. The static patch in de Sitter space is a concrete example in which it is useful to think in these terms and to explicitly incorporate an observer in the description.

1 Introduction

In ordinary quantum mechanics, we do not usually incorporate the observer as part of the system. That is fine for many purposes, but in the presence of gravity, one has to take into account the fact that the observer gravitates. In some contexts, this is unimportant, because the observer’s gravity is negligible. For example, in an asymptotically flat spacetime, an observer more or less at rest in the asymptotic region is likely to have negligible gravitational influence on whatever is in the interior of the spacetime. On the other hand, in a closed universe, where the gravitational flux due to the observer has “nowhere to go,” one might expect that it may be essential to include the observer in the description.

We will discuss an example of this – based on the paper CLPW . But we begin with more general considerations. In ordinary quantum field theory, one can attach an algebra 𝒜𝒰{\mathcal{A}}_{\mathcal{U}} of observables to a rather general open set 𝒰{\mathcal{U}} in spacetime. In the presence of gravity, there are potentially two problems with this notion. First, when spacetime fluctuates, we may generically have difficulty saying just what we mean by the spacetime region 𝒰{\mathcal{U}}. Second, it is not clear what is the logic of discussing an algebra 𝒜𝒰{\mathcal{A}}_{\mathcal{U}} unless there is someone who can make the observations corresponding to elements of that algebra. In the absence of gravity, we do not usually worry about this question; we just assume, in effect, that some observer external to the system has the relevant capability. But in the presence of gravity, anticipating that sometimes it will be necessary to include the observer in the description, we should take seriously the idea that it is only well-motivated to discuss an algebra of observables if this is the algebra of observables accessible to someone.

But what algebra is accessible to an observer? We will discuss this question in the context of ordinary quantum field theory, without gravity, but hoping to learn some lessons that are useful when gravity is included. We make use of two classic but not so well known results about ordinary quantum field theory. The first result Borch , described in section 2, enables one to define an algebra of observables along the timelike worldline of an observer. The second result, described in section 3, is the “timelike tube theorem” Borch2 ; Araki ; Stroh ; SW , which among other things says that, in the absence of gravity, in a real analytic spacetime, the algebra of observables along the observer’s worldline is the same as (roughly) the algebra of observables in the region causally accessible to the observer. The first result is more elementary, and more general (since it does not require a hypothesis of real analyticity), but in a real analytic spacetime, it can potentially be viewed as a corollary of the second.

The two results indicate that instead of discussing the algebra of observables in a spacetime region, we could discuss the algebra generated by the quantum fields along the observer’s worldline. In a theory of gravity, the algebra of observables along the worldline would appear to be both better defined and operationally more meaningful than the algebra of a region.

In section 4, following CLPW , we consider the “static patch” in de Sitter space as an example in which it is necessary to explicitly include an observer in order to define a sensible algebra of observables. The algebra defined after taking the observer explicitly into account turns out to be a von Neumann algebra of Type II1. This gives an abstract explanation of why “empty de Sitter space” is a state of maximum entropy, and in what sense the density matrix of empty de Sitter space is maximally mixed.

Presumably, in a full theory of the world, an observer cannot be added from outside but must emerge as part of the theory. In that context, what it means to include an observer in the description is that one considers a “code subspace” of states in which the observer is present, and one defines operators that are well-defined on the code subspace, though they would not be well-defined on all states of the theory. For considerations of the present article, however, however, it is not necessary to have such a full theory in hand.

Refer to caption
Figure 1: In the absence of gravity, one can consider any chosen open set 𝒰{\mathcal{U}} in a spacetime MM, and define the algebra of observables in that region.

2 The Algebra Accessible to an Observer

In ordinary quantum field theory in a spacetime MM, we can arbitrarily specify any open set 𝒰M{\mathcal{U}}\subset M and define an algebra 𝒜𝒰{\mathcal{A}}_{\mathcal{U}} of operators in 𝒰{\mathcal{U}} (fig. 1). In the presence of gravity, since spacetime fluctuates, it does not make sense to talk about the region 𝒰{\mathcal{U}} unless we have an invariant way to identify it. For example, in an asymptotically flat spacetime, in the presence of a black hole, we could talk about the region outside the black hole horizon. That is invariantly defined and presumably makes sense at least perturbatively even when spacetime fluctuates. We could introduce an observer who is more or less at rest near infinity and describe the region outside the horizon as the region visible to this observer. However, as already noted, in an asymptotically flat universe we do not expect that it is essential to incorporate the observer in the description.

If we assume the existence of an observer, we can invariantly identify various regions in spacetime. For example, if the observer carries a clock, then as in fig. 2, we can discuss the region that is visible to the observer prior to a given time, or, alternatively, the region that is causally accessible to the observer in a stated time interval (meaning that the observer can both see and influence this region during the interval in question).

Refer to caption
Figure 2: The worldline of an observer, with (a) the region of spacetime that is visible to the observer prior to a given time, or (b) the region that the observer can both see and influence in a given time interval. Time runs vertically and the boundaries of the chosen time intervals are marked by the black dots.

But what can an observer actually measure? Here we will assume a very simple model in which the observer is described by a timelike worldline, and what the observer can measure are simply the quantum fields along this worldline.111See Unruh for a classic discussion in this framework, with a somewhat different motivation. This seems like a rather minimal model of what an observer is, and one could well worry that it is too crude. A realistic observer would presumably also carry measuring equipment, and a recording device, and would have access to operators that act on all that. But it turns out that in ordinary quantum field theory without gravity, the rather crude model in which an observer is just characterized by a worldline and the observables are the quantum fields along the worldline is sufficient, for many purposes.

This model raises two immediate questions:

(1) Can well-defined operators be defined by smearing a quantum field along a timelike worldline?

(2) Given a “yes” answer to the first question, what is the algebra generated by these operators?

Let us elaborate a bit on the first question. We are accustomed in quantum field theory to talking about “local operators” ϕ(x)\phi(x), but a local operator is not really a Hilbert space operator, since acting on a Hilbert space state it takes us out of Hilbert space. In the case of the vacuum state Ω\Omega in Minkowski space, this is clear from the fact that |ϕ(x)|Ω|2=|\phi(x)|\Omega\rangle|^{2}=\infty or equivalently

Ω|ϕ(x)ϕ(x)|Ω=,\langle\Omega|\phi(x)\phi(x)|\Omega\rangle=\infty, (1)

due to a short distance singularity. Since the leading short distance singularity is universal, it is also true that |ϕ(x)|Ψ|2=|\phi(x)|\Psi\rangle|^{2}=\infty for any state Ψ\Psi in any spacetime MM. The problem has nothing to do with the state Ψ\Psi, and purely reflects the fact the product ϕ(x)ϕ(x)\phi(x)\cdot\phi(x) is not well-defined, since in fact a more general product ϕ(y)ϕ(x)\phi(y)\cdot\phi(x) is singular for yxy\to x. This singularity is governed by the operator product expansion (OPE).

If we could measure ϕ(x)\phi(x), the answer would be one of its eigenvalues, but since ϕ(x)\phi(x) maps us out of Hilbert space, it does not have eigenvectors or eigenvalues, and we cannot measure it. What are actually measureable are suitable smeared versions of ϕ(x)\phi(x). Which ones? Suppose we are going to smear a real scalar field ϕ(x)\phi(x) over a set SS to get a smeared “operator” ϕf=Sdμf(x)ϕ(x),\phi_{f}=\int_{S}{\mathrm{d}}\mu f(x)\phi(x), where f(x)f(x) is a complex-valued smooth function with support in SS. If ϕf\phi_{f} is actually going to make sense as an operator, the smearing has to be such that the ϕ(x)ϕ(x)\phi(x^{\prime})\cdot\phi(x) OPE singularity is integrable, when smeared in this fashion.

For example, spatial smearing will only succeed for an operator of rather low dimension. In 𝖣=d+1{\sf D}=d+1 spacetime dimensions, with space coordinates x\vec{x} and a time coordinate tt, spatial smearing at, say, t=0t=0, produces an expression ϕf=t=0ddxf(x)ϕ(x,0)\phi_{f}=\int_{t=0}{\mathrm{d}}^{d}\vec{x}f(\vec{x})\phi(\vec{x},0), where f(x)f(\vec{x}) is a smooth function of the spatial coordinates x\vec{x} that we can assume to have compact support. Does ϕf\phi_{f} make sense as an operator? In computing the product ϕf¯ϕf\phi_{\overline{f}}\cdot\phi_{f}, we run into the integral

t=t=0dxdxf¯(x)f(x)ϕ(x,0)ϕ(x,0).\int_{t^{\prime}=t=0}{\mathrm{d}}\vec{x}{\mathrm{d}}\vec{x}^{\prime}\overline{f}(\vec{x}^{\prime})f(\vec{x})\phi(\vec{x}^{\prime},0)\phi(\vec{x},0). (2)

For illustrative purposes, let us consider first the case of a conformal field theory, though with minor modifications, the following remarks apply much more widely. If ϕ\phi is a conformal field of dimension Δ\Delta, then the leading singularity in the operator product ϕ(x,0)ϕ(x,0)\phi(\vec{x}^{\prime},0)\phi(\vec{x},0) for xx\vec{x}^{\prime}\to\vec{x} is proportional to |xx|2Δ|\vec{x}^{\prime}-\vec{x}|^{-2\Delta}, where Δ\Delta is the dimension of the operator ϕ\phi. The condition for this singularity to be integrable when inserted in (2) is 2Δ<d2\Delta<d. If ϕ\phi is a free scalar field, then Δ=(𝖣2)/2=(d1)/2\Delta=({\sf D}-2)/2=(d-1)/2, and the condition 2Δ<d2\Delta<d is satisfied.222In dimension 𝖣=2{\sf D}=2, a free scalar has Δ=0\Delta=0 and the condition Δ<d/2\Delta<d/2 is satisfied by any normal ordered polynomial :ϕn(x)::\phi^{n}(x):. This was important in early work on constructive field theory Jaffe . But what about, say, QCD, in the real world with d=3d=3? QCD is asymptotically free, so short distance singularities have the behavior just discussed up to logarithms, which are inessential except in the borderline case Δ=d/2\Delta=d/2. In QCD, the smallest value of Δ\Delta for any gauge-invariant operator is 3, corresponding to a quark bilinear such as q¯q\overline{q}q. So the condition 2Δ<d=32\Delta<d=3 is never satisfied and QCD is an example of a theory in which no true operator can be produced by smearing of a “local operator” in space.

Smearing in Euclidean space is only slightly better. If we try to define a smeared operator ϕf=d𝖣xf(x)ϕ(x)\phi_{f}=\int{\mathrm{d}}^{\sf D}xf(x)\phi(x) (where now the integration is over all 𝖣{\sf D} coordinates of Euclidean spacetime), we will run into the operator product singularity d𝖣xd𝖣xf¯(x)f(x)ϕ(x)ϕ(x).\int{\mathrm{d}}^{\sf D}x{\mathrm{d}}^{\sf D}x^{\prime}\overline{f}(x^{\prime})f(x)\phi(x^{\prime})\phi(x). This is integrable if and only if 2Δ<𝖣2\Delta<{\sf D}, a weaker condition but one that again cannot be satisfied in QCD, for example.

How then do we get true operators by smearing of “local operators”? The secret is smearing in real time. Though smearing in space is only effective in favorable cases, smearing in real time turns a “local operator” of any dimension into a true operator. This rather old result Borch was originally proved directly on the basis of the Wightman axioms of quantum field theory for the case that the timelike curve is a timelike geodesic in Minkowski space.

To understand the result, we begin again with the case of a conformal field theory. Suppose that at x=0\vec{x}=0, we smear a “local operator” ϕ(x,t)\phi(\vec{x},t) by a compactly supported function f(t)f(t) that depends only on tt. Thus we define ϕf(0)=dtf(t)ϕ(0,t)\phi_{f}(0)=\int{\mathrm{d}}t\,f(t)\phi(0,t). In evaluating |ϕf|Ω|2|\phi_{f}|\Omega\rangle|^{2}, we now run into the operator product dtdtf¯(t)f(t)ϕ(0,t)ϕ(0,t)\int{\mathrm{d}}t^{\prime}{\mathrm{d}}t\overline{f}(t^{\prime})f(t)\phi(0,t^{\prime})\phi(0,t). If ϕ\phi has dimension Δ\Delta, the leading OPE singularity is ϕ(t)ϕ(t)(ttiε)2Δ\phi(t^{\prime})\phi(t)\sim(t^{\prime}-t-{\mathrm{i}}\varepsilon)^{-2\Delta}, so we have to consider the integral

dtdtf¯(t)f(t)1(ttiε)2Δ.\int{\mathrm{d}}t^{\prime}\,{\mathrm{d}}t\,\overline{f}(t^{\prime})f(t)\frac{1}{(t^{\prime}-t-{\mathrm{i}}\varepsilon)^{2\Delta}}. (3)

There are also subleading OPE singularities; they have the same form with different exponents and can be treated just as we are about to describe. The integral is obviously well-defined for ε>0\varepsilon>0, and we want to show that no divergence appears in the limit ε0\varepsilon\to 0. For this, we write333If 2Δ2\Delta\in{\mathbb{Z}}, the following formula has to be slightly modified with a logarithmic factor on the right hand side. The derivation otherwise proceeds in the same way.

1(ttiε)2Δ=Cnntn(ttiε)n2Δ,\frac{1}{(t^{\prime}-t-{\mathrm{i}}\varepsilon)^{2\Delta}}=C_{n}\frac{\partial^{n}}{\partial t^{n}}(t^{\prime}-t-{\mathrm{i}}\varepsilon)^{n-2\Delta}, (4)

for any integer n>0n>0, with a constant CnC_{n}. Inserting this in the integral (3) and integrating by parts nn times, we replace the original integral with

(1)nCndtdtf¯(t)f[n](t)(ttiε)n2Δ.(-1)^{n}C_{n}\int{\mathrm{d}}t^{\prime}\,{\mathrm{d}}t\,\overline{f}(t^{\prime})f^{[n]}(t)\,(t^{\prime}-t-{\mathrm{i}}\varepsilon)^{n-2\Delta}. (5)

For large enough nn, this is manifestly convergent for ε0\varepsilon\to 0.

In a general quantum field theory, consider the operator product expansion

ϕ(0,t)ϕ(0,t)αhα(ttiε)ϕα(0,t).\phi(0,t^{\prime})\phi(0,t)\sim\sum_{\alpha}h_{\alpha}(t^{\prime}-t-{\mathrm{i}}\varepsilon)\phi_{\alpha}(0,t). (6)

The coefficient functions hα(ttiε)h_{\alpha}(t^{\prime}-t-{\mathrm{i}}\varepsilon) are holomorphic for Im(tt)<0{\mathrm{Im}}(t^{\prime}-t)<0. This follows from positivity of energy. Normally one can assume that the singularities of the hαh_{\alpha} are bounded444An example of a local operator that would not satisfy this condition is eiϕ(x)e^{{\mathrm{i}}\phi(x)}, where ϕ\phi is a scalar field in spacetime dimension 𝖣>2{\sf D}>2. by a power law, |hα(tt)|<Cα|tt|nα|h_{\alpha}(t^{\prime}-t)|<C_{\alpha}|t^{\prime}-t|^{-n_{\alpha}}, for some Cα,nαC_{\alpha},n_{\alpha}. In the context of the Wightman axioms of quantum field theory, the correlation functions are usually assumed to be tempered distributions, which implies such a bound. Alternatively, if a theory is conformally invariant or asymptotically free in the ultraviolet, and the operators considered behave as fields of definite dimension in the ultraviolet (modulo logarithms in the asymptotically free case), this again implies such a bound. Given holomorphy in the lower half tt^{\prime} plane and a power law bound,555The precise mathematical statement is that if f(z)f(z) is a function holomorphic in the lower half plane, then a necessary and sufficient condition for the boundary values of f(z)f(z) along the real axis to define a distribution is a bound |f(z)|<k|Imz|c|f(z)|<k|{\mathrm{Im}}\,z|^{-c} for some constants k,ck,c. For an elementary proof (and a bound on the distribution) see Proposition 4.2 in BF . See also Theorem 1.1 in Straube for a dd-dimensional generalization. one can imitate the previous derivation to show the finiteness of

dtdtf¯(t)f(t)ϕ(0,t)ϕ(0,t)limε0αdtdtf¯(t)f(t)hα(tt+iε)ϕα(t).\int{\mathrm{d}}t^{\prime}{\mathrm{d}}t\overline{f}(t^{\prime})f(t)\phi(0,t^{\prime})\phi(0,t)\sim\lim_{\varepsilon\to 0}\sum_{\alpha}\int{\mathrm{d}}t^{\prime}{\mathrm{d}}t\overline{f}(t^{\prime})f(t)h_{\alpha}(t^{\prime}-t+{\mathrm{i}}\varepsilon)\phi_{\alpha}(t). (7)

For this, one writes each hα(ttiε)h_{\alpha}(t^{\prime}-t-{\mathrm{i}}\varepsilon) as the nthn^{th} derivative with respect to tt^{\prime}, for some nn, of a function kα(ttiε)k_{\alpha}(t^{\prime}-t-{\mathrm{i}}\varepsilon) that remains continuous (though not smooth) for ε0\varepsilon\to 0, and then one integrates by parts as in the derivation of eqn. (5).

What happens if we consider an arbitrary timelike curve γ\gamma in Minkowski space, not necessarily a geodesic? Parametrize γ\gamma by the proper time τ\tau and let d(τ,τ)d(\tau^{\prime},\tau) be the signed proper distance666The proper distance in Minkowski space between points along γ\gamma labeled by τ\tau^{\prime} and by τ\tau is the proper time elapsed along a geodesic between the two points (not along the path γ\gamma). To get the signed proper distance d(τ,τ)d(\tau^{\prime},\tau), we multiply the proper distance by +1+1 if τ>τ\tau^{\prime}>\tau and by 1-1 if τ>τ\tau>\tau^{\prime}. in Minkowski space between points on γ\gamma labeled by τ\tau^{\prime} and by τ\tau. The effect of replacing a timelike geodesic by an arbitrary timelike curve is to replace ττiε\tau^{\prime}-\tau-{\mathrm{i}}\varepsilon in the preceding formulas by d(τ,τ)iεd(\tau^{\prime},\tau)-{\mathrm{i}}\varepsilon. Since d(τ,τ)d(\tau^{\prime},\tau) is a smooth function and d(τ,τ)ττ+𝒪((ττ)2),d(\tau^{\prime},\tau)\sim\tau^{\prime}-\tau+{\mathcal{O}}((\tau^{\prime}-\tau)^{2}), this does not substantially affect the preceding analysis. The singularities at ττ\tau^{\prime}\to\tau are of the same general form and are harmless.

What happens if we replace Minkowski space by a general spacetime MM? Intuitively, one would not expect this to matter, since everything is determined by short distance behavior. In a curved spacetime, the operator product expansion becomes more complicated, with curvature dependent terms; see HW for an axiomatic discussion. But the extra terms have similar singularities to what we have already considered, so one would expect the same result. For a proof, under reasonable assumptions about correlation functions in a curved spacetime, see K . See also further discussion in Fewster , SW .

3 The Timelike Tube Theorem

3.1 The Timelike Envelope and the Timelike Tube Theorem

In section 2, we learned that one can define operators by smearing a quantum field along the timelike worldline of an observer. Therefore we can consider the algebra generated by such operators (or more precisely by bounded functions of such operators). But what are the algebras that we make this way? In the context of quantum field theory without gravity, this question is answered by the “timelike tube theorem.” This theorem was originally formulated for timelike geodesics in Minkowski space Borch2 ; Araki . It was generalized to free field theories in curved spacetime in Stroh . For a version of the theorem suitable for non-free theories in curved spacetime, see SW ; SW2 .

Refer to caption
Figure 3: The timelike envelope (𝒰){\mathcal{E}}({\mathcal{U}}) of an open set 𝒰{\mathcal{U}} consists of all points that can be reached by deforming a timelike curve in 𝒰{\mathcal{U}} keeping its endpoints fixed. In general, one considers all possible timelike curves in 𝒰{\mathcal{U}}, but in the case depicted, it suffices to consider segments of the particular timelike curve γ\gamma.

If 𝒰{\mathcal{U}} is an open set in spacetime, its “timelike envelope” (𝒰){\mathcal{E}}({\mathcal{U}}) consists of all points that can be reached by deforming timelike curves in 𝒰{\mathcal{U}} through a family of timelike curves, keeping the endpoints fixed (fig. 3). (Thus (𝒰){\mathcal{E}}({\mathcal{U}}) is contained in the intersection J+(𝒰)J(𝒰)J^{+}({\mathcal{U}})\cap J^{-}({\mathcal{U}}) of the past and future of 𝒰{\mathcal{U}}, but in general it is smaller. See section 3.3 for an example.) The timelike tube theorem asserts that the algebra of operators777For a more precise explanation of what algebra is intended here, see section 3.2. in 𝒰{\mathcal{U}} is the same as the algebra of operators in the possibly much larger region (𝒰){\mathcal{E}}({\mathcal{U}}). We will aim to give at least a hint of why this is true, after first explaining an implication, noted in Stroh .

Suppose that we are actually interested in a timelike curve γ\gamma, possibly one of finite extent with endpoints q,pq,p or possibly an infinite or semi-infinite curve. The timelike envelope (γ){\mathcal{E}}(\gamma) consists of all points that can be reached by deforming γ\gamma through a family of timelike curves, keeping γ\gamma fixed near its ends. If γ\gamma has endpoints, they are omitted from (γ){\mathcal{E}}(\gamma) to ensure that (γ){\mathcal{E}}(\gamma) is an open set. We can thicken γ\gamma (minus its endpoints, if any) to an open set 𝒰{\mathcal{U}}, in such a way that the timelike envelope (𝒰){\mathcal{E}}({\mathcal{U}}) does not depend on 𝒰{\mathcal{U}} – it is the same as the timelike envelope (γ){\mathcal{E}}(\gamma) of γ\gamma. See fig. 3. The timelike tube theorem says that the algebra 𝒜(𝒰){\mathcal{A}}({\mathcal{U}}) of operators in region 𝒰{\mathcal{U}} does not really depend on 𝒰{\mathcal{U}} but only on γ\gamma. It coincides with 𝒜((γ)){\mathcal{A}}({\mathcal{E}}(\gamma)), the algebra of operators in (γ){\mathcal{E}}(\gamma). So we can define an algebra for every (possibly bounded) timelike curve γ\gamma. That in itself is not a surprise, since we arrived at the same conclusion in a more direct way888The timelike tube theorem depends on real analyticity of spacetime, as discussed shortly, and the reasoning in section 2 does not, so the previous analysis was more general. in section 2. But it is nice to see the consistency with what we learned from more elementary arguments.

Strictly speaking, the algebra associated to a curve γ\gamma in section 2 consists of the quantum fields smeared along γ\gamma, while the timelike tube theorem gives us an algebra 𝒜(γ){\mathcal{A}}(\gamma) of operators that can be defined in an arbitrarily small neighborhood of γ\gamma. Presumably, these two algebras coincide (assuming that we include all possible local operators in the construction of section 2, including derivatives of operators). A proof might start with an axiomatic characterization of local operators at a point in terms of a limit of the operators in a small ball around that point. Were the two algebras to differ, one might argue that as the notion of an observer characterized by an infinitely thin worldline is an idealization, the physically more relevant algebra would be the one provided by the timelike tube theorem.

The interpretation that we want to give is that the algebra 𝒜(γ){\mathcal{A}}(\gamma) of operators supported on a curve γ\gamma is a good stand-in for the algebra 𝒜(){\mathcal{A}}({\mathcal{E}}) of the spacetime region {\mathcal{E}} associated to γ\gamma. The algebra 𝒜(γ){\mathcal{A}}(\gamma) is more operationally meaningful than 𝒜(){\mathcal{A}}({\mathcal{E}}), since it is more directly what an observer can measure. And it is also better defined in the presence of gravity than 𝒜(){\mathcal{A}}({\mathcal{E}}) would appear to be. Once one takes into account fluctuations in spacetime, it is hard to see how one would directly define 𝒜(){\mathcal{A}}({\mathcal{E}}), but as long as an observer is included in the description, the algebra of observables along the observer’s worldline seems to be a meaningful notion, even when the observer’s worldline fluctuates. So 𝒜(γ){\mathcal{A}}(\gamma) seems like a good substitute for the algebras associated to open sets that one considers in the absence of gravity.

To try to gain some idea of why the timelike tube theorem is true, let us consider the classical limit. Suppose we have a reasonable relativistic wave equation like the Klein-Gordon equation (+m2)ϕ=0(\Box+m^{2})\phi=0, where \Box is the wave operator, or possibly a nonlinear modification of this equation. In fig. 4, we are given a solution in one region of a spacetime MM, and we want to predict the solution in a larger region. We consider two cases. In fig. 4(a), the solution is given in a “spacelike pancake” 𝒰{\mathcal{U}}, and we want to extend it over the domain of dependence D(𝒰)D({\mathcal{U}}) of 𝒰{\mathcal{U}}. In fig. 4(b), the solution is given in a “timelike tube” 𝒰{\mathcal{U}} and we want to extend the solution over the corresponding “timelike envelope” (𝒰){\mathcal{E}}({\mathcal{U}}).

There is a basic asymmetry between the two cases.999 To be precise, this asymmetry holds in any spacetime dimension 𝖣>2{\sf D}>2. In 𝖣=2{\sf D}=2, spacetime has one dimension of space and one dimension of time, and there is a perfect symmetry between the two cases of fig. 4. In 𝖣>2{\sf D}>2, there are more space dimensions than time dimensions, and there is no such symmetry. The counterexample illustrated in fig. 5 fails in 𝖣=2{\sf D}=2 because in 𝖣=2{\sf D}=2, the solution with a delta function source along a worldline \ell does not blow up along \ell, but instead is discontinuous across \ell. Starting with the solution on one side of \ell, one can drop the discontinuity and smoothly continue the solution across \ell. In 𝖣>2{\sf D}>2, a solution with such a delta function source blows up along \ell and there is no way to remove the singularity without changing the solution in the original region 𝒰{\mathcal{U}}. The Holmgren uniqueness theorem of classical partial differential equations101010For an accessible exposition of this theorem, see chapter 5 of Smoller . asserts that the extension over the larger region is unique, if it exists, both in fig. 4(a) and in fig. 4(b). But existence is more special and only holds in fig. 4(a).

Refer to caption
Figure 4: Two cases in which the solution of a hyperbolic wave equation is given in one open set (the dark shaded region) and one wishes to extend the solution over a larger open set (the more lightly shaded region). Time runs vertically and space runs horizontally. In (a), the solution is given in a “spacelike pancake” 𝒰{\mathcal{U}} and one wishes to extend it over the domain of dependence D(𝒰)D({\mathcal{U}}). In (b), the solution is given in a “timelike tube” 𝒰{\mathcal{U}}, and one wishes to extend it over the “timelike envelope” (𝒰){\mathcal{E}}({\mathcal{U}}). In dimension 2, there is a perfect symmetry between these two cases, but in higher dimension, there is no such symmetry.
Refer to caption
Figure 5: This picture illustrates the obstruction to existence in the setting of fig. 4(b). Starting with initial data in an open set 𝒰{\mathcal{U}}, if one tries to extend a given solution in spatial directions over the timelike envelope (𝒰){\mathcal{E}}({\mathcal{U}}), it may develop singularities that will prevent the existence of the extended solution. In fact, this is the generic behavior. The singularity might arise on the worldline of a point charge that passes through (𝒰){\mathcal{E}}({\mathcal{U}}) but not through 𝒰{\mathcal{U}}, as sketched here.

A simple counterexample (fig. 5) shows that an existence result cannot possibly hold in fig. 4(b). Let \ell be a timelike curve that passes through the region (𝒰){\mathcal{E}}({\mathcal{U}}) but not through 𝒰{\mathcal{U}}. Consider a solution of the equation (+m2)ϕ=0(\Box+m^{2})\phi=0 with a delta function source along \ell. If we consider Maxwell’s equations rather than the Klein-Gordon equation, then \ell can be the worldline of a point charge. The field ϕ\phi blows up along \ell (in dimension 𝖣>2{\sf D}>2; see footnote 9). Starting with the solution in the region 𝒰{\mathcal{U}}, there is no way to extend it over (𝒰){\mathcal{E}}({\mathcal{U}}) as a solution, since if one tries to do this, one encounters the blowup along \ell. The equation is not obeyed along \ell, because of the delta function source.

The existence and uniqueness result in the case of fig. 4(a) is the basis for much of physics. It says that the solution can be predicted from initial data – physics is causal. But by contrast the uniqueness result without a guarantee of existence in fig. 4(b) is not usually useful at the classical level because generically the extension over (𝒰){\mathcal{E}}({\mathcal{U}}) of a solution on 𝒰{\mathcal{U}} does not exist and it is very hard to predict when it does.

Suppose, however, that we are doing quantum field theory and for simplicity consider a free field ϕ\phi with the action

I=12Md𝖣xg(DμϕDμϕm2ϕ2).I=\frac{1}{2}\int_{M}{\mathrm{d}}^{\sf D}x\sqrt{g}\left(-D_{\mu}\phi D^{\mu}\phi-m^{2}\phi^{2}\right). (8)

In this case, we can view ϕ\phi as an operator-valued solution of the Klein-Gordon equation (+m2)ϕ=0(\Box+m^{2})\phi=0. If we are studying this quantum field theory on MM, then the field ϕ(x)\phi(x) does exist throughout MM and therefore existence of the extension from 𝒰{\mathcal{U}} to (𝒰){\mathcal{E}}({\mathcal{U}}) is not an issue.

But what does uniqueness mean? In some sense, uniqueness means “the field ϕ(x)\phi(x) for x(𝒰)x\in{\mathcal{E}}({\mathcal{U}}) is uniquely determined by ϕ(y)\phi(y) for y𝒰y\in{\mathcal{U}}.” As explained by Borchers and Araki in the early 1960’s, the quantum meaning of this statement is really “ϕ(x)\phi(x) for x(𝒰)x\in{\mathcal{E}}({\mathcal{U}}) is contained in the algebra generated by ϕ(y)\phi(y) for y𝒰y\in{\mathcal{U}}” or equivalently the operator algebras of the two regions are the same:

𝒜((𝒰))=𝒜(𝒰).{\mathcal{A}}({\mathcal{E}}({\mathcal{U}}))={\mathcal{A}}({\mathcal{U}}).

This is the timelike tube theorem.

Classically, we could just as well add higher order terms to the action and consider a field that satisfies a nonlinear partial differential equation. Holmgren uniqueness applies equally well to such an equation. Quantum mechanically, although a free quantum field can be viewed as an operator-valued solution of a classical field equation, that is not the case for a non-free quantum field, because of issues involving renormalization. Accordingly, proofs of the timelike tube theorem in non-free theories require additional ingredients, beyond what is needed in free theories. An approach that works for non-free theories in curved spacetime was presented in SW (see SW2 for an informal account).

In this discussion, we have skipped over two important points. First, we have not been precise about what is the algebra that is governed by the timelike tube theorem. See section 3.2. Second, we have omitted to explain a key assumption in the classical Holmgren uniqueness theorem and the quantum timelike tube theorem. These theorems hold in real analytic spacetimes. Real analyticity is not needed for existence and uniqueness in the setting of fig. 4(a), but it is needed for uniqueness in the setting of111111Actually, it suffices for the spacetime to be, in some coordinate system, real analytic in the time direction Tataru . In free field theory, this is also true for the timelike tube theorem Stroh . fig. 4(b). Is this restriction important for the application of the timelike tube theorem to semiclassical gravity? In semiclassical gravity, one is usually studying the behavior of quantum fields in a background spacetime that in practice is normally real analytic. For instance, in section 4, we consider the static patch in de Sitter space as an example in which it is important to explicitly include an observer in order to define a sensible algebra of observables, and thus in which the timelike tube theorem provides important motivation. This is a typical example in which the starting point is a real analytic spacetime. After picking a semiclassical starting point, one then expands around it. In perturbation theory, the metric certainly fluctuates away from the real analytic starting point, but one expects that in perturbation theory, the gravitational field can be treated like any other quantum field in a real analytic background. Thus it appears that for typical applications of the timelike tube theorem to semiclassical gravity, the restriction of the theorem to real analytic spacetimes is not a problem.

3.2 The Additive Algebra

For a region 𝒰{\mathcal{U}} in spacetime, what precisely is the algebra of operators to which the timelike tube theorem applies? The smallest reasonable candidate is the algebra generated by local operators in 𝒰{\mathcal{U}} (or more precisely, by bounded functions of smeared local operators). If 𝒰{\mathcal{U}} is contractible, this is the natural algebra of observables in 𝒰{\mathcal{U}} in a generic quantum field theory. However, if 𝒰{\mathcal{U}} is not contractible, then in general, as we discuss shortly, there can be additional operators in 𝒰{\mathcal{U}}, such as Wilson operators defined on noncontractible loops in 𝒰{\mathcal{U}}, that cannot be constructed from local operators in 𝒰{\mathcal{U}}.

The algebra generated by local operators has been called the additive algebra 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}) casini1 ; casini2 . One can then reserve the name 𝒜(𝒰){\mathcal{A}}({\mathcal{U}}) for the possibly larger algebra of all operators in 𝒰{\mathcal{U}}. The timelike tube theorem is really a theorem about 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}). Thus a more precise statement of the theorem than we have given so far is 𝒜add(𝒰)=𝒜add((𝒰)){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}({\mathcal{U}})). This is clear from the proofs, which involve studying algebras of local operators. However, as we will discuss, while the distinction between 𝒜(𝒰){\mathcal{A}}({\mathcal{U}}) and 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}) exists in quantum field theory in general, there are reasons to believe that this distinction is absent in any theory that emerges at long distances from a full model of quantum gravity.

The motivation for the phrase “additive algebra” is as follows. Let {\mathcal{B}} be a small open ball in 𝒰{\mathcal{U}}. Then as {\mathcal{B}} is contractible, there is no distinction between 𝒜(){\mathcal{A}}({\mathcal{B}}) and 𝒜add(){\mathcal{A}}_{\mathrm{add}}({\mathcal{B}}). Now cover 𝒰{\mathcal{U}} by open balls α{\mathcal{B}}_{\alpha}, with α\alpha ranging over some set SS. The additive algebra of 𝒰{\mathcal{U}} is the same as the algebra generated by the 𝒜(α){\mathcal{A}}({\mathcal{B}}_{\alpha}), αS\alpha\in S:

𝒜add(𝒰)=αS𝒜(α).{\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})=\vee_{\alpha\in S}{\mathcal{A}}({\mathcal{B}}_{\alpha}). (9)

In what situation will we have 𝒜(𝒰)𝒜add(𝒰){\mathcal{A}}({\mathcal{U}})\not={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}), where 𝒜(𝒰){\mathcal{A}}({\mathcal{U}}) is the algebra of all possible operators in 𝒰{\mathcal{U}}, not necessarily built from local operators? For a typical example, consider a theory with a U(1){\mathrm{U}}(1) gauge field AA, with curvature F=dAF={\mathrm{d}}A. Let \ell be a closed loop in 𝒰{\mathcal{U}}, and consider the Wilson operator W(A)=exp(iA)W_{\ell}(A)=\exp({\mathrm{i}}\oint_{\ell}A). Is this operator contained in 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})? The answer will be “yes” if \ell is the boundary of an oriented two-manifold D𝒰D\subset{\mathcal{U}}, for then W(A)=exp(iDF)W_{\ell}(A)=\exp({\mathrm{i}}\int_{D}F) is a bounded function of a smeared local field in 𝒰{\mathcal{U}}, namely DF\int_{D}F. But even if \ell is not a boundary in 𝒰{\mathcal{U}}, the answer can still be “yes,” for a more subtle reason that was explained in Harlow . Consider a theory that has a charge 1 field ϕ\phi. Then for an open path γ𝒰\gamma\subset{\mathcal{U}}, say with endpoints q,pq,p, one can consider the gauge-invariant operator Vγ(A)=ϕ¯(p)exp(iγA)ϕ(q).V_{\gamma}(A)=\overline{\phi}(p)\exp({\mathrm{i}}\int_{\gamma}A)\phi(q). In the case that γ\gamma is a very short path, with pp very near qq, VγV_{\gamma} can be expanded in terms of local operators at qq and so is contained in the additive algebra of any open set containing γ\gamma. If \ell is a closed loop, then W(A)W_{\ell}(A) can be “cut” in the sense that if one omits from \ell a number of open intervals of size ε\varepsilon, with the portion retained then being a disjoint union of closed intervals γi\gamma_{i}, i=1,,ni=1,\cdots,n, then W(A)W_{\ell}(A) appears as the most singular contribution in the operator product i=1nVγi(A)\prod_{i=1}^{n}V_{\gamma_{i}}(A) for ε0\varepsilon\to 0. Hence, in this situation W(A)W_{\ell}(A) is contained in the additive algebra 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}) for any region 𝒰{\mathcal{U}} containing \ell.

However, if the theory has no field of unit charge, and \ell is not a boundary in 𝒰{\mathcal{U}}, then W(A)W_{\ell}(A) is contained in 𝒜(𝒰){\mathcal{A}}({\mathcal{U}}) but not in 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}). This phenomenon is certainly not limited to U(1){\mathrm{U}}(1) gauge theory. In a theory with any gauge group GG, if some representation RR of GG is “missing,” in the sense that no local operator of the theory transforms in this representation, then for suitable regions 𝒰{\mathcal{U}} one will have 𝒜add(𝒰)𝒜(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})\not={\mathcal{A}}({\mathcal{U}}). A Wilson loop in the representation RR can be present in 𝒜(𝒰){\mathcal{A}}({\mathcal{U}}) but not in 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}). A similar role can be played by a pp-form gauge field BB with p>1p>1. In a theory that has such a field, one can consider the operator QS=exp(iSB)Q_{S}=\exp({\mathrm{i}}\int_{S}B), where SS is a pp-cycle in 𝒰{\mathcal{U}}. If SS is not a boundary in 𝒰{\mathcal{U}} and the theory does not have a string or membrane that couples to BB (which would enable one to “cut” the operator QSQ_{S} in small pieces), then QSQ_{S} is contained in 𝒜(U){\mathcal{A}}(U) but not in 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}). There is also an analog of this for magnetic charges (and their analogs for strings and membranes), with ’t Hooft operators instead of Wilson operators.

We can now easily give an example that shows that it is necessary to specify that the timelike tube theorem applies to 𝒜add{\mathcal{A}}_{\mathrm{add}}, not to 𝒜{\mathcal{A}}. Consider U(1){\mathrm{U}}(1) gauge theory without charged fields on a spacetime ×S1{\mathbb{R}}\times S^{1} with product metric, where {\mathbb{R}} parametrizes time and S1S^{1} is parametrized by an angular variable ϕ\phi. In this spacetime, there is a nontrivial Wilson operator W=exp(iA)W_{\ell}=\exp({\mathrm{i}}\oint_{\ell}A), where \ell is a closed loop that wraps around the S1S^{1}. Let pp and qq be two points at the same value of ϕ\phi but different values of tt, and let γ\gamma be the timelike geodesic that runs from qq to pp at fixed tt. If 𝒰{\mathcal{U}} is a small neighborhood of γ\gamma, there is no nontrivial Wilson operator in 𝒰{\mathcal{U}}, so 𝒜(𝒰)=𝒜add(𝒰){\mathcal{A}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}). But if pp is separated in time from qq by more than the circumference of the S1S^{1}, then the region (𝒰){\mathcal{E}}({\mathcal{U}}) includes a circle that wraps all the way around the S1S^{1}, so 𝒜((𝒰))𝒜add((𝒰)){\mathcal{A}}({\mathcal{E}}({\mathcal{U}}))\not={\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}({\mathcal{U}})). Thus in this example, we have 𝒜add(𝒰)=𝒜add((𝒰)){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}({\mathcal{U}})), as guaranteed by the timelike tube theorem, but 𝒜(𝒰)𝒜((𝒰)){\mathcal{A}}({\mathcal{U}})\not={\mathcal{A}}({\mathcal{E}}({\mathcal{U}})).

Let us say that a quantum field theory which may have gauge fields or pp-form gauge fields for p>1p>1 is “complete” if the electrically and magnetically charged particles, strings, and membranes coupling to those gauge fields are a maximal possible set consistent with all principles of low energy physics including Dirac quantization of electric and magnetic charge. Complete theories are precisely the ones with 𝒜add(𝒰)=𝒜(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}({\mathcal{U}}) for all 𝒰{\mathcal{U}}, so we might also call them “additive.” It is believed that a quantum field theory that emerges at low energies from a full-fledged (ultraviolet complete) theory of quantum gravity is always complete in this sense. This was originally suggested based on experience with string theory Polchinski . Further arguments were given in BS , and the fullest known argument is based on holographic duality HO .

The claim that a theory that emerges from a full quantum gravity theory will be additive or complete has another interpretation. A theory with “missing” charges (or strings or membranes) has a pp-form global symmetry in the language of GKSW (where pp depends on what is missing). So the statement that a full-fledged theory of quantum gravity is complete can be viewed as an extension to pp-form global symmetry with p>1p>1 of the statement that there are no global symmetries in a full theory of quantum gravity. For discussion of this statement, see for example BS ; EW .

In the spirit of the present article, one can motivate as follows the idea that an ultraviolet-complete theory of quantum gravity should have no missing charges. We assume that a complete gravity theory can be approximated, in an appropriate class of states, by an ordinary quantum field theory weakly coupled to gravity. Suppose that this ordinary quantum field theory is not complete, for example because it has a U(1){\mathrm{U}}(1) gauge field but no state of charge 1. Then if a loop \ell is not a boundary in spacetime, the Wilson operator W=exp(iA)W_{\ell}=\exp({\mathrm{i}}\oint_{\ell}A) is not measurable by any observer living in the spacetime and described by the theory. After all, given that \ell is not a boundary, WW_{\ell} cannot be measured by measuring the curvature F=dAF={\mathrm{d}}A. To measure WW_{\ell} when \ell is not a boundary requires measuring the interference between different histories of a charge 1 particle that differ by the homology class of the worldline of the particle. But the assumption that the theory has no state of charge 1 means that such an experiment is not possible using the resources available in the theory.

What, then, do we mean in claiming that WW_{\ell} (or its hermitian part) is an “observable”? In ordinary quantum mechanics, we consider the observer to be external to the system, and we make no particular assumption about what resources the observer can use in probing the system. From that point of view, the observer might be able to introduce a massive charged particle to probe the system and measure WW_{\ell}. The observer, after all, is described by a more complete theory of the universe that might have the necessary charged particle. An ultraviolet-complete theory of quantum gravity, however, is expected to describe all that there is in the universe that it describes, with no way to add anything from outside. In the context of such a theory, the observer and any apparatus used by the observer must already be described by the theory. So in an ultraviolet complete theory of quantum gravity with a “missing” charge, we would be in the awkward situation that the theory would enable us to define an “observable” WW_{\ell} that would be well-defined in an appropriate, long distance limit, but that no one, in principle, could measure.

3.3 What Else?

In addition to the timelike tube theorem, which asserts that 𝒜add(𝒰)=𝒜add((𝒰)){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}({\mathcal{U}})), one has causality, which, denoting the domain of dependence of a set 𝒰{\mathcal{U}} as D(𝒰)D({\mathcal{U}}), asserts that 𝒜(𝒰)=𝒜(D(𝒰)){\mathcal{A}}({\mathcal{U}})={\mathcal{A}}(D({\mathcal{U}})). If the difference between 𝒜{\mathcal{A}} and 𝒜add{\mathcal{A}}_{\mathrm{add}} is unimportant, these statements can be combined, relating 𝒜(𝒰){\mathcal{A}}({\mathcal{U}}) to the algebra of a set that in general is larger than either (𝒰){\mathcal{E}}({\mathcal{U}}) or D(𝒰)D({\mathcal{U}}). This was noted by Araki Araki in one of the original papers on the timelike tube theorem. However, Araki actually conjectured a further generalization. In the context of quantum fields in Minkowski space, Araki wrote, “ … If \ell is a snake-like line, connecting two mutually timelike points P1P_{1} and P2P_{2} but everywhere spacelike, and {\mathcal{B}} is a tube around \ell, of a small diameter, then it is rather likely that any solution of the [massless scalar] wave equation vanishing in {\mathcal{B}} might always vanish in the double light cone spanned by P1P_{1}, and P2P_{2}. If such a conjecture turns out to be true, then we immediately have the corresponding theorem for R()R({\mathcal{B}}).” In our terminology, the proposal is that if Δ\Delta is the double cone or causal diamond with vertices P1P_{1} and P2P_{2}, then 𝒜add()=𝒜add(Δ){\mathcal{A}}_{\mathrm{add}}({\mathcal{B}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{B}}\cup\Delta). This follows from arguments in Araki’s paper if it is true that any solution of the massless scalar wave equation vanishing in {\mathcal{B}} also vanishes in Δ\Delta. This statement does not follow from Holmgren uniqueness in any obvious way and its validity remains unclear sixty years later.

In general, what extension of the timelike tube theorem is conceivable? For any set 𝒰{\mathcal{U}} in a spacetime MM, one writes 𝒰{\mathcal{U}}^{\prime} for the set of points in MM that are spacelike separated from 𝒰{\mathcal{U}}. Then 𝒰′′=(𝒰){\mathcal{U}}^{\prime\prime}=({\mathcal{U}}^{\prime})^{\prime} is called the causal completion of 𝒰{\mathcal{U}}. For any open set 𝒰{\mathcal{U}}, since 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}) commutes with 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime}), clearly any statement that 𝒜add(𝒰)=𝒜add(𝒱){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{V}}) for some open set 𝒱{\mathcal{V}} implies that 𝒜add(𝒱){\mathcal{A}}_{\mathrm{add}}({\mathcal{V}}) commutes with 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime}). If one is hoping to make a general statement that would apply to any quantum field theory, this really forces 𝒱𝒰′′{\mathcal{V}}\subset{\mathcal{U}}^{\prime\prime}, since local operators outside of 𝒰′′=(𝒰){\mathcal{U}}^{\prime\prime}=({\mathcal{U}}^{\prime})^{\prime} typically do not commute with 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime}). So the most optimistic statement that one might hope for would be 𝒜add(𝒰)=𝒜add(𝒰′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime\prime}).

For example, if 𝒰{\mathcal{U}} is a timelike curve in Minkowski space, then (𝒰)=𝒰′′{\mathcal{E}}({\mathcal{U}})={\mathcal{U}}^{\prime\prime}, and therefore the timelike tube theorem does indeed give 𝒜add(𝒰)=𝒜add(𝒰′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime\prime}). In general, (𝒰)𝒰′′{\mathcal{E}}({\mathcal{U}})\subset{\mathcal{U}}^{\prime\prime}, but 𝒰′′{\mathcal{U}}^{\prime\prime} can be strictly bigger. Indeed, in Araki’s example, ′′{\mathcal{B}}^{\prime\prime} is strictly bigger than (){\mathcal{E}}({\mathcal{B}}). His conjecture would be a special case of 𝒜add(𝒰)=𝒜add(𝒰′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime\prime}), in a situation in which that does not follow directly from the known statement of the timelike tube theorem.

A statement along the lines of 𝒜add(𝒰)=𝒜add(𝒰′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime\prime}) would be very attractive, as it would combine ordinary causality with the timelike tube theorem. Extending from a spacelike pancake to its domain of dependence (fig. 4(a)) and from a timelike tube to its timelike envelope (fig. 4(b)) can be regarded as two different cases of extending from 𝒰{\mathcal{U}} to 𝒰′′{\mathcal{U}}^{\prime\prime}. However, there are some easy counterexamples to 𝒜add(𝒰)=𝒜add(𝒰′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime\prime}).

One type of counterexample121212This example is relatively well known but the original reference is not clear. A version of the example of fig. 6 was discussed in Stroh . arises if 𝒰{\mathcal{U}} is not connected. In Minkowski space in any even dimension 𝖣{\sf D}, let 𝒪{\mathcal{O}} be a small neighborhood of the origin. Let 𝒰{\mathcal{U}} be any open set that contains points to the future of 𝒪{\mathcal{O}} and also points to the past of 𝒪{\mathcal{O}}, but none that can be reached from 𝒪{\mathcal{O}} by a null geodesic. Then 𝒪𝒰′′{\mathcal{O}}\subset{\mathcal{U}}^{\prime\prime}, so 𝒜add(𝒰)=𝒜add(𝒰′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime\prime}) would predict that 𝒜add(𝒪)𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{O}})\subset{\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}). In the theory of a massless free scalar field ϕ\phi, however, this statement is false. By Huygen’s principle along with commutativity at spacelike separation, the commutator function G(x,y)=[ϕ(x),ϕ(y)]G(x,y)=[\phi(x),\phi(y)] in that theory is supported on the light cone (for 𝖣{\sf D} even), so it vanishes for x𝒪x\in{\mathcal{O}}, y𝒰y\in{\mathcal{U}}. Thus in this situation, 𝒜add(𝒪){\mathcal{A}}_{\mathrm{add}}({\mathcal{O}}) commutes with 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}), rather than being contained in 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}).

Refer to caption
Figure 6: In the spacetime ×S1{\mathbb{R}}\times S^{1}, with {\mathbb{R}} parametrizing the “time,” \ell is a spacelike curve that spirals around from a point P1P_{1} to a point P2P_{2} that is to its future. 𝒪{\mathcal{O}} is an open set in the “gap” between P1P_{1} and P2P_{2}. If {\mathcal{B}} (not drawn) is an open set that is a small “thickening” of \ell, then a right-moving null geodesic through 𝒪{\mathcal{O}} does not intersect {\mathcal{B}}.

In two dimensions, a small variant of this gives a counterexample with 𝒰{\mathcal{U}} connected. In fact, the setup is closely related to that suggested by Araki. Consider the spacetime M=×S1M={\mathbb{R}}\times S^{1}, with a product metric, where {\mathbb{R}} parametrizes time and S1S^{1} parametrizes space. Let the point P2P_{2} be slightly to the future of P1P_{1}, and let \ell be a spacelike curve that starts at P1P_{1}, spirals once around S1S^{1} to the right, and ends at P2P_{2}. Let {\mathcal{B}} be an open set that is a slight thickening of \ell, and let Δ\Delta be the causal diamond with vertices P1,P2P_{1},P_{2}. Finally, let 𝒪Δ{\mathcal{O}}\subset\Delta be an open set small enough that any right-moving null geodesic that intersects 𝒪{\mathcal{O}} does not intersect {\mathcal{B}} (fig 6). Then 𝒪Δ′′{\mathcal{O}}\subset\Delta\subset{\mathcal{B}}^{\prime\prime}. In two-dimensional quantum field theory, it is possible to have a local field JJ with the property that [J(x),J(y)][J(x),J(y)] vanishes unless the points xx and yy are connected by a right-moving null geodesic. For example, JJ could be a right-moving conserved current (or a chiral component of the stress tensor) in a conformally invariant theory. Let ff be a smearing function supported in 𝒪{\mathcal{O}}, and consider the operator Jf=d2xf(x)J(x)𝒜add(𝒪)𝒜add(′′)J_{f}=\int{\mathrm{d}}^{2}xf(x)J(x)\in{\mathcal{A}}_{\mathrm{add}}({\mathcal{O}})\subset{\mathcal{A}}_{\mathrm{add}}({\mathcal{B}}^{\prime\prime}). This operator commutes with 𝒜add(){\mathcal{A}}_{\mathrm{add}}({\mathcal{B}}), rather than being contained in 𝒜add(){\mathcal{A}}_{\mathrm{add}}({\mathcal{B}}), as one would expect based on 𝒜add()=𝒜add(′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{B}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{B}}^{\prime\prime}).

In the context of the present article, we are not primarily interested in applying the timelike tube theorem or a hypothetical generalization to an arbitrary open set. We are primarily interested in observables along the timelike worldline γ\gamma of an observer. The timelike tube theorem tells us that it does not matter whether we consider a timelike curve γ\gamma or an open set that is a suitable slight thickening of it, so we will express the following in terms of the algebra associated to a curve. The example just discussed can be slightly modified to give a counterexample in that context. If P2P_{2} is farther to the future of P1P_{1} than was assumed so far, then the curve γ\gamma that wraps around the S1S^{1} (in general any number of times) en route from P1P_{1} to P2P_{2} can be timelike, but almost null. Then the timelike envelope (γ){\mathcal{E}}(\gamma) is a small neighborhood of γ\gamma, but its causal completion ′′{\mathcal{B}}^{\prime\prime}, which is the same as γ′′\gamma^{\prime\prime} if {\mathcal{B}} is suitably chosen, wraps all the way around the S1S^{1}. The statement 𝒜add((γ))=𝒜add(γ′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}(\gamma))={\mathcal{A}}_{\mathrm{add}}(\gamma^{\prime\prime}) is false in general just as before.

In this example, γ′′\gamma^{\prime\prime} is the same as the causal diamond131313Here Δ(γ)=J+(γ)J(γ)\Delta(\gamma)=J^{+}(\gamma)\cap J^{-}(\gamma) is the intersection of the past and future of γ\gamma, or equivalently J(P2)J+(P1)J^{-}(P_{2})\cap J^{+}(P_{1}). Δ(γ)\Delta(\gamma) with vertices P1,P2P_{1},P_{2}, so in particular (γ)Δ(γ){\mathcal{E}}(\gamma)\subsetneqq\Delta(\gamma). Thus, this example shows that in the statement of the timelike tube theorem, in general (γ){\mathcal{E}}(\gamma) cannot be replaced with141414Similarly, (γ)Δ(γ){\mathcal{E}}(\gamma)\subsetneqq\Delta(\gamma) in a simply-connected spacetime M=×S2M={\mathbb{R}}\times S^{2}, with metric ds2=dt2+dx2+dy2+ϵdz2{\mathrm{d}}s^{2}=-{\mathrm{d}}t^{2}+{\mathrm{d}}x^{2}+{\mathrm{d}}y^{2}+\epsilon{\mathrm{d}}z^{2}, with x2+y2+z2=1x^{2}+y^{2}+z^{2}=1, ϵ1\epsilon\ll 1. Embedding ×S1{\mathbb{R}}\times S^{1} in ×S2{\mathbb{R}}\times S^{2} at z=0z=0 and choosing γ\gamma as before, again Δ(γ)\Delta(\gamma) is much larger than (γ){\mathcal{E}}(\gamma) if P2P_{2} is sufficiently far to the future of P1P_{1}. However, as in Araki’s example, it is not clear in this case whether it is always true that 𝒜add(γ)=𝒜add(Δ(γ)){\mathcal{A}}_{\mathrm{add}}(\gamma)={\mathcal{A}}_{\mathrm{add}}(\Delta(\gamma)). Free field theory does not provide an immediate counterexample. Δ(γ)\Delta(\gamma).

So some simple generalizations of the timelike tube theorem are false in a general quantum field theory, and others, such as Araki’s conjecture, have unclear status. However, the following somewhat fanciful remarks come to mind. Let us go back to the example in which 𝒰{\mathcal{U}} is not connected and is, for example, the union of a small ball q{\mathcal{B}}_{q} around a point qq and a small ball p{\mathcal{B}}_{p} around a point pp to its future. (Similar remarks apply in the other examples.) In this case, for 𝖣{\sf D} even, massless free field theory provides a counterexample to 𝒜add(𝒰)=𝒜add(𝒰′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime\prime}), but the argument was limited to massless free field theory and would not generalize in an obvious way to any other theory. Consider then in a more generic theory an observer151515In this paragraph, unlike the rest of the article, we consider an observer probing the spacetime from outside, not an observer who propagates on a worldline in the spacetime. who has the capability to manipulate the quantum fields in an arbitrary fashion, but only in 𝒰=pq{\mathcal{U}}={\mathcal{B}}_{p}\cup{\mathcal{B}}_{q}. A strategy this observer can follow is to inject into spacetime a probe in the region q{\mathcal{B}}_{q}, directed on a trajectory γ\gamma that will carry it to p{\mathcal{B}}_{p}, and equipped and programmed to make some chosen measurement and record the result. Then the observer retrieves the probe at p{\mathcal{B}}_{p} and reads the answer. Does this construction along with the timelike tube theorem prove that 𝒜add((γ))𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}(\gamma))\subset{\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})? Though we assume no particular limit on the technological capabilities of the observer, we assume that whatever happens in a part of spacetime to which the observer does not have direct access is governed by the theory that is being probed. Thus in particular the probe that the observer injects into spacetime in q{\mathcal{B}}_{q} must be something that can be built in this theory. A complex probe cannot be built in free field theory, so there is no tension between existence of this protocol and our earlier observations about free field theory. However, in a sufficiently complex theory – possibly any theory that encompasses the Standard Model, for example – this protocol does indeed hint that 𝒜add((γ))𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}(\gamma))\subset{\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}). Actually, the observer can choose γ\gamma at will (assuming that the probe can be equipped with a rocket engine and programmed to travel on a pre-chosen trajectory), and more generally the observer could inject into the system several probes with pre-chosen trajectories γi\gamma_{i} from (q){\mathcal{B}}(q) to (p){\mathcal{B}}(p), each designed and programmed to carry out a particular quantum operation, and then the observer can collect the probes in (p){\mathcal{B}}(p) and process their (possibly quantum) output at will. All this suggests that in a theory that is sufficiently complex, possibly in the end 𝒜add(𝒰)=𝒜add(𝒰′′){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}^{\prime\prime}) in general.

3.4 Causal Wedge Reconstruction

There is actually a manifestation of the timelike tube theorem that has been much discussed in the literature. This is causal wedge reconstruction, or HKLL reconstruction, in the context of the AdS/CFT correspondence BDHM ; B ; Bal ; HKLL ; HKLL2 ; KLL ; HMPS ; Mor ; H .

In the AdS/CFT correspondence, one considers a conformal field theory on a globally hyperbolic spacetime MM of dimension 𝖣{\sf D}. AdS/CFT duality says that an appropriate conformal field theory on MM is equivalent to a gravitational theory formulated on a spacetime XX that has MM for its conformal boundary, and that is globally hyperbolic in the asymptotically AdS sense. In fact, in AdS/CFT duality, one has to consider all possible XX’s that have a given conformal boundary MM, but for our purposes here, we can assume that a particular XX is important. Causal wedge reconstruction applies in a semiclassical situation in which XX can be viewed, in leading order, as a definite spacetime in which quantum fields are propagating.

Refer to caption
Figure 7: A region 𝒰{\mathcal{U}} in the boundary MM of XX, and a slight thickening 𝒰X{\mathcal{U}}_{X} of 𝒰{\mathcal{U}} in the bulk spacetime XX, chosen so that X(𝒰)=(𝒰X){\mathcal{E}}_{X}({\mathcal{U}})={\mathcal{E}}({\mathcal{U}}_{X}).

Let SS be a Cauchy hypersurface in MM, BB an open set in SS, and 𝒰{\mathcal{U}} the domain of dependence of BB in MM. Then 𝒰{\mathcal{U}} is its own timelike envelope in MM, so the timelike tube theorem, applied directly to the region 𝒰{\mathcal{U}} in the CFT on MM, says nothing of interest. However, if MM is the conformal boundary of XX and 𝒰M{\mathcal{U}}\subset M, then it makes sense to consider the timelike envelope X(𝒰){\mathcal{E}}_{X}({\mathcal{U}}) of 𝒰{\mathcal{U}} in XX (fig. 7). In its simplest form, causal wedge reconstruction then asserts that 𝒜add(𝒰)=𝒜add(X(𝒰)){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}})={\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}_{X}({\mathcal{U}})). This can be viewed as a composite of two statements: (1) the basic AdS/CFT duality expressing local operators in MM as limits of local operators in XX, and (2) the timelike tube theorem applied to quantum fields in XX.

To explain this, we can proceed as follows. First, it is possible to thicken 𝒰{\mathcal{U}} slightly to an open set 𝒰XX{\mathcal{U}}_{X}\subset X such that X(𝒰)=(𝒰X){\mathcal{E}}_{X}({\mathcal{U}})={\mathcal{E}}({\mathcal{U}}_{X}). The timelike tube theorem says that 𝒜add(𝒰X){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}_{X}) does not depend on the precise choice of 𝒰X{\mathcal{U}}_{X}, and always equals 𝒜add((𝒰X)){\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}({\mathcal{U}}_{X})). Since this is the case, we can take a limit in which 𝒰X{\mathcal{U}}_{X} becomes arbitrarily “thin.” The basic AdS/CFT relation between local operators on XX and local operators on MM can be interpreted as a statement

lim𝒰X𝒰𝒜add(𝒰X)=𝒜add(𝒰),\lim_{{\mathcal{U}}_{X}\downarrow{\mathcal{U}}}{\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}_{X})={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}), (10)

where lim𝒰X𝒰\lim_{{\mathcal{U}}_{X}\downarrow{\mathcal{U}}} refers to a limit in which the bulk open set 𝒰X{\mathcal{U}}_{X} collapses down to the boundary open set 𝒰{\mathcal{U}}. On the left hand side, 𝒜add(𝒰X){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}_{X}) is the algebra generated by bulk local operators in the bulk region 𝒰X{\mathcal{U}}_{X}, and on the right hand side, 𝒜add(𝒰){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}) is the algebra generated by boundary local operators in the boundary region 𝒰{\mathcal{U}}. Eqn. (10) is a way to express the fact that boundary local operators are the boundary limits of bulk local operators. Since the timelike tube theorem tells us that 𝒜add(𝒰X)=𝒜add((𝒰X))=𝒜add(X(𝒰)){\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}_{X})={\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}({\mathcal{U}}_{X}))={\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}_{X}({\mathcal{U}})) regardless of the choice of 𝒰X{\mathcal{U}}_{X}, the limit in eqn. (10) is trivial and we learn that

𝒜add(X(𝒰))=𝒜add(𝒰).{\mathcal{A}}_{\mathrm{add}}({\mathcal{E}}_{X}({\mathcal{U}}))={\mathcal{A}}_{\mathrm{add}}({\mathcal{U}}). (11)

In a globally hyperbolic spacetime that satisfies the null energy condition, X(𝒰){\mathcal{E}}_{X}({\mathcal{U}}) is the causal wedge of 𝒰{\mathcal{U}}, and eqn. (11) is the usual statement of causal wedge reconstruction. However, this requires some explanation,161616The following proof is not needed in the rest of the article. Facts used in the proof are explained, for example, in Wald ; GaoWald ; LightRays . On compactness of spaces of causal curves, see for instance sections 2 and 3.3 of LightRays ; on the fact that a causal curve that satisfies a promptness condition is a null geodesic without focal points, see sections 5.1 and 5.2 of that article; on the completeness of a null geodesic in an asymptotically AdS spacetime XX whose ends are on the conformal boundary of XX, see section 7.3; on the fact that a complete null geodesic along which the null energy condition is satisfied as a strict inequality must have focal points, see section 8.2. since the usual definition of the causal wedge is slightly different. The causal wedge of 𝒰{\mathcal{U}}, which we will denote as 𝒞(𝒰){\mathcal{C}}({\mathcal{U}}), is usually defined to consist of all points rXr\in X that are contained in a causal curve γX\gamma\subset X between two points q,p𝒰q,p\in{\mathcal{U}}, say with pp to the future of qq. (Equivalently, 𝒞(𝒰){\mathcal{C}}({\mathcal{U}}) is the intersection of the past and future of 𝒰{\mathcal{U}} in XX.) Instead, X(𝒰){\mathcal{E}}_{X}({\mathcal{U}}) contains all points rXr\in X that are contained in a causal curve γX\gamma\subset X between points q,p𝒰q,p\in{\mathcal{U}} such that γ\gamma can be deformed, through a family of causal curves with fixed endpoints, to a causal curve entirely in MM (and therefore in 𝒰{\mathcal{U}}). Thus to show that X(𝒰)=𝒞(𝒰){\mathcal{E}}_{X}({\mathcal{U}})={\mathcal{C}}({\mathcal{U}}), we have to show that any causal curve γX\gamma\subset X that starts and ends at points q,p𝒰q,p\in{\mathcal{U}} can be deformed, through a family of causal curves with those fixed endpoints, to a causal curve in MM. To prove this, let qp{\mathcal{R}}_{qp} be the space of causal curves in XX with initial and final endpoints q,pq,p. Pick a causal curve ζqp𝒰\zeta_{qp}\subset{\mathcal{U}} from qq to pp, and let 𝒯ζ{\mathcal{T}}_{\zeta} be the space of causal curves in XX with initial endpoint qq and any final endpoint sζqps\in\zeta_{qp}. A causal curve from qq to ss can be converted to a causal curve from qq to pp by gluing on the segment ζsp\zeta_{sp} of ζqp\zeta_{qp}. This operation maps 𝒯ζ{\mathcal{T}}_{\zeta} continuously to qp{\mathcal{R}}_{qp}; on the other hand, qp𝒯ζ{\mathcal{R}}_{qp}\subset{\mathcal{T}}_{\zeta}. So a causal curve γX\gamma\subset X with endpoints q,pq,p can be deformed in qp{\mathcal{R}}_{qp} to a curve entirely in MM if and only if it can be so deformed in 𝒯ζ{\mathcal{T}}_{\zeta}. Hence, to prove that X(𝒰)=𝒞(𝒰){\mathcal{E}}_{X}({\mathcal{U}})={\mathcal{C}}({\mathcal{U}}), it suffices to show that any causal curve γX\gamma\subset X with endpoints q,pq,p can be deformed in 𝒯ζ{\mathcal{T}}_{\zeta} to a curve entirely in MM. This question is topological in nature: is γ\gamma in the connected component of 𝒯ζ{\mathcal{T}}_{\zeta} that contains curves that lie entirely in MM? So the answer is invariant under an infinitesimal perturbation of XX, and we can make such a perturbation to ensure that the null energy condition is satisfied in XX as a strict inequality (not as an equality). We will see later exactly where in XX the perturbation should be made. On 𝒯ζ{\mathcal{T}}_{\zeta}, one can define the following continuous function φ\varphi: if γ𝒯γ\gamma_{*}\in{\mathcal{T}}_{\gamma} has endpoints q,sζqpq,s\in\zeta_{qp}, then φ(γ)\varphi(\gamma_{*}) is the proper time elapsed from qq to ss along ζqp\zeta_{qp}. An important detail is that we allow the case of a causal curve that consists of only one point. Thus in particular 𝒯ζ{\mathcal{T}}_{\zeta} contains a point qq_{*} that corresponds to a causal curve consisting only of the point qq. The point qq_{*} is the absolute minimum of φ\varphi, since φ(q)=0\varphi(q_{*})=0, and at other points in 𝒯ζ{\mathcal{T}}_{\zeta}, φ>0\varphi>0. The purpose of including the point qq_{*} in the definition of 𝒯ζ{\mathcal{T}}_{\zeta} is to ensure that 𝒯ζ{\mathcal{T}}_{\zeta} is compact; this compactness follows from the fact that the initial endpoint of a curve γ𝒯ζ\gamma_{*}\in{\mathcal{T}}_{\zeta} is fixed and its final endpoint ss ranges over the compact set ζqp\zeta_{qp}, along with the fact that in a globally hyperbolic spacetime, the space of causal curves with specified endpoints is compact. Compactness of 𝒯ζ{\mathcal{T}}_{\zeta} ensures that in every connected component of 𝒯ζ{\mathcal{T}}_{\zeta}, the nonnegative function φ\varphi has an absolute minimum. Any γ𝒯ζ\gamma\in{\mathcal{T}}_{\zeta} can be deformed in 𝒯ζ{\mathcal{T}}_{\zeta} to the minimum of φ\varphi in its connected component. We will conclude the proof by showing that the point qq_{*} is the unique local minimum of the function φ\varphi, implying that the local minimum of φ\varphi to which γ\gamma can be deformed in 𝒯ζ{\mathcal{T}}_{\zeta} is actually the absolute minimum qq_{*}. This in particular implies that γ\gamma can be deformed in 𝒯ζ{\mathcal{T}}_{\zeta} to a causal curve entirely in MM, as we wished to show. To show that φ\varphi has no other local minimum, we note that any point in 𝒯ζ{\mathcal{T}}_{\zeta} other than qq_{*} is a nontrivial causal curve γ~\widetilde{\gamma} with distinct endpoints in MM. In general, if the initial point of a causal curve γ~\widetilde{\gamma} is specified (here qq) and the final point of the curve is constrained by a condition of “promptness” (here γ~\widetilde{\gamma} is supposed to end along ζqp\zeta_{qp}, and the condition for it to be a local minimum of φ\varphi means that it arrives on ζqp\zeta_{qp} sooner than any nearby causal curve from qq; this is the promptness condition), then γ~\widetilde{\gamma} must be a null geodesic without focal points. However, in an asymptotically AdS spacetime, any null geodesic between distinct points on the conformal boundary is complete in both directions. As remarked earlier, we can assume that the null energy condition in XX is satisfied along γ~\widetilde{\gamma} as a strict inequality. A null geodesic that is complete in both directions and along which the null energy condition is satisfied as a strict inequality always has focal points. So the function φ\varphi can have no local minimum other than its absolute minimum qq_{*}, completing the proof.

4 An Algebra of Observables For De Sitter Space

Finally, we turn to a concrete example in which, in order to define a sensible algebra of observables, it is necessary to include an observer in the description CLPW . De Sitter space in 𝖣{\sf D} dimensions or dS𝖣{\mathrm{dS}}_{\sf D} is the maximally symmetric solution of Einstein’s equations with a positive cosmological constant. It can be described by the metric

ds2=dt2+R2cosh2(t/R)dΩ2,{\mathrm{d}}s^{2}=-{\mathrm{d}}t^{2}+R^{2}\cosh^{2}(t/R){\mathrm{d}}\Omega^{2}, (12)

where RR is the radius of curvature and dΩ2{\mathrm{d}}\Omega^{2} is the metric of a round sphere S𝖣1S^{{\sf D}-1} of unit radius. This sphere is compact, so dS𝖣{\mathrm{dS}}_{\sf D} is an example of a closed universe. At time tt, the sphere has radius R(t)=Rcosh(t/R),R(t)=R\cosh(t/R), so it grows exponentially for t+t\to+\infty or tt\to-\infty. The exponential growth for t+t\to+\infty is believed to be a good approximation to what is currently beginning to happen in the real world.

In the 1970’s, Gibbons and Hawking GH studied de Sitter space as a simple example of a spacetime with a cosmological horizon – in which an observer cannot see the whole universe. They attached a temperature and entropy to the de Sitter horizon, as Bekenstein and Hawking had done not long before for the horizon of a black hole. The thermal interpretation is most obvious in Euclidean signature, where dS𝖣{\mathrm{dS}}_{\sf D} becomes simply a 𝖣{\sf D}-sphere S𝖣S^{\sf D}, with metric

dsE2=dτ2+R2cos2(τ/R)dΩ2.{\mathrm{d}}s^{2}_{E}={\mathrm{d}}\tau^{2}+R^{2}\cos^{2}(\tau/R){\mathrm{d}}\Omega^{2}. (13)

In ordinary quantum field theory in de Sitter space (and also in the presence of semiclassical gravity), there is a natural de Sitter state ΨdS\Psi_{\mathrm{dS}} such that correlation functions in this state can be obtained by analytic continuation from Euclidean signature. Let CC be a great circle in S𝖣S^{{\sf D}} and let λ\lambda be a length parameter along CC. CC is an orbit of a U(1){\mathrm{U}}(1) symmetry of S𝖣S^{\sf D} (which is uniquely determined if we say that its fixed point set is a copy of S𝖣2S^{{\sf D}-2}). If we normalize the generator HH of this U(1){\mathrm{U}}(1) to act as /λ\partial/\partial\lambda along CC, then HH obeys exp(2πRH)=1\exp(-2\pi RH)=1. When continued to Lorentz signature, this leads to the striking statement that correlation functions in the state ΨdS\Psi_{\mathrm{dS}} have a thermal interpretation at the de Sitter temperature TdS=1/βdST_{\mathrm{dS}}=1/\beta_{\mathrm{dS}}, where βdS=2πR\beta_{\mathrm{dS}}=2\pi R GH ; FHN . The thermal interpretation of de Sitter space has been extensively explored for nearly half a century. For a small sampling of the relevant literature, see Sewell ; Maeda ; BoussoOne ; BoussoTwo ; Banks ; BanksFischler ; BanksOne ; BanksTwo ; BFTwo ; SusskindA ; Susskind ; DF ; BD ; SB ; DST .

Refer to caption
Figure 8: The Penrose diagram of de Sitter space. Past infinity is at the bottom; future infinity is at the top. The left and right boundaries represent timelike geodesics that can be viewed as the trajectories of observers who remains at rest on the south or north pole of S𝖣1S^{{\sf D}-1}, respectively. The diagonals represent the past and future horizons of those observers; the unshaded triangles on the left and right are the static patches that are causally accessible to these observers.

In Lorentz signature, de Sitter space is conveniently understood via a Penrose diagram (fig. 8). The great circle CC continues in Lorentz signature to a hyperbola that has two components, each of them a geodesic. Coordinates can be chosen so that these geodesics make up the left and right boundaries of the Penrose diagram. Given an observer traveling on a geodesic in dS𝖣{\mathrm{dS}}_{\sf D}, we can assume that the worldline of this observer is, say, the left boundary of the diagram. The observer then has past and future horizons which are the diagonals in the picture. The region causally accessible to the observer (the region the observer can see and also can influence) is bounded by these diagonals, along with the left boundary of the diagram. A similar region of the diagram is causally accessible to an observer on the right boundary. The operator HH generates a symmetry of the Penrose diagram; with a suitable choice of sign, it maps the region accessible to the left observer forward in time and the region accessible to the right observer backwards in time. Near the bifurcate horizon where the two diagonals meet, HH looks like the generator of a Lorentz boost.

If tt measures the time along the left boundary, then on the left boundary of the figure, H=/tH=\partial/\partial t. So it is natural for an observer whose worldline is the left boundary to interpret HH as a generator of time translations. With this interpretation, since HH is a symmetry of the region causally accessible to the left observer, this region is time-independent and thus “static.” This is why the region causally accessible to an observer in de Sitter space has been called a “static patch.” However, this view of de Sitter space as being “static” is highly observer-dependent. In a global view, the form (12) of the de Sitter metric shows that dS𝖣{\mathrm{dS}}_{\sf D} is expanding exponentially both toward the future and toward the past, so globally one would definitely not call dS𝖣{\mathrm{dS}}_{\sf D} “static.”

An ordinary quantum field theory in de Sitter space has a Hilbert space 0{\mathcal{H}}_{0} of quantum states. In such a theory, we associate to the static patch (or any region) an algebra 𝒜0{\mathcal{A}}_{0} of observables consisting of operators on 0{\mathcal{H}}_{0} that act on the quantum fields in the region in question. The algebra of any local region, and in particular the algebra 𝒜0{\mathcal{A}}_{0} of the static patch, is a possibly unfamiliar Type III von Neumann algebra. This is an algebra with an infinite amount of quantum entanglement built in, giving an abstract explanation of the fact that entanglement entropy is ultraviolet divergent in quantum field theory. Including weakly coupled gravitational fluctuations does not qualitatively change the picture. We simply include the weakly coupled gravitational field as one more field in the construction of 0{\mathcal{H}}_{0} and 𝒜0{\mathcal{A}}_{0}. What does really change the picture is that in a closed universe, such as de Sitter space, the isometries have to be treated as constraints. In the case of the static patch, the important constraint is the Hamiltonian HH. Imposing HH as a constraint means that we should replace 𝒜0{\mathcal{A}}_{0} by 𝒜0H{\mathcal{A}}_{0}^{H}, its invariant subalgebra. But that does not work: the invariant subalgebra is trivial. Roughly, that is because anything that commutes with HH can be averaged over all the thermal fluctuations and replaced by its thermal average, a cc-number. A technical statement is that there are no nontrivial invariants in 𝒜0{\mathcal{A}}_{0} because HH generates the modular automorphism group of the state ΨdS\Psi_{\mathrm{dS}} for the algebra 𝒜0{\mathcal{A}}_{0} Sewell , and therefore acts ergodically.

To get a reasonable algebra of observables, we include an observer in the analysis. Of course, as noted in the introduction, in principle an observer should really be described by the theory, not injected from outside. What it means to include an observer is that we consider a “code subspace” of states in which an observer is present in the static patch, and then we consider operators that can be defined in the low energy effective field theory in this code subspace, though they are not well-defined on the whole Hilbert space.

Should we be surprised that we need to include the observer in the analysis to get a sensible answer? As was also noted in the introduction, a gravitating system in a closed universe is the situation in which we are most likely to need to explicitly incorporate the observer in the analysis. That is exactly the situation here because de Sitter space is a simple model of a closed universe, that is, a universe with compact spatial sections.

Once we include an observer in the analysis, there is a rationale to study a particular static patch, namely the region that is causally accessible to that given observer. Moreover, the timelike tube theorem tells us that the algebra 𝒜0{\mathcal{A}}_{0} of observables in the causally accessible region can be interpreted as the algebra generated by the quantum fields along the observer’s worldline. Thus the algebra 𝒜0{\mathcal{A}}_{0} of the static patch becomes, in a sense, operationally meaningful once the observer is included.

It turns out that to get a sensible answer, it suffices to consider a minimal model in which the observer is characterized just by a clock with Hamiltonian

Hobs=q.H_{\mathrm{obs}}=q. (14)

It is physically reasonable to assume that the observer’s energy is bounded below. The precise lower bound will not be important and we will just take it171717More realistically, the observer would have a mass mm and minimum energy mc2mc^{2}. If the observer is minimally coupled to gravity, then the observer worldline will be a geodesic. In the presence of an observer, the relevant static patch is the region causally accessible to the observer. to be 0. With that choice, the effect of including the observer is to modify the Hilbert space by

01=0L2(+),{\mathcal{H}}_{0}\to{\mathcal{H}}_{1}={\mathcal{H}}_{0}\otimes L^{2}({\mathbb{R}}_{+}), (15)

where qq is a multiplication operator on the positive half-line +{\mathbb{R}}_{+}. The algebra is likewise extended from 𝒜0{\mathcal{A}}_{0} to

𝒜1=𝒜0B(L2(+)).{\mathcal{A}}_{1}={\mathcal{A}}_{0}\otimes B(L^{2}({\mathbb{R}}_{+})). (16)

The last factor is the (Type I) algebra of all bounded operators on L2(+)L^{2}({\mathbb{R}}_{+}).

Finally the constraint becomes the total Hamiltonian of the quantum fields plus the observer:

HH^=H+Hobs.H\to\widehat{H}=H+H_{\mathrm{obs}}. (17)

The “correct” algebra of observables taking account of the presence of the observer is therefore

𝒜=𝒜1H^,{\mathcal{A}}={\mathcal{A}}_{1}^{\widehat{H}}, (18)

that is, the H^\widehat{H}-invariant part of 𝒜1{\mathcal{A}}_{1}. To be more exact, this is the algebra of observables accessible to the observer in the limit GN0G_{N}\to 0. In higher order in GNG_{N}, it will be necessary to incorporate direct couplings between the observer and the quantum fields; the constraint operator will not be a simple sum. Of course, the notion of the algebra of a spacetime region is presumably only well-defined in the limit GN0G_{N}\to 0; the timelike tube theorem suggests that in going beyond that limit, we should reinterpret 𝒜{\mathcal{A}} as the algebra of observables along the observer’s worldline. As explained presently, the important conclusions that we will draw about 𝒜{\mathcal{A}} are expected to be robust against perturbative corrections in GNG_{N}.

The answer (18) makes sense, unlike the previous one. The reason is that once an observer is present, we can “gravitationally dress” any operator to the observer’s world-line. It is easiest to explain this if we momentarily ignore the lower bound q0q\geq 0. In fact, in the field of a black hole, there is a problem that is mathematically quite similar to what we are discussing here, but without the lower bound on qq CLPW . If we do ignore the lower bound on qq, then the Hilbert space including the observer is 1=0L2(){\mathcal{H}}_{1}^{\prime}={\mathcal{H}}_{0}\otimes L^{2}({\mathbb{R}}), and the algebra is 𝒜1=𝒜0B(L2()){\mathcal{A}}_{1}^{\prime}={\mathcal{A}}_{0}\otimes B(L^{2}({\mathbb{R}})). Here B(L2())B(L^{2}({\mathbb{R}})) is generated by (bounded functions of) qq and p=iddqp=-{\mathrm{i}}\frac{{\mathrm{d}}}{{\mathrm{d}}q}. If Θ\Theta is the projection operator onto states with q0q\geq 0, then 1=Θ1{\mathcal{H}}_{1}=\Theta{\mathcal{H}}_{1}^{\prime}, 𝒜1=Θ𝒜1Θ{\mathcal{A}}_{1}=\Theta{\mathcal{A}}_{1}^{\prime}\Theta. Similarly the H^\widehat{H}-invariant subalgebras 𝒜=𝒜1H^{\mathcal{A}}={\mathcal{A}}_{1}^{\widehat{H}}, 𝒜=𝒜1H^{\mathcal{A}}^{\prime}={\mathcal{A}}_{1}^{\prime}{}^{\widehat{H}} are related by 𝒜=Θ𝒜Θ{\mathcal{A}}=\Theta{\mathcal{A}}^{\prime}\Theta. So 𝒜{\mathcal{A}} is easily constructed once we understand 𝒜{\mathcal{A}}^{\prime}.

To construct 𝒜{\mathcal{A}}^{\prime}, we reason as follows. For any 𝖺𝒜0{\sf a}\in{\mathcal{A}}_{0}, the operator

𝖺^=eipH𝖺eipH\widehat{\sf a}=e^{{\mathrm{i}}pH}{\sf a}e^{-{\mathrm{i}}pH}

commutes with the constraint H^=H+q\widehat{H}=H+q. One more operator that commutes with the constraint is qq itself (or equivalently H-H, which equals qq modulo the constraint). It follows from a classic result of Takesaki Takesaki that by coincidence was proved in the early days of black hole thermodynamics that (1) there are no more operators in 𝒜1{\mathcal{A}}_{1}^{\prime} that commute with the constraint, and (2) the algebra 𝒜{\mathcal{A}}^{\prime} that is generated by the operators 𝖺^\widehat{\sf a} and qq is actually a von Neumann algebra of Type II, with a trivial center (that is, its center consists only of cc-numbers). In the case of the static patch in de Sitter space, we want to impose the constraint q0q\geq 0, so the appropriate algebra is not 𝒜{\mathcal{A}}^{\prime} but 𝒜=Θ𝒜Θ{\mathcal{A}}=\Theta{\mathcal{A}}\Theta. This is an algebra of Type II1, again with trivial center.

A basic introduction to the relevant facts about von Neumann algebras can be found in CLPW , among other places. For much more depth, see Sorce . An important fact for our purposes is that a Type II algebra 𝒜{\mathcal{A}} (unlike one of Type III, which we would have in the absence of gravity) has a trace, that is a complex-valued linear function obeying Tr𝖺𝖻=Tr𝖻𝖺{\mathrm{Tr}}\,{\sf a}{\sf b}={\mathrm{Tr}}\,{\sf b}{\sf a}, for all 𝖺,𝖻𝒜{\sf a},{\sf b}\in{\mathcal{A}}. Moreover, the trace in a Type II algebra is positive, in the sense that Tr𝖺𝖺>0{\mathrm{Tr}}\,{\sf a}{\sf a}^{\dagger}>0 for all 𝖺𝒜{\sf a}\in{\mathcal{A}}.

A von Neumann algebra with trivial center is called a factor. Thus the above analysis implies that in the limit GN=0G_{N}=0, the algebra of observables in the static patch is a factor of Type II1. A factor is the von Neumann algebra analog of a simple Lie group. A simple Lie group is rigid, in the sense that it has has no infinitesimal deformations. That is not true for a non-simple Lie group; for example, the symmetry group of n{\mathbb{R}}^{n} is non-simple, and can be deformed, by an arbitrarily small perturbation of the commutation relations in its Lie algebra, to the symmetry group SO(n+1){\mathrm{SO}}(n+1) of SnS^{n}. A similar statement holds for von Neumann algebras: an algebra with a non-trivial center can potentially be deformed to an algebra of a different type, while making the center smaller,181818For example, the Type III algebra described by Leutheusser and Liu in the GN0G_{N}\to 0 limit in the field of a black hole LL ; LLtwo has a nontrivial center and is modified by perturbative corrections to a factor of Type II CLPW ; Witt . but a factor is rigid and has no infinitesimal deformations. Though we have only analyzed the algebra of observables in the static patch in the limit GN0G_{N}\to 0, because the answer that we obtained is a factor, perturbative corrections in GNG_{N} are not expected to modify the algebra up to isomorphism (they will modify the commutation relations among operators along the observer’s worldline, but not the isomorphism class of the algebra that those operators generate). Nonperturbatively, matters are unclear and indeed it is quite unclear whether quantum de Sitter space makes sense nonperturbatively. If quantum de Sitter space does make sense nonperturbatively, then one expects to describe it by a finite-dimensional Hilbert space BanksOne ; BanksTwo , and the algebra will have to be of Type I.

A Type II algebra does not have pure states, but because it does have a trace, familiar ideas like density matrices and entropies make sense for a state of such an algebra Segal ; LW . To a global state Ψ\Psi of de Sitter space plus the observer, reduced to the static patch, one can associate a density matrix ρ𝒜\rho\in{\mathcal{A}}, characterized by

Ψ|𝖺|Ψ=Tr𝖺ρ,𝖺𝒜.\langle\Psi|{\sf a}|\Psi\rangle={\mathrm{Tr}}\,{\sf a}\rho,~{}~{}\forall{\sf a}\in{\mathcal{A}}. (19)

Therefore, we can define a von Neumann entropy

S(ρ)=Trρlogρ.S(\rho)=-{\mathrm{Tr}}\,\rho\log\rho. (20)

There is no such definition in the absence of gravity, because without gravity, the observables in the static patch constitute the Type III algebra 𝒜0{\mathcal{A}}_{0}. The fact that gravity turns the Type III algebra into a Type II algebra gives an abstract explanation for why entropy is better defined in the presence of gravity than in ordinary quantum field theory. However, from a physical point of view, Type II entropy is a renormalized entropy from which an infinite constant has been subtracted.

As already remarked, if we put no constraint on the observer’s energy, we get an algebra of Type II; if we assume the observer’s energy is bounded below, we get an algebra of Type II1. In a Type II1 algebra, the trace is defined for all elements of the algebra, while in an algebra of Type II, the trace is finite only for a dense subset of elements of the algebra. In particular, for Type II1, the identity element has a finite trace and we can normalize the trace so that the trace of the identity is 1, while in Type II, the trace of the identity element is ++\infty.

For our purposes, the important difference is that a Type II1 algebra has a state of maximum possible entropy, which is the “maximally mixed” state with density matrix ρ=1\rho=1. This is consistent with Trρ=1{\mathrm{Tr}}\,\rho=1, assuming that the trace has been appropriately normalized. Evaluating S(ρ)=TrρlogρS(\rho)=-{\mathrm{Tr}}\,\rho\log\rho for ρ=1\rho=1, we see that the state with ρ=1\rho=1 has entropy 0. It is not difficult to prove that all other states have negative entropy (for example, see CLPW ; LW ). This tells us the meaning of the subtraction that is involved in defining the entropy of a state of a Type II1 algebra so that S(ρ)=0S(\rho)=0 for ρ=1\rho=1: the constant that is subtracted is the maximum possible entropy. In general, entropy of a state of a Type II1 algebra is the entropy difference of that state relative to the entropy of the maximum entropy state. The maximum entropy state with ρ=1\rho=1 is the Type II1 analog of a maximally mixed state in ordinary quantum mechanics, in which the density matrix is a multiple of the identity. By contrast, there is no upper bound on the entropy of a state of a Type II algebra.

It is felicitous that a Type II1 algebra has a state of maximum entropy, because in fact de Sitter space is believed to have a state of maximum entropy – namely “empty de Sitter space,” with all the entropy in the cosmological horizon Maeda ; BoussoOne ; BoussoTwo . So to get a reasonable model of de Sitter space, it is important to assume that the observer’s energy is bounded below, which is a more reasonable assumption anyway, thereby ensuring that the algebra of observables is of Type II1, not Type II. One can explicitly construct a state ΨmaxdS\Psi_{\mathrm{max}}\in{\mathcal{H}}_{\mathrm{dS}} that has maximum entropy (and thus density matrix ρ=1\rho=1) when reduced to the static patch. In fact, one can pick Ψmax=ΨdSeβdSq/2βdS\Psi_{\mathrm{max}}=\Psi_{\mathrm{dS}}\otimes e^{-\beta_{{\mathrm{dS}}}q/2}\sqrt{\beta_{\mathrm{dS}}}. Thus empty de Sitter space, tensored with a state in which the observer’s energy has a thermal distribution at the de Sitter temperature, has maximum entropy from the point of view of the Type II1 algebra. In this sense the analysis based on the Type II1 algebra agrees with the claim that empty de Sitter space has maximum entropy.

We can now compare with some further claims in the previous literature. First of all, since the maximum entropy state has ρ=1\rho=1, it has a “flat entanglement spectrum” (all eigenvalues of the density matrix are equal) and accordingly the Rényi entropies all vanish:

Sα(ρ)=11αlogTrρα=0.S_{\alpha}(\rho)=\frac{1}{1-\alpha}\log\,{\mathrm{Tr}}\,\rho^{\alpha}=0. (21)

This matches with a result that has been found using Euclidean path integrals to analyze the Rényi entropies of the static patch DST . Given the assertion that de Sitter space has a state of maximum entropy, this result is what one should expect. In ordinary quantum mechanics, the maximum entropy state of a system is “maximally mixed,” with a “flat entanglement spectrum” (the density matrix is a multiple of the identity and all its eigenvalues are equal) and its Rényi entropies are independent of α\alpha.

Now, suppose that the observer makes a measurement with two outcomes that correspond to the projection operators Π\Pi and 1Π1-\Pi. The probabilities of the two outcomes are TrΠ{\mathrm{Tr}}\,\Pi and Tr(1Π)=1TrΠ{\mathrm{Tr}}\,(1-\Pi)=1-{\mathrm{Tr}}\,\Pi. All values 0TrΠ10\leq{\mathrm{Tr}}\,\Pi\leq 1 are possible. If the outcome corresponding to Π\Pi is observed, then after this measurement, the density matrix is

σ=1TrΠΠ.\sigma=\frac{1}{{\mathrm{Tr}}\,\Pi}\Pi.

Since the two eigenvalues of σ\sigma are 0 and 1/TrΠ1/{\mathrm{Tr}}\,\Pi, one has σlogσ=σlog(1/TrΠ)\sigma\log\sigma=\sigma\log(1/{\mathrm{Tr}}\,\Pi) so the entropy after the observation is

S(σ)=Trσlogσ=log(1/TrΠ).S(\sigma)=-{\mathrm{Tr}}\,\sigma\log\sigma=-\log(1/{\mathrm{Tr}}\,\Pi).

The entropy reduction from knowing the outcome is therefore ΔS=log(1/TrΠ)\Delta S=\log(1/{\mathrm{Tr}}\,\Pi), and this is related to the probability p=TrΠp={\mathrm{Tr}}\,\Pi of the given outcome by

p=eΔS.p=e^{-\Delta S}.

However, the probability of a (low entropy) energy EE fluctuation of the static patch is

p=eβdSE,p=e^{-\beta_{\mathrm{dS}}E},

according to the thermal interpretation of de Sitter space. Since also p=eΔSp=e^{-\Delta S}, we must have for consistency of the two descriptions

eβdSE=eΔS.e^{-\beta_{\mathrm{dS}}E}=e^{-\Delta S}.

In other words, “‘thermal” suppression of a fluctuation can be understood as purely entropic suppression. This is surprising, but it has been argued before on other grounds, notably by considering the case that the “fluctuation” is a small black hole at the center of the static patch Susskind .

Which part of this is unexpected? The formula p=eΔSp=e^{-\Delta S} for the probability of an outcome is an inevitable consequence of having a maximum entropy state in which all states are equally probable. In other words, if all states are equally likely, then the probability of a given outcome is just proportional to the number of microstates that are compatible with that outcome. Here we use language appropriate for an ordinary quantum system with a finite-dimensional Hilbert space. For a system described by a Type II1 algebra, the number of microstates compatible with any given outcome is infinite, and one has to express the argument in terms of traces, as we did earlier. In short, the surprise is not that p=eΔSp=e^{-\Delta S}, which one should expect for a maximum entropy state, but that after coupling to gravity and including the observer, the thermal state ΨdS\Psi_{\mathrm{dS}} can be promoted to a maximal entropy state Ψmax\Psi_{\mathrm{max}}.

Let us see explicitly how this happens at the level of correlation functions.191919An error in a previous claim about this matter was pointed out by G. Penington. First we recall some basic facts about time-dependent correlation functions in a thermal ensemble. The time dependence of an operator is defined in the usual way by 𝖺(t)=eiHt𝖺eiHt{\sf a}(t)=e^{{\mathrm{i}}Ht}{\sf a}e^{-{\mathrm{i}}Ht}. A typical time-dependent two-point function is

F(t)=𝖺(t)𝖻(0)β=1ZtreβH𝖺(t)𝖻(0)=1ZtreβHeiHt𝖺eiHt𝖻.F(t)=\langle{\sf a}(t){\sf b}(0)\rangle_{\beta}=\frac{1}{Z}{\mathrm{tr}}\,e^{-\beta H}{\sf a}(t){\sf b}(0)=\frac{1}{Z}{\mathrm{tr}}\,e^{-\beta H}e^{{\mathrm{i}}Ht}{\sf a}e^{-{\mathrm{i}}Ht}{\sf b}. (22)

Here tr{\mathrm{tr}} is the trace in the Hilbert space of a thermal system with Hamiltonian HH, and ZZ is the partition function. It follows immediately from the definition that F(t)F(t) is holomorphic in a strip 0Imtβ0\geq\,{\mathrm{Im}}\,t\geq-\beta and moreover that the boundary value at Imt=β{\mathrm{Im}}\,t=\-\beta is the thermal correlator with the opposite ordering of the operators. In other words, let

G(t)=𝖻(0)𝖺(t)β=1ZtreβH𝖻eiHt𝖺eiHt.G(t)=\langle{\sf b}(0){\sf a}(t)\rangle_{\beta}=\frac{1}{Z}{\mathrm{tr}}\,e^{-\beta H}{\sf b}e^{{\mathrm{i}}Ht}{\sf a}e^{-{\mathrm{i}}Ht}. (23)

This function is related to FF by

G(t)=F(tiβ).G(t)=F(t-{\mathrm{i}}\beta). (24)

The precise meaning of this statement is that there is a function holomorphic in the strip 0Imtβ0\geq{\mathrm{Im}}\,t\geq-\beta whose boundary value on the upper boundary is FF, while its boundary value on the lower boundary is GG.

Let us express this in terms of Fourier transforms of the two functions. Suppose that

F(t)\displaystyle F(t) =dω2πeiωtf(ω)\displaystyle=\int_{-\infty}^{\infty}\frac{{\mathrm{d}}\omega}{2\pi}e^{-{\mathrm{i}}\omega t}f(\omega) (25)
G(t)\displaystyle G(t) =dω2πeiωtg(ω).\displaystyle=\int_{-\infty}^{\infty}\frac{{\mathrm{d}}\omega}{2\pi}e^{-{\mathrm{i}}\omega t}g(\omega). (26)

Then eqn. (24) becomes

g(ω)=eβωf(ω).g(\omega)=e^{-\beta\omega}f(\omega). (27)

These facts about an ordinary quantum system also hold in de Sitter space for time-dependent correlation functions in the state ΨdS\Psi_{\mathrm{dS}}, though more sophisticated proofs are required. To make precisely the same argument for thermal correlators in the static patch of de Sitter space that one can make for an ordinary thermal system, one should have a Hilbert space that describes the static patch, and tr{\mathrm{tr}} should be the trace in this Hilbert space. Such a “one-sided” Hilbert space does not exist for quantum fields in de Sitter space, because the algebra of the static patch is of Type III, not Type I. Instead one only has a global Hilbert space describing all of de Sitter space. To deal with this situation requires more careful arguments using Tomita-Takesaki theory Sewell .

What happens after coupling to gravity? As described earlier, we introduce an observer with energy qq, and canonical momentum p=id/dqp=-{\mathrm{i}}{\mathrm{d}}/{\mathrm{d}}q. Let Θ(x)\Theta(x) be the function that is 1 for x0x\geq 0 and vanishes for x<0x<0, so that Θ=Θ(q)\Theta=\Theta(q) is the projection operator onto states of q0q\geq 0. To get a model of de Sitter space with weakly coupled gravity, we replace operators 𝖺{\sf a}, 𝖻{\sf b} of the original thermal system by gravitationally dressed versions

𝖺^=ΘeipH𝖺eipHΘ,𝖻^=ΘeipH𝖻eipHΘ.\widehat{\sf a}=\Theta e^{{\mathrm{i}}pH}{\sf a}e^{-{\mathrm{i}}pH}\Theta,~{}~{}~{}\widehat{\sf b}=\Theta e^{{\mathrm{i}}pH}{\sf b}e^{-{\mathrm{i}}pH}\Theta. (28)

If we do not include the projection operators Θ\Theta, the algebra would be of Type II, with no maximum entropy state. Including the projection operators gives a Type II1 algebra with a maximum entropy state. As claimed earlier, this state is

Ψmax=ΨdSΨβ(q),\Psi_{\mathrm{max}}=\Psi_{\mathrm{dS}}\otimes\Psi_{\beta}(q), (29)

where ΨdS\Psi_{\mathrm{dS}} is the natural de Sitter invariant state in the absence of gravity, and

Ψβ(q)={eβq/2βq00q<0\Psi_{\beta}(q)=\begin{cases}e^{-\beta q/2}\sqrt{\beta}&q\geq 0\cr 0&q<0\end{cases} (30)

is a state in which the observer’s energy has a thermal distribution. Time dependence is introduced in the usual way, by, for example, 𝖺^(t)=eiHt𝖺^eiHt\widehat{\sf a}(t)=e^{{\mathrm{i}}Ht}\widehat{\sf a}e^{-{\mathrm{i}}Ht}.

In the limit that gravity is weakly coupled, the de Sitter analogs of F(t)F(t), G(t)G(t) are supposed to be

F^(t)\displaystyle\widehat{F}(t) =Ψmax|𝖺^(t)𝖻^|Ψmax=Ψmax|𝖺eiHteipHΘ(q)eipH𝖻|Ψmax\displaystyle=\langle\Psi_{\mathrm{max}}|\widehat{\sf a}(t)\widehat{\sf b}|\Psi_{\mathrm{max}}\rangle=\langle\Psi_{\mathrm{max}}|{\sf a}e^{-{\mathrm{i}}Ht}e^{-{\mathrm{i}}pH}\Theta(q)e^{{\mathrm{i}}pH}{\sf b}|\Psi_{\mathrm{max}}\rangle (31)
G^(t)\displaystyle\widehat{G}(t) =Ψmax|𝖻^𝖺^(t)|Ψmax=Ψmax|𝖻eipHΘ(q)eipHeiHt𝖺|Ψmax.\displaystyle=\langle\Psi_{\mathrm{max}}|\widehat{\sf b}\widehat{\sf a}(t)|\Psi_{\mathrm{max}}\rangle=\langle\Psi_{\mathrm{max}}|{\sf b}e^{-{\mathrm{i}}pH}\Theta(q)e^{{\mathrm{i}}pH}e^{{\mathrm{i}}Ht}{\sf a}|\Psi_{\mathrm{max}}\rangle. (32)

We have used the facts that HΨmax=0H\Psi_{\mathrm{max}}=0, and ΘΨmax=Ψmax\Theta\Psi_{\mathrm{max}}=\Psi_{\mathrm{max}}. We observe now that

eipHΘ(q)eipH=Θ(qH)e^{-{\mathrm{i}}pH}\Theta(q)e^{{\mathrm{i}}pH}=\Theta(q-H) (33)

so eqn. (31) simplifies to

F^(t)\displaystyle\widehat{F}(t) =Ψmax|𝖺eiHtΘ(qH)𝖻|Ψmax\displaystyle=\langle\Psi_{\mathrm{max}}|{\sf a}e^{-{\mathrm{i}}Ht}\Theta(q-H){\sf b}|\Psi_{\mathrm{max}}\rangle (34)
G^(t)\displaystyle\widehat{G}(t) =Ψmax|𝖻Θ(qH)eiHt𝖺|Ψmax.\displaystyle=\langle\Psi_{\mathrm{max}}|{\sf b}\Theta(q-H)e^{{\mathrm{i}}Ht}{\sf a}|\Psi_{\mathrm{max}}\rangle. (35)

We expect F^(t)=G^(t)\widehat{F}(t)=\widehat{G}(t), since either of these functions is supposed to be Tr𝖺^(t)𝖻{\mathrm{Tr}}\,\widehat{\sf a}(t){\sf b}, where Tr{\mathrm{Tr}} is the trace of the Type II1 algebra (as opposed to the trace in the Hilbert space of an underlying thermal system, which has been denoted tr{\mathrm{tr}} – and which anyway does not really exist in the case of de Sitter space).

Now let us write a Fourier-transformed version of these last two equations. In the formula for F^(t)\widehat{F}(t), we see that a contribution that varies with tt as eiωte^{-{\mathrm{i}}\omega t} comes from intermediate states with H=ωH=\omega, but in the formula for G^(t)\widehat{G}(t), such a contribution comes from states with H=ωH=-\omega. The upshot of this is that Fourier-transformed formulas can be written for F^\widehat{F} and G^\widehat{G} that are just analogous to those of eqn. (25), but with an extra factor involving the expectation value in the state Ψβ\Psi_{\beta} of Θ(qω)\Theta(q\mp\omega):

F^(t)\displaystyle\widehat{F}(t) =dω2πeiωtf(ω)Ψβ|Θ(qω)|Ψβ\displaystyle=\int_{-\infty}^{\infty}\frac{{\mathrm{d}}\omega}{2\pi}e^{-{\mathrm{i}}\omega t}f(\omega)\langle\Psi_{\beta}|\Theta(q-\omega)|\Psi_{\beta}\rangle (36)
G^(t)\displaystyle\widehat{G}(t) =dω2πeiωtg(ω)Ψβ|Θ(q+ω)|Ψβ.\displaystyle=\int_{-\infty}^{\infty}\frac{{\mathrm{d}}\omega}{2\pi}e^{-{\mathrm{i}}\omega t}g(\omega)\langle\Psi_{\beta}|\Theta(q+\omega)|\Psi_{\beta}\rangle. (37)

Using the definition of Ψβ\Psi_{\beta}, we can find explicit formulas for uβ(ω)=Ψβ|Θ(qω)|Ψβu_{\beta}(\omega)=\langle\Psi_{\beta}|\Theta(q-\omega)|\Psi_{\beta}\rangle and vβ(ω)=Ψβ|Θ(q+ω)|Ψβv_{\beta}(\omega)=\langle\Psi_{\beta}|\Theta(q+\omega)|\Psi_{\beta}\rangle:

uβ(ω)\displaystyle u_{\beta}(\omega) ={eβωω01ω<0\displaystyle=\begin{cases}e^{-\beta\omega}&\omega\geq 0\cr 1&\omega<0\end{cases} (38)
vβ(ω)\displaystyle v_{\beta}(\omega) ={1ω0eβωω<0.\displaystyle=\begin{cases}1&\omega\geq 0\cr e^{\beta\omega}&\omega<0.\end{cases} (39)

Note that

uβ(ω)vβ(ω)1=eβω.u_{\beta}(\omega)v_{\beta}(\omega)^{-1}=e^{-\beta\omega}. (40)

We have then

F^(t)\displaystyle\widehat{F}(t) =dω2πeiωtf^(ω)\displaystyle=\int_{-\infty}^{\infty}\frac{{\mathrm{d}}\omega}{2\pi}e^{-{\mathrm{i}}\omega t}\widehat{f}(\omega) (41)
G^(t)\displaystyle\widehat{G}(t) =dω2πeiωtg^(ω)\displaystyle=\int_{-\infty}^{\infty}\frac{{\mathrm{d}}\omega}{2\pi}e^{-{\mathrm{i}}\omega t}\widehat{g}(\omega) (42)

with

f^(ω)=uβ(ω)f(ω),g^(ω)=vβ(ω)g(ω).\widehat{f}(\omega)=u_{\beta}(\omega)f(\omega),\hskip 28.45274pt\widehat{g}(\omega)=v_{\beta}(\omega)g(\omega). (43)

So using eqns. (27) and (40), we have

f^(ω)=uβ(ω)eβωg(ω)=uβ(ω)vβ(ω)1eβωg^(ω)=g^(ω),\widehat{f}(\omega)=u_{\beta}(\omega)e^{\beta\omega}g(\omega)=u_{\beta}(\omega)v_{\beta}(\omega)^{-1}e^{\beta\omega}\widehat{g}(\omega)=\widehat{g}(\omega), (44)

implying that F^(t)=G^(t)\widehat{F}(t)=\widehat{G}(t). This confirms that the coupling to gravity has converted the thermal expectation value of an operator into a trace, and moreover that this trace is the expectation value in the maximum entropy state Ψmax\Psi_{\mathrm{max}}.

In sum, we have identified a concrete example of including an observer in order to get a sensible answer in a cosmological model with a closed universe. And, at least in the example of de Sitter space, we have understood that gravity makes the notion of entropy better defined than it is in ordinary quantum field theory. It is possible CLPW to probe more deeply and show that entropy defined via the Type II1 algebra agrees, up to an additive constant independent of the state, with the generalized gravitational entropy, as usually computed via quantum extremal surfaces. It is also possible to give an analogous treatment of a black hole CLPW ; Witt ; CPW , involving in this case an algebra of Type II and therefore no upper bound on the entropy.


Acknowledgements I thank V. Hubeny, A. Jaffe, J. Kohn, H. Maxfield, R. Mazzeo, G. Penington, and A. Strohmaier for comments and advice. Research supported in part by NSF Grant PHY-2207584.

References

  • [1] V. Chandrasekharan, R. Longo, G. Penington, and E. Witten, “An Algebra of Observables for De Sitter Space,” arXiv:2206.10790.
  • [2] H. J. Borchers, “Field Operators as {\mathbb{C}}^{\infty} Functions In Spacelike Directions,” Il Nuovo Cimento 33 (1964) 1.
  • [3] H. J. Borchers, “Uber die Vollständigkeit lorentzinvarianter Felder in einer zeitartigen Röhre,” Il Nuovo Cimento 19 (1961) 787.
  • [4] H. Araki, “A Generalization Of Borchers’ Theorem,” Helv. Phys. Acta 36 (1963) 132-9.
  • [5] A. Strohmaier, “On the Local Structure of the Klein-Gordon Field on Curved Spacetimes,” Lett. Math. Phys. 54 (2000) 249-61.
  • [6] A. Strohmaier and E. Witten, “Analytic States in Quantum Field Theory on Curved Spacetimes,” arXiv:2302.02709.
  • [7] A. Strohmaier and E. WItten, “The Timelike Tube Theorem in Curved Spacetime,” arXiv:2303.16380.
  • [8] W. G. Unruh, “Notes on Black Hole Evaporation,” Phys. Rev. D14 (1976) 870.
  • [9] A. Jaffe, “Wick Polynomials at a Fixed Time,” J. Math. Phys. 7 (1966) 1250.
  • [10] H. Bostelman and C. J. Fewster, “Quantum Inequalities From Operator Product Expansions,” Commun. Math. Phys. 292 (2009) 761-95, arXiv:0812.4760.
  • [11] E. J. Straube, “Harmonic and Analytic Functions Admitting a Distribution Boundary Value,” Annali della Scuola Normale Superiore di Pisa 4e{e} Serie, 11 (1984) 559-91.
  • [12] S. Hollands and R. M. Wald, “Axiomatic Quantum Field Theory in Curved Spacetime,” Commun. Math. Phys. 293 (2010) 85-125.
  • [13] M. Keyl, “Quantum Fields on Timelike Curves,” arXiv:math-ph/0012024.
  • [14] C. J. Fewster, “Lectures on Quantum Energy Inequalities,” arXiv:1208.5399.
  • [15] J. Smoller, Shock Waves and Reaction-Diffusion Equations, second edition (Springer-Verlag, 2012).
  • [16] D. Tataru, “Unique Continuation For Operators With Partially Analytic Coefficients,” J. Math. Pures Appl. 78 (1999) 505-21.
  • [17] H. Casini, M. Huerta, J. M. Magan, and D. Pontello, “Entropic Order Parameters for the Phases of QFT,” JHEP 04 (2021) 277, arXiv:2008.11748.
  • [18] H. Casini and J. M. Magan, “On Completeness and Generalized Symmetries in Quantum Field Theory,” arXiv:2110.11358.
  • [19] D. Harlow, “Wormholes, Emergent Gauge Fields, and the Weak Gravity Conjecture,” JHEP 01 (2016) 122, arXiv:1510.07911
  • [20] J. Polchinski, “Monopoles, Duality, and String Theory,” Int. J. Mod. Phys. A 19S1 (2004) 145-56, hep-th/0304042.
  • [21] T. Banks and N. Seiberg, “Symmetries and Strings in Field Theory and Gravity,” Phys. Rev. D83 (2011) 084019, arXiv:1011.5120.
  • [22] D. Harlow and H. Ooguri, “Symmetries in Quantum Field Theory and Quantum Gravity,” Commun. Math. Phys. 383 (2021) 3, 1669-1804, arXiv:1810.05338.
  • [23] D. Gaiotto, A. Kapustin, N. Seiberg, and B. Willett, “Generalized Global Symmetries,” JHEP 02 (2015) 172, arXiv:1412.5148.
  • [24] E. Witten, “Symmetry and Emergence,” Nat. Phys. 14 (2018) 116-9, arXiv:1710.01791.
  • [25] T. Banks, M. R. Douglas, G. T. Horowitz, and E. Martinec, “AdS Dynamics From Conformal Field Theory,” hep-th/9808016.
  • [26] I. Bena, “On the Construction of Local Fields in the Bulk of AdS5 and Other Spaces,” Phys. Rev. D62 (2000) 066007, hep-th/9905186.
  • [27] V. Balasubramanian, P. Kraus and A.E. Lawrence, “Bulk Versus Boundary Dynamics in Anti-de Sitter Space-Time,’ Phys. Rev. D59 (1999) 046003, hep-th/9805171.
  • [28] A. Hamilton, D. Kabat, G. Lifschytz, and D. A. Lowe, “Holographic Representation of Local Bulk Operators,” Phys.Rev. D74 (2006) 066009, hep-th/0606141.
  • [29] A. Hamilton, D. Kabat, G. Lifschytz, and D. A. Lowe, “Local Bulk Operators in AdS/CFT: A Holographic Description of the Black Hole Interior,” Phys. Rev. D75 (2007) 106001, hep-th/0612053.
  • [30] D. Kabat, G. Lifschytz, and D. A. Lowe, “Constructing Local Bulk Observables in Interacting AdS/CFT,” Phys. Rev. D83 (2011) 106009, arXiv:1102.2910.
  • [31] I. Heemskerk, D. Marolf, J. Polchinski, and J. Sully, “Bulk and Transhorizon Measurements in AdS/CFT,” JHEP 10 (2012) 165, arXiv:1201.3664.
  • [32] I. A. Morrison, “Boundary-to-bulk Maps for AdS Causal Wedges and the Reeh-Schlieder Property in Holography,” JHEP 05 (2014) 053, arXiv:1403.3426.
  • [33] V. E. Hubeny, “Covariant Residual Entropy,” JHEP 09 (2014) 156, arXiv:1406.4611.
  • [34] R. M. Wald, General Relativity (University of Chicago, 1984).
  • [35] S. Gao and R. M. Wald, “Theorems on Gravitational Time Delay and Related Issues,” Class. Quant. Grav. 17 (2000) 4999-5008, gr-qc/0007021.
  • [36] E. Witten, “Light Rays, Singularities, and All That,” Rev. Mod. Phys. 92 (2020) 045004, arXiv:1901.03928.
  • [37] G. W. Gibbons and S. W. Hawking, “Cosmological Event Horizons, Thermodynamics, and Particle Creation,” Phys. Rev. D15 (1977) 2738-51.
  • [38] R. Figari, R. Hoegh-Krohn, and C. R. Nappi, “Interacting Relativistic Boson Fields in the De Sitter Universe With Two Space-Time Dimensions,” Commun. Math. Phys. 44 (1975) 265-278.
  • [39] G. L. Sewell, “Quantum Fields On Manifolds: PCT and Gravitationally Induced Thermal States,” Ann. Phys. 141 (1982) 201-24.
  • [40] K. Maeda, T. Koike, M. Narita, A. Ishibashi, “Upper Bound for Entropy in Asymptotically de Sitter Space-time,” Phys. Rev. D 57(6) (1998) 3503.
  • [41] R. Bousso, “Positive Vacuum Energy and the NN Bound,” JHEP 11 (2000) 038, hep-th/0012052.
  • [42] R. Bousso, “Bekenstein Bounds in de Sitter and Flat Space,” JHEP 04 (2001) 035, hep-th/0010252.
  • [43] T. Banks, “Cosmological Breaking of Supersymmetry? Or Little Lambda Goes Back to the Future, II,” Int. J. Mod. Phys. A16 (2001) 910-921, hep-th/0007146.
  • [44] T. Banks and W. Fischler, “MM-Theory Observables For Cosmological Spacetimes,” arXiv:hep-th/0102077.
  • [45] T. Banks, “More Thoughts on the Quantum Theory of Stable de Sitter Space,” arXiv:hep-th/0503066.
  • [46] T. Banks, B. Fiol, and A. Morisse, “Towards a Quantum Theory of de Sitter Space,” arXiv:hep-th/0609062.
  • [47] T. Banks and W. Fischler, “The Holographic Model Of Cosmology,” arXiv:1806.01749.
  • [48] L. Susskind, “De Sitter Holography: Fluctuations, Anomalous Symmetry, and Wormholes,” Universe 7 (2021) 464, arXiv:2106.03964.
  • [49] L. Susskind, “Black Holes Hint Towards De Sitter-Matrix Theory,” arXiv:2109.01322.
  • [50] X. Dong, E. Silverstein, and G. Torroba, “De Sitter Holography and Entanglement Entropy,” arXiv:1804.08623.
  • [51] P. Draper and S. Farkas, “De Sitter Black Holes as Constrained States in the Euclidean Path Integral,” Phys,. Rev. D105 (2022) 126022.
  • [52] T. Banks and P. Draper, “Comments on the Entanglement Spectrum of de Sitter Space,” JHEP 01 (2023) 135, arXiv:2209.08991.
  • [53] H. Lin and L. Susskind, “Infinite Temperature’s Not So Hot,” arXiv:2206.01083.
  • [54] M. Takesaki, “Duality for Crossed Products and the Structure of von Neumann Algebras of Type III,” Acta Mathematica 131 (1973) 249-310.
  • [55] J. Sorce, “Notes on the Type Classification of von Neumann Algebras,” arXiv:2302.01958.
  • [56] S. Leutheusser and H. Liu, “Causal Connectability Between Quantum Systems and the Black Hole Interior in Holographic Duality,” arXiv:2110.05497.
  • [57] S. Leutheusser and H. Liu, “Emergent Times in Holographic Duality,” arXiv:2112.12156.
  • [58] E. Witten, “Gravity and the Crossed Product,” JHEP 10 (2022) 008, arXiv:2112.12828.
  • [59] I. E. Segal, “A Note on the Concept of Entropy,” J. Math. Mech. 9 (1960) 623-9.
  • [60] R. Longo and E. Witten, “A Note On Continuous Entropy,” arXiv:2202.03357.
  • [61] V. Chandraskharan, G. Penington, and E. Witten, “Large NN Algebras and Generalized Entropy,” arXiv:2206.10780.