This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Yoga Dark Energy: Natural Relaxation and Other Dark Implications of a Supersymmetric Gravity Sector

C.P. Burgess,1,2,3 Danielle Dineen1 and F. Quevedo4
1 Department of Physics & Astronomy, McMaster University,
          1280 Main Street West, Hamilton ON, Canada.
2 Perimeter Institute for Theoretical Physics,
          31 Caroline Street North, Waterloo ON, Canada.
3 CERN, Theoretical Physics Department, Genève 23, Switzerland.
4 DAMTP, Cambridge University, Wilberforce Road, Cambridge, United Kingdom.
Abstract:

We construct a class of 4D ‘yoga’ (naturally relaxed) models for which the gravitational response of heavy-particle vacuum energies is strongly suppressed. The models contain three ingredients: (i) a relaxation mechanism driven by a scalar field (the ‘relaxon’), (ii) a very supersymmetric gravity sector coupled to the Standard Model in which supersymmetry is non-linearly realised, and (iii) an accidental approximate scale invariance expressed through the presence of a low-energy dilaton supermultiplet. All three are common in higher-dimensional and string constructions and although none suffices on its own, taken together they can dramatically suppress the net vacuum-energy density. The dilaton’s vev τ\tau determines the weak scale MWMp/τM_{\scriptscriptstyle W}\sim M_{p}/\sqrt{\tau}. We compute the potential for τ\tau and find it can be stabilized in a local de Sitter minimum at sufficiently large field values to explain the size of the electroweak hierarchy, doing so using input parameters no larger than O(60) because the relevant part of the scalar potential arises as a rational function of lnτ\ln\tau. The de Sitter vacuum energy at the minimum is order cMW81/τ4c\,M_{\scriptscriptstyle W}^{8}\propto 1/\tau^{4}, with a coefficient c𝒪(MW4)c\ll{\cal O}(M_{\scriptscriptstyle W}^{-4}). We discuss ways to achieve c1/Mp4c\sim 1/M_{p}^{4} as required by observations. Scale invariance implies the dilaton couples to matter like a Brans-Dicke scalar with coupling large enough to be naively ruled out by solar-system tests of gravity. Yet because it comes paired with an axion it can evade fifth-force bounds through the novel screening mechanism described in ArXiV:2110.10352. Cosmological axio-dilaton evolution predicts a natural quintessence model for Dark Energy, whose evolution might realize recent proposals to resolve the Hubble tension, and whose axion contributes to Dark Matter. We summarize inflationary implications and some remaining challenges, including the unusual supersymmetry breaking regime used and the potential for UV completions of our approach.

preprint: CERN-TH-2021-192dedicated: Dedicated to the memory of Steven Weinberg: a physicist’s Standard Model.

1 Introduction

The cosmological constant problem [1, 2, 3] seems hopeless. Despite years of effort and much model-building [4] no technically natural mechanism has been found that reconciles the large vacuum fluctuations associated with known particles with the small gravitational response to the vacuum revealed by the evidence for Dark Energy [5, 6]. Indeed, it is widely believed that a symmetry-based or relaxation-type [7, 8] mechanism does not exist, and this point of view has driven much of the community towards anthropic [1, 9] and/or landscape arguments (see for instance [10, 11]). Although these might ultimately prove to be the way Nature works, a proper assessment of their likelihood suffers from the absence of compelling-yet-natural alternatives with which to compare.

We here propose a class of models that we hope can provide such a point of comparison. These models are designed to address the low-energy111By this we mean we focus on how the vacuum energies of known particles (e.g. the electron) can avoid gravitating; arguably the hardest part of the problem because its solution involves changing the properties of well-measured particles at experimentally accessible energies. We leave open questions of UV completion (except where these introduce new constraints – see §3.3), but do so knowing that there are multiple ways (including supersymmetry) to suppress the vacuum energy of undiscovered particles above the weak scale. For other recent proposals for a technically natural vacuum energy see for instance [12, 13, 14]. (and so hardest) part of the cosmological constant problem and are built on the interplay of three separate ingredients, all of which seem to play important roles:

  1. (i)

    A very supersymmetric gravity sector, for which supermultiplets are split by much less than for Standard Model fields (see [15] for a discussion of some other implications and the naturality of this assumption).

  2. (ii)

    A relaxation mechanism in which a ‘relaxon’ scalar field222There is also no ‘i’ in ‘relaxon’ (as opposed to relaxion [16]) since for us this field need not be an axion. dynamically reduces the leading non-gravitational vacuum energy.

  3. (iii)

    Accidental approximate scale invariance, including the implied low-energy dilaton, τ\tau, such as is known to be a generic property of low-energy string vacua [17] and higher-dimensional supergravities more generally [18, 19, 20, 21, 22].

Although each of these ingredients has a plausible UV pedigree, we here avoid unnecessary UV baggage (like extra dimensions) and instead let all three stand on their own within a simple 4D context, with a view to better understanding the underlying mechanisms that could be at work. Indeed, this kind of phenomenological approach lends itself to the cosmological constant problem, which is at heart a low-energy problem rather than a high-energy one (for the reasons given in footnote 1). (We do examine UV completions more explicitly in §3.3, with a view to understanding the independent new constraints that having a UV provenance for these ingredients can introduce.)

The presence of the dilaton introduces a τ\tau-dependence to particle masses, so we first start with a general EFT at low energies and ask how the τ\tau-dependence associated with the vacuum energy, δVm4(τ)\delta V\sim m^{4}(\tau), can be dynamically suppressed. We then push the EFT into the UV to see how far it can go, and ask how it extends to energies near the weak scale, but well below the masses of any putative superpartners for Standard Model fields (which therefore do not appear to be supersymmetric333Despite supersymmetry playing an important role one of our first predictions therefore is a successful one: the absence of Standard-Model superpartners at the LHC. at all). We focus on whether our three ingredients suffice to adequately suppress the gravitational response as the Standard Model fields themselves are integrated out. They appear to do so, subject to a few provisos discussed below.

Because the model we propose has a number of moving parts it is instructive here to summarize the underlying reasons why it works. The first ingredient – the assumption that gravity is described by 𝒩=1{\cal N}=1 supergravity down to very low energies – is important largely because of the auxiliary fields, FAF^{\scriptscriptstyle A}, that its linear realization requires to be in the low-energy scalar potential. Although these fields do not propagate,444Concrete evidence for the importance of keeping track of non-propagating fields in low-energy EFTs comes from Quantum Hall systems, although the fields in these examples are usually topological gauge potentials [23, 24]. From this point of view it is suggestive that in known UV completions 4D auxiliary fields like FF start life as 4-form fields that often convey topological information from higher dimensions, both in string theory [25] and extra-dimensional gravity more generally [26]. they are required in order to linearly realize supersymmetry. Crucially, their presence changes the way that UV physics can enter into the low-energy potential; because supersymmetry-breaking masses necessarily themselves involve FF the contribution of virtual heavy nonsupersymmetric states to the low-energy potential tends to be δVM2F+h.c.\delta V\propto M^{2}F+\hbox{h.c.} (where MM is the UV scale) rather than directly as an FF-independent term like δVM4\delta V\propto M^{4} [15]. Even though M4M^{4} eventually arises once FF is integrated out, the form involving FF shows that the most UV-sensitive effective couplings have a reduced dimension.

The second important consequence of having low-energy auxiliary fields is the structure that their elimination imposes on the scalar potential, which comes as the usual sum and differences of squares: V=VF+VDV=V_{\scriptscriptstyle F}+V_{\scriptscriptstyle D} with

VF=eK/Mp2[KA¯BDAW¯DBW3|W|2Mp2]withDAW:=WA+KAWMp2,V_{\scriptscriptstyle F}=e^{K/M_{p}^{2}}\left[K^{\bar{A}B}{\overline{D_{\scriptscriptstyle A}W}}D_{\scriptscriptstyle B}W-\frac{3|W|^{2}}{M_{p}^{2}}\right]\quad\hbox{with}\quad D_{\scriptscriptstyle A}W:=W_{\scriptscriptstyle A}+\frac{K_{\scriptscriptstyle A}W}{M_{p}^{2}}\,, (1)

and

VD=12𝔉αβ𝔇α𝔇β,V_{\scriptscriptstyle D}=\frac{1}{2}\,{\mathfrak{F}}^{\alpha\beta}{\mathfrak{D}}_{\alpha}{\mathfrak{D}}_{\beta}\,, (2)

familiar from 𝒩=1{\cal N}=1 supergravity, where KK is the supersymmetric Kähler potential, WW is the holomorphic superpotential, 𝔉αβ{\mathfrak{F}}^{\alpha\beta} is the inverse of the real part of the holomorphic gauge kinetic function, 𝔣αβ{\mathfrak{f}}_{\alpha\beta}, and 𝔇α{\mathfrak{D}}_{\alpha} are the ‘moment maps’ for the gauge symmetries [46] (whose detailed form is not needed here). Subscripts on KK and WW denote differentiation with respect to any complex scalars ZAZ^{\scriptscriptstyle A}. This implies in particular that the dominant ‘globally supersymmetric’ term (the terms unsuppressed by 1/Mp1/M_{p}) arise as a square,

Vglob=KA¯BWA¯WB,V_{\rm glob}=K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\overline{W_{\scriptscriptstyle A}}}W_{\scriptscriptstyle B}\,, (3)

and so can vanish at its minimum very naturally.

The above observation is only useful if (1) can also be used when supergravity is coupled to systems like the Standard Model, for which the matter does not come in 𝒩=1{\cal N}=1 supermultiplets and for which the potential usually need not be positive. The generality of the above form ultimately follows from the generality of the rules for nonlinearly realizing supersymmetry described in [27], together with its coupling to supergravity [28, 29, 30]. Ref. [15] explores more concretely why generic potentials are consistent with the above supergravity form, using this general framework. When supersymmetry is nonlinearly realized (as it must be in such theories) there is always a low-energy superfield XX that is nilpotent, X2=0X^{2}=0, since this is what is required to represent the goldstino [27], and it is typically true that WX0W_{\scriptscriptstyle X}\neq 0 for this field. For systems where global supersymmetry breaks badly in the UV, for example, the positivity of (3) is consistent with the non-supersymmetric low-energy scalar potential UU not being positive because WXμ2+U/(2μ2)+W_{\scriptscriptstyle X}\simeq\mu^{2}+U/(2\mu^{2})+\cdots and so |WX|2μ4+U+|W_{\scriptscriptstyle X}|^{2}\simeq\mu^{4}+U+\cdots, since constant terms in the potential are irrelevant in global supersymmetry. Supergravity complicates things because gravity couples to all sources of energy, but also the gravity sector introduces new auxiliary fields. In what follows we imagine that XX is the only supermultiplet to descend from the UV sector555The generality of the emergence of XX in the far infrared carrying the main supersymmetry-breaking order parameter (even if supersymmetry should be partly broken in a more complicated way, including by DD-terms in the UV, say) is argued in [27] with nonzero derivative for WW, so that Vglob|WX|2V_{\rm glob}\propto|W_{\scriptscriptstyle X}|^{2}.

The relaxation mechanism is now built around the structure of the scalar potential described above. A (nonsupersymmetric) relaxon field ϕ\phi is introduced, whose mass is assumed to be a bit smaller than the electron mass (so that it survives to appear in the low-energy theory below the lightest known dangerous Standard Model field). This scalar appears in particular in WXW_{\scriptscriptstyle X}, and so long as a configuration exists for which WX=0W_{\scriptscriptstyle X}=0 then this will be a minimum for VglobV_{\rm glob}. (A very similar mechanism is also commonly at work in supersymmetric gauge theories, where charged scalars automatically seek the zero of the positive DD-term potential given in (2).) The relaxon field likes in this way to zero out the biggest (order Mp0M_{p}^{0}) contribution in (1), causing WXW_{\scriptscriptstyle X} to be Planck suppressed once gravitational interactions are included. We return below to why it remains consistent to use the formalism of nonlinearly realized supersymmetry when WXW_{\scriptscriptstyle X} is suppressed in this way.

Such a mechanism still leaves order Mp2M_{p}^{-2} contributions to (1), and because these are not positive definite they cannot as simply be removed using the same kind of relaxon mechanism. Here is where accidental scale invariance finally plays a role. Motivated by the accidental scaling symmetries known to be common in the low-energy limit of higher-dimensional supergravity, we propose that the theory comes to us with an action that is expanded in inverse powers of a large scalar field τ1\tau\gg 1,

S=S0+S1+S2+S3+,S=S_{0}+S_{1}+S_{2}+S_{3}+\cdots\,, (4)

with each term in this expansion scaling homogeneously in the sense that Snλ1nsSnS_{n}\to\lambda^{1-ns}S_{n} when gμνλgμνg_{\mu\nu}\to\lambda g_{\mu\nu} and τλsτ\tau\to\lambda^{s}\tau for constant λ\lambda. This is as would be expected if S0λS0S_{0}\to\lambda S_{0} and each successive term scales with an additional power of 1/τ1/\tau relative to the previous one. The scaling of S0S_{0} is chosen to be consistent with the scaling of the 4D Einstein-Hilbert action, SEHMp2d4xgS_{\scriptscriptstyle EH}\propto M_{p}^{2}\int{\rm d}^{4}x\,\sqrt{-g}\;{\cal R}, when written in Einstein frame.

Within a supergravity framework we imagine τ\tau being combined with an axion, 𝔞{\mathfrak{a}}, into a complex axio-dilaton field 𝒯=12(τ+i𝔞){\cal T}=\frac{1}{2}(\tau+i{\mathfrak{a}}) that, together with a spin-half field ξ\xi, forms a proper666By so doing we assume that the mass splittings in this dilaton supermultiplet are small enough that all members remain in the low-energy theory, unlike for the Standard Model sector. We verify below that the gravitational coupling of these fields do keep their splittings to be similar to those in the gravity sector. supermultiplet, TT. Invariance under the axion shift symmetry 𝔞𝔞+c{\mathfrak{a}}\to{\mathfrak{a}}+c ensures KK depends only on τ=T+T¯\tau=T+{\overline{T}} and that WW is TT-independent, and the above condition of accidental approximate scale invariance says KK admits the expansion

eK/(3Mp2)=τFk+hτ+𝒪(1/τ2)e^{-K/(3M_{p}^{2})}=\tau F-k+\frac{h}{\tau}+{\cal O}(1/\tau^{2}) (5)

where FF is possibly a scale-invariant function of other fields, and none of FF, kk, hh and so on can depend on powers of τ\tau. They can be functions of any other fields besides TT (and, as it turns out [31], potentially also on logarithms of τ\tau, as we shall see).

Now comes the final bit of magic. The scale invariance of the leading K=3Mp2ln(τF)K=-3M_{p}^{2}\ln(\tau F) term in (5) suffices to prove [32] that it is automatically of ‘no-scale’ form [33], for which KK satisfies the identity

KA¯BKA¯KB=3Mp2.K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}=3M_{p}^{2}\,. (6)

This guarantees the flatness of the potential along the directions in field space that do not appear in WW (such as TT). But even though the action is not scale invariant in the same way when the first subdominant term in (5) is kept, so K=3Mp2ln(τFk)K=-3M_{p}^{2}\ln(\tau F-k), it happens that (6) remains true [17] provided only that kk does not depend on TT. This type of accidental preservation of the no-scale structure beyond leading order in a large field expansion was first noticed in certain string compactifications [34, 35], where it is called an ‘extended no-scale structure’ and fits within the general approach described in [36]. The interplay between scale invariance and supersymmetry is more than the sum of its parts [17]: the flat potential for τ\tau gets lifted at one higher order in 1/τ1/\tau than would naively be expected.777For aficionados: this is how we evade (really, co-exist with) Weinberg’s no-go theorem [1]. Although the theorem says flat directions built on scale invariance must be lifted, it does not say by how much and so does not preclude supersymmetry making the lifting smaller than would otherwise be generic.

For the present purposes, what is nice about this last observation is that it means that the 1/Mp21/M_{p}^{2} contributions to the potential also vanish, even after the relaxon has been integrated out, leaving the final dominant result at order V1/Mp4V\propto 1/M_{p}^{4}. This is the start of the explanation for why the vacuum energy turns out to be of order Vmvac4V\simeq m_{\rm vac}^{4} where mvacMTeV2/Mpm_{\rm vac}\sim M_{\scriptscriptstyle TeV}^{2}/M_{p} and MTeVM_{\scriptscriptstyle TeV} is of order the TeV scale.

Because of the underlying scale invariance and the expansion in powers of 1/τ1/\tau, the powers of 1/Mp1/M_{p} in VV turn out to go along with powers of 1/τ1/\tau leading to a result for the potential that has size VM8/(τMp)4V\sim M^{8}/(\tau M_{p})^{4}, where MM is the generic UV scale appearing everywhere in KK and WW on dimensional grounds. The upshot is that the generic |WX|2/τ2|W_{\scriptscriptstyle X}|^{2}/\tau^{2} part of the potential – including in particular any MTeV4M_{\scriptscriptstyle TeV}^{4} contributions due to SM particles with masses MTeV1/τM_{\scriptscriptstyle TeV}\propto 1/\sqrt{\tau} – is cancelled, leaving a low-energy potential that depends on other parameters. Yet both the weak scale and the vacuum-energy scale are predicted to depend on τ\tau in a manner consistent with VMTeV4V\propto M_{\scriptscriptstyle TeV}^{4}.

The next question becomes: why should the field τ\tau be stabilized at such large values? §3.2 shows that radiative corrections generically imply the function kk can depend on lnτ\ln\tau [31], and mild assumptions about this dependence give a potential for τ\tau that is stabilized at very large values. Because these functions depend only logarithmically on τ\tau minima can arise at astronomically large values while only dialing in hierarchies amongst the parameters in kk that are of order lnτ\ln\tau. Furthermore, standard renormalization-group (RG) methods allow this minimum to be reliably explored without losing control over the underlying radiative corrections.

With this full picture in mind we can return to the question, deferred above, as to why a nonlinearly realized treatment of the Standard Model fields can be consistent even though the relaxon adjusts to ensure that WXW_{\scriptscriptstyle X} vanishes. These two conditions might normally be thought to contradict one another because it is the auxiliary field, FXF^{\scriptscriptstyle X}, for the goldstino multiplet XX, that is the measure of the size of supersymmetry breaking in the unseen sector that badly breaks supersymmetry (and thereby gives superpartners to the Standard Model large masses). In particular, the formulation of nonlinearly realized supersymmetry assumes FXF^{\scriptscriptstyle X} is a UV scale and works as an expansion in powers of 1/FX1/F^{\scriptscriptstyle X}. But in global supersymmetry the field equations usually predict that FXF^{\scriptscriptstyle X} is given by

FXKX¯XWX¯,F^{\scriptscriptstyle X}\propto K^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\overline{W_{\scriptscriptstyle X}}}\,, (7)

and so large FXF^{\scriptscriptstyle X} should be inconsistent with small or vanishing WXW_{\scriptscriptstyle X}.

We argue that there are two reasons why the above framework is nonetheless consistent. First, in supergravity FXF^{\scriptscriptstyle X} is instead determined by

FXKX¯XDXW¯=KX¯X[WX¯+KX¯W¯Mp2]F^{\scriptscriptstyle X}\propto K^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\overline{D_{\scriptscriptstyle X}W}}=K^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}\left[{\overline{W_{\scriptscriptstyle X}}}+\frac{K_{{\overline{{\scriptscriptstyle X}}}}{\overline{W}}}{M_{p}^{2}}\right] (8)

rather than by (7), and so need not vanish even if WXW_{\scriptscriptstyle X} does. Second, relaxation actually implies that WXW_{\scriptscriptstyle X} is Planck-suppressed rather than strictly zero. Both of these can be consistent with a large-FXF^{\scriptscriptstyle X} expansion, if the Planck-suppressed terms in (8) are sufficiently big.

Ultimately the suppression of the vacuum energy relative to the weak scale depends on the size of τ\tau, with τ1026\tau\sim 10^{26} proving to be consistent with the two observed hierarchies MTeVMp/τM_{\scriptscriptstyle TeV}\sim M_{p}/\sqrt{\tau} and V(MTeV2/Mp)4V\sim(M_{\scriptscriptstyle TeV}^{2}/M_{p})^{4}. However – as discussed in §3.3 – additional constraints on how large τ\tau can be arise once its UV origins are more explicit. The same UV frameworks also provide extra sources of suppression (such as warping), making the final solution likely involve a cocktail of suppressions, possibly along the lines described in §3.3.

Explicit details of the above construction are given in later sections, but an immediate consequence of the scale invariance and any successful suppression of the cosmological constant is that the dilaton field τ\tau must be very light, with a mass of order the present-day Hubble scale.888The size of the dilaton mass in this model is a special case of a general result [37] that a gravitationally coupled scalar field, Mp2(θ)2+V(θ){\cal L}\sim M_{p}^{2}(\partial\theta)^{2}+V(\theta), whose potential at its minimum successfully gives the observed dark energy density, generically predicts a mass for θ\theta that is of order the present-day Hubble scale, HH. It does so because the generic condition V′′(θ)V(θ)𝒪(1)V^{\prime\prime}(\theta)\sim V(\theta)\sim{\cal O}(1) at the minimum implies a mass m2V/Mp2m^{2}\sim V/M_{p}^{2}, which is order H2H^{2} whenever the scalar potential dominates the universal energy density. It follows that it must be cosmologically active up to the current epoch, and so predicts Dark Energy must be described by a specific type of near-scale-invariant quintessence theory [38, 39], but (remarkably) one for which both the cosmological constant and the quintessence-field mass would be technically natural.

But it gets better than this. A gravitationally coupled scalar as light as the Hubble scale should stick out in tests of gravity like social skills at a physics meeting. Indeed, the low-energy lagrangian relevant to astrophysics is explored in §4 where it is shown that the underlying scale invariance forces the dilaton τ\tau to couple to Standard Model matter as does a Brans Dicke scalar [40] (at leading order in 1/τ1/\tau – a great approximation when τ1026\tau\sim 10^{26}). And it does so with a coupling that is apparently too large to have escaped detection in precision tests of gravity in the solar system and elsewhere [41]. A more careful look, however, shows that its supersymmetric partner (the axion) can save the day, and does so because of the target-space axion-dilaton interactions also automatically predicted by the model. As explored in more detail in [42], the axion-dilaton interactions have the effect of making matter-dilaton couplings largely generate external axion fields (rather than dilaton fields), which are much less effective at altering test-particle motions within the solar system and so can escape detection. We call this mechanism ‘axion homeopathy’ because it can work for extremely small direct axion-matter couplings, provided only that these are nonzero.

Because the phenomenology of the axio-dilaton field is so crucial to the viability of such models, §5 provides a preliminary discussion of axio-dilaton cosmology and checks that the most basic things work (though without doing justice to the entirety of the constraints that a viable model must ultimately pass – further studies of structure formation and CMB properties within this framework are important to explore). However even if axio-dilaton phenomenology eventually poses challenges to the version of this approach we present here, we regard any such model-building problems within this general framework to be a good trade for progress on the (much harder) cosmological constant problem.

There are also many other ways to test this picture, such as through tests of gravity and the changes predicted in cosmology during well-measured epochs (such as the variations in fundamental masses – in Planck units – that are predicted whenever the dilaton field τ\tau varies in space and time). Intriguingly, some of these may actually help with the Hubble tension [43] by allowing particle masses (in particular mem_{e}) all to differ by a common factor at recombination relative to their values today. Such a scaling potentially exploits the mechanism described in [44], and we briefly check that the basic requirements of this mechanism can be satisfied.

So what is the catch? For the long-distance physics (below the eV scale) relevant to astrophysics, we do not see a fundamental one yet and not for want of looking. The main provisos about which we worry are described in §3 and §6 below. They start with the observation that the large value for τ\tau required to explain the hierarchies also implies a breakdown of EFT methods well below electroweak scales. It does so because τ1026\tau\sim 10^{26} implies the axion decay constant is faMp/τ10f_{a}\sim M_{p}/\tau\sim 10 eV. This need not in itself be a problem because supersymmetric extra dimensions [45] could provide a plausible UV completion at these scales, while remaining consistent with SM degrees of freedom being four-dimensional as assumed here.

The worries come once the low-energy picture is embedded into such a UV completion, because new constraints on the value of τ\tau can arise depending on precisely how this is done. For instance a natural choice in extra-dimensional models identifies τ\tau with the extra-dimensional volume modulus, but this seems to require τ<1020\tau\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{20}. Either τ\tau must arise differently in the UV completion or there must be additional sources of hierarchical suppression (or both). We explore some of the options in §3.3. Other worries include ensuring the naturalness of having the relaxon be so light; exploring the detailed stability of the nonlinearly realized supergravity form in regions for which WXW_{\scriptscriptstyle X} is Planck-suppressed; and so on.

Our presentation is organized as follows. §2 describes the main mechanism in some detail, starting by explicitly writing out the EFT applicable at energies just below the electron mass. Since all of the dangerous Standard Model particles are integrated out at this point the EFT at these scales illuminates most clearly the interplay between scale-invariance and the relaxation mechanism. Standard Model fields are then reintroduced, allowing their couplings to low-energy states to be made more precise.

Knowledge of Standard Model couplings allows a more explicit assessment of naturalness issues, such as why low-energy gravitational physics is relatively insensitive to the loops of Standard Model fields. This is the topic of §3. Since most of the hierarchies of scale in the model are set by the background value of the dilaton field, this section also shows how this field can be stabilized, along the lines described above.

§4 makes a down payment on the most pressing phenomenological challenges, including a derivation of the low-energy EFT relevant to astronomical and cosmological tests. These give the field equations used in [42] to evade the constraints on dilaton-matter couplings coming from tests of General Relativity (GR) in the solar system (whose results are merely quoted here). §5 provides a preliminary evaluation of the cosmological evolution of the axio-dilaton fields for comparison with some features of late-universe cosmology. This includes a brief discussion of potential relevance to the Hubble tension, mentioned above.

Finally §6 summarizes some of the implications of our proposal; contrasts our approach with other discussions that build on the role of scale invariance. Along the way this section outlines several topics for future investigation – such as possible implications for dark matter, inflation, baryogenesis and neutrino physics – that we do not study here.

2 The model

The goal in this section is to set up the lagrangian for a low-energy nonsupersymmetric world coupled to supergravity, for which there is a relaxation mechanism that dynamically removes the gravitational response of any vacuum energy obtained by integrating out Standard Model fields. When doing so we lean heavily on the description given in [15] of the form this lagrangian must take [27, 28, 29, 30].

The idea is two-fold: first combine supersymmetry and scale invariance with a relaxation mechanism to make the leading part of the scalar potential small and then use the ‘extended no-scale’ mechanism to minimize the damage to the scalar potential that inevitably must come once subdominant scale-invariance breaking effects are included.

2.1 SUSY scaling and relaxation

We start with a minimal formulation, to which complications are later introduced as needed. We concentrate (in the first instance) on the effective theory that applies below the electron mass in order to focus on the more difficult low-energy part of the cosmological constant problem, putting aside until later a discussion of the theory further into the UV. We do not take the low-energy potential of this theory to be particularly small because loops of heavier Standard Model fields are assumed to contribute to it in an unsupressed way, giving contributions involving the power of mm (for a field of mass mm) required on dimensional grounds (and so includes the dangerous contributions Vm4V\ni m^{4}).

2.1.1 Low-energy particle content

In this EFT some Standard Model fields remain: the photon and the neutrinos, but because these do not themselves contribute dangerous vacuum energies we do not include them explicitly in our initial description of the model’s vacuum dynamics. (They are of course included in later sections about low-energy phenomenology.) Our focus is initially to see how the new degrees of freedom we propose in this section can suppress the contribution of UV-sensitive terms in the lagrangian to the spacetime curvature.

To this end we postulate several new low-energy degrees of freedom: a very supersymmetric low-energy gravity sector (with both a massless graviton and very light gravitino); a dilaton-dilatino scalar supermultiplet T=(𝒯,ξ)T=({\cal T},\xi) that contains the low-energy pseudo-Goldstone boson for accidental scale invariance plus its supersymmetric partners. All of the above particles are assumed to be very light, and this assumption is checked ex post facto by computing their masses once the model is fully formulated.

In addition to the above fields we also add a real scalar, ϕ\phi, not to be confused with the Standard Model’s Higgs (which has been integrated out). The scalar ϕ\phi nonlinearly realizes supersymmetry (as do the photon and left-handed neutrinos), since its superpartner is assumed to be heavy and so to have already been integrated out. The relaxaton field ϕ\phi itself should be light enough to appear in the EFT below the electron mass, and were it not for this condition its role could be played by the Standard Model Higgs. We discuss in §3 the naturalness issues associated with ϕ\phi being this light.

According to the rules for nonlinearly realizing supersymmetry given in [27, 28, 29, 30, 15], we are to build our EFT using the standard rules [46, 47] for constructing a supergravity lagrangian using specific types of constrained superfields for each particle in the theory that does not have an explicit superpartner:

  • The UV supersymmetry breaking order parameter FXF^{\scriptscriptstyle X} and the spin-half goldstino field GG (ultimately eaten by the gravitino) turn out to be described by a chiral multiplet XX satisfying a nilpotent condition X2=0X^{2}=0 that allows its scalar component 𝒳X{\cal X}\in X to be expressed in terms of GG and FXF^{\scriptscriptstyle X}.

  • Scalar fields like the ‘relaxon’ ϕ\phi are represented by a superfield Φ\Phi that satisfies the constraint that the supercovariant derivative 𝒟¯(XΦ¯){\overline{{\cal D}}}(X{\overline{\Phi}}) vanishes, which says XΦ¯X{\overline{\Phi}} is left-chiral. This constraint removes both the fermionic and auxiliary field parts of Φ\Phi as independent variables, leaving only the scalar ϕ\phi as a physical degree of freedom. The constraint also implies X(Φ,Φ¯)X{\cal F}(\Phi,{\overline{\Phi}}) is left-chiral for more general functions {\cal F}. A reality condition for ϕ\phi gets represented in this framework as the constraint XΦ¯=XΦX{\overline{\Phi}}=X\Phi.

With these constraints the theory is specified at the two-derivative level by giving the Kähler potential KK, superpotential WW and gauge kinetic function 𝔣αβ{\mathfrak{f}}_{\alpha\beta} as functions of the above superfields.

2.1.2 Low-energy lagrangian

Accidental scale invariance is incorporated by assuming the theory has an expansion in powers999Because the theory is organized as an expansion in powers of 1/τ1/\tau we take the field τ\tau to be dimensionless (and so not canonically normalized). of 1/τ1/\tau, where τ:=𝒯+𝒯¯\tau:={\cal T}+{\overline{{\cal T}}}. For supersymmetric theories this means WW and eK/(3Mp2)e^{-K/(3M_{p}^{2})} should be expanded in this way, although for WW this doesn’t say much because we also assume 𝔞:=2Im𝒯{\mathfrak{a}}:=2\,\hbox{Im}\,{\cal T} enjoys an axionic shift symmetry that prevents 𝒯{\cal T} from appearing in WW at all.

Scale invariance for the leading term of the expansion of eK/(3Mp2)e^{-K/(3M_{p}^{2})} requires it to be a homogeneous function of 𝒯{\cal T} and we are free to define 𝒯{\cal T} so that this leading term is homogeneous degree one. Subsequent orders in 1/τ1/\tau break this scale invariance, as do loops built from the leading term (because the above assumptions ensure that 1/τ1/\tau plays the role of \hbar in this part of the theory). We assume all such corrections arise as integer powers of 1/τ1/\tau (apart from logarithms – more about which below).

The first terms in the expansion of the Kähler function therefore become

K3Mp2ln𝒫with𝒫(τ,X,X¯,Φ,Φ¯)=τk+hτ+\displaystyle K\simeq-3M_{p}^{2}\ln{\cal P}\quad\hbox{with}\quad{\cal P}(\tau,X,{\overline{X}},\Phi,{\overline{\Phi}})=\tau-k+\frac{h}{\tau}+\cdots (9)
andWw0(Φ)+XwX(Φ,Φ¯),\displaystyle\qquad\qquad\hbox{and}\quad W\simeq w_{0}(\Phi)+Xw_{{\scriptscriptstyle X}}(\Phi,{\overline{\Phi}})\,,

where the ellipses denote higher orders in 1/τ1/\tau and the functions kk, hh and WW are otherwise chosen to be the most general consistent with the constraints X2=X(Φ¯Φ)=0X^{2}=X({\overline{\Phi}}-\Phi)=0:

k=1Mp2{𝔎(Φ,Φ¯,lnτ)+[X𝔎X(Φ,Φ¯,lnτ)+h.c.]+X¯X𝔎XX¯(Φ,Φ¯,lnτ)},k=\frac{1}{M_{p}^{2}}\Bigl{\{}{\mathfrak{K}}(\Phi,{\overline{\Phi}},\ln\tau)+\Bigl{[}X{\mathfrak{K}}_{\scriptscriptstyle X}(\Phi,{\overline{\Phi}},\ln\tau)+\hbox{h.c.}\Bigr{]}+{\overline{X}}X{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}(\Phi,{\overline{\Phi}},\ln\tau)\Bigr{\}}\,, (10)

and similarly for hh and higher-order terms (although these are not needed in what follows, except briefly in §3). The series in 1/τ1/\tau is written explicitly so the functions kk and hh do not depend on powers of τ\tau, but they can in principle101010As we see below a logarithmic dependence on τ\tau can naturally arise once loop corrections are included. depend on lnτ\ln\tau as is indicated in (10).

Since all factors of MpM_{p} are explicit and XX has dimension mass, the function 𝔎{\mathfrak{K}} has dimension (mass)2, while 𝔎X{\mathfrak{K}}_{\scriptscriptstyle X} has dimension (mass) and 𝔎XX¯{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}} is dimensionless. (The MpM_{p}’s are chosen so that the ϕ\phi kinetic term and the ‘global supersymmetry’ term involving |wX|2|w_{\scriptscriptstyle X}|^{2} in (16) are both independent of MpM_{p}.) The superpotential functions wnw_{n} similarly have dimension (mass)3-n. Because of the constraints X2=X(Φ¯Φ)=0X^{2}=X({\overline{\Phi}}-\Phi)=0 it is always possible to rescale XX~(Φ,Φ¯)X\to\tilde{X}{\cal F}(\Phi,{\overline{\Phi}}) with {\cal F} chosen to set 𝔎XX¯=1{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}=1 (provided 𝔎XX¯{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}} does not depend on lnτ\ln\tau).

The scalar potential VFV_{\scriptscriptstyle F} obtained given these functions KK and WW is as given in (1), repeated here for convenience

VF=eK/Mp2[KA¯BDAW¯DBW3|W|2Mp2].V_{\scriptscriptstyle F}=e^{K/M_{p}^{2}}\left[K^{\bar{A}B}{\overline{D_{\scriptscriptstyle A}W}}D_{\scriptscriptstyle B}W-\frac{3|W|^{2}}{M_{p}^{2}}\right]\,. (11)

Despite appearances there is an important change here relative to ordinary supergravity: the absence of independent auxiliary fields in the constrained field Φ\Phi implies that the sums on the indices AA and BB only run over the fields zA:={T,X}z^{\scriptscriptstyle A}:=\{T,X\} and not also over Φ\Phi [29].

Working to leading nontrivial order in 1/τ1/\tau we drop h/τh/\tau and higher orders, so

𝒫τk=τ𝔎Mp2{\cal P}\simeq\tau-k=\tau-\frac{{\mathfrak{K}}}{M_{p}^{2}} (12)

and the Kähler metric and its inverse become

KAB¯3Mp2𝒫2(1kX¯kX𝒫kXX¯+kXkX¯)andKB¯A𝒫3Mp2(𝒫+𝔨2kXkX¯kX¯X),K_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}}\simeq\frac{3M_{p}^{2}}{{\cal P}^{2}}\left(\begin{array}[]{ccc}1&&-k_{{\overline{{\scriptscriptstyle X}}}}\\ -k_{\scriptscriptstyle X}&&{\cal P}\,k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}+k_{\scriptscriptstyle X}k_{{\overline{{\scriptscriptstyle X}}}}\end{array}\right)\quad\hbox{and}\quad K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}}\simeq\frac{{\cal P}}{3M_{p}^{2}}\left(\begin{array}[]{ccc}{\cal P}+{\mathfrak{k}}^{2}&&k^{{\scriptscriptstyle X}}\\ k^{{\overline{{\scriptscriptstyle X}}}}&&k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}\end{array}\right)\,, (13)

where zA:={T,X}z^{\scriptscriptstyle A}:=\{T,X\} and subscripts on kk as usual denote differentiation. Furthermore 𝔨2:=kX¯XkXkX¯{\mathfrak{k}}^{2}:=k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}k_{\scriptscriptstyle X}k_{{\overline{{\scriptscriptstyle X}}}}, kX:=kX¯XkX¯k^{\scriptscriptstyle X}:=k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}k_{{\overline{{\scriptscriptstyle X}}}} and kX¯=kX¯XkXk^{{\overline{{\scriptscriptstyle X}}}}=k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}k_{\scriptscriptstyle X} with kX¯X:=1/kXX¯k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}:=1/k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}.

At leading order the Kähler covariant derivatives for TT and XX (evaluated at X=0X=0) similarly become

DTW=KTWMp2=3w0𝒫andDXW=WX+KXWMp2=wX+3𝔎Xw0𝒫Mp2.D_{\scriptscriptstyle T}W=\frac{K_{\scriptscriptstyle T}W}{M_{p}^{2}}=-\frac{3w_{0}}{{\cal P}}\quad\hbox{and}\quad D_{\scriptscriptstyle X}W=W_{\scriptscriptstyle X}+\frac{K_{\scriptscriptstyle X}W}{M_{p}^{2}}=w_{{\scriptscriptstyle X}}+\frac{3{\mathfrak{K}}_{{\scriptscriptstyle X}}w_{0}}{{\cal P}M_{p}^{2}}\,. (14)

The useful identity KB¯AKA=z¯B¯K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}}K_{\scriptscriptstyle A}=-\bar{z}^{{\overline{{\scriptscriptstyle B}}}} (that follows – see e.g. eq. (238) – from the no-scale [33] property KB¯AKB¯KA=3Mp2K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}}K_{{\overline{{\scriptscriptstyle B}}}}K_{\scriptscriptstyle A}=-3M_{p}^{2}) then shows that the Einstein-frame auxiliary fields FB¯=eK/(2Mp2)KB¯ADAWF^{{\overline{{\scriptscriptstyle B}}}}=e^{K/(2M_{p}^{2})}K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}}D_{\scriptscriptstyle A}W are

FT¯=eK/(2Mp2)τWMpw0τMpandFX¯=eK/(2Mp2)WXwXτ3/2,F^{{\overline{{\scriptscriptstyle T}}}}=-e^{K/(2M_{p}^{2})}\frac{\tau\,W}{M_{p}}\simeq\frac{w_{0}}{\sqrt{\tau}M_{p}}\quad\hbox{and}\quad F^{{\overline{{\scriptscriptstyle X}}}}=e^{K/(2M_{p}^{2})}W_{\scriptscriptstyle X}\simeq\frac{w_{\scriptscriptstyle X}}{\tau^{3/2}}\,, (15)

where the approximate equalities use 𝒫τ{\cal P}\simeq\tau.

Denoting (as above) zA={T,X}z^{\scriptscriptstyle A}=\{T,X\} and neglecting subdominant powers of 1/τ1/\tau one finds the potential (11) becomes (see eq. (230) of Appendix A for details)

VF1𝒫2[13𝔎X¯XwX¯wX+𝔎X¯X𝔎XT¯Mp2w0wX¯+𝔎X¯X𝔎TX¯Mp2wXw0¯3(𝔎TT¯𝔎X¯X𝔎TX¯𝔎XT¯)1+2𝔎XX¯𝔎X𝔎X¯/Mp2|w0|2Mp4].V_{\scriptscriptstyle F}\simeq\frac{1}{{\cal P}^{2}}\left[\frac{1}{3}\,{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\overline{w_{{\scriptscriptstyle X}}}}w_{{\scriptscriptstyle X}}+\frac{{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}}{M_{p}^{2}}\;w_{0}{\overline{w_{{\scriptscriptstyle X}}}}+\frac{{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}}{M_{p}^{2}}\;w_{{\scriptscriptstyle X}}{\overline{w_{0}}}-\frac{3({\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}})}{1+2{\mathfrak{K}}^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{\scriptscriptstyle X}{\mathfrak{K}}_{{\overline{{\scriptscriptstyle X}}}}/M_{p}^{2}}\;\frac{|w_{0}|^{2}}{M_{p}^{4}}\right]\,. (16)

The powers of MpM_{p} are here shown explicitly for later convenience, and follow directly from the dimensions of XX and TT and from (12). The unusual Mp4M_{p}^{-4} of the last term is an artefact of TT being dimensionless.

Notice that only the first term survives if 𝔎{\mathfrak{K}} is independent of TT, and this happens because in this case (12) becomes a no-scale model. Because 𝔎{\mathfrak{K}} depends on TT only through lnτ\ln\tau it follows that each derivative with respect to TT costs a power of 1/τ1/\tau and so the w0w_{0}wXw_{\scriptscriptstyle X} mixing terms of (16) arise at order 1/τ31/\tau^{3} while the |w0|2|w_{0}|^{2} term first appears at order 1/τ41/\tau^{4}. Contributions from the function hh in (9) involve at least one additional power of 1/τ1/\tau compared to those shown.

The kinetic terms for the physical scalars zI:={𝒯,ϕ}z^{\scriptscriptstyle I}:=\{{\cal T}\,,\phi\} are given by the second derivatives of KK in the usual way: 𝔏kinscal=gKJI¯μzJμz¯I¯{\mathfrak{L}}_{{\rm kin\,scal}}=-\sqrt{-g}\,K_{{\scriptscriptstyle J}{\overline{{\scriptscriptstyle I}}}}\partial_{\mu}z^{\scriptscriptstyle J}\partial^{\mu}\bar{z}^{{\overline{{\scriptscriptstyle I}}}} [29]. Evaluating at X=0X=0 and working to lowest order in 1/τ1/\tau gives

𝔏kinscalg\displaystyle-\frac{{\mathfrak{L}}_{{\rm kin\,scal}}}{\sqrt{-g}} =\displaystyle= KTT¯μ𝒯¯μ𝒯+KΦΦ¯μϕ¯μϕ+(KΦT¯μ𝒯¯μϕ+h.c.)\displaystyle K_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}\,\partial^{\mu}{\overline{{\cal T}}}\partial_{\mu}{\cal T}+K_{\Phi{\overline{\Phi}}}\,\partial^{\mu}{\overline{\phi}}\,\partial_{\mu}\phi+\Bigl{(}K_{\Phi{\overline{{\scriptscriptstyle T}}}}\,\partial^{\mu}{\overline{{\cal T}}}\,\partial_{\mu}\phi+\hbox{h.c.}\Bigr{)}
=\displaystyle= [3Mp2𝒫2μ𝒯¯μ𝒯+3𝒫𝔎ΦΦ¯μϕ¯μϕ3𝒫2𝔎Φ(μ𝒯¯μϕ+h.c.)][1+𝒪(τ1)].\displaystyle\left[\frac{3M_{p}^{2}}{{\cal P}^{2}}\,\partial^{\mu}{\overline{{\cal T}}}\partial_{\mu}{\cal T}+\frac{3}{{\cal P}}\,{\mathfrak{K}}_{\Phi{\overline{\Phi}}}\,\partial^{\mu}{\overline{\phi}}\,\partial_{\mu}\phi-\frac{3}{{\cal P}^{2}}\,{\mathfrak{K}}_{\Phi}\Bigl{(}\partial^{\mu}{\overline{{\cal T}}}\,\partial_{\mu}\phi+\hbox{h.c.}\Bigr{)}\right]\left[1+{\cal O}\Bigl{(}\tau^{-1}\Bigr{)}\right]\,.

The leading off-diagonal kinetic term for fluctuations can be removed through a field redefinition of the form δϕδϕ+Aδ𝒯\delta\phi\to\delta\phi+A\,\delta{\cal T} where A=𝔎Φ/(τ𝔎ΦΦ¯)A={\mathfrak{K}}_{\Phi}/(\tau{\mathfrak{K}}_{\Phi{\overline{\Phi}}}) is a function of the background fields; while correcting the diagonal kinetic terms only by subdominant powers of 1/τ1/\tau.

2.1.3 Relaxation mechanism

When minimizing the potential VFV_{\scriptscriptstyle F} of (16) with respect to Φ\Phi it is the |wX|2|w_{{\scriptscriptstyle X}}|^{2} term that should dominate for large τ\tau because it is proportional to the fewest powers of 1/τ1/\tau (and the fewest powers of 1/Mp1/M_{p}).

The case 𝔎XT¯=0{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}=0

For simplicity of explanation suppose first that 𝔎XT¯=0{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}=0 (the more general case is handled below, with similar results). In this case all wXw_{\scriptscriptstyle X}– w0w_{0} cross terms vanish and the potential (16) simplifies to

VF1𝒫2[13𝔎X¯X|wX|23𝔎TT¯1+2𝔎XX¯𝔎X𝔎X¯/Mp2|w0|2Mp4],V_{\scriptscriptstyle F}\simeq\frac{1}{{\cal P}^{2}}\left[\frac{1}{3}\,{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}|w_{{\scriptscriptstyle X}}|^{2}-\frac{3{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}}{1+2{\mathfrak{K}}^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{\scriptscriptstyle X}{\mathfrak{K}}_{{\overline{{\scriptscriptstyle X}}}}/M_{p}^{2}}\;\frac{|w_{0}|^{2}}{M_{p}^{4}}\right]\,, (18)

which the supergravity auxiliary field structure has ensured is a sum of squares,111111Notice that the second term in this potential is positive when 𝔎TT¯<0{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}<0 (as we assume henceforth) and this is allowed because 𝔎TT¯{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}} does not control the sign of the kinetic term for 𝒯{\cal T} (c.f. eq. (2.1.2)). with the |wX|2|w_{\scriptscriptstyle X}|^{2} term arising at order 1/τ21/\tau^{2} and the |w0|2|w_{0}|^{2} term arising suppressed by both 1/τ41/\tau^{4} and 1/Mp41/M_{p}^{4}. Of these it is the |wX|2|w_{\scriptscriptstyle X}|^{2} that contains the usual dangerous contributions to the vacuum energy, since [15] shows wXw_{\scriptscriptstyle X} receives contributions of order m2m^{2} when integrating out non-supersymmetric particles of mass mm.

Suppose however that this function has a zero for some nonzero value of Φ\Phi, such as121212For reasons given in §3.1 we assume a symmetry ϕϕ\phi\to-\phi to prevent WW from depending linearly on ϕ\phi. Even though we take the simplest quadratic case, our discussion below extends straightforwardly to any function with a non trivial zero. if

wX=g(Φ¯Φv2)=g(Φ2v2)w_{{\scriptscriptstyle X}}=g({\overline{\Phi}}\Phi-v^{2})=g(\Phi^{2}-v^{2}) (19)

near some Φ=v\Phi=v, where the last equality uses the constraint XΦ¯=XΦX{\overline{\Phi}}=X\Phi to trade Φ¯{\overline{\Phi}} for Φ\Phi and the parameter gg is dimensionless. Because the |wX|2|w_{\scriptscriptstyle X}|^{2} term is both nonnegative and the largest term in the potential, it is energetically favourable for ϕ\phi to seek the zero of wXw_{{\scriptscriptstyle X}}, thereby turning this term off and allowing the |w0|2|w_{0}|^{2} term to dominate. As noted above, this remaining term vanishes if 𝔎{\mathfrak{K}} is independent of lnτ\ln\tau, and does so because in this case the supergravity becomes a no-scale model for which TT is a flat direction.

The second ϕ\phi-derivative of the potential near this point is of order VΦΦ¯|wXΦ|2/τ2(gv)2/τ2V_{\Phi{\overline{\Phi}}}\sim|w_{{\scriptscriptstyle X}\Phi}|^{2}/\tau^{2}\sim(gv)^{2}/\tau^{2}. Comparing this to the ϕ\phi kinetic term, which – assuming 𝔎ΦΦ¯{\mathfrak{K}}_{\Phi{\overline{\Phi}}} to be order unity – has the form 𝒵μϕ¯μϕ{\cal Z}\,\partial_{\mu}{\overline{\phi}}\,\partial^{\mu}\phi with 𝒵1/τ{\cal Z}\sim 1/\tau for large τ\tau, we see that the ϕ\phi mass is of order

mϕ2VΦΦ¯𝒵ΦΦ¯(gv)2τ.m_{\phi}^{2}\sim\frac{V_{\Phi{\overline{\Phi}}}}{{\cal Z}_{\Phi{\overline{\Phi}}}}\sim\frac{(gv)^{2}}{\tau}\,. (20)

For comparison, the value of the potential at this minimum is given (because wX=0w_{{\scriptscriptstyle X}}=0 and 𝔎XT¯=0{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}=0) by

Vmin𝔎TT¯|w0|2τ2Mp4M2μW6τ4Mp4V_{\rm min}\sim\frac{{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}|w_{0}|^{2}}{\tau^{2}M_{p}^{4}}\sim\frac{M^{2}\mu_{\scriptscriptstyle W}^{6}}{\tau^{4}M_{p}^{4}} (21)

which uses 𝔎=𝔎(lnτ){\mathfrak{K}}={\mathfrak{K}}(\ln\tau) to conclude 𝔎TT¯1/τ2{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}\propto 1/\tau^{2} and otherwise assumes 𝔎M2{\mathfrak{K}}\sim M^{2} and w0μW3w_{0}\sim\mu_{\scriptscriptstyle W}^{3} when Φ=v\Phi=v, where MM and μW\mu_{\scriptscriptstyle W} are generic UV scales131313As we see below these scales both turn out to be large, but are different. This difference is argued in §3 to be both natural and compatible with expectations coming from string compactifications. to be determined (with MM eventually identified with the Planck scale MpM_{p}). In later sections we see that the τ\tau-dependence of ordinary (Standard Model) particles in this scenario is also generically

mTeVMτm_{\scriptscriptstyle TeV}\sim\frac{M}{\sqrt{\tau}} (22)

and so the τ\tau-dependence of this leading contribution to the potential suggests an origin for the successful numerology Vmin(mTeV2/Mp)4V_{\rm min}\sim(m_{\scriptscriptstyle TeV}^{2}/M_{p})^{4} regardless of the value of τ\tau.

This entire picture using nonlinearly realized supersymmetry only makes sense if the FF-term of the XX multiplet is large, presumably the scale of the masses of the superpartners of Standard Model fields, and one might worry that this is not possible if WX=wXW_{\scriptscriptstyle X}=w_{{\scriptscriptstyle X}} is being arranged to vanish. This would indeed be a legitimate worry to the extent that FXwXF^{\scriptscriptstyle X}\propto w_{\scriptscriptstyle X} (as is true both in global supersymmetry and (15)). The conclusion wX=0w_{\scriptscriptstyle X}=0 turns out to be a consequence of the non-essential simplifying assumption 𝔎XT¯=0{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}=0 made above, however, so before proceeding further we first pause to relax this assumption.

More general ϕ\phi-dependence and the case 𝔎XT¯0{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}\neq 0

To show what happens when 𝔎XT¯0{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}\neq 0 we return to the potential (16), reproduced here for convenience of reference

VF1𝒫2[13𝔎X¯XwX¯wX+𝔎X¯X𝔎XT¯Mp2w0wX¯+𝔎X¯X𝔎TX¯Mp2wXw0¯3(𝔎TT¯𝔎X¯X𝔎TX¯𝔎XT¯)1+2𝔎XX¯𝔎X𝔎X¯/Mp2|w0|2Mp4],V_{\scriptscriptstyle F}\simeq\frac{1}{{\cal P}^{2}}\left[\frac{1}{3}\,{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\overline{w_{{\scriptscriptstyle X}}}}w_{{\scriptscriptstyle X}}+\frac{{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}}{M_{p}^{2}}\;w_{0}{\overline{w_{{\scriptscriptstyle X}}}}+\frac{{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}}{M_{p}^{2}}\;w_{{\scriptscriptstyle X}}{\overline{w_{0}}}-\frac{3({\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}})}{1+2{\mathfrak{K}}^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{\scriptscriptstyle X}{\mathfrak{K}}_{{\overline{{\scriptscriptstyle X}}}}/M_{p}^{2}}\;\frac{|w_{0}|^{2}}{M_{p}^{4}}\right]\,, (23)

and ask what the low energy potential for τ\tau is after extremization with respect to ϕ\phi. This extremization is also relatively simple to do when 𝔎X{\mathfrak{K}}_{{\scriptscriptstyle X}} (and so also 𝔎X¯X{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}} and 𝔎TX¯{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}) are independent of ϕ\phi (though we also relax the assumption of ϕ\phi-independence below).

In this case ϕ\phi enters into VFV_{\scriptscriptstyle F} only through wXw_{\scriptscriptstyle X} and so extremizing VFV_{\scriptscriptstyle F} with respect to ϕ\phi is equivalent to extremizing with respect to wXw_{\scriptscriptstyle X}. Since the dependence of VFV_{\scriptscriptstyle F} on wXw_{\scriptscriptstyle X} is quadratic the extremization with respect to wXw_{\scriptscriptstyle X} is easily obtained by evaluating at the saddle point

wX=3𝔎XT¯w0Mp2,w_{\scriptscriptstyle X}=-\frac{3{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}\,w_{0}}{M_{p}^{2}}\,, (24)

showing that in the general case wXw_{\scriptscriptstyle X} does not vanish, but is both Planck-suppressed and down by a power of 1/τ1/\tau (because logarithmic dependence of 𝔎{\mathfrak{K}} implies 𝔎XT¯1/τ{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}\sim 1/\tau). Using this in the potential leads to

VF3|w0|2τ2Mp4[𝔎X¯X𝔎XT¯𝔎TX¯+𝔎TT¯𝔎X¯X𝔎TX¯𝔎XT¯1+2𝔎XX¯𝔎X𝔎X¯/Mp2]=:Uτ4.V_{\scriptscriptstyle F}\simeq-\frac{3|w_{0}|^{2}}{\tau^{2}M_{p}^{4}}\left[{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}+\frac{{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}}{1+2{\mathfrak{K}}^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{\scriptscriptstyle X}{\mathfrak{K}}_{{\overline{{\scriptscriptstyle X}}}}/M_{p}^{2}}\right]=:\frac{U}{\tau^{4}}\,. (25)

Although the overall sign in this expression is negative the square bracket (and therefore also VFV_{\scriptscriptstyle F}) can have either sign depending on the properties of 𝔎{\mathfrak{K}}.

This derivation also shows why the simplifying assumption that 𝔎X¯X{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}} and 𝔎TX¯{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}} be ϕ\phi-independent is not crucial. The main point is that the dominant ϕ\phi dependence enters VFV_{\scriptscriptstyle F} at order 1/τ21/\tau^{2} and because this arises proportional to |wX|2|w_{\scriptscriptstyle X}|^{2} it always favours configurations that make wXw_{\scriptscriptstyle X} vanish. Although the subdominant 1/τ31/\tau^{3} terms in VFV_{\scriptscriptstyle F} move the minimum away from wX=0w_{\scriptscriptstyle X}=0, its displacement from zero it at most of order δwX1/τ\delta w_{\scriptscriptstyle X}\sim 1/\tau, and this is equally true whether or not ϕ\phi enters the potential through 𝔎{\mathfrak{K}} or WW. Because the dominant result is then quadratic in δwX\delta w_{\scriptscriptstyle X} it very robustly contributes VF1/τ4V_{\scriptscriptstyle F}\sim 1/\tau^{4} once ϕ\phi is eliminated. Although the detailed form of U(lnτ)U(\ln\tau) in (25) can depend on UV specifics, what is important is that the leading result is always proportional to 1/τ41/\tau^{4}.

2.1.4 Scales and supersymmetry breaking

We return now to the question of how large FXF^{\scriptscriptstyle X} is, and the consistency of using nilpotent fields. The starting point is expression (24) for the size of wXw_{\scriptscriptstyle X} which implies wXMμW3/(τMp2)w_{\scriptscriptstyle X}\sim M\mu_{\scriptscriptstyle W}^{3}/(\tau M_{p}^{2}), where we again estimate 𝔎XT¯M/τ{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}\sim M/\tau and w0μW3w_{0}\sim\mu_{\scriptscriptstyle W}^{3} on dimensional grounds, with the power of τ\tau coming from differentiating a function of lnτ\ln\tau. In this case the auxiliary field given in (15) becomes

FX¯eK/(2Mp2)KX¯XwXMμW3τ3/2Mp2.F^{{\overline{{\scriptscriptstyle X}}}}\sim e^{K/(2M_{p}^{2})}K^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}w_{\scriptscriptstyle X}\sim\frac{M\mu_{\scriptscriptstyle W}^{3}}{\tau^{3/2}M_{p}^{2}}\,. (26)

As we see in more detail in §3, the scale μX\mu_{\scriptscriptstyle X} defined by FX=μX2F^{\scriptscriptstyle X}=\mu^{2}_{\scriptscriptstyle X} sets the mass scale for e.g. scalar superpartners for SM particles, and so must be a UV scale (since it was the integrating out of these particles that necessitates use of the nilpotent formalism).

Eq. (26) for μX\mu_{\scriptscriptstyle X}, the expression mTeVM/τ1/2m_{\scriptscriptstyle TeV}\sim M/\tau^{1/2} for the SM-particle masses (also justified in more detail below) and141414We include here a factor ϵ5\epsilon^{5} in VminV_{\rm min} where ϵ<1/60\epsilon\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}1/60 – whose origins lie in the stabilization mechanism for τ\tau, as explained below eq. (78) – since this ‘order-unity’ factor matters when inferring a size for τ\tau. Vmin=ϵ5mvac4V_{\rm min}=\epsilon^{5}m_{\rm vac}^{4} for the vacuum energy determine the three input parameters μW\mu_{\scriptscriptstyle W}, MM and τ\tau. The relaxon mass mϕm_{\phi} then fixes gvgv. In particular, successful phenomenology requires

Vminϵ5M2μW6τ4Mp4(1011GeV)4,V_{\rm min}\sim\frac{\epsilon^{5}M^{2}\mu_{\scriptscriptstyle W}^{6}}{\tau^{4}M_{p}^{4}}\sim\Bigl{(}10^{-11}\;\hbox{GeV}\Bigr{)}^{4}\,, (27)

while SM scales are set (up to small dimensionless gauge and Yukawa couplings) by

mTeVMτ1/2103GeV,m_{\scriptscriptstyle TeV}\sim\frac{M}{\tau^{1/2}}\sim 10^{3}\;\hbox{GeV}\,, (28)

while superpartner masses must satisfy

μX2MμW3τ3/2Mp2>(104GeV)2.\mu_{\scriptscriptstyle X}^{2}\sim\frac{M\mu_{\scriptscriptstyle W}^{3}}{\tau^{3/2}M_{p}^{2}}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\Bigl{(}10^{4}\;\hbox{GeV}\Bigr{)}^{2}\,. (29)

These can be cast into two useful dimensionless combinations

μX2mTeV2μW3MMp2τ>100,\frac{\mu_{\scriptscriptstyle X}^{2}}{m_{\scriptscriptstyle TeV}^{2}}\sim\frac{\mu_{\scriptscriptstyle W}^{3}}{MM_{p}^{2}\sqrt{\tau}}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}100\,, (30)

and

VminμX4ϵ5τ<(1015)41060.\frac{V_{\rm min}}{\mu_{\scriptscriptstyle X}^{4}}\sim\frac{\epsilon^{5}}{\tau}\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\Bigl{(}10^{-15}\Bigr{)}^{4}\sim 10^{-60}\,. (31)

Eq. (31) shows we wish to have τ\tau as large as possible, and (28) argues that this requires MM also to be chosen as large as possible. If MMp1018M\sim M_{p}\sim 10^{18} GeV then (28) implies τ1030\tau\sim 10^{30} and so (30) and (31) then give ϵ105\epsilon\sim 10^{-5} and μW4.6Mpτ1/6105Mp\mu_{\scriptscriptstyle W}\sim 4.6\,M_{p}\,\tau^{1/6}\sim 10^{5}M_{p}:

MMp,μW105Mp,τ1030andϵ105.M\sim M_{p}\,,\quad\mu_{\scriptscriptstyle W}\sim 10^{5}\,M_{p}\,,\quad\tau\sim 10^{30}\quad\hbox{and}\quad\epsilon\sim 10^{-5}\,. (32)

Although having scales larger than MpM_{p} might seem unusual, §3.3 argues why it can actually be expected for the scale μW\mu_{\scriptscriptstyle W} if the UV completion is a string compactification.

The above estimates rely heavily on eq. (25) for the scalar potential, and this (in particular the factors of ϵ\epsilon) relies on precisely how a minimum arises for VV that fixes the present-day value for τ\tau. These estimates use the stabilization mechanism described in detail in §3.2.1. The sensitivity of the above numbers to these choices can be assessed by comparing this stabilization mechanism with the alternative mechanism sketched in §3.2.2 (which so far as we can see turns out not to improve on the numbers given above).

Although SM superpartners dominantly acquire masses through their couplings to FXF^{\scriptscriptstyle X} (see §2.2) the same is not true of the gravitino, which responds to the total invariant order parameter

:=(KAB¯FAF¯B¯)1/2=[eK/Mp2KA¯BDAW¯DBW]1/2eK/(2Mp)|W|MpμW3τ3/2Mp,{\cal F}:=\Bigl{(}K_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}}F^{\scriptscriptstyle A}{\overline{F}}^{{\overline{{\scriptscriptstyle B}}}}\Bigr{)}^{1/2}=\Bigl{[}e^{K/M_{p}^{2}}K^{\bar{\scriptscriptstyle A}{\scriptscriptstyle B}}{\overline{D_{\scriptscriptstyle A}W}}\,D_{\scriptscriptstyle B}W\Bigr{]}^{1/2}\sim e^{K/(2M_{p})}\frac{|W|}{M_{p}}\sim\frac{\mu_{\scriptscriptstyle W}^{3}}{\tau^{3/2}M_{p}}\,, (33)

where the approximate equalities use (15), the form for KK and the earlier estimate for wXw_{\scriptscriptstyle X}. The corresponding gravitino mass is

m3/2=eK/(2Mp2)|W|Mp2MpμW3Mp2τ3/2,m_{3/2}=e^{K/(2M_{p}^{2})}\frac{|W|}{M_{p}^{2}}\sim\frac{{\cal F}}{M_{p}}\sim\frac{\mu_{\scriptscriptstyle W}^{3}}{M_{p}^{2}\tau^{3/2}}\,, (34)

which for the benchmark numerical estimates of (32) gives

100Mp2τ>(104GeV)2andm3/2100Mpτ>1014GeV=105eV.{\cal F}\sim\frac{100M_{p}^{2}}{\tau}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\Bigl{(}10^{4}\;\hbox{GeV}\Bigr{)}^{2}\quad\hbox{and}\quad m_{3/2}\sim\frac{100M_{p}}{\tau}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{-14}\;\hbox{GeV}=10^{-5}\;\hbox{eV}\,. (35)

2.1.5 Relaxon mass

The mass of the relaxing field ϕ\phi must be small enough that it is present in the low-energy EFT where the relaxation takes place. This requires it to be lighter than the lightest known particle whose vacuum energy is known to be a problem: the electron.151515Notice that we are able to take the ϕ\phi mass to be this large because technical naturalness does not care about quadratic divergences, and instead is concerned with the dependence of low-energy observables on physical quantities like renormalized masses mm (see e.g. [2] for more details). We therefore ask

mϕgvτto be smaller thanmeyemTeVyeMτ0.5MeV.m_{\phi}\sim\frac{gv}{\sqrt{\tau}}\quad\hbox{to be smaller than}\quad m_{e}\sim y_{e}m_{\scriptscriptstyle TeV}\sim\frac{y_{e}M}{\sqrt{\tau}}\simeq 0.5\;\hbox{MeV}\,. (36)

This assumes no other particles are present with masses below the ϕ\phi mass but much larger than the cosmological constant scale. The bound mϕ<mem_{\phi}\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}m_{e} implies gv<yeMgv\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}y_{e}M and so for the benchmark numbers of (32)

gv<meτ1/25×1012GeVM.gv\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}m_{e}\,\tau^{1/2}\sim 5\times 10^{12}\;\hbox{GeV}\ll M\,. (37)

To further determine the size of gg requires knowing the relative sizes of vv and MM. Since §3 shows that loops involving a Standard Model particle with mass mM/τm\sim M/\sqrt{\tau} contribute δwXM2\delta w_{\scriptscriptstyle X}\sim M^{2}, naturalness requires the constant term in (19) not to be small – i.e. gv2M2gv^{2}\sim M^{2}. Demanding mϕmem_{\phi}\sim m_{e} therefore requires gye2g\sim y_{e}^{2}.

Figure 1 illustrates the different scales separating the non-linearly realized supersymmetry breaking sector to the linearly realized supersymmetric gravity sector.

Refer to caption
Figure 1: Scales and thresholds differentiating the nonlinearly realized supersymmetric sector (including the Standard Model and the relaxon field) with the gravity sector (including the graviton and axio-dilaton multiplets) in which supersymmetry is linearly realized. The line marked ‘eV’ represents the naive scale for the axion decay constant, where the gravity sector requires UV completion, possibly to a higher-dimensional supergravity. This scale is drawn above the ‘gravitino’ scale, as happens when (100) is true and b>ab>a, although these scales coincide in most of the scenarios we examine. Three different EFT thresholds are described in the main text: Threshold 1 denotes the point below which SM degrees of freedom must nonlinearly realize supersymmetry (because their superpartners are integrated out) while Threshold 2 describes the EFT for which the SM degrees of freedom are also integrated out but in which the relaxon must still be present. Threshold 3 represents the scale below which the gravity sector must be 4-dimensional and so for which our 4D EFT relevant to astrophysics and cosmology applies.

2.2 Couplings to ordinary matter

The remainder of this section fills in what this framework implies for the properties of the new fields and explores how they couple to ordinary SM fields. Along the way we justify several earlier assertions, such as that SM particles have masses proportional to τ1/2\tau^{-1/2}.

We first focus on SM couplings, returning to Dark Sector properties in §2.3. SM particles are again included using the nilpotent framework of [27, 28, 29, 30]. This involves adding new constrained superfields for each individual SM particle, along the lines of [15], including:

  • A chiral multiplet YY satisfying the constraint XY=0XY=0 for each Standard Model fermion. This constraint allows the scalar part 𝒴{\cal Y} to be expressed in terms of other fields. Although Y2Y^{2} is nonzero for fields satisfying this constraint, it happens that the constraint implies Y3=0Y^{3}=0 pointwise.

  • Standard Model gauge bosons are contained within left-chiral spinor multiplets 𝒲{\cal W} (just as if they were supersymmetric) but their fermionic partners get projected out as independent fields by the constraint X𝒲=0X{\cal W}=0.

  • The Higgs is described by a superfield HH subject to the constraint 𝒟¯(XH¯)=0{\overline{{\cal D}}}(X{\overline{H}})=0, much as was done for the ‘relaxon’ ϕ\phi above. (If it weren’t for the condition that the relaxaton field ϕ\phi should be light enough to appear in the EFT below the electron mass, its role could be played by the Standard Model Higgs itself.)

With these ingredients one simply turns the same crank as in the previous section to compute the component lagrangian. (A simpler alternate derivation of matter couplings to the dilaton is also given in §4.2.) We illustrate the main points of this construction by summarizing the example of a single charged Dirac fermion, whose presence is encoded by adding two multiplets Y±Y_{\pm} (representing, say, the left-handed electron and positron). Both multiplets are subject to the constraint XY±=0XY_{\pm}=0. The couplings of this field to the fields TT, XX and Φ\Phi is obtained as before by including Y±Y_{\pm} in the Kähler function and superpotential:

K(T,T¯,X,X¯,Φ,Φ¯,Y±,Y¯±)3Mp2ln(τk+hτ+)\displaystyle K(T,{\overline{T}},X,{\overline{X}},\Phi,{\overline{\Phi}},Y_{\pm},{\overline{Y}}_{\pm})\simeq-3M_{p}^{2}\ln\left(\tau-k+\frac{h}{\tau}+\cdots\right)
andW(T,X,Φ,Y±)w0(Φ,Y+Y)+XwX(Φ,Y+Y),\displaystyle\quad\hbox{and}\quad W(T,X,\Phi,Y_{\pm})\simeq w_{0}(\Phi,Y_{+}Y_{-})+Xw_{{\scriptscriptstyle X}}(\Phi,Y_{+}Y_{-})\,, (38)

where charge conservation requires WW to depend only on the combination Y+YY_{+}Y_{-}, and

k=1Mp2{𝔎(Φ,Φ¯,Y±,Y¯±,lnτ)+[X𝔎X(Φ,Φ¯,Y¯±,lnτ)+h.c.]+X¯X𝔎XX¯(Φ,Φ¯,lnτ)}.k=\frac{1}{M_{p}^{2}}\Bigl{\{}{\mathfrak{K}}(\Phi,{\overline{\Phi}},Y_{\pm},{\overline{Y}}_{\pm},\ln\tau)+\Bigl{[}X{\mathfrak{K}}_{{\scriptscriptstyle X}}(\Phi,{\overline{\Phi}},{\overline{Y}}_{\pm},\ln\tau)+\hbox{h.c.}\Bigr{]}+{\overline{X}}X\;{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}(\Phi,{\overline{\Phi}},\ln\tau)\Bigr{\}}\,. (39)

We assume the potential is chosen so that the auxiliary fields FY±F^{{\scriptscriptstyle Y}\pm} vanish in the vacuum, as required by unbroken electromagnetic gauge invariance. Since XY±=0XY_{\pm}=0 constrains the sclar part 𝒴±{\cal Y}_{\pm} to be a function of the fermions, the field Y±Y_{\pm} is ‘pure fluctuation’ and the lagrangian can be expanded in powers of Y±Y_{\pm}. The leading parts of the Kähler function and superpotential are

w0=𝔴0(Φ)+𝔪(Φ)Y+Y+andwX=𝔴X(Φ)+𝔫(Φ)Y+Y+,w_{0}={\mathfrak{w}}_{0}(\Phi)+{\mathfrak{m}}(\Phi)\,Y_{+}Y_{-}+\cdots\quad\hbox{and}\quad w_{{\scriptscriptstyle X}}={\mathfrak{w}}_{{\scriptscriptstyle X}}(\Phi)+{\mathfrak{n}}(\Phi)\,Y_{+}Y_{-}+\cdots\,, (40)

and Φ\Phi and Y±Y_{\pm} can be rescaled so that

𝔎\displaystyle{\mathfrak{K}} =\displaystyle= ΦΦ¯+Y+Y¯++YY¯+(cubic and higher)\displaystyle\Phi{\overline{\Phi}}+Y_{+}{\overline{Y}}_{+}+Y_{-}{\overline{Y}}_{-}+\hbox{(cubic and higher)}
𝔎X\displaystyle{\mathfrak{K}}_{\scriptscriptstyle X} =\displaystyle= κ(Φ,Φ¯)+𝔢(Y+Y¯++YY¯)+,\displaystyle\kappa(\Phi,{\overline{\Phi}})+{\mathfrak{e}}(Y_{+}{\overline{Y}}_{+}+Y_{-}{\overline{Y}}_{-})+\cdots\,, (41)

and so on.

Denoting, as before, 𝒫τ(𝔎/Mp2){\cal P}\simeq\tau-({{\mathfrak{K}}}/{M_{p}^{2}}) one finds in addition to (14) the new Kähler derivatives (evaluated at 𝒳=0{\cal X}=0)

D±W=WY±+KY±WMp2=𝔪Y+3Y¯±𝒫Mp2(𝔴0+𝔪Y+Y),D_{\pm}W=W_{{\scriptscriptstyle Y}_{\pm}}+\frac{K_{{\scriptscriptstyle Y}_{\pm}}W}{M_{p}^{2}}={\mathfrak{m}}\,Y_{\mp}+\frac{3{\overline{Y}}_{\pm}}{{\cal P}M_{p}^{2}}({\mathfrak{w}}_{0}+{\mathfrak{m}}\,Y_{+}Y_{-})\,, (42)

which automatically vanishes when fermions are set to zero. This precludes this fermion from mixing with the gravitino and also keeps these new supercovariant derivatives from contributing to the potential VFV_{\scriptscriptstyle F}, which therefore again has the form given in (16). The discussion in the previous section of the minima of this potential goes through as before, leading (at large τ\tau) again to (18) – or (25).

Denoting the fields collectively as zA:={T,za}z^{\scriptscriptstyle A}:=\{T,z^{a}\} with za:={X,Y+,Y}z^{a}:=\{X,Y_{+},Y_{-}\}, the leading parts of the Kähler metric and its inverse evaluated at X=Y±=0X=Y_{\pm}=0 generalize (13) to

KBA¯=3Mp2𝒫2(1ka¯kb𝒫kba¯+kbka¯)and soKA¯B=𝒫3Mp2(𝒫+𝔨2kbka¯ka¯b),K_{B{\overline{A}}}=\frac{3M_{p}^{2}}{{\cal P}^{2}}\left(\begin{array}[]{ccc}1&&-k_{\bar{a}}\\ -k_{b}&&{\cal P}\,k_{b\bar{a}}+k_{b}k_{\bar{a}}\end{array}\right)\quad\hbox{and so}\quad K^{\bar{A}B}=\frac{{\cal P}}{3M_{p}^{2}}\left(\begin{array}[]{ccc}{\cal P}+{\mathfrak{k}}^{2}&&k^{b}\\ k^{\bar{a}}&&k^{\bar{a}b}\end{array}\right)\,, (43)

where 𝔨2:=ka¯bka¯kb{\mathfrak{k}}^{2}:=k^{\bar{a}b}k_{\bar{a}}k_{b}. Notice that these expressions show that the matter fields do not alter the kinetic terms for the scalars ϕ\phi and 𝒯{\cal T} – leaving these still given by (2.1.2).

When 𝔎XX¯𝔎Y±Y¯±1{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}\simeq{\mathfrak{K}}_{{\scriptscriptstyle Y}_{\pm}{\overline{{\scriptscriptstyle Y}}}_{\pm}}\simeq 1 these formulae imply

KXX¯KY+Y¯+=KYY¯3τKY+YX¯3𝔢τ,K_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}\simeq K_{{\scriptscriptstyle Y}_{+}{\overline{{\scriptscriptstyle Y}}}_{+}}=K_{{\scriptscriptstyle Y}_{-}{\overline{{\scriptscriptstyle Y}}}_{-}}\simeq\frac{3}{\tau}\quad K_{{\scriptscriptstyle Y}_{+}{\scriptscriptstyle Y}_{-}{\overline{{\scriptscriptstyle X}}}}\simeq\frac{3{\mathfrak{e}}}{\tau}\,, (44)

when evaluated at 𝒳=𝒴±=0{\cal X}={\cal Y}_{\pm}=0 and to leading order in 1/τ1/\tau. The leading part of the kinetic terms for the fermion ψ±Y±\psi_{\pm}\in Y_{\pm} are given by

kinψ3𝒫(ψ¯γL / Dψ+ψ¯γR / Dψ)3τψ¯ / Dψ,{\cal L}_{{\rm kin}\,\psi}\simeq-\frac{3}{{\cal P}}\Bigl{(}{\overline{\psi}}\gamma_{\scriptscriptstyle L}\hbox to0.0pt{\hbox to8.55695pt{\hfil/\hfil}\hss}D\psi+{\overline{\psi}}\gamma_{\scriptscriptstyle R}\hbox to0.0pt{\hbox to8.55695pt{\hfil/\hfil}\hss}D\psi\Bigr{)}\simeq-\frac{3}{\tau}\;{\overline{\psi}}\hbox to0.0pt{\hbox to8.55695pt{\hfil/\hfil}\hss}D\psi\,, (45)

though these coefficients are also ϕ\phi-dependent at subdominant orders in 1/τ1/\tau.

The fermion mass term is similarly given by the supergravity expression

massψ=12mABψ¯AγLψB+h.c.{\cal L}_{{\rm mass}\,\psi}=-\frac{1}{2}\,m_{{\scriptscriptstyle A}{\scriptscriptstyle B}}{\overline{\psi}}^{\scriptscriptstyle A}\gamma_{\scriptscriptstyle L}\psi^{\scriptscriptstyle B}+\hbox{h.c.} (46)

where

mAB=eK/(2Mp2)DADBW=eK/(2Mp2)[(A+KAMp2)DBWΓABCDCW]m_{{\scriptscriptstyle A}{\scriptscriptstyle B}}=e^{K/(2M_{p}^{2})}D_{\scriptscriptstyle A}D_{\scriptscriptstyle B}W=e^{K/(2M_{p}^{2})}\left[\left(\partial_{\scriptscriptstyle A}+\frac{K_{\scriptscriptstyle A}}{M_{p}^{2}}\right)D_{\scriptscriptstyle B}W-\Gamma^{\scriptscriptstyle C}_{{\scriptscriptstyle A}{\scriptscriptstyle B}}D_{\scriptscriptstyle C}W\right] (47)

and the target-space connection is ΓABC=gCD¯AgBD¯=KCD¯KABD¯\Gamma^{\scriptscriptstyle C}_{{\scriptscriptstyle A}{\scriptscriptstyle B}}=g^{{\scriptscriptstyle C}{\overline{{\scriptscriptstyle D}}}}\partial_{\scriptscriptstyle A}g_{{\scriptscriptstyle B}{\overline{{\scriptscriptstyle D}}}}=K^{{\scriptscriptstyle C}{\overline{{\scriptscriptstyle D}}}}K_{{\scriptscriptstyle A}{\scriptscriptstyle B}{\overline{{\scriptscriptstyle D}}}}. Our interest is in m+:=mY+Ym_{+-}:=m_{{\scriptscriptstyle Y}_{+}{\scriptscriptstyle Y}_{-}} for which we can use WY±=KY±=0W_{{\scriptscriptstyle Y}_{\pm}}=K_{{\scriptscriptstyle Y}_{\pm}}=0 leaving the only nonzero terms

m+=eK/(2Mp2)WY+YKY+YX¯F¯X¯𝔪𝒫3/23𝔢𝒫(wXτ3/2),m_{+-}=e^{K/(2M_{p}^{2})}W_{{\scriptscriptstyle Y}_{+}{\scriptscriptstyle Y}_{-}}-K_{{\scriptscriptstyle Y}_{+}{\scriptscriptstyle Y}_{-}{\overline{{\scriptscriptstyle X}}}}{\overline{F}}^{{\overline{{\scriptscriptstyle X}}}}\simeq\frac{{\mathfrak{m}}}{{\cal P}^{3/2}}-\frac{3{\mathfrak{e}}}{{\cal P}}\left(\frac{w_{\scriptscriptstyle X}}{\tau^{3/2}}\right)\,, (48)

which uses (40) and evaluates FX¯F^{{\overline{{\scriptscriptstyle X}}}} using (15). Expression (48) shows that the FXF^{\scriptscriptstyle X}-dependent term is subdominant in powers of 1/τ1/^{\prime}\tau and so can be dropped, leading to

massψ𝔪𝒫3/2(ψ¯+γLψ+h.c.)𝔪τ3/2ψ¯ψ,{\cal L}_{{\rm mass}\,\psi}\simeq-\frac{{\mathfrak{m}}}{{\cal P}^{3/2}}\Bigl{(}{\overline{\psi}}_{+}\gamma_{\scriptscriptstyle L}\psi_{-}+\hbox{h.c.}\Bigr{)}\simeq-\frac{{\mathfrak{m}}}{\tau^{3/2}}\;{\overline{\psi}}\psi\,, (49)

where the last equality assumes 𝔪{\mathfrak{m}} is real. Comparing with the kinetic term reveals the τ\tau-dependence of the physical mass to be

mψ𝔪3𝒫𝔪τ,m_{\psi}\simeq\frac{{\mathfrak{m}}}{3\sqrt{\cal P}}\sim\frac{{\mathfrak{m}}}{\sqrt{\tau}}\,, (50)

as claimed earlier.

When the dust settles what is important is that the leading dependence of mψm_{\psi} on powers of τ\tau comes completely from the factor eK/(2Mp2)e^{K/(2M_{p}^{2})} in (48), and as a result they can be traced to the Weyl rescalings g~μν=eK/(3Mp2)gμν\tilde{g}_{\mu\nu}=e^{K/(3M_{p}^{2})}g_{\mu\nu} required to put the lagrangian into Einstein frame. This is important for several reasons. First it ensures this τ\tau-dependence is universal, and so is the same for all SM particles. This is because the τ\tau dependence ultimately enters only through dimensionful parameters, and so can only do so through the quadratic term in the Higgs potential once formulated in an SUc(3)×SUL(2)×UY(1)SU_{c}(3)\times SU_{\scriptscriptstyle L}(2)\times U_{\scriptscriptstyle Y}(1) invariant way. This ensures that dimensionless physical quantities like Yukawa couplings and Higgs self-interactions remain τ\tau-independent, once expressed in terms of canonically normalized fields (see [15] for more details). As explained in more detail in (4.2.2), it also ensures that τ\tau couples to ordinary matter as does a Brans-Dicke scalar.

2.3 Dark sector properties

We now compute the main physical properties of the new dark particles in the dilaton and gravity supermultiplets: the complex scalar 𝒯{\cal T}, its dilatino partner, ξ\xi (and how this mixes with the gravitino). This section shows why these particles are light and collects predictions for their masses and couplings for later use.

2.3.1 Axio-dilaton

The leading contributions to the kinetic term for the field 𝒯{\cal T} is given by (2.1.2)

king3Mp24τ2[μτμτ+μ𝔞μ𝔞]=12(χ)2+3Mp24e2ζχ/Mp(𝔞)2,-\frac{{\cal L}_{\rm kin}}{\sqrt{-g}}\simeq\frac{3M_{p}^{2}}{4\tau^{2}}\Bigl{[}\partial^{\mu}\tau\partial_{\mu}\tau+\partial^{\mu}{\mathfrak{a}}\,\partial_{\mu}{\mathfrak{a}}\Bigr{]}=\frac{1}{2}(\partial\chi)^{2}+\frac{3M_{p}^{2}}{4}\,e^{-2\zeta\chi/M_{p}}(\partial{\mathfrak{a}})^{2}\,, (51)

where the axion field is defined by 𝔞=2Im𝒯{\mathfrak{a}}=2\,\hbox{Im}\,{\cal T} and the canonically normalized dilaton is

τ=τ0eζχ/Mpwithζ=23.\tau=\tau_{0}\,e^{\zeta\chi/M_{p}}\qquad\hbox{with}\qquad\zeta=\sqrt{\frac{2}{3}}\,. (52)

Dilaton mass

The potential for the dilaton is given by (25), VF=U(lnτ)/τ4V_{\scriptscriptstyle F}=U(\ln\tau)/\tau^{4}, and so looks like a modulated exponential potential in terms of χ\chi,

VFU(lnτ)τ4=U(χ)eλχ/Mpwithλ:=423.V_{\scriptscriptstyle F}\simeq\frac{U(\ln\tau)}{\tau^{4}}=U(\chi)\,e^{-\lambda\chi/M_{p}}\quad\hbox{with}\quad\lambda:=4\sqrt{\frac{2}{3}}\,. (53)

Whether this potential has a minimum or not depends on the function UU in and so also on the details of how 𝔎{\mathfrak{K}} depends on lnτ\ln\tau. §3.2 describes a simple choice for 𝔎(lnτ){\mathfrak{K}}(\ln\tau) that ensures there is a minimum for τ=τ0\tau=\tau_{0} (with V(τ0)>0V(\tau_{0})>0) and because UU comes as a simple rational function of lnτ\ln\tau the minimum can be at exponentially large values like τ01030\tau_{0}\sim 10^{30} assuming only that ratios amongst the parameters are order lnτ070\ln\tau_{0}\simeq 70.

Because the potential is dominantly exponential, its derivatives near the minimum are generically of the same order as the potential itself,161616This estimate is not changed by any dependence on lnτ\ln\tau of the form entertained in later sections.

V0:=V(χ0)λV0MpandV0′′:=V′′(χ0)λ2V0Mp2.V^{\prime}_{0}:=V^{\prime}(\chi_{0})\sim-\frac{\lambda V_{0}}{M_{p}}\quad\hbox{and}\quad V^{\prime\prime}_{0}:=V^{\prime\prime}(\chi_{0})\sim\frac{\lambda^{2}V_{0}}{M_{p}^{2}}\,. (54)

If V0V_{0} is to describe the present-day Dark Eneryg density (more about which in §5) then its potential energy must dominate the present-day Hubble scale, H02V0/(3Mp2)H_{0}^{2}\simeq V_{0}/(3M_{p}^{2}), and so V0(102eV)4V_{0}\simeq(10^{-2}\,\hbox{eV})^{4}. But this means the dilaton mass is also of order mχ2V′′(χ0)H02m_{\chi}^{2}\sim V^{\prime\prime}(\chi_{0})\sim H_{0}^{2} where H01032H_{0}\sim 10^{-32} eV.

It is usually difficult to get scalars to be naturally light enough to be cosmologically active, but this is actually generic for gravitationally coupled scalars whose mass comes from a potential that naturally describes Dark Energy. This makes them phenomenologically important because scalars this light are generically visible in tests of gravity. Later sections investigate some of these constraints, and how they might be avoided using the mechanism described in [42].

Axion interactions

Consider next the axionic part of 𝒯{\cal T}. Eq. (51) shows that the target-space dilaton-axion interactions can be interpreted as a dilaton-dependent axion decay constant of order171717To pin down the numerical factors requires knowing if a^\hat{a} is periodic, since then the requirement that the period be 2π2\pi removes the freedom to rescale the fields.

faMpτf_{a}\sim\frac{M_{p}}{\tau} (55)

which is of order 10310^{-3} eV for the numerical benchmarks of (32). This is an unusually small decay constant, and for traditional axion models one might expect this to cause a breakdown of the low-energy derivative expansion governs the EFT at energies of order faf_{a}.

In simple models this breakdown of EFT methods can indicate the approach of a critical point, with another field, φ\varphi, becoming light enough to combine with 𝔞{\mathfrak{a}} into a complex field φei𝔞\varphi\,e^{i{\mathfrak{a}}} that linearly realizes the axion shift symmetry in the unbroken phase. This need not be how it works in supergravity, however, since the axion lagrangian fa2(𝔞)2f_{a}^{2}(\partial{\mathfrak{a}})^{2} there often instead arises in dual form, fa2HμνλHμνλf_{a}^{-2}H^{\mu\nu\lambda}H_{\mu\nu\lambda} where H=dbH={\rm d}b is the field strength for a 2-form gauge potential bμνb_{\mu\nu} with faμ𝔞ϵμνλρνbλρf_{a}\partial_{\mu}{\mathfrak{a}}\propto\epsilon_{\mu\nu\lambda\rho}\partial^{\nu}b^{\lambda\rho}. The axion shift symmetry need not be linearly realized at scales above faf_{a}; in the dual formulation it is instead the gauge symmetry bb+dΛb\to b+{\rm d}\Lambda that is important.

But even for supergravity the scale faf_{a} usually indicates a breakdown of effective methods. In string theory (for example, taking the Type IIB axio-dilaton as representative) the decay constant is similarly suppressed, with fa/Mp1/τgsf_{a}/M_{p}\propto 1/\tau\propto g_{s} where gsg_{s} is the string coupling. In this example faf_{a} indeed indicates the breakdown of EFT methods that occurs at the string scale (which is systematically below the Planck scale when gs1g_{s}\ll 1).

Having such a small decay constant for 𝔞{\mathfrak{a}} could well indicate the advent of new physics in the gravity sector at eV scales. Supersymmetric large-extra-dimension models [45] provide concrete examples of what this physics might be, with τ\tau playing the role of the radion for the large dimensions and 𝔞{\mathfrak{a}} being its axionic partner. In this case the new physics that kicks in at eV scales is extra-dimensional: unitarity becomes restored by the tower of Kaluza-Klein modes for the dual 2-form field living in the extra dimensions. This would be consistent with the gravitino mass (35) also being at this scale, as appropriate if it were a specific KK mode of an extra-dimensional field.

Notice that the existence of such physics does not matter for applications (such as to cosmology or in the solar system) whose energies are much smaller than faf_{a}. It also need not affect the discussion of higher-energy couplings of the axio-dilaton to Standard Model particles – to the extent that extra-dimensional models are our guide – since these would be restricted to a brane within the extra dimensions and so remain largely four-dimensional.

Even if present, having extra dimensions at this scale need not undermine use of the above 4D nilpotent EFT, at least for the two most frequent applications. First, it does not affect applications (such as to cosmology or in the solar system as considered in later sections) whose energies are much smaller than faf_{a}. Second, it also need not affect the discussion of higher-energy interactions amongst Standard Model particles – at least to the extent that extra-dimensional models are our guide – since these would be restricted to a brane within the extra dimensions and so remain four-dimensional.

Axion mass

The size of the axion mass is more model-dependent, even though its shift symmetry cannot be broken without undermining the no-scale structure of the scalar potential for τ\tau. Unbroken shift symmetry makes the axion massless unless the symmetry is gauged, in which case the axion is eaten by a gauge boson to acquire nonzero mass through the Higgs mechanism.

Gauging the axion shift symmetry forces the replacement μ𝔞μ𝔞Aμ\partial_{\mu}{\mathfrak{a}}\to\partial_{\mu}{\mathfrak{a}}-A_{\mu} in the action, where AμA_{\mu} is the relevant gauge field. Working in the gauge 𝔞=0{\mathfrak{a}}=0 then turns the axion kinetic term of (51) into a gauge-boson mass term

massg=3Mp24τ2AμAμ.-\frac{{\cal L}_{\rm mass}}{\sqrt{-g}}=\frac{3M_{p}^{2}}{4\tau^{2}}\,A_{\mu}A^{\mu}\,. (56)

The size of the resulting mass depends on whether or not the gauge-boson kinetic term is τ\tau-dependent, and this depends on whether the holomorphic gauge kinetic function 𝔣{\mathfrak{f}} is a constant or proportional181818Having 𝔣T{\mathfrak{f}}\propto T implies the existence of an anomalous coupling 𝔞ϵμνλρFμνFλρ{\mathfrak{a}}\,\epsilon^{\mu\nu\lambda\rho}F_{\mu\nu}F_{\lambda\rho}, and so can only be present if there are also other charged fields to provide cancelling gauge anomalies. to TT: 𝔣=𝔣0=1/e2{\mathfrak{f}}={\mathfrak{f}}_{0}=1/e^{2} or 𝔣T{\mathfrak{f}}\propto T. The corresponding physical gauge-boson (and so also axion) mass then is of order MAMp/(𝔣τ)M_{\scriptscriptstyle A}\sim M_{p}/({\mathfrak{f}}\,\tau) and so is given by one of

MA{eMp/τ103eVif 𝔣0e2𝒪(1)Mp/τ3/21018eVif 𝔣TM_{\scriptscriptstyle A}\sim\begin{cases}{eM_{p}}/{\tau}\sim 10^{-3}\,\hbox{eV}&\hbox{if ${\mathfrak{f}}_{0}\sim e^{-2}\sim{\cal O}(1)$}\\ {M_{p}}/{\tau^{3/2}}\sim 10^{-18}\,\hbox{eV}&\hbox{if ${\mathfrak{f}}\propto T$}\end{cases} (57)

Gauging the axionic symmetry also introduces new complications to the scalar potential however, because in a supersymmetric theory it implies that KK depends on τ\tau only through the combination K=K(T+T¯𝒜)K=K(T+{\overline{T}}-{\cal A}) where 𝒜{\cal A} is the scalar superfield that contains the gauge potential AμA_{\mu}. This implies a contribution to the scalar potential involving the gauge-field auxiliary field D, of the form of a field-dependent Fayet-Iliopoulos term: δVKTD\delta V\sim K_{\scriptscriptstyle T}\hbox{D} and so after D is eliminated naively contributes to the D-term scalar potential an amount VDKTKT¯/(V_{\scriptscriptstyle D}\sim K_{\scriptscriptstyle T}K_{{\overline{{\scriptscriptstyle T}}}}/(Re 𝔣){\mathfrak{f}}). This would be 𝒪(1/τ2){\cal O}(1/\tau^{2}) if 𝔣𝔣0{\mathfrak{f}}\sim{\mathfrak{f}}_{0} and 𝒪(1/τ3){\cal O}(1/\tau^{3}) if 𝔣T{\mathfrak{f}}\propto T, which in either case would be large enough to overwhelm the 𝒪(1/τ4){\cal O}(1/\tau^{4}) term found above. This need not be a problem if other supermultiplets containing fields charged under the gauge symmetry exist (as anomaly cancellation typically requires in any case when 𝔣T{\mathfrak{f}}\propto T) since then these other fields adjust191919Indeed, there is a sense that the standard adjustment of charged fields to find the potential’s D-flat directions – such as occurs automatically in the MSSM for example – is a special case of the relaxon mechanism described above for ϕ\phi. to ensure that D=0=0.

We see from this discussion that the axion could simply be massless, or it could be eaten by the Higgs mechanism in a meal that necessarily involves the presence of other light fields (the gauge multiplet itself at the very least) in the supersymmetric sector. Depending on the τ\tau-dependence of the gauge couplings such mixings would give the axion a mass that could be as large as Mp/τ10M_{p}/\tau\sim 10 eV or as low as Mp/τ3/21018M_{p}/\tau^{3/2}\sim 10^{-18} eV.

2.3.2 Dark fermions

The supersymmetric sector necessarily also involves very light fermions: both the gravitino – c.f. eq. (34) – and the dilaton’s light partner ξ\xi (the dilatino), in addition to the usual Standard Model neutrinos. For later use we here summarize the leading features of this fermionic sector.

For simplicity we assume the gravity-dilaton sector not to break lepton number, with lepton number broken only by the neutrino masses themselves. In practice this allows us to ignore mixing between the gravitino/dilatino sector and the neutrinos, though there is clearly much interest in exploring the phenomenology allowed by more general assumptions.

Under these circumstances the couplings in the dilatino/gravitino sector are the usual ones predicted by supergravity, and so have the form

fermiong\displaystyle\frac{{\cal L}_{\rm fermion}}{\sqrt{-g}} =\displaystyle= i2ϵμνλρψ¯μγ5γνDλψρ12m3/2ψ¯μγμνψν\displaystyle-\frac{i}{2}\,\epsilon^{\mu\nu\lambda\rho}{\overline{\psi}}_{\mu}\gamma_{5}\gamma_{\nu}D_{\lambda}\psi_{\rho}-\frac{1}{2}\,m_{3/2}\,{\overline{\psi}}_{\mu}\gamma^{\mu\nu}\psi_{\nu}
[12KTT¯ξ¯γR / Dξ12𝔪ξξ¯γLξ+𝔪ξgMpψ¯μγRγμξ+h.c.]+\displaystyle\qquad-\left[\frac{1}{2}K_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}{\overline{\xi}}\gamma_{\scriptscriptstyle R}\hbox to0.0pt{\hbox to8.55695pt{\hfil/\hfil}\hss}D\xi-\frac{1}{2}{\mathfrak{m}}_{\xi}\,{\overline{\xi}}\gamma_{\scriptscriptstyle L}\xi+\frac{{\mathfrak{m}}_{\xi g}}{M_{p}}\,{\overline{\psi}}_{\mu}\gamma_{\scriptscriptstyle R}\gamma^{\mu}\xi+\hbox{h.c.}\right]+\cdots

where m3/2m_{3/2} is given in (34) and the ellipses represent other terms (like 4-fermi interactions) whose form is not required in what follows. The supergravity expression for 𝔪ξg{\mathfrak{m}}_{\xi g} (evaluating at X=0X=0) is given by [46]

𝔪ξg=12Mp2eK/(2Mp2)DTW12𝒫3/2Mp2(3w0𝒫)w0τ5/2Mp2,{\mathfrak{m}}_{\xi g}=\frac{1}{\sqrt{2}\;M_{p}^{2}}\,e^{K/(2M_{p}^{2})}D_{\scriptscriptstyle T}W\simeq\frac{1}{\sqrt{2}\;{\cal P}^{3/2}M_{p}^{2}}\left(-\frac{3w_{0}}{{\cal P}}\right)\sim-\frac{w_{0}}{\tau^{5/2}M_{p}^{2}}\,, (59)

while that for 𝔪ξ{\mathfrak{m}}_{\xi} (using WT=0W_{\scriptscriptstyle T}=0) is

𝔪ξ=mTT\displaystyle{\mathfrak{m}}_{\xi}=m_{{\scriptscriptstyle T}{\scriptscriptstyle T}} =\displaystyle= 1Mp2eK/(2Mp2)DTDTW1Mp2eK/(2Mp2)[KTMp2DTWΓTTADAW]\displaystyle\frac{1}{M_{p}^{2}}e^{K/(2M_{p}^{2})}D_{\scriptscriptstyle T}D_{\scriptscriptstyle T}W\simeq\frac{1}{M_{p}^{2}}e^{K/(2M_{p}^{2})}\left[\frac{K_{\scriptscriptstyle T}}{M_{p}^{2}}\,D_{\scriptscriptstyle T}W-\Gamma^{\scriptscriptstyle A}_{{\scriptscriptstyle T}{\scriptscriptstyle T}}D_{\scriptscriptstyle A}W\right]
\displaystyle\simeq 1Mp2eK/(2Mp2)[(KTMp2KTA¯KTTA¯)DTWKXA¯KTTA¯DXW]3w0τ7/2Mp2,\displaystyle\frac{1}{M_{p}^{2}}e^{K/(2M_{p}^{2})}\left[\left(\frac{K_{\scriptscriptstyle T}}{M_{p}^{2}}-K^{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle A}}}}K_{{\scriptscriptstyle T}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle A}}}}\right)D_{\scriptscriptstyle T}W-K^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle A}}}}K_{{\scriptscriptstyle T}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle A}}}}\,D_{\scriptscriptstyle X}W\right]\simeq\frac{3w_{0}}{\tau^{7/2}M_{p}^{2}}\,,

which uses KT3Mp2/𝒫K_{\scriptscriptstyle T}\simeq-3M_{p}^{2}/{\cal P} while KTA¯KTTA¯2/𝒫K^{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle A}}}}K_{{\scriptscriptstyle T}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle A}}}}\simeq-2/{\cal P} and KXA¯KTTA¯𝒪(1/𝒫3)K^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle A}}}}K_{{\scriptscriptstyle T}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle A}}}}\simeq{\cal O}(1/{\cal P}^{3}).

These expressions show that ξ\xi acquires a mass partially by mixing with the gravitino (because the supersymmetry breaking contribution DTW0D_{\scriptscriptstyle T}W\neq 0 causes it to contribute to the Goldstino) and partially through a direct mass term. Keeping in mind that the ξ\xi kinetic term is proportional to KTT¯Mp2/𝒫2K_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}\sim M_{p}^{2}/{\cal P}^{2} shows that both these contributions are of the same order as the gravitino mass.

3 Naturalness issues

Knowing the forms for the couplings between ordinary particles and the dilaton multiplet allows a more explicit treatment of technical naturalness. Our exploration of this comes in three parts, each of which is considered in turn. First §3.1 estimates whether loops of heavy particles preserve the choices that have been made to achieve a small scalar potential. Then, §3.2 provides an explicit stabilization mechanism that produces exponentially large value for τ\tau without introducing unusually large parameters into the scalar potential. An explicit understanding of stabilization is necessary because it is the vacuum value of τ\tau that controls both electroweak and cosmological-constant hierarchies. Finally §3.3 explores the extent to which our choices are compatible with the properties of known stringy UV completions.

3.1 Loops

We start by asking whether loops of Standard Model particles preserve the main choices made to this point: (ii) use of the nilpotent supergravity form of the lagrangian; (iiii) stability of the small vacuum energy within this supergravity formulation; (iiiiii) stability of the small scalar masses for the ϕ\phi and 𝒯{\cal T} fields; and (iviv) possible origins of the assumed lnτ\ln\tau dependence in kk (that play a role once we ask how τ\tau becomes stabilized at the large values).

3.1.1 Supergravity form

The entire discussion presupposes that the lagrangian has a supergravity form specified at the two-derivative level by a Kähler function KK, a superpotential WW and a gauge kinetic function 𝔣αβ{\mathfrak{f}}_{\alpha\beta}. In particular this is what ensures terms like |wX|2/τ2|w_{\scriptscriptstyle X}|^{2}/\tau^{2} arise in the scalar potential as a perfect square; an important precondition for this term becoming small as the relaxon field seeks its minimum. This structure is ultimately dictated by supersymmetry and its requirements for how auxiliary fields appear in the lagrangian, since it is the elimination of these that give the scalar potential its special form. The key role of auxiliary fields underlines the potential importance of non-propagating auxiliary fields for EFTs in general and for naturalness arguments in particular [25, 26, 17].

Are these features robust to quantum loops? Ref. [15] argues that they are, and does so by investigating how the effective couplings in the functions KK and WW evolve as heavy nonsupersymmetric particles are integrated out. The remainder of this section argues why this robustness also can be seen on more general symmetry grounds.

The physical assumption that justifies coupling to supergravity is that the mass splitting Δmg\Delta m_{g} within the graviton multiplet (and the dilaton multiplet TT) is much smaller than the corresonding splittings ΔmSM\Delta m_{\scriptscriptstyle SM} within multiplets containing Standard Model particles. This hierarchy seems likely to be natural given that supermultiplet splittings are of order Δm2gF\Delta m^{2}\sim gF where FF is the supersymmetry-breaking vev and gg is a measure of the multiplet’s coupling to it. Maintaining ΔmgΔmSM\Delta m_{g}\ll\Delta m_{\scriptscriptstyle SM} should only require Standard Model fields to couple more strongly to the supersymmetry breaking sector than does the weakest force of all: gravity.

It is the because ΔmgΔmSM\Delta m_{g}\ll\Delta m_{\scriptscriptstyle SM} that we can make our Wilsonian UV/IR split somewhere in between: ΔmgΛΔmSM\Delta m_{g}\ll\Lambda\ll\Delta m_{\scriptscriptstyle SM}. Since we are only interested in Standard Model scales at energies below Λ\Lambda, we are free to integrate out the SM superpartners to obtain a nonsupersymmetric matter sector coupled to supergravity.

Consider first how this theory looks in the global-supersymmetry limit MpM_{p}\to\infty. In this limit the low-energy sector contains the Standard model coupled to the Goldstone fermion GG [48] – and like for any Goldstone particle these couplings are dictated by the supersymmetry algebra itself. An arbitrary non-supersymmetric theory can be made globally supersymmetric (for free) by appropriately coupling a Goldstone fermion to it. As is always true when global symmetries break spontaneously, the only symmetry information that survives well below the symmetry breaking scale is encoded in the couplings of the appropriate Goldstone fields [49, 50, 51] (for a textbook description see [24]).

The basic claim of ref. [27] is that there is no loss of generality in describing these low-energy goldstino couplings in terms of the supersymmetric interactions of constrained superfields coupled to a nilpotent goldstino multiplet XX, and this ultimately is what guarantees that the Wilsonian EFT (for global supersymmetry) at any scale can be captured for some choice of the functions KK, WW and 𝔣αβ{\mathfrak{f}}_{\alpha\beta}. There is no loss of generality because an arbitrary nonsupersymmetric theory can be made supersymmetric ‘for free’ in this way. Because this framework is so general, it in particular must remain valid for the Wilsonian action as successive nonsupersymmetric Standard Model particles are integrated out.

Once the Standard Model sector is coupled to the goldstino in this way its couplings to the graviton multiplet are dictated by symmetry through the usual rules for gauging supersymmetry [47, 46].

3.1.2 Vacuum energy

Consider next the size of the vacuum energy within this supergravity framework, since this is the quantity that is normally never naturally small. The key assumptions in §2.1 are (a)(a) that all functions like 𝔎{\mathfrak{K}} and w0w_{0} depend on the generic UV scale MM simply as they should on dimensional grounds; and (b)(b) that the lagrangian can be organized into a series of powers of 1/τ1/\tau, with the scalar potential starting off at order 1/τ21/\tau^{2}. Neither of these properties are changed by contributions to the lagrangian due to loops of Standard Model particles.

To this end consider, for instance, a loop contribution to VV obtained by integrating out a Standard Model particle of mass mm, given that formulae like (50) show that this mass is given by m𝔪/τm\simeq{\mathfrak{m}}/\sqrt{\tau} where 𝔪yM{\mathfrak{m}}\sim yM for some Yukawa coupling yy. The dangerous part of this loop is generically given by

δVm416π2𝔪416π2τ2y4M416π2τ2.\delta V\sim\frac{m^{4}}{16\pi^{2}}\sim\frac{{\mathfrak{m}}^{4}}{16\pi^{2}\tau^{2}}\sim\frac{y^{4}M^{4}}{16\pi^{2}\tau^{2}}\,. (61)

This has precisely the 1/τ21/\tau^{2} dependence required to be interpreted as a contribution to VV coming from a correction to wXw_{{\scriptscriptstyle X}}. Furthermore, the size of this correction is of order δwX𝔪2/4πy2M2/4π\delta w_{\scriptscriptstyle X}\sim{\mathfrak{m}}^{2}/4\pi\sim y^{2}M^{2}/4\pi, consistent with assuming wXM2w_{\scriptscriptstyle X}\sim M^{2} (and with the results of [15]).

Contributions such as these to wXw_{\scriptscriptstyle X} are irrelevant to the value of VminV_{\rm min} to the extent that they do not remove the property that a zero of wXw_{{\scriptscriptstyle X}} exists for some choice of ϕ\phi. They only change the precise value of the field, ϕ0\phi_{0}, for which this minimum exists. It is for this reason that VminV_{\rm min} can be stable against integrating out Standard Model particles. Central to this stability is the scale-invariant form of the τ\tau-dependence of Standard Model masses.

The one exception to the general assignment of MM as a UV scale is the even larger value chosen for μW\mu_{\scriptscriptstyle W}. Such large values for μW\mu_{\scriptscriptstyle W} are known to be consistent with string compactifications, 202020The precise bound is W0𝒱1/3W_{0}\leq{\cal V}^{1/3} with 𝒱{\cal V} the overall volume. Although generically for flux superpotentials W0/Mp3W_{0}/M_{p}^{3} tends to be of order 1-100, but higher and smaller values are also allowed. Notice that the bound is saturated when the gravitino mass m3/2=MpW0/𝒱m_{3/2}=M_{p}W_{0}/{\cal V} is of the same order as the overall Kaluza-Klein scale MKKMp/𝒱2/3M_{{\scriptscriptstyle KK}}\sim M_{p}/{\cal V}^{2/3}, at which point the 4D EFT ceases to be valid. and generically arise whenever the gravitino mass is close to the Kaluza Klein scale [52]. Once chosen, the value of W0W_{0} remains unchanged as particles are integrated out. It is not affected by heavy supersymmetry-breaking effects because these always involve XX in the low-energy theory. Standard non-renormalization arguments [53] protect W0W_{0} when integrating out heavy supersymmetric physics.

3.1.3 Scalar masses

Besides the small value of VminV_{\rm min} there are two types of light scalar fields, whose masses must also be protected from loops if the model is to be technically natural.

Dilaton mass

The lightest scalar in the problem is the dilaton τ\tau itself, whose mass is shown in §2.3.1 to be of order the present-day Hubble scale. Can such a small scalar mass be stable against integrating out UV physics?

We argue here that it is, through the mechanism identified some time ago in [37]. The main point can be seen from eq. (54) which shows that the derivatives of the scalar potential are proportional to the value of the potential itself when evaluated at the minimum. This is an automatic consequence for the exponential potential that is dictated at leading order by the nonlinearly realized accidental scale invariance that underlies the model’s construction.

Approximate scale invariance links the small dilaton mass to the small value of VminV_{\rm min} and so the dilaton mass is guaranteed to be naturally small once the cosmological constant itself is. The generality of this argument makes it likely that any gravitationally coupled scalar appearing in VV should acquire a similar mass, pointing to a world where multiple scalars might be equally light.

Relaxon mass

A second relatively light scalar is the relaxon field ϕ\phi, which must be light enough to remain in the low-energy EFT defined below the electron mass. If ϕ\phi were not this light it would not be present to remove the dangerous |wX|2|w_{\scriptscriptstyle X}|^{2} contribution from the scalar potential generated once the electron is integrated out. §2.1 arranges ϕ\phi to be this light through two choices. First it is assumed that WW does not contain a term like MXΦWMX\Phi\in W that contributes a UV-sensitive linear term in ϕ\phi to wXw_{\scriptscriptstyle X}. It is only because of this that the ϕ\phi mass is controlled by the dimensionless coupling gg. This choice is natural since it can be enforced through a symmetry like ϕϕ\phi\to-\phi (or more generally by a continuous rephasing symmetry, provided that the corresponding massless Goldstone boson causes no trouble once ϕ0\langle\phi\rangle\neq 0).

The second choice required to keep mϕm_{\phi} small asks for the hierarchy gvMgv\ll M amongst the model parameters. We argue that this is also technically natural, and it is key for this argument that the ϕ\phi mass again comes from the leading supersymmetry-breaking piece of the superpotential: wXw_{\scriptscriptstyle X}. If wXw_{\scriptscriptstyle X} has a zero for some ϕ=v\phi=v then because it is a function only of ϕ2\phi^{2} its dependence near this zero very generically has the form wX=g(ϕ2v2)+w_{\scriptscriptstyle X}=g(\phi^{2}-v^{2})+\cdots where gg is dimensionless and we have seen that large ϕ\phi-independent Standard Model contributions δwXM2\delta w_{\scriptscriptstyle X}\sim M^{2} imply gv2M2gv^{2}\sim M^{2}. This corresponds to the superpotential used in earlier sections having the form

W=w0+X[g(Φ2v2)+].W=w_{0}+X\Bigl{[}g(\Phi^{2}-v^{2})+\cdots\Bigr{]}\,. (62)

Having gv2M2gv^{2}\sim M^{2} is completely consistent with mϕ2(gv)2gM2M2m_{\phi}^{2}\sim(gv)^{2}\sim gM^{2}\ll M^{2} provided that g1g\ll 1, showing that the hierarchy mϕgvMm_{\phi}\sim gv\ll M relies solely on the dimensionless coupling gg being small. But small dimensionless couplings can be completely natural given that loop corrections to a marginal term like gXΦ2WgX\Phi^{2}\in W depend only logarithmically on the mass of the particle in the loop.

How does one see in components that a quadratic term in ϕ\phi is has a marginal (dimensionless) coupling rather than a relevant (positive power of mass) one? This occurs because the component part of XΦ2WX\Phi^{2}\in W is FXϕ2VF^{\scriptscriptstyle X}\phi^{2}\in V and so is dimension-four (rather than dimension-two) due to the presence of the auxiliary field FXF^{\scriptscriptstyle X}.

3.1.4 Logarithmic τ\tau-dependence

Although logarithmic mass dependence arising from quantum loops do not threaten naturalness arguments, they do play an important role in the next section’s stabilization mechanism and do so because heavy-particle masses in this model are generically τ\tau-dependent. For instance the one-loop contribution to the scalar potential obtained when integrating out a particle of mass mm is strictly speaking not simply δVm4\delta V\sim m^{4}, but more accurately is given by

δV±𝔠m416π2ln(m2μ2)=±𝔠𝔪416π2τ2ln(𝔪2τμ2),\delta V\sim\pm{\mathfrak{c}}\,\frac{m^{4}}{16\pi^{2}}\ln\left(\frac{m^{2}}{\mu^{2}}\right)=\pm\frac{{\mathfrak{c}}\,{\mathfrak{m}}^{4}}{16\pi^{2}\tau^{2}}\ln\left(\frac{{\mathfrak{m}}^{2}}{\tau\mu^{2}}\right)\,, (63)

rather than simply being proportional to m4m^{4}. Here 𝔠{\mathfrak{c}} is a calculable number, μ\mu is an arbitrary renormalization scale, the upper (lower) sign applies for bosons (fermions) and the last equality assumes m2=𝔪2/τm^{2}={\mathfrak{m}}^{2}/\tau, as found above for Standard Model particles.

Similar logarithms appear quite generally for other effective operators once loop corrections are included, such as corrections that change expressions like (45) to

kinψ\displaystyle{\cal L}_{{\rm kin}\,\psi} \displaystyle\simeq 3τ{1+𝔠1αg4πln(m2μ2)+𝔠2[αg4πln(m2μ2)]2+}ψ¯ / Dψ\displaystyle-\frac{3}{\tau}\left\{1+{\mathfrak{c}}_{1}\,\frac{\alpha_{g}}{4\pi}\ln\left(\frac{m^{2}}{\mu^{2}}\right)+{\mathfrak{c}}_{2}\left[\frac{\alpha_{g}}{4\pi}\ln\left(\frac{m^{2}}{\mu^{2}}\right)\right]^{2}+\cdots\right\}{\overline{\psi}}\hbox to0.0pt{\hbox to8.55695pt{\hfil/\hfil}\hss}D\psi
\displaystyle\simeq 3τ{1+𝔠1αg4πln(𝔪2τμ2)+𝔠2[αg4πln(𝔪2τμ2)]2+}ψ¯ / Dψ,\displaystyle-\frac{3}{\tau}\left\{1+{\mathfrak{c}}_{1}\,\frac{\alpha_{g}}{4\pi}\ln\left(\frac{{\mathfrak{m}}^{2}}{\tau\mu^{2}}\right)+{\mathfrak{c}}_{2}\left[\frac{\alpha_{g}}{4\pi}\ln\left(\frac{{\mathfrak{m}}^{2}}{\tau\mu^{2}}\right)\right]^{2}+\cdots\right\}{\overline{\psi}}\hbox to0.0pt{\hbox to8.55695pt{\hfil/\hfil}\hss}D\psi\,,

and so on for other effective operators. Here αg\alpha_{g} is a dimensionless coupling constant and the 𝔠i{\mathfrak{c}}_{i} are again calculable numbers. Because the logarithmic dependence on τ\tau is related to logarithms of mass scales, standard renormalization-group arguments allow them to be resummed to all orders in αglnτ\alpha_{g}\ln\tau, such as is done when writing the running of dimensionless couplings in the form

4πα(μ)=4πα(μ0)+𝔟ln(μ2μ02).\frac{4\pi}{\alpha(\mu)}=\frac{4\pi}{\alpha(\mu_{0})}+{\mathfrak{b}}\,\ln\left(\frac{\mu^{2}}{\mu_{0}^{2}}\right)\,. (65)

τ\tau-dependence enters into expressions like these once these couplings are evaluated at physical masses μ02=m2=𝔪2/τ\mu^{2}_{0}=m^{2}={\mathfrak{m}}^{2}/\tau, such as when matching across particle thresholds.

The logarithmic τ\tau-dependence hidden within the running of gauge couplings shown in (65) actually plays a crucial role in later phenomenology because it also predicts the τ\tau-dependence expected for dynamical scales like the QCD scale

ΛQCD=μ0exp[12𝔟αs(μ0)]=𝔪Zτexp[12𝔟αs(MZ)].\Lambda_{\scriptscriptstyle QCD}=\mu_{0}\,\exp\left[-\frac{1}{2{\mathfrak{b}}\,\alpha_{s}(\mu_{0})}\right]=\frac{{\mathfrak{m}}_{\scriptscriptstyle Z}}{\sqrt{\tau}}\,\exp\left[-\frac{1}{2{\mathfrak{b}}\,\alpha_{s}(M_{\scriptscriptstyle Z})}\right]\,. (66)

Phenomenological success (such as tests of the equivalence princple) relies on this sharing (at leading order in 1/τ1/\tau) the same τ\tau-dependence as do other Standard Model masses, since at a microscopic level it is the QCD scale that determines most of the mass of macroscopic objects through its contribution to the nucleon mass.

Although the above considerations make it sound like all effective interactions must depend on lnτ\ln\tau, this need not actually be the case. One way to see why is to notice that τ\tau appears inside the logarithm in expressions like (3.1.4) multiplied by μ\mu, but μ\mu always cancels out of physical observables. This happens in detail because effective couplings are matched across different thresholds, leaving physical results depending only on the logarithm of physical mass ratios

ln(m12m22)=ln(𝔪12𝔪22),\ln\left(\frac{m^{2}_{1}}{m^{2}_{2}}\right)=\ln\left(\frac{{\mathfrak{m}}_{1}^{2}}{{\mathfrak{m}}^{2}_{2}}\right)\,, (67)

from which all powers of τ\tau cancel provided that m1m_{1} and m2m_{2} both share the same τ\tau-dependence (as is in particular true for all Standard Model particles to leading order in 1/τ1/\tau).

What can introduce lnτ\ln\tau dependence into the effective lagrangian is the presence within loops of particles with masses that depend differently on τ\tau. The gravitino and the axion and/or dilatino fields in the TT multiplet are examples of particle whose masses depend on τ\tau differently than for Standard Model particles, and there is no reason why the same should not occur for particles in the UV sector as well. If two species of particles have masses mi2τpim_{i}^{2}\propto\tau^{-p_{i}} with p1p2p_{1}\neq p_{2}, and if these particles can appear together in loops then lnτ\ln\tau dependence in couplings can arise through factors like

ln(m12m22)=A0(p1p2)A1lnτ,\ln\left(\frac{m_{1}^{2}}{m_{2}^{2}}\right)=A_{0}-(p_{1}-p_{2})A_{1}\ln\tau\,, (68)

for appropriate constants A0A_{0} and A1A_{1}. As is argued in §3.3 below, the existence of UV particles with different τ\tau-dependence in their masses is in fact very plausible in candidate UV completions. It is this observation that motivates our including lnτ\ln\tau dependence in the effective coupling functions considered in §2.1.

3.2 Dilaton stabilization

The story to this point describes a natural hierarchy, but only does so if the field τ\tau takes acceptably large values. We now argue that the potential for τ\tau actually does have minima at such large values, and that this can be achieved without losing control of the underlying approximations. We describe two mechanisms for doing so, starting first with the mechanism assumed when making the estimates in §2.1.4 and then sketching an alternative that (at face value) shows promise for providing additional suppression of the vacuum energy (but which we have so far been unable to exploit).

3.2.1 Logarithmic stabilization

We start with an example of dilaton stabilization that exploits the dependence U=U(lnτ)U=U(\ln\tau) in the potential (25), following the ideas in [37]. The main attraction of this mechanism is its ability to produce exponentially large values of τ\tau using only a mild hierarchy among the input parameters, whose ratios need not be smaller than 1/60\sim 1/60.

To make things concrete suppose the function kk appearing in (25) is given by

k=𝒦+(X+X¯)κ+XX¯𝒵k={\cal K}+(X+{\overline{X}})\kappa+X{\overline{X}}{\cal Z} (69)

where κ\kappa and 𝒵{\cal Z} are τ\tau-independent212121This assumption is just to simplify expressions, the general case with both κ\kappa and 𝒵{\cal Z} depending on lnτ\ln\tau works in the same way as below but with more cumbersome expressions. but 𝒦{\cal K} acquires a dependence on lnτ\ln\tau through the running of some UV-sector dimensionless coupling αg\alpha_{g}. In this case the potential (25) becomes

VF+3|w0|2τ4[𝒦𝒦′′1+2κ2/𝒵]=:𝒞τ4(𝒦𝒦′′),V_{\scriptscriptstyle F}\simeq+\frac{3|w_{0}|^{2}}{\tau^{4}}\left[\frac{{\cal K}^{\prime}-{\cal K}^{\prime\prime}}{1+2\kappa^{2}/{\cal Z}}\right]=:\frac{{\cal C}}{\tau^{4}}\Bigl{(}{\cal K}^{\prime}-{\cal K}^{\prime\prime}\Bigr{)}\,, (70)

where 𝒞:=3|w0|2/(1+2κ2/𝒵){\cal C}:=3|w_{0}|^{2}/(1+2\kappa^{2}/{\cal Z}) and primes denote differentiation with respect to lnτ\ln\tau.

To evaluate these derivatives write the perturbative expansion of 𝒦{\cal K} in the form

𝒦𝒦0+𝒦1(αg4π)+𝒦22(αg4π)2+{\cal K}\simeq{\cal K}_{0}+{\cal K}_{1}\left(\frac{\alpha_{g}}{4\pi}\right)+\frac{{\cal K}_{2}}{2}\,\left(\frac{\alpha_{g}}{4\pi}\right)^{2}+\cdots (71)

with

τddτ(αg4π)=:β(αg)=b1(αg4π)2+b2(αg4π)3+.\tau\frac{{\rm d}}{{\rm d}\tau}\left(\frac{\alpha_{g}}{4\pi}\right)=:\beta(\alpha_{g})=b_{1}\left(\frac{\alpha_{g}}{4\pi}\right)^{2}+b_{2}\left(\frac{\alpha_{g}}{4\pi}\right)^{3}+\cdots\,. (72)

The solution for the τ\tau-dependence of αg\alpha_{g} to leading order in αg\alpha_{g} becomes

4παg=b0b1lnτ,\frac{4\pi}{\alpha_{g}}=b_{0}-b_{1}\ln\tau\,, (73)

for some integration constant b0b_{0}. This solution neglects αg1\alpha_{g}\ll 1 while working to all orders in αglnτ\alpha_{g}\ln\tau, which is valuable if minimization occurs in the regime lnτ1/αg\ln\tau\sim 1/\alpha_{g} (as it will). For example, with the couplings normalized as above the constant b1b_{1} appropriate to NN charged fermions would be

b1=4N3.b_{1}=\frac{4N}{3}\,. (74)

Using (71) and (72) to evaluate the derivatives in (70) gives

VF\displaystyle V_{\scriptscriptstyle F} \displaystyle\simeq 𝒞τ4[𝒦1b1(αg4π)2+(𝒦1b2+𝒦2b12𝒦1b12)(αg4π)3\displaystyle\frac{{\cal C}}{\tau^{4}}\Bigl{[}{\cal K}_{1}b_{1}\left(\frac{\alpha_{g}}{4\pi}\right)^{2}+\Bigl{(}{\cal K}_{1}b_{2}+{\cal K}_{2}b_{1}-2{\cal K}_{1}b_{1}^{2}\Bigr{)}\left(\frac{\alpha_{g}}{4\pi}\right)^{3}
+(𝒦1b3+𝒦2b2+𝒦3b15𝒦1b1b23𝒦2b12)(αg4π)4+].\displaystyle\qquad\qquad\qquad+\Bigl{(}{\cal K}_{1}b_{3}+{\cal K}_{2}b_{2}+{\cal K}_{3}b_{1}-5{\cal K}_{1}b_{1}b_{2}-3{\cal K}_{2}b_{1}^{2}\Bigr{)}\left(\frac{\alpha_{g}}{4\pi}\right)^{4}+\cdots\Bigr{]}\,.

This potential can have a minimum at τ=τ0\tau=\tau_{0} for α0=αg(τ0)\alpha_{0}=\alpha_{g}(\tau_{0}) consistent with using perturbative methods provided there is a mild hierarchy amongst the coefficients 𝒦i{\cal K}_{i}. In particular, if |𝒦2/𝒦3|𝒪(ϵ)|{\cal K}_{2}/{\cal K}_{3}|\sim{\cal O}(\epsilon) and |𝒦1/𝒦3|𝒪(ϵ2)|{\cal K}_{1}/{\cal K}_{3}|\sim{\cal O}(\epsilon^{2}) for some smallish ϵ1/601\epsilon\sim 1/60\ll 1, then

VFτ=𝒞τ5(4𝒦+5𝒦′′𝒦′′′)4b1[𝒦1(α04π)2+𝒦2(α04π)3+𝒦3(α04π)4],\frac{\partial V_{\scriptscriptstyle F}}{\partial\tau}=\frac{{\cal C}}{\tau^{5}}\Bigl{(}-4{\cal K}^{\prime}+5{\cal K}^{\prime\prime}-{\cal K}^{\prime\prime\prime}\Bigr{)}\simeq-4b_{1}\left[{\cal K}_{1}\left(\frac{\alpha_{0}}{4\pi}\right)^{2}+{\cal K}_{2}\left(\frac{\alpha_{0}}{4\pi}\right)^{3}+{\cal K}_{3}\left(\frac{\alpha_{0}}{4\pi}\right)^{4}\right]\,, (76)

where the last equality drops the coefficients of α0n\alpha_{0}^{n} that are subleading in ϵ\epsilon.

Refer to caption
Figure 2: A sketch of the potential U(α)U(\alpha) vs α\alpha where VF=U(α)/τ4V_{\scriptscriptstyle F}=U(\alpha)/\tau^{4}. The plots are obtained from (3.2.1) using the representative values 𝒦1/𝒦3=0.01{\cal K}_{1}/{\cal K}_{3}=0.01 and 𝒦2/𝒦3=0.133{\cal K}_{2}/{\cal K}_{3}=-0.133 (arbitrary scale).

The solutions to VF=0V_{\scriptscriptstyle F}^{\prime}=0 at leading order in ϵ\epsilon therefore are

α0±12[𝒦2𝒦3±(𝒦2𝒦3)24𝒦1𝒦3]𝒪(ϵ).\alpha_{0\pm}\simeq\frac{1}{2}\left[-\frac{{\cal K}_{2}}{{\cal K}_{3}}\pm\sqrt{\left(\frac{{\cal K}_{2}}{{\cal K}_{3}}\right)^{2}-\frac{4{\cal K}_{1}}{{\cal K}_{3}}}\right]\sim{\cal O}(\epsilon)\,. (77)

For 𝒦1{\cal K}_{1} and 𝒦3{\cal K}_{3} positive and 𝒦2{\cal K}_{2} negative with 𝒦22>𝒦1𝒦3{\cal K}_{2}^{2}>{\cal K}_{1}{\cal K}_{3} there are two real roots for which both α0\alpha_{0} and VFV_{\scriptscriptstyle F} can be positive, with the minimum (maximum) being the root α0+\alpha_{0+} (or α0\alpha_{0-}). Because α0𝒪(ϵ)\alpha_{0}\simeq{\cal O}(\epsilon) eq. (73) implies lnτ01/α0𝒪(1/ϵ)\ln\tau_{0}\sim 1/\alpha_{0}\sim{\cal O}(1/\epsilon) at the minimum, provided the constants b0b_{0} and b1b_{1} are order unity. In principle the values of α0\alpha_{0} and τ0\tau_{0} can be adjusted independently by choosing b1b_{1} appropriately (such as by choosing NN in (74)).

The values of the potential and its second derivative

2VFτ2𝒞τ6(20𝒦29𝒦′′+10𝒦′′′𝒦′′′′).\frac{\partial^{2}V_{\scriptscriptstyle F}}{\partial\tau^{2}}\simeq\frac{{\cal C}}{\tau^{6}}\Bigl{(}20{\cal K}^{\prime}-29{\cal K}^{\prime\prime}+10{\cal K}^{\prime\prime\prime}-{\cal K}^{\prime\prime\prime\prime}\Bigr{)}\,. (78)

at these stationary points also turn out to be proportional to [𝒦1(α0/4π)2+𝒦2(α0/4π)3+𝒦3(α0/4π)4]\Bigl{[}{\cal K}_{1}(\alpha_{0}/4\pi)^{2}+{\cal K}_{2}(\alpha_{0}/4\pi)^{3}+{\cal K}_{3}(\alpha_{0}/4\pi)^{4}\Bigr{]} at leading order in ϵ\epsilon, showing that both VFV_{\scriptscriptstyle F} and 2VF/τ2\partial^{2}V_{\scriptscriptstyle F}/\partial\tau^{2} are of order ϵ5/τ04\epsilon^{5}/\tau_{0}^{4} when evaluated at the minimum, rather than the naive ϵ4/τ04\epsilon^{4}/\tau_{0}^{4}. All of these features are visible in the illustrative plot shown in Fig. 2, which uses a parameter set for which ϵ0.1\epsilon\sim 0.1 to plot U(α)U(\alpha) against α\alpha, where VF=U(α)/τ4V_{\scriptscriptstyle F}=U(\alpha)/\tau^{4}. This is the origin of the factors of ϵ\epsilon seen in (27), which in turn lead to the numerical estimates of (32) in §2.1.4.

3.2.2 An alternative stabilization scenario

The dependence of kk on lnτ\ln\tau described in the previous section leads to a potential whose minimum can easily occur at the extremely large values of τ\tau that appear in the benchmark values (32). For these choices the potential can be very small at its minimum but even so it is only as small as the observed Dark Energy density for extremely tiny values of the parameter ϵ105\epsilon\sim 10^{-5}. But if τ1030\tau\sim 10^{30} and ϵ105\epsilon\sim 10^{-5} then the leading ϵ5w02/τ4\epsilon^{5}w_{0}^{2}/\tau^{4} contribution to the potential is so small that the next-to-leading w02/τ5w_{0}^{2}/\tau^{5} contribution becomes competitive.

This observation suggests exploring situations where this naively subdominant 1/τ51/\tau^{5} contribution might actually dominate. This is actually what happens if kk does not depend on lnτ\ln\tau, since in this case K=3Mp2ln(τk)K=-3M_{p}^{2}\ln(\tau-k) is a no-scale model whose scalar potential vanishes for all τ\tau. Although we have argued that a lnτ\ln\tau-dependence to kk is plausible, it is also not compulsory and in its absence222222A natural way to have lnτ\ln\tau-dependence first arise in hh rather than in kk would be if the relevant coupling αg\alpha_{g} were itself proportional to 1/τ1/\tau (as happens if it is the coupling of a gauge field whose kinetic function is 𝔣abT{\mathfrak{f}}_{ab}\propto T). In this case hh might be expected to be only linear in lnτ\ln\tau, for which RG resummation is not needed. it is the h/τh/\tau contribution of (9) that dominates the potential and generate Vminw02/τ5V_{\rm min}\propto w_{0}^{2}/\tau^{5}. We therefore recompute the action without neglecting hh, which – recalling (10) – we take to have the form

h=1Mp2{(Φ,Φ¯,lnτ)+[XX(Φ,Φ¯,lnτ)+h.c.]+X¯XXX¯(Φ,Φ¯,lnτ)}.h=\frac{1}{M_{p}^{2}}\Bigl{\{}{\mathfrak{H}}(\Phi,{\overline{\Phi}},\ln\tau)+\Bigl{[}X{\mathfrak{H}}_{\scriptscriptstyle X}(\Phi,{\overline{\Phi}},\ln\tau)+\hbox{h.c.}\Bigr{]}+{\overline{X}}X{\mathfrak{H}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}(\Phi,{\overline{\Phi}},\ln\tau)\Bigr{\}}\,. (79)

The calculation is relatively easy because hh contributes to formulae like (13) or (2.1.2) only at through negligible terms that are subdominant in 1/τ1/\tau. The only place where nonzero hh actually matters is in the potential VFV_{\scriptscriptstyle F} because when kT=0k_{\scriptscriptstyle T}=0 the no-scale cancellation makes the hh-dependent term the dominant piece. The result for VFV_{\scriptscriptstyle F} is also simple to read off because it has the same form as (16) but with the substitutions

𝔎TX¯X¯τ2TX¯τand𝔎TT¯2τ3+T+T¯τ2TT¯τ,{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}\to\frac{{\mathfrak{H}}_{{\overline{{\scriptscriptstyle X}}}}}{\tau^{2}}-\frac{{\mathfrak{H}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}}{\tau}\quad\hbox{and}\quad{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}\to-\frac{2{\mathfrak{H}}}{\tau^{3}}+\frac{{\mathfrak{H}}_{\scriptscriptstyle T}+{\mathfrak{H}}_{{\overline{{\scriptscriptstyle T}}}}}{\tau^{2}}-\frac{{\mathfrak{H}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}}{\tau}\,, (80)

while (to leading order) 𝔎X{\mathfrak{K}}_{\scriptscriptstyle X} and 𝔎XX¯{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}} remain unchanged. Since this substitution makes the 𝔎X¯X𝔎TX¯𝔎XT¯{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}} terms subdominant to the 𝔎TT¯{\mathfrak{K}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}} term, we are led to the following form for the potential

VF\displaystyle V_{\scriptscriptstyle F} \displaystyle\simeq 1𝒫2{13𝔎X¯XwX¯wX+[𝔎X¯XwXw0¯Mp2(X¯τ2TX¯τ)+h.c.]\displaystyle\frac{1}{{\cal P}^{2}}\left\{\frac{1}{3}\,{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\overline{w_{{\scriptscriptstyle X}}}}w_{{\scriptscriptstyle X}}+\left[\frac{{\mathfrak{K}}^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}w_{{\scriptscriptstyle X}}{\overline{w_{0}}}}{M_{p}^{2}}\left(\frac{{\mathfrak{H}}_{{\overline{{\scriptscriptstyle X}}}}}{\tau^{2}}-\frac{{\mathfrak{H}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}}{\tau}\right)+\hbox{h.c.}\right]\right.
+3|w0|2/Mp41+2𝔎XX¯𝔎X𝔎X¯/Mp2(2τ3T+T¯τ2+TT¯τ)}.\displaystyle\qquad\qquad\left.+\frac{3{|w_{0}|^{2}}/{M_{p}^{4}}}{1+2{\mathfrak{K}}^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{\scriptscriptstyle X}{\mathfrak{K}}_{{\overline{{\scriptscriptstyle X}}}}/M_{p}^{2}}\left(\frac{2{\mathfrak{H}}}{\tau^{3}}-\frac{{\mathfrak{H}}_{\scriptscriptstyle T}+{\mathfrak{H}}_{{\overline{{\scriptscriptstyle T}}}}}{\tau^{2}}+\frac{{\mathfrak{H}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}}{\tau}\right)\right\}\,.

If the relaxon is now eliminated using its equation of motion it drives wXw_{\scriptscriptstyle X} to become order w0/τ2w_{0}/\tau^{2}, and this means all of the wXw_{\scriptscriptstyle X}-dependent terms now only contribute to VFV_{\scriptscriptstyle F} at order w02/τ6w_{0}^{2}/\tau^{6}. This makes them subdominant to the last line of (3.2.2), implying the leading potential for τ\tau has the advertised w02/τ5w_{0}^{2}/\tau^{5} form:

VFU(lnτ)τ5withU3|w0|2Mp4[2τ(T+T¯)+τ2TT¯1+2𝔎XX¯𝔎X𝔎X¯/Mp2].V_{\scriptscriptstyle F}\simeq\frac{U(\ln\tau)}{\tau^{5}}\quad\hbox{with}\quad U\simeq\frac{3|w_{0}|^{2}}{M_{p}^{4}}\left[\frac{2{\mathfrak{H}}-\tau\Bigl{(}{\mathfrak{H}}_{\scriptscriptstyle T}+{\mathfrak{H}}_{{\overline{{\scriptscriptstyle T}}}}\Bigr{)}+\tau^{2}{\mathfrak{H}}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}}{1+2{\mathfrak{K}}^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}{\mathfrak{K}}_{\scriptscriptstyle X}{\mathfrak{K}}_{{\overline{{\scriptscriptstyle X}}}}/M_{p}^{2}}\right]\,. (82)

This emphasizes that {\mathfrak{H}} could well (but need not) depend on lnτ\ln\tau through radiative corrections, in the same way as was true for 𝔎{\mathfrak{K}}.

Stabilization of τ\tau can now proceed precisely as above, if {\mathfrak{H}} depends on lnτ\ln\tau in the appropriate way. An important difference between this case and the one considered in §3.2.1 is how suppressed the potential’s minimum value, VminV_{\rm min}, is by powers of α0/4πϵ\alpha_{0}/4\pi\simeq\epsilon. Because UU had to vanish if 𝔎{\mathfrak{K}} were TT-independent it is proportional to derivatives of 𝔎{\mathfrak{K}}, making its expansion in powers of αg\alpha_{g} start at order αg2\alpha_{g}^{2}. The same is not true for (82), which does not vanish even if {\mathfrak{H}} is TT-indepenent. Consequently although U=𝒪(ϵ5)U={\cal O}(\epsilon^{5}) when it is constructed from 𝔎(lnτ){\mathfrak{K}}(\ln\tau) and evaluated at the minimum, the potential (82) need only be232323If αgϵ/τ\alpha_{g}\propto\epsilon/\tau then instead U𝒪(ϵ)U\sim{\cal O}(\epsilon). 𝒪(ϵ3){\cal O}(\epsilon^{3}).

Although the extra factor of 1/τ1/\tau in the potential seems promising, it is compensated by the fact that having wX1/τ2w_{\scriptscriptstyle X}\propto 1/\tau^{2} also implies FXKX¯XwXF^{\scriptscriptstyle X}\propto K^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}w_{\scriptscriptstyle X} is smaller by a factor of τ\tau and so is now τ5/2\propto\tau^{-5/2}. This means that the ratio Vmin/(FX)2V_{\rm min}/(F^{\scriptscriptstyle X})^{2} is now τ\tau-independent. The upshot is the additional powers of τ\tau suppress FXF^{\scriptscriptstyle X} relative to Mp2M_{p}^{2} but do not suppress VminV_{\rm min} relative to the supersymmetry breaking scale.

3.3 Scales and UV constraints

The point of view taken so far in this paper is to work within an EFT treatment of supergravity coupled to ordinary particles in four dimensions at and below electroweak energies. Our goal was to ensure that the prediction of small vacuum energies can remain stable as ordinary particles are integrated out. There are nonetheless at least five practical reasons to ask how this picture might arise from a UV completion (which we in practice take to be string theory, since this is sufficiently well-developed that questions can be sharply posed):

  • Even if extremely large values like τ1026\tau\sim 10^{26} are self-consistent within the low-energy EFT UV physics relates τ\tau to other observables in a way that can bring new constraints on its size. (We argue that the most obvious stringy provenance for τ\tau relates it to the extra-dimensional volume, Ω6\Omega_{6}, in string units 𝒱:=Ω6Ms6{\cal V}:=\Omega_{6}M_{s}^{6} through the relation 𝒱=τ3/2{\cal V}=\tau^{3/2}. If so, then extra-dimensosional constraints on 𝒱{\cal V} impose a new limit not obtainable purely within the low-energy EFT: τ<1018\tau\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{18}.)

  • Even if UV completions should preclude τ\tau being as large 102610^{26}, they also provide new suppression parameters that would be hard to guess purely from within the EFT. (We explore whether extra-dimensional warping might provide an example of such a suppression, with only mixed results.)

  • Although §3.1 argues that choices for parameters – e.g. values for w0w_{0} very different than Mp3M_{p}^{3} – remain stable as particles are integrated out, one can still ask whether and why the original values chosen in the UV make sense once embedded into a broader framework. (We argue that they do.)

  • The required EFT couplings (e.g. the axion decay constant) can – but need not (see below) – require the EFT to break down below TeV energies. If so, UV completions are important at experimentally accessible energies, and are needed to understand why this new physics does not undermine inferences obtained thinking about SM particles within the EFT. (We argue supersymmetric extra dimensions [45] plausibly unitarize the cases where new physics intervenes at sub-TeV energies, in which case SM physics is localized to a brane and remains 4-dimensional.)

  • Specific UV completions make predictions with varying reliability. Robust predictions usually rely only on symmetry properties, while more fragile ones depend more sensitively on UV details. Knowing which is which is useful but often beyond the reach of the low-energy EFT. (We argue our central three mechanisms are robust in this way, while other predictions – such as the value of w0w_{0} – are likely more model-specific.)

For these reasons we make preliminary UV connections here, while leaving a fuller study of UV completions for the future.

3.3.1 A stringy pedigree for τ\tau

The possibility there might be a connection to extra-dimensional models (including strings) is clear from the form for the dilaton-metric equations whose low-energy implications we discuss in §5, since these coincide precisely with the 4D effective description used in ref. [37] motivated by models like [45]. Such a convergence of dynamics is not a fluke: it reflects the underlying accidental scale invariance in higher-dimensional supergravity [18, 19, 20, 21, 22] that in turn follows from the automatic scale invariances of low-energy string vacua [17, 54].

The extended no-scale property relied on here also suggests a string connection, since it was first identified using explicit loop calculations in string compactifications in [34] and has been used in concrete mechanisms for moduli stabilisation and inflation in [55, 56]. It also fits with the general treatments of no-scale supergravity discussed in [36, 17].

UV constraints on τ\tau

The connection to string vacua is the most concrete for Type IIB flux compactifications on Calabi-Yau orientifolds since for these issues of modulus stabilization have been thought through in some detail [57], leading to a rich class of phenomenological constructions [58, 59, 60]. Moduli are central to these applications because they are naturally light (and so naturally appear in the low-energy 4D EFT), and modulus stabilization is what allows a quantitative calculation of the form of their low-energy scalar potential.

The number of moduli appearing in any UV completion depends on details of the extra dimensions, but in IIB models there is always at least one modulus associated with the extra-dimensional volume, Ω6\Omega_{6}, evaluated in string units: 𝒱=Ms6Ω6{\cal V}=M_{s}^{6}\,\Omega_{6}. 𝒱{\cal V} is a natural UV completion for τ\tau because it is both universal and large: the consistency of describing low-energy physics using a field theory requires 𝒱1{\cal V}\gg 1, making expansions in inverse powers of 𝒱{\cal V} useful and ubiquitous.

Among other things, the volume modulus determines the relative size of the string and 4D Planck scales through

MsMp𝒱1/2.M_{s}\sim\frac{M_{p}}{{\cal V}^{1/2}}\,. (83)

In some geometries all extra-dimensional length scales are similar in size, allowing the volume to be written Ω6L6\Omega_{6}\sim L^{6}. For these the volume also indicates the energy threshold above which 4D effective descriptions fail,

MKK=1LMs𝒱1/6Mp𝒱2/3.M_{\scriptscriptstyle KK}=\frac{1}{L}\sim\frac{M_{s}}{{\cal V}^{1/6}}\sim\frac{M_{p}}{{\cal V}^{2/3}}\,. (84)

When the EFT below MKKM_{\scriptscriptstyle KK} is a 4D supergravity then the dynamics of a potentially large class of 4D fields – the Kähler moduli – is described by a supergravity Kähler potential that at leading order at large 𝒱{\cal V} has the form (in Planck units)

K2ln𝒱,K\simeq-2\ln{\cal V}\,, (85)

where 𝒱{\cal V} is regarded as being an implicit function of the complex scalars that represent the Kähler moduli and transform in the standard way under 4D supersymmetry. Comparing this to K3lnτK\simeq-3\ln\tau suggests the identification

𝒱τ3/2,{\cal V}\sim\tau^{3/2}\,, (86)

in which case the string and KK scales are related to τ\tau by

Ms=Mpτ3/4andMKK=Mpτ.M_{s}=\frac{M_{p}}{\tau^{3/4}}\quad\hbox{and}\quad M_{\scriptscriptstyle KK}=\frac{M_{p}}{\tau}\,. (87)

Both MsM_{s} and MKKM_{\scriptscriptstyle KK} are scales associated with physics not present in the low-energy 4D EFT, but knowing their τ\tau dependence allows new constraints to be derived for τ\tau, since both should be UV scales from the point of view of the 4D theory. For instance, the constraint that MsM_{s} be a UV scale implies Ms>104M_{s}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{4} GeV and so

𝒱<1028orτ𝒱2/3<4×1018.{\cal V}\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{28}\quad\hbox{or}\quad\tau\sim{\cal V}^{2/3}\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}4\times 10^{18}\,. (88)

The stronger constraint MKK>104M_{\scriptscriptstyle KK}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{4} GeV would instead require 𝒱<1021{\cal V}\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{21} and so τ<1014\tau\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{14} (though, as argued below, is less robust). At face value both of these constraints appear to rule out having τ\tau as large as 102610^{26} whenever (86) holds.

Asymmetric compactifications

The constraint from MKKM_{\scriptscriptstyle KK} can be evaded if all extra dimensions are not similar in size, but (88) holds more robustly. This is because the lower bound on MKKM_{\scriptscriptstyle KK} can be much smaller than 10410^{4} GeV [61] if these extra dimensions can only be probed only using gravitational strength interactions. When this is so two extra dimensions could be large enough to put MKKM_{\scriptscriptstyle KK} at sub-eV scales. Having extra dimensions only probed by gravitational interactions requires all SM fields to be trapped on a lower-dimensional surface (such as happens if they are open-string modes localized on a D-brane). Because not all dimensions can be this large any such a compactification must be ‘asymmetric’ in that it involves dimensions with dramatically different sizes.

The constraint (88) is insensitive to this type of asymmetry. Taking two dimensions much larger than the other four leads to a total volume 𝒱=𝒱4𝒱2{\cal V}={\cal V}_{4}{\cal V}_{2} where 𝒱4=Ms44{\cal V}_{4}=M_{s}^{4}\ell^{4} and 𝒱2=Ms2L2{\cal V}_{2}=M_{s}^{2}L^{2} with L\ell\ll L. In terms of these the two kinds of KK scale are

MKL=1=Ms𝒱41/4=Mp𝒱43/4𝒱21/2andMKS=1L=Ms𝒱21/2=Mp𝒱41/2𝒱2.M_{\scriptscriptstyle KL}=\frac{1}{\ell}=\frac{M_{s}}{{\cal V}_{4}^{1/4}}=\frac{M_{p}}{{\cal V}_{4}^{3/4}{\cal V}_{2}^{1/2}}\quad\hbox{and}\quad M_{\scriptscriptstyle KS}=\frac{1}{L}=\frac{M_{s}}{{\cal V}_{2}^{1/2}}=\frac{M_{p}}{{\cal V}_{4}^{1/2}{\cal V}_{2}}\,. (89)

If (for example) MKS10M_{\scriptscriptstyle KS}\sim 10 eV and MKL10M_{\scriptscriptstyle KL}\sim 10 TeV then these imply 𝒱21025{\cal V}_{2}\sim 10^{25} and 𝒱4100{\cal V}_{4}\sim 100 and so 𝒱=𝒱2𝒱41027{\cal V}={\cal V}_{2}{\cal V}_{4}\sim 10^{27}. The string scale remains MsMp/𝒱1/23×1014Mp60M_{s}\sim M_{p}/{\cal V}^{1/2}\sim 3\times 10^{-14}M_{p}\sim 60 TeV, which is acceptably above the weak scale (and a bit larger than MKLM_{\scriptscriptstyle KL}). But because the low-energy Kähler potential remains K=2ln𝒱K=-2\ln{\cal V} it again suggests the identification 𝒱=τ3/2{\cal V}=\tau^{3/2}, and so τ𝒱2/31018\tau\sim{\cal V}^{2/3}\sim 10^{18} which does not evade the bound (88).

In the absence of other suppression τ\tau is too small on its own to make VminV_{\rm min} in §2.1.4 tiny enough to describe Dark Energy. For instance, with τ1018\tau\sim 10^{18} the estimate (27) becomes

Vvacϵ5w02τ4Mp2ϵ5Mp4τ3ϵ51054Mp4{(104GeV)4ifϵ𝒪(1)(1GeV)4ifϵ𝒪(103)(104GeV)4ifϵ𝒪(106),V_{\rm vac}\sim\frac{\epsilon^{5}w_{0}^{2}}{\tau^{4}M_{p}^{2}}\sim\frac{\epsilon^{5}M_{p}^{4}}{\tau^{3}}\sim\epsilon^{5}10^{-54}M_{p}^{4}\sim\begin{cases}(10^{4}\;\hbox{GeV})^{4}&\hbox{if}\quad\epsilon\sim{\cal O}(1)\cr(1\;\hbox{GeV})^{4}&\hbox{if}\quad\epsilon\sim{\cal O}(10^{-3})\cr(10^{-4}\;\hbox{GeV})^{4}&\hbox{if}\quad\epsilon\sim{\cal O}(10^{-6})\,,\end{cases} (90)

and so on. At face value, asymmetric compactifications in themselves do not avoid the constraint (88) – provided τ\tau is identified with the volume modulus – and so seem unable to achieve the observed dark-energy density (although the vacuum energy is substantially reduced relative to standard constructions).

Warping

UV completions also suggest other sources of hierarchical suppression, and we now ask whether these can help reduce the value of VminV_{\rm min}. The most promising of these involves warping: the phenomenon where the 4D metric has a normalization that depends on an observer’s position within the extra dimensions: gμν(x,y)=e2A(y)gμν(x)g_{\mu\nu}(x,y)=e^{2A(y)}g_{\mu\nu}(x). String compactifications are known to produce such warping [57], which (depending on the kinds of fluxes present) can generate strongly warped throats within which eAe^{A} can be very small [62].

Schematically we regard a strongly warped geometry to be one whose warp-factor satisfies

eA𝒱1/61,e^{A}\ll{\cal V}^{-1/6}\ll 1\,, (91)

and within strongly warped geometries physical scales are suppressed by warping in their immediate vicinity. For instance the string scale as measured by the tension of a space-filling 3-brane sitting at a position yby_{b} in the extra dimensions is given by

Msw=MsΞ(ϖ,𝒱)=Mp𝒱1/2Ξ(ϖ,𝒱){Mp/𝒱1/2=Mp/τ3/4(unwarped)ϖMp/𝒱1/3=ϖMp/τ1/2(strongly warped),M^{w}_{s}=M_{s}\,\Xi(\varpi,{\cal V})=\frac{M_{p}}{{\cal V}^{1/2}}\;\Xi(\varpi,{\cal V})\simeq\begin{cases}M_{p}/{\cal V}^{1/2}=M_{p}/\tau^{3/4}&\hbox{(unwarped)}\cr\varpi M_{p}/{\cal V}^{1/3}={\varpi M_{p}}/{\tau^{1/2}}&\hbox{(strongly warped)}\,,\end{cases} (92)

where ϖ:=eA(yb)\varpi:=e^{A(y_{b})} and the calculable function Ξϖ𝒱1/61\Xi\sim\varpi{\cal V}^{1/6}\ll 1 when ϖ𝒱1/6\varpi\ll{\cal V}^{-1/6} but is 𝒪(1){\cal O}(1) otherwise. This means that a 3-brane tension in a strongly warped throat, T3(Msw)4T_{3}\sim(M_{s}^{w})^{4}, has the volume dependence

T3ϖ4Mp4𝒱4/3ϖ4Mp4τ2.T_{3}\sim\frac{\varpi^{4}M_{p}^{4}}{{\cal V}^{4/3}}\sim\frac{\varpi^{4}M_{p}^{4}}{\tau^{2}}\,. (93)

In particular, UV sources of supersymmetry breaking (such as anti-D3 branes) are energetically attracted to such regions, when they exist [58, 63]. Indeed, the observation that the energy density (93) shares the same τ\tau-dependence as does the leading |wX|2/τ2|w_{\scriptscriptstyle X}|^{2}/\tau^{2} term of the potential (16) shows how the nilpotent field formulation provides a natural supergravity description of antibranes situated in warped throats [64, 65, 66, 67, 68].

Comparing (92) to the KK scale also gives a lower bound that ϖ\varpi can take in a strongly warped region, since consistency of an extra-dimensional field-theory description requires Msw>MKKM^{w}_{s}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}M_{\scriptscriptstyle KK} and so ϖ>𝒱1/3\varpi\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}{\cal V}^{-1/3}. Field theoretic descriptions (usually the only ones available) for strongly warped throats therefore only allow

1𝒱1/6>ϖ>1𝒱1/3and so1τ1/4>ϖ>1τ1/2.\frac{1}{{\cal V}^{1/6}}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\varpi\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\frac{1}{{\cal V}^{1/3}}\quad\hbox{and so}\quad\frac{1}{\tau^{1/4}}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\varpi\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\frac{1}{\tau^{1/2}}\,. (94)

An obvious candidate UV completion for the Yoga models therefore has the SM localized on supersymmetry-breaking antibranes deep within a warped throat. In such a picture Standard Model fields are naturally described in the 4D EFT by constrained superfields coupled to the nilpotent goldstino field XX that captures the low-energy limit of a very unsupersymmetric place; the position of the anti-D3 brane attracted to the tip of a warped throat.242424For a recent discussion on how to adapt these tools to a concrete statistical proposal to explain the small cosmological constant see [11]. Concrete string scenarios with the Standard Model at an anti-brane are studied in [72]. Ref. [73, 74] obtains explicit string constructions for which the goldstino superfield XX is unequivocally identified.

Because the most general effective field theory including warping and a nilpotent superfield has not fully been studied, it is not clear how the different components of the XX-dependent Kähler potential depend on the warp factor. For the purposes of estimates we assume the following form:

kX:=𝔎XMpϖa,kXT¯:=𝔎XT¯Mpϖa,kXX¯:=𝔎XX¯Mp2ϖbandwXϖ(4+b)/2,k_{\scriptscriptstyle X}:=\frac{{\mathfrak{K}}_{\scriptscriptstyle X}}{M_{p}}\propto\varpi^{a}\,,\quad k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}:=\frac{{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}}{M_{p}}\propto\varpi^{a}\,,\quad k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}:=\frac{{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}}{M_{p}^{2}}\propto\varpi^{b}\quad\hbox{and}\quad w_{\scriptscriptstyle X}\propto\varpi^{(4+b)/2}\,, (95)

with arbitrary powers a,ba,b. The warping dependence of wXw_{\scriptscriptstyle X} is fixed by obtaining the known ϖ4\varpi^{4} result for an anti-brane tension. The choice a=ba=b would correspond to all XX-dependent terms in kk sharing the same warping dependence while b=2ab=2a corresponds to the situation where the warping can be removed from kk (but not also from WW) by rescaling XX appropriately.

Eq. (95) predicts the warping dependence of FXF^{\scriptscriptstyle X} to be

F¯X¯eK/2KX¯XwXwXτ1/2ϖbandwX3𝔎XT¯w0Mp2ϖaw0τMp{\overline{F}}^{{\overline{{\scriptscriptstyle X}}}}\simeq e^{K/2}K^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}w_{\scriptscriptstyle X}\sim\frac{w_{\scriptscriptstyle X}}{\tau^{1/2}\varpi^{b}}\quad\hbox{and}\quad w_{\scriptscriptstyle X}\simeq\frac{3{\mathfrak{K}}_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}\,w_{0}}{M_{p}^{2}}\sim\frac{\varpi^{a}\,w_{0}}{\tau\,M_{p}} (96)

and so

FXϖab(w0τ3/2Mp).F^{\scriptscriptstyle X}\simeq\varpi^{a-b}\left(\frac{w_{0}}{\tau^{3/2}\,M_{p}}\right)\,. (97)

When b>a+1b>a+1 this leads to an enhancement of FXF^{\scriptscriptstyle X} and therefore a suppression of VminV_{\rm min} relative to (FX)2(F^{\scriptscriptstyle X})^{2}. To see why, notice that in this case the vacuum energy is:

Vminϵ5w02τ4Mp2ϵ5τϖ2(ba)(FX)2>ϵ5τϖ2(ba)(1015Mp)4>ϵ5ϖ2(ba1)(1022Mp)4,V_{\rm min}\sim\frac{\epsilon^{5}w_{0}^{2}}{\tau^{4}M_{p}^{2}}\sim\frac{\epsilon^{5}}{\tau}\;\varpi^{2(b-a)}\Bigl{(}F^{\scriptscriptstyle X}\Bigr{)}^{2}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\frac{\epsilon^{5}}{\tau}\;\varpi^{2(b-a)}\Bigl{(}10^{-15}\,M_{p}\Bigr{)}^{4}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\epsilon^{5}\varpi^{2(b-a-1)}\Bigl{(}10^{-22}\,M_{p}\Bigr{)}^{4}\,, (98)

where the first inequality imposes FX>(103GeV)2F^{\scriptscriptstyle X}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}(10^{3}\;\hbox{GeV})^{2} and the last inequality uses ϖ2/τ>1028\varpi^{2}/\tau\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{-28} as required once (92) is used in Msw>104M_{s}^{w}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{4} GeV. For instance, choosing b=a+2b=a+2 and τ1014\tau\sim 10^{14} with ϖ107\varpi\sim 10^{-7} gives

Vmin>ϵ5ϖ4τ(1015Mp)410102ϵ5Mp4V_{\rm min}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\frac{\epsilon^{5}\varpi^{4}}{\tau}\Bigl{(}10^{-15}M_{p}\Bigr{)}^{4}\simeq 10^{-102}\epsilon^{5}M_{p}^{4} (99)

which can be tantalizingly small for reasonable values of ϵ\epsilon like ϵ103\epsilon\sim 10^{-3}. Note that these values for the warp factor ϖ\varpi and the modulus τ\tau satisfy the conditions (94) above.

We remark in passing that these choices for warping also change the relation between the gravitino mass and FXF^{\scriptscriptstyle X}, leading to

m3/2=eK/2|W|Mp2w0τ3/2Mp2ϖbaFXMp,m_{3/2}=e^{K/2}\frac{|W|}{M_{p}^{2}}\sim\frac{w_{0}}{\tau^{3/2}M_{p}^{2}}\sim\varpi^{b-a}\frac{F^{\scriptscriptstyle X}}{M_{p}}\,, (100)

a result that can be very small. For instance, choosing as above b=a+2b=a+2 with ϖ107\varpi\sim 10^{-7} and FX(103GeV)2F^{\scriptscriptstyle X}\sim(10^{3}\;\hbox{GeV})^{2} implies m3/21017m_{3/2}\sim 10^{-17} eV: at face value well below the experimental bound of 10510^{-5} eV quoted in [75, 76]. We argue in §4.3 why the calculations on which these bounds are based need not be valid for the model described here.

What should we take for the powers aa and bb? Although a full answer to this requires a better understanding of warped compactifications, some insight comes from the more detailed description of how strong warping enters into a 4D EFT [69, 70, 71]. It does so through a complex structure modulus, YY, whose expectation value determines the shape of the throat, and so warping dependence can be computed given how YY couples to the nilpotent superfield. It can be consistent to incude this particular modulus in the 4D description because warping makes it lighter than the other complex structure moduli. When included it is represented using an ordinary unconstrained superfield (similar to TT) and its inclusion in the 4D EFT reproduces the known calculations of warping with nilpotent superfields.

Integrating in this new modulus YY in the 4D effective field theory introduces the following new terms in the superpotential:

W=W0+U(Y)+σY2XwithU(Y)=Y3(𝔞lnY+𝔟),W=W_{0}+U(Y)+\sigma Y^{2}X\quad\hbox{with}\quad U(Y)=Y^{3}({\mathfrak{a}}\ln Y+{\mathfrak{b}})\,, (101)

where 𝔞,𝔟{\mathfrak{a}},{\mathfrak{b}} and σ\sigma are constants, of which 𝔞{\mathfrak{a}} and 𝔟{\mathfrak{b}} capture the presence of the three-form fluxes that fix the corresponding three-cycle. Solving WY=0W_{\scriptscriptstyle Y}=0 for YY gives the warp factor ϖ2:=Ye𝔟/𝔞1\varpi^{2}:=\langle Y\rangle\simeq e^{-{\mathfrak{b}}/{\mathfrak{a}}}\ll 1, and so in this picture how this appears in the rest of the action is controlled by the action’s YY-dependence. For example, choosing the Kähler potential

K=3ln𝒫,𝒫=τkαYY¯β(XY¯+X¯Y)γYY¯XX¯K=-3\ln{\cal P},\qquad{\cal P}=\tau-k-\alpha Y{\overline{Y}}-\beta(X{\overline{Y}}+{\overline{X}}Y)-\gamma Y{\overline{Y}}X{\overline{X}} (102)

with α,β,γ\alpha,\beta,\gamma order-one coefficients gives, after substituting for Y\langle Y\rangle, the warping dependence assumed in (95) with a=4a=4 and b=2b=2. Even though this appears to be a promising source of hierarchies, it remains to be shown that this effective field theory actually emerges from the physics of the anti-brane EFT.252525In particular the potential presence of an term of the form (Y+Y¯)XX¯(Y+{\overline{Y}})X{\overline{X}} would change the results to a=b=2a=b=2 in which case the predictions for warping would not differ from the unwarped case.

3.3.2 Further UV information: fluxes and more moduli

With the above estimates of scales in mind, we briefly summarize some of the other implications that a UV completion using string theory can have for some of the other choices made in the Yoga scenario.

Supersymmetry and fluxes

Several types of three-form fluxes, FMNPF_{{\scriptscriptstyle M}{\scriptscriptstyle N}{\scriptscriptstyle P}} and GMNPG_{{\scriptscriptstyle M}{\scriptscriptstyle N}{\scriptscriptstyle P}}, play a central role in Type IIB vacua, where they help stabilise the complex-structure moduli and the string dilaton.262626Not to be confused with the dilaton field of this paper. The fluxes do not fix the Kähler moduli, whose potential remains flat at tree level because it has a no-scale form [57]. This no-scale structure arises along the lines described in §4.1.3 and has its roots in the compactification’s underlying accidental scale invariance.

Fluxes also contribute to the way that compactifications break supersymmetry, and because of this it is fluxes that determine the UV value of the parameter w0w_{0}. The contributions of fluxes to the low energy superpotential turns out to be given by [77]

W=X(F3iSG3)Ω,W=\int_{X}\Bigl{(}F_{3}-iSG_{3}\Bigr{)}\wedge\Omega\,, (103)

where SS is the string dilaton and Ω\Omega is the holomorphic harmonic (3,0)-form that exists on any Calabi-Yau space XX.

For the present purposes notice that because (103) gives WW as an integral over the extra dimensions its size need not be set purely by the string scale and so can grow with extra-dimensional size. (Having WW be larger than string scale in string frame means it is larger than Planck scale in 4D Einstein frame.) Ref. [52] examines precisely how big this allows the parameter w0w_{0} to be within the 4D EFT and argues it is bounded above by

w0<Mp3𝒱1/3Mp3τ1/2.w_{0}\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}M_{p}^{3}\,{\cal V}^{1/3}\sim M_{p}^{3}\,\tau^{1/2}\,. (104)

They further argue that the upper limit is saturated, w0Mp3τ1/2w_{0}\sim M_{p}^{3}\tau^{1/2}, when the gravitino mass, m3/2w0/(𝒱Mp2)w0/(τ3/2Mp2)m_{3/2}\sim w_{0}/({\cal V}M_{p}^{2})\sim w_{0}/(\tau^{3/2}M_{p}^{2}) is of order the KK scale MKKMp/𝒱2/3=Mp/τM_{\scriptscriptstyle KK}\sim M_{p}/{\cal V}^{2/3}=M_{p}/\tau (as considered by our earlier 4D estimates).

But this does not mean w0w_{0} must be large; systematic searching [78] has also found many solutions where w0w_{0} can be as small as 109010^{-90} in Planck units. Small w0w_{0} turns out to be of interest when FX(Msw)2F^{\scriptscriptstyle X}\sim(M_{s}^{w})^{2}, as is often true for warped string compactifications.

Logarithmic potentials

Generating exponentially large field values, as we have found here, embeds well into the ‘large-volume scenario’ (LVS) string vacua [59, 60] (if τ\tau is the volume modulus), for which similar arguments are used to show that the volume of the extra dimensions can be exponentially large. Logarithmic dependence of moduli within the Kähler potential has also been considered within a string context [79] where its effects on KKLT [58] or LVS moduli stabilisation was investigated. More recently this log-dependence was considered in [80, 81] to provide a new mechanism for moduli stabilisation. In particular [81] uses it to realise large volumes even in the case of one Kähler modulus.

The RG-based stabilisation mechanism used here goes back to the earlier extra-dimensional examples of [37] has not been used in a string context, but may be useful there for modulus stabilisation in string theory independent of our current scenario. If so, it may provide a new approach in which the volume is naturally exponentially large with a built-in de Sitter uplift that builds on a controlled approximation in powers of a small coupling αg\alpha_{g}, for which RG resummation allows the exploration of the regime lnτ1/αg\ln\tau\sim 1/\alpha_{g} without losing control of the underlying expansion (as was done here).

Stringy provenance for ϕ\phi

The relaxon field ϕ\phi is the one ingredient to this picture whose UV provenance is the least obvious (although the related general phenomenon of charged scalars adjusting to minimize DD-term potentials at zero is very common). One way to think about ϕ\phi is that it is a field that interpolates between regions that turn off and turn on a large positive contribution in the scalar potential. If the positive energy were to dominate in the early universe then any such a field that can turn it off through slow evolution would naturally play the role of an inflaton.

From this point of view a UV provenance for ϕ\phi is equivalent to finding stringy origins for a nonsupersymmetric inflaton. Among the many fields that present themselves, the most obvious candidate is the modulus describing brane-antibrane separation within the extra dimensions. This naturally arises in a sector that badly breaks supersymmetry (the antibrane) and also naturally interpolates between regions where a large positive energy (the antibrane tension) can be present or not. In this picture the region where the positive energy turns off would be places where brane-antibrane annihilation removes this underlying energy. This suggests a natural embedding of our framework into brane-antibrane inflationary models.272727Note added: further exploration confirms this connection, with the stabilization mechanism explored here improving them by removing the η\eta problem these models usually have [82].

Other moduli

The constraint (88) rules out values as large as τ1026\tau\sim 10^{26}. But it could also be that τ\tau should not be identified with the volume modulus. In general τ\tau could also correspond to a combination of the many other moduli that are generically present in Calabi-Yau orientifolds, including blow-up or fibre moduli. (The warped geometries described in §3.3 are a special case of additional suppression coming from other – in that case, complex-structure – moduli.) The generic accidental scale invariance generally ensures other Kähler moduli can also be included into the low-energy EFT while preserving the no-scale condition [17], along the lines described in §4.1.3 below.

Although it is generic to have many moduli, it is also true that their masses generically depend on the overall volume 𝒱{\cal V} and the flux superpotential W0W_{0}. Even if τ𝒱2/3\tau\neq{\cal V}^{2/3}, having τ\tau so large would imply the volume is also large even if τ\tau is some other modulus. If so, the cosmological moduli problem [83, 84, 85] has to be addressed (see also [86]). This demands that light and long-lived fields not overclose the universe. Roughly speaking generic 𝒪(Mp){\cal O}(M_{p}) initial amplitudes for these fields can become dangerous once they begin to oscilate around their minima because these oscillations can dominate the energy density of the universe today (ρχρc\rho_{\chi}\geq\rho_{c}) unless their mass satisfies mχ1026m_{\chi}\leq 10^{-26} eV. If this bound is satisfied they may contribute to dark matter. Any heavier unstable modulus that decays into Standard Model daughters must do so quickly in order not to ruin nucleosynthesis and therefore needs to be heavier than 3030 TeV. This problem does not arise if the potential at its minimum can be suppressed down to Dark Energy scales, because then our earlier bounds imply mχ1032m_{\chi}\sim 10^{-32} eV, but must be faced otherwise.

τ\tau-dependent masses

Another feature playing an important role in the EFT was the τ\tau-dependence of masses, and the possibility that different species of particles could depend differently on τ\tau (since this would allow logarithmic dependence on mass ratios to turn into a dependence on lnτ\ln\tau). String compactifications very generically contain a suite of states that have a hierarchy of masses parametrised by differing dependence on the volume 𝒱\cal V:

MpMsMp𝒱1/2MkkMp𝒱2/3m3/2Mp|W|𝒱M_{p}\geq M_{s}\propto\frac{M_{p}}{{\cal V}^{1/2}}\geq M_{kk}\propto\frac{M_{p}}{{\cal V}^{2/3}}\geq m_{3/2}\propto\frac{M_{p}|W|}{\cal V}\cdots (105)

and so can naturally lead to logs of τ\tau for τ𝒱2/3\tau\sim{\cal V}^{2/3}. Moreover, the τ\tau-independence of physical Yukawa couplings (like those involving the Higgs field) argued here also generically happens for the Yukawa couplings of particles localized on a (anti) D3 brane [87].

It is clear that many ingredients of our scenario have natural counterparts in concrete UV completions from string compactifications, but the general bound (88) that follows from the simplest identification 𝒱=τ3/2{\cal V}=\tau^{3/2} implies that some non-trivial engineering is required to fully achieve Dark Energy scales from string constructions. If this should turn out to be possible, the string landscape might then provide a welcome alternative to anthropic arguments to explain the Higgs mass and dark energy scales. The anisotropic compactification scenario presented in [88] already includes some of the required ingredients and may provide a natural place to start. We leave further considerations on how to realize this scenario in string theory to future work.

4 Phenomenological issues

The next two sections take the point of view that values as large as τ>τ26\tau\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\tau^{26} can be reconciled with a UV completion along the lines of [45] at eV scales and above (where the axion coupling tells us that the EFT for τ\tau this large must break down). We do so because the main phenomenological challenges and opportunities provided by the relaxation/supersymmetry mechanism described above arise at energies lower than this, and can be examined independent of any search for a UV embedding. All we assume about SM fields is that their masses are mτ1/2m\propto\tau^{-1/2} as found above; a result we rederive in this section on more robust grounds that are likely to apply to the brane-world framework suggested by string models and [45].

We identify the main arenas of concern and/or opportunity to be astrophysical. In particular, this section describes general constraints, focussing in particular tests of gravity such as in the solar system. Cosmology is the focus of §5. Our aim here is not to be exhaustive, but instead to identify some of the model-building issues that are likely necessary to accommodate our approach to the cosmological constant problem. The core message is that although the models considered here are potentially constrained by many observations, it is not clear that these constraints need be fatal (and some offer tantalizing opportunities).

4.1 The EFT relevant to astrophysics

Because the main arena is astronomy and cosmology, for phenomenological purposes it is useful to strip away all UV details and examine the low-energy effective lagrangian relevant at scales well below the electron and ϕ\phi-scalar mass, dropping all subleading powers of 1/τ1/\tau. This lagrangian has the form

=Mp22gR+dil+SM+dark+int,{\cal L}=-\frac{M_{p}^{2}}{2}\,\sqrt{-g}\;R+{\cal L}_{{\rm dil}}+{\cal L}_{{\scriptscriptstyle SM}}+{\cal L}_{{\rm dark}}+{\cal L}_{\rm int}\,, (106)

where the first term is the usual Einstein-Hilbert action, whose canonical form shows the remainder of the lagrangian is given in Einstein frame. dil{\cal L}_{\rm dil} describes the leading interactions of the dilaton supermultiplet, both with itself and with the graviton supermultiplet. The next two terms respectively describe the couplings of the graviton and dilaton-axion to Standard Model degrees of freedom (i.e. ordinary matter), and to any other non-dilaton dark low-energy degrees of freedom, while int{\cal L}_{\rm int} contains all other mutual interactions (such as 4-fermi interactions) of these fields.

We next collect the expressions for each of these in turn, to leading order in 1/τ1/\tau.

4.1.1 Standard Model sector

The above discussion determines the Standard Model contribution SM{\cal L}_{{\scriptscriptstyle SM}} in (106) to leading order in 1/τ1/\tau. Keeping only terms out to dimension 4 we have

SMg=14FμνFμνaψ¯a[ / 𝒟+mab(τ)]ψb,\frac{{\cal L}_{{\scriptscriptstyle SM}}}{\sqrt{-g}}=-\frac{1}{4}\,F_{\mu\nu}F^{\mu\nu}-\sum_{a}{\overline{\psi}}_{a}\Bigl{[}\hbox to0.0pt{\hbox to7.6389pt{\hfil/\hfil}\hss}{\cal D}+m_{ab}(\tau)\Bigr{]}\psi_{b}\,, (107)

where Fμν=μAννAμF_{\mu\nu}=\partial_{\mu}A_{\nu}-\partial_{\nu}A_{\mu} is the standard electromagnetic field strength, and the sum over aa runs over any spin-half particles, ψa\psi_{a}, relevant (which for cosmology or solar-system applications might be neutrinos or heavier but stable everyday particles like electrons and baryons). The derivative 𝒟μ{\cal D}_{\mu} is the spin-half version of the derivative defined in (117) below.

Although mabδabτ1/2m_{ab}\propto\delta_{ab}\tau^{-1/2} for stable everyday particles, there are two natural options to assume for the τ\tau-dependence of neutrino masses. One is to assume that they also have masses proportional to τ1/2\tau^{-1/2}. But the presence of supersymmetry in the dark sector also predicts there to be other sterile fermions present that could mix with Standard Model neutrinos. Some of these must be light (such as the superpartners of the graviton or the dilaton), but others could be very heavy. Those that are heavy would have to have masses unsuppressed by τ\tau, and so once they mix with left-handed neutrinos through a mixing term that scales like other SM masses – i.e. like τ1/2{\tau}^{-1/2} – the see-saw mechanism would predict physical light-neutrino masses to be order mντ1m_{\nu}\propto\tau^{-1}; providing an attractive explanation of the otherwise coincidental similarity between neutrino masses and the cosmological constant scale: Vminmν4V_{\rm min}\sim m_{\nu}^{4}.

4.1.2 Dilaton sector

The supergravity framework is unusually specific about some parts of the dark sector. The most constrained of these new dark particles are the members of the dilaton multiplet itself, which we divide into its bosonic and fermionic parts: dil=dilb+dilf{\cal L}_{\rm dil}={\cal L}_{{\rm dil}\,b}+{\cal L}_{{\rm dil}\,f}.

The bosonic part contains the kinetic terms (51) for the axion and dilaton together with the scalar potential computed in §2:

dilbg=12(μχμχ+Mp2e2ζχ/MpDμ𝔞Dμ𝔞)+V(χ),-\frac{{\cal L}_{{\rm dil}\,b}}{\sqrt{-g}}=\frac{1}{2}\Bigl{(}\partial^{\mu}\chi\,\partial_{\mu}\chi+M_{p}^{2}\,e^{-2\zeta\chi/M_{p}}\,D^{\mu}{\mathfrak{a}}\,D_{\mu}{\mathfrak{a}}\Bigr{)}+V(\chi)\,, (108)

with (c.f. eq. (52)) ζ=23\zeta=\sqrt{\frac{2}{3}}. Unbroken axion shift symmetry prevents VV from depending on 𝔞{\mathfrak{a}}. The stabilization mechanism of §3.2 leads to a potential like (25)

V(χ,𝔞)=Ue4ζχ/Mp,V(\chi,{\mathfrak{a}})=U\,e^{-4\zeta\chi/M_{p}}\,, (109)

where U=U(lnτ)=U(χ)U=U(\ln\tau)=U(\chi) is what remains after ϕ\phi has relaxed to suppress 𝔴X{\mathfrak{w}}_{{\scriptscriptstyle X}}, and might be a rational function of lnτ\ln\tau. In the examples of §3.2, the dependence of UU on lnτ\ln\tau gives VV a minimum at ττ0\tau\sim\tau_{0} where lnτ01/αg1/ϵ60\ln\tau_{0}\sim 1/\alpha_{g}\sim 1/\epsilon\sim 60, and when evaluated at its minimum UU is of order ϵ5\epsilon^{5} and so Vminϵ5mTeV8/Mp4V_{\rm min}\sim\epsilon^{5}m_{\scriptscriptstyle TeV}^{8}/M_{p}^{4} as in (27).

As discussed in §2.3.1 the axion shift symmetry might be gauged. If not then Dμ𝔞=μ𝔞D_{\mu}{\mathfrak{a}}=\partial_{\mu}{\mathfrak{a}} in (108), and if so then Dμ𝔞=μ𝔞AμD_{\mu}{\mathfrak{a}}=\partial_{\mu}{\mathfrak{a}}-A_{\mu} for some dark-sector gauge boson. This implies the axion could, but need not, be massless, depending on whether or not gauging allows it to be eaten via the Higgs mechanism. If gauged its mass could be either of order Mp/τ3/2>MTeV3/Mp21014M_{p}/\tau^{3/2}\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}M_{\scriptscriptstyle TeV}^{3}/M_{p}^{2}\sim 10^{-14} eV or Mp/τ>MTeV2/Mp10M_{p}/\tau\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}M_{\scriptscriptstyle TeV}^{2}/M_{p}\sim 10 eV, depending on how the dark photon kinetic term depends on τ\tau.

A potential axion mass strongly affects how the axio-dilaton responds to gravitating objects. If its mass is around 10 eV then its Compton wavelength is comparable to the size of an atom and it can be integrated out and cannot mediate macroscopic forces relevant to tests of gravity. If its mass is 101210^{-12} eV then its Compton wavelength is of order 10410^{4} km, or about a tenth of a light-second, and although its implications for solar system tests are less clear it is also unlikely to be relevant for testing gravity on longer scales than this. In either case we shall see that the dilaton-matter coupling likely poses a problem because it is too large to have escaped detection. If the axion were massless, however, then it can mediate long-range forces and its presence can be exploited as in [42] to help hide the axio-dilaton from tests of gravity.

The fermionic contribution to dil{\cal L}_{\rm dil} consists of the leading parts of the gravitino and the dilatino actions,

dilfg\displaystyle\frac{{\cal L}_{{\rm dil}\,f}}{\sqrt{-g}} =\displaystyle= i2ϵμνλρψ¯μγ5γνDλψρ12m3/2ψ¯μγμνψν\displaystyle-\frac{i}{2}\,\epsilon^{\mu\nu\lambda\rho}{\overline{\psi}}_{\mu}\gamma_{5}\gamma_{\nu}D_{\lambda}\psi_{\rho}-\frac{1}{2}\,m_{3/2}\,{\overline{\psi}}_{\mu}\gamma^{\mu\nu}\psi_{\nu}
12ξ¯ / Dξ12[𝔪ξξ¯γLξ+𝔪ξgMpψ¯μγLγμξ+h.c.]\displaystyle\qquad\qquad-\frac{1}{2}\,{\overline{\xi}}\hbox to0.0pt{\hbox to8.55695pt{\hfil/\hfil}\hss}D\xi-\frac{1}{2}\left[{\mathfrak{m}}_{\xi}\,{\overline{\xi}}\gamma_{\scriptscriptstyle L}\xi+\frac{{\mathfrak{m}}_{\xi g}}{M_{p}}\,{\overline{\psi}}_{\mu}\gamma_{\scriptscriptstyle L}\gamma^{\mu}\xi+\hbox{h.c.}\right]

where the gravitino mass, m3/2m_{3/2}, is given by (34) and so is proportional to w02τ3/2w_{0}^{2}\,\tau^{-3/2} and of present-day numerical size mTeV3/Mp21014m_{\scriptscriptstyle TeV}^{3}/M_{p}^{2}\sim 10^{-14} eV. The functions 𝔪ξ{\mathfrak{m}}_{\xi} and 𝔪ξg{\mathfrak{m}}_{\xi g} are similarly given by eqs. (59) and (2.3.2), and generically also give ξ\xi a mass that is of order m3/2m_{3/2}. Notice that the gravitino-dilatino mixing parameter 𝔪ξg{\mathfrak{m}}_{\xi g} vanishes when DTW=0D_{\scriptscriptstyle T}W=0, but the direct mass term 𝔪ξ{\mathfrak{m}}_{\xi} need not also do so because of the target-space curvature terms in (2.3.2).

4.1.3 Other dark ingredients

The remaining term dark{\cal L}_{\rm dark} of (106) describes any other of the more model-dependent light supersymmetric degrees of freedom that might be present, of which there are two types of natural candidates.

Dark gauge sector

The first type of additional dark sector could consist of a dark-sector gauge boson, which could (but need not) include one that gauges the axion shift symmetry and so eats the axion to get a mass. As discussed in §2.3.1 the presence of such a gauge boson often also implies the existence of other dark degrees of freedom, such as the corresponding gaugino as well as possible charged supermultiplets whose presence can be required to cancel anomalies and/or to allow the corresponding gauge auxiliary fields, D, to be minimized at zero (see e.g. [89]).

Dark moduli

A second natural sector to have in the supersymmetric world are new superfields SiS^{i} corresponding to extensions of the no-scale sector. These kinds of extensions are well-motivated by specific UV examples – for which they can arise as compactification moduli in addition to the scaling field TT (see e.g. the discussion in §3.3). When present, they generically acquire a Hubble-scale mass for the same reasons that the dilaton τ\tau does.

The single field TT can be extended to a more general sector ZA={T,Si}Z^{\scriptscriptstyle A}=\{T,S^{i}\} without ruining the no-scale cancellations that lead to an acceptable Dark Energy density because all that is needed is for the TT sector to be scale invariant at leading order and to remain a no-scale model at subleading order in 1/τ1/\tau. Both of these properties also hold for more complicated sectors. For example imagine if our starting Kähler potential were to replace (9) by

K=3Mp2ln𝒫with𝒫=(τ,σi)k+hτ+,K=-3M_{p}^{2}\ln{\cal P}\quad\hbox{with}\quad{\cal P}={\cal F}(\tau,\sigma^{i})-k+\frac{h}{\tau}+\cdots\,, (111)

where σi=Si+S¯i\sigma^{i}=S^{i}+{\overline{S}}^{i} and scale invariance is built in by requiring {\cal F} to be a homogeneous degree-one function:

(λτ,λσi)λ(τ,σi){\cal F}(\lambda\tau,\lambda\sigma^{i})\equiv\lambda{\cal F}(\tau,\sigma^{i}) (112)

for all τ\tau and σi\sigma^{i}. This property ensures {\cal F} can always be written with an overall factor of τ\tau scaled out: (τ,σi)=τF(xi){\cal F}(\tau,\sigma^{i})=\tau\,F(x^{i}) where xi:=σi/τx^{i}:=\sigma^{i}/\tau.

As is easy to check, eq. (112) implies that KK defined by (111) satisfies the no-scale identity KA¯BKA¯KB3Mp2K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}\equiv 3M_{p}^{2} provided we drop all terms involving hh (and higher orders in 1/τ1/\tau) and that kk is independent of τ\tau and σi\sigma^{i}. This means that the scalar potential VFV_{\scriptscriptstyle F} vanishes identically for all TT and SiS^{i} so long as these fields do not appear in kk, and when they do appear the potential must be proportional to derivatives of kk, similar to (23) in the single-modulus case. These properties ensure that the arguments for the low-energy potential being order w02/τ4w_{0}^{2}/\tau^{4} remain true in this more complicated setting.

For phenomenological purposes, what is important is that the additional scalars σi\sigma^{i} generically have masses at the present-day Hubble scale, as argued above. This makes these fields also relevant for tests of gravity, and allows them to be active during cosmology. Their axion counterparts can be massive or massless depending on the details, much like for the axion partner of τ\tau discussed above.

4.2 Axio-dilaton couplings and tests of gravity

At first sight, the extremely small dilaton mass and its nominally gravitational-strength couplings to matter makes the dilaton’s competition with gravity likely the most constraining part of our phenomenological story. Because of its importance we start by rederiving the dilaton-matter couplings in a more transparent way than in §2.2, showing these couplings follow very generally from the accidental scale invariance.

The good news is that scale invariance forces the dilaton-matter couplings to have a ‘quasi-Brans Dicke’ form [90], and this is good because it means that it automatically satisfies all of the very stringent tests of the Equivalence Principle [91]. We then show that this same scale invariance actually predicts the dilaton behaves as an honest-to-God Brans-Dicke scalar [40], whose coupling is predicted quite generally to be282828The traditional Brans-Dicke parameter ω\omega is related to the coupling as defined here by 2𝔤2=1/(3+2ω)2{\mathfrak{g}}^{2}=1/(3+2\omega). 𝔤=1/60.408{\mathfrak{g}}=-{1}/{\sqrt{6}}\simeq-0.408, even in the general multi-modulus case.

The bad news is that couplings this large are naively ruled out by solar system tests of GR. But all is not lost: we summarize how having the axion couple to matter in addition to the dilaton provides a way to evade this constraint, even for extremely small axion couplings, using the mechanism proposed in [42].

4.2.1 Quasi-Brans Dicke scalar

For the present purposes we define a quasi-Brans Dicke scalar to be a scalar-tensor theory for which the scalar field χ\chi couples through a lagrangian density of the form

=g[Mp22+12(χ)2]+m(g~,ψ~,h~).{\cal L}=-\sqrt{-g}\left[\frac{M_{p}^{2}}{2}\,{\cal R}+\frac{1}{2}\,(\partial\chi)^{2}\right]+{\cal L}_{m}(\tilde{g},\tilde{\psi},\tilde{h})\,. (113)

Here m{\cal L}_{m} is the lagrangian density for representative spin-half and spin-zero matter fields (ψ~\tilde{\psi} and h~\tilde{h}) whose defining feature is that the Einstein-frame metric and the scalar χ\chi only enter into m{\cal L}_{m} through the Jordan-frame metric

g~μν=A2(χ)gμν,\tilde{g}_{\mu\nu}=A^{2}(\chi)\;g_{\mu\nu}\,, (114)

for some function A(χ)A(\chi). The Brans Dicke special case [40] is the choice

A(χ)=e𝔤χ/MpA(\chi)=e^{{\mathfrak{g}}\chi/M_{p}} (115)

where 𝔤{\mathfrak{g}} is one of the ways to write the dimensionless Brans-Dicke coupling parameter.

Because particle kinetic and mass terms are by assumption χ\chi-independent in Jordan frame, their χ\chi-dependence in Einstein frame is predicted to be universal (i.e. depend only on the matter-particle spin); for instance for spinless and spin-half fields

g~[g~μνμh~νh~+e~aψ~¯μγaDμψ~]=g[gμνA2(χ)μh~νh~+eaA3μ(χ)ψ~¯γaDμψ~].\sqrt{-\tilde{g}}\;\Bigl{[}\tilde{g}^{\mu\nu}\partial_{\mu}\tilde{h}^{*}\,\partial_{\nu}\tilde{h}+{\tilde{e}_{a}}{}^{\mu}\,{\overline{\tilde{\psi}}}\gamma^{a}D_{\mu}\tilde{\psi}\Bigr{]}=\sqrt{-g}\Bigl{[}g^{\mu\nu}A^{2}(\chi)\,\partial_{\mu}\tilde{h}^{*}\,\partial_{\nu}\tilde{h}+e_{a}{}^{\mu}A^{3}(\chi)\,{\overline{\tilde{\psi}}}\gamma^{a}D_{\mu}\tilde{\psi}\Bigr{]}\,. (116)

The kinetic terms are put into canonical form by redefining h~=A1h\tilde{h}=A^{-1}h and ψ~=A3/2ψ\tilde{\psi}=A^{-3/2}\psi, at the expense of modifying the covariant derivatives: μh~=A1𝒟μh\partial_{\mu}\tilde{h}=A^{-1}{\cal D}_{\mu}h and Dμψ~=A3/2𝒟μψD_{\mu}\tilde{\psi}=A^{-3/2}{\cal D}_{\mu}\psi, with

𝒟μh=μh(μAA)hand𝒟μψ=Dμψ32(μAA)ψ.{\cal D}_{\mu}h=\partial_{\mu}h-\left(\frac{\partial_{\mu}A}{A}\right)h\quad\hbox{and}\quad{\cal D}_{\mu}\psi=D_{\mu}\psi-\frac{3}{2}\,\left(\frac{\partial_{\mu}A}{A}\right)\psi\,. (117)

In both cases Einstein-frame particle masses depend universally on χ\chi with

mEF=mA(χ),m_{{\scriptscriptstyle EF}}=m\,A(\chi)\,, (118)

and so all mass ratios are χ\chi-independent, although their ratio with the Planck mass is not.292929Since these are physical conclusions they are frame-independent and so can be seen equally well in either Jordan or Einstein frame. The Brans-Dicke coupling function η(χ)\eta(\chi) is defined by the matter contribution to the χ\chi field equation,

χ(x)+η(χ)MpgμνTμν=0whereη(χ)Mp:=χlnA(χ),\Box\chi(x)+\frac{\eta(\chi)}{M_{p}}\,g_{\mu\nu}T^{\mu\nu}=0\quad\hbox{where}\quad\frac{\eta(\chi)}{M_{p}}:=\frac{\partial}{\partial\chi}\ln A(\chi)\,, (119)

and TμνT^{\mu\nu} is the usual matter stress energy tensor as defined by varying the Einstein-frame metric. Notice η=𝔤\eta={\mathfrak{g}} is a constant in the pure Brans Dicke special case defined by (115).

4.2.2 The dilaton as a Brans-Dicke scalar

The above definitions precisely capture the matter couplings of the dilaton τ\tau, provided we omit the scalar potential303030For cosmological purposes the scalar potential must be included, but its contributions are negligible for this section’s focus: scalar-tensor constraints in non-cosmological settings. and restrict to leading order in 1/τ1/\tau. To see why, recall that the supergravity action can be derived in superspace starting from an expression

=d2Θ 2[38(𝒟¯28)eK/3+W]+h.c.,{\cal L}=\int{\rm d}^{2}\Theta\,2{\cal E}\left[\frac{3}{8}\Bigl{(}{\overline{{\cal D}}}^{2}-8{\cal R}\Bigr{)}\,e^{-K/3}+W\right]+\hbox{h.c.}\,, (120)

where WW and KK are the superpotential and Kähler potential. The gravitational lagrangian in this frame313131For aficianados: we imagine fixing superconformal invariance here using a KK-independent compensator, and so the Einstein-Hilbert action is not in canonical form. turns out to be proportional to Mp2eK/(3Mp2)M_{p}^{2}\,e^{-K/(3M_{p}^{2})}, and so the Weyl rescaling (114) required to reach Einstein frame is given by

A(χ)=eK/(6Mp2)=1𝒫1/2,A(\chi)=e^{K/(6M_{p}^{2})}=\frac{1}{{\cal P}^{1/2}}\,, (121)

which uses K=3Mp2ln𝒫K=-3M_{p}^{2}\ln{\cal P} with 𝒫τk+𝒪(1/τ){\cal P}\simeq\tau-k+{\cal O}(1/\tau).

Furthermore when WW is independent of TT then g~μν\tilde{g}_{\mu\nu} also agrees with the Jordan-frame as defined in (113) because it is the frame for which the particle masses found in §2.2 for ordinary matter fields like Y±Y_{\pm} are τ\tau-independent.323232τ\tau-independent in that they depend at most logarithmically on τ\tau through kk. Using (121) in (118) shows more directly why the masses of all ordinary particles turned out proportional to 𝒫1/2{\cal P}^{-1/2} at leading order (as was restated in (107)).

The dilaton therefore couples like a quasi-Brans Dicke scalar, and using eq. (121) in (119) gives the Brans-Dicke coupling function

η(χ)=ζlnAlnτζτKT6Mp2ζ2=160.408,\eta(\chi)=\zeta\,\frac{\partial\ln A}{\partial\ln\tau}\simeq\frac{\zeta\,\tau K_{\scriptscriptstyle T}}{6M_{p}^{2}}\simeq-\frac{\zeta}{2}=-\frac{1}{\sqrt{6}}\simeq-0.408\,, (122)

showing that τ\tau is a pure Brans Dicke scalar (for which η\eta is χ\chi-independent) to leading order in 1/τ1/\tau, with coupling 𝔤=ζ/20.408{\mathfrak{g}}=-\zeta/2\simeq-0.408.

Multiple moduli

We now pause to show that the same conclusion also holds very generally when the no-scale sector has multiple moduli, as in (111), subject to the scaling condition (112).

To this end suppose we have several scaling moduli, ZA={T,Si}Z^{\scriptscriptstyle A}=\{T,S^{i}\}, with kinetic term

=g[KAB¯(z)μZAμZ¯B¯+V]+m,{\cal L}=-\sqrt{-g}\Bigl{[}K_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}}(z)\,\partial_{\mu}Z^{\scriptscriptstyle A}\,\partial^{\mu}{\overline{Z}}^{{\overline{{\scriptscriptstyle B}}}}+V\Bigr{]}+{\cal L}_{m}\,, (123)

where KAB¯=AB¯KK_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}}=\partial_{\scriptscriptstyle A}\partial_{{\overline{{\scriptscriptstyle B}}}}K is the Kähler metric built using (111), and zA:=ZA+Z¯A¯z^{\scriptscriptstyle A}:=Z^{\scriptscriptstyle A}+{\overline{Z}}^{{\overline{{\scriptscriptstyle A}}}}. In this case the scalar field equation (119) generalizes to

ZA+ΓBCAμZBμZCKB¯AB¯V+ηAgμνTμν=0\Box Z^{\scriptscriptstyle A}+\Gamma^{\scriptscriptstyle A}_{{\scriptscriptstyle B}{\scriptscriptstyle C}}\partial_{\mu}Z^{\scriptscriptstyle B}\,\partial^{\mu}Z^{\scriptscriptstyle C}-K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}}\,\partial_{{\overline{{\scriptscriptstyle B}}}}V+\eta^{{\scriptscriptstyle A}}\;g_{\mu\nu}T^{\mu\nu}=0 (124)

where (as usual) KB¯AK^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}} and ΓBCA:=KD¯AKBCD¯\Gamma^{\scriptscriptstyle A}_{{\scriptscriptstyle B}{\scriptscriptstyle C}}:=K^{{\overline{{\scriptscriptstyle D}}}{\scriptscriptstyle A}}K_{{\scriptscriptstyle B}{\scriptscriptstyle C}{\overline{{\scriptscriptstyle D}}}} are respectively the inverse metric and the Christoffel symbol built from the target-space Kähler metric KAB¯K_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}}. The matter-coupling function appearing in (124) is defined just as for the single-field case, with

ηB¯=B¯AA=KB¯6Mp2and soηA=KB¯AηB¯=KB¯AKB¯6Mp2.\eta_{{\overline{{\scriptscriptstyle B}}}}=\frac{\partial_{{\overline{{\scriptscriptstyle B}}}}A}{A}=\frac{K_{{\overline{{\scriptscriptstyle B}}}}}{6M_{p}^{2}}\quad\hbox{and so}\quad\eta^{\scriptscriptstyle A}=K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}}\eta_{{\overline{{\scriptscriptstyle B}}}}=\frac{K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}}K_{{\overline{{\scriptscriptstyle B}}}}}{6M_{p}^{2}}\,. (125)

The same arguments as given above also show that multiple-modulus models are QBD scalars, though in this case the Weyl rescaling factor, AA, appearing in (114) becomes

A=eK/(6Mp2)=11/2,A=e^{K/(6M_{p}^{2})}=\frac{1}{{\cal F}^{1/2}}\,, (126)

once (111) is used with 𝒫{\cal P} truncated to leading order: 𝒫=(z+z¯){\cal P}={\cal F}(z+\bar{z}). The scale-invariance condition (112) satisfied by {\cal F} turns out to make the coupling ηA\eta^{\scriptscriptstyle A} surprisingly simple, since it implies the identity

KB¯AKB¯=zA,K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle A}}K_{{\overline{{\scriptscriptstyle B}}}}=-z^{\scriptscriptstyle A}\,, (127)

that follows by multiply differentiating the scaling identity K(λz)K(z)3Mp2lnλK(\lambda z)\equiv K(z)-3M_{p}^{2}\ln\lambda. This last result shows that the dilaton coupling function ηA=zA/(6Mp2)\eta^{\scriptscriptstyle A}=-z^{\scriptscriptstyle A}/(6M_{p}^{2}) is completely insensitive to the details of the choice of the Kähler potential for the moduli fields, assuming only that it is scale invariant (in the sense of (112)).

In the absence of an axion mass or gauge-boson/axion mixing the axio-dilaton corresponds to the special case of a single field ZATZ^{\scriptscriptstyle A}\to T with K=3Mp2lnτK=-3M_{p}^{2}\ln\tau. For this choice eq. (124) reduces to

T2τμTμTτ23Mp2VTτ6Mp2gμνTμν=0,\Box T-\frac{2}{\tau}\,\partial_{\mu}T\,\partial^{\mu}T-\frac{\tau^{2}}{3M_{p}^{2}}\;V_{\scriptscriptstyle T}-\frac{\tau}{6M_{p}^{2}}\;g_{\mu\nu}T^{\mu\nu}=0\,, (128)

whose imaginary part gives the axion evolution equation333333This neglects any axion mass or gauge mixing, should this exist.

𝔞2τμτμ𝔞=0\Box{\mathfrak{a}}-\frac{2}{\tau}\,\partial_{\mu}\tau\,\partial^{\mu}{\mathfrak{a}}=0 (129)

and whose real part agrees with (119) once expressed in terms of the canonical field χ\chi.

4.2.3 Solar system tests of gravity

Modern precision tests of gravity – be it within the solar system, using binary pulsars or from cosmology – pose serious challenges for any theory with very light scalars (for a review see e.g. [41]). Many of the strongest constraints, such as tests of the equivalence principle [91] or on variations of the fine-structure constant [92], are automatically evaded by theories in the QBD class,343434Having all Standard Model masses and the QCD scale depend on the same power of τ\tau is important for this statement, which is another feature generically satisfied by our model. so the above demonstration that our model falls into this category (to leading order in 1/τ1/\tau) makes these not worrisome (at least at leading orders in the PPN expansion).

With these limits evaded, the most restrictive bounds on light dilaton-like fields come from current-epoch tests of General Relativity within the Solar System [93] and binary pulsars353535Gravitational wave emission by compact objects might eventually also provide competitive bounds, but do not do so yet [95]. [94]. (We return to the challenges and opportunities such a picture raises for concordance cosmology in §5 below.) Such constraints arise because eq. (119) – or, for multiple-field models, eqs. (124) and (127) – ensures that matter acts as a source to the dilaton field and so gμνg_{\mu\nu} and g~μν\tilde{g}_{\mu\nu} differ from one another in a predictable way, even in the vacuum away from the gravitating masses themselves. Since matter particles move (in the eikonal limit) along geodesics of g~μν\tilde{g}_{\mu\nu} rather than gμνg_{\mu\nu}, their motion reveals both the discrepancies between these two metrics and the change in gμνg_{\mu\nu} due to the dilaton stress energy (see [42] for more details on how this works in this particular model).

Current measurements – e.g. using the Cassini probe363636Although solar system tests are currently the most restrictive, pulsar measurements are likely to become competitive in the near future [96]. [93] – agree with the predictions of general relativity, and this leads to the following constraint on the parameterized post-Newtonian (PPN) parameter γPPN\gamma_{\scriptscriptstyle PPN} [97, 41]:

γPPN1<1.5×105.\gamma_{\scriptscriptstyle PPN}-1<1.5\times 10^{-5}\,. (130)

PPN effects also cause gravitational and inertial masses to differ from one another, leading to a slightly weaker lunar-laser-ranging constraint on the Nordvedt parameter

|ηN|<5×105withηN=4βPPNγPPN3.|\eta_{\scriptscriptstyle N}|\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}5\times 10^{-5}\quad\hbox{with}\quad\eta_{\scriptscriptstyle N}=4\beta_{\scriptscriptstyle PPN}-\gamma_{\scriptscriptstyle PPN}-3\,. (131)

Comparing these with the Brans-Dicke result

γPPN=12𝔤21+2𝔤2=ω+1ω+2andβPPN=1\gamma_{\scriptscriptstyle PPN}=\frac{1-2{\mathfrak{g}}^{2}}{1+2{\mathfrak{g}}^{2}}=\frac{\omega+1}{\omega+2}\quad\hbox{and}\quad\beta_{\scriptscriptstyle PPN}=1 (132)

shows that (130) is inconsistent (by several orders of magnitude) with the prediction (122).

How might these bounds be avoided? Evidently not by changing the size of the predicted coupling, (122), since the above arguments show this only relies on the no-scale structure that underpins the suppression of the cosmological constant itself. A different approach to evading solar-system bounds does not try to suppress all of the scalar couplings to matter, but instead uses the nonlinearities of the scalar self-interactions – that is, the ΓBCA\Gamma^{\scriptscriptstyle A}_{{\scriptscriptstyle B}{\scriptscriptstyle C}} term in the equations of motion (124) (along the lines used in other contexts in [98]) – to channel any scalar fields that are produced into directions that do not contribute much to the Weyl factor AA and so are suppressed in their influence on the motion of test particles. One seeks to evade the constraint (130) by changing the prediction (132) instead of the prediction (122).

A new screening mechanism

The couplings of the axion to the dilaton turn out to provide (for free) a simple and novel example of this type [42]. The Kähler metric KTT¯=3Mp2/τ2K_{T{\overline{T}}}=-3M_{p}^{2}/\tau^{2} implies a target-space curvature whose effects can channel field development away from the dilaton and towards the axion despite the dilaton having the dominant coupling to matter. This changes the prediction (132) because test-particle motion can be insensitive to the axion because it does not appear at all in AA.

The details of how this works can be found in [42], which solves the coupled axion-dilaton equations (128) supplemented by a small direct axion-matter coupling

g𝒥=2(δSmδ𝔞)0.\sqrt{-g}\;{\cal J}=2\left(\frac{\delta S_{m}}{\delta{\mathfrak{a}}}\right)\neq 0\,. (133)

It is useful when making estimates to assume the axion source profile to be proportional to the energy density, 𝒥(x)=εaρ(x){\cal J}(x)=\varepsilon_{a}\,\rho(x), with εa\varepsilon_{a} constant and small. There are several reasons to expect εa1\varepsilon_{a}\ll 1 to be small. The microscopic axion-matter coupling might itself be small. Or it might be small because the parity properties of the axion cause it to couple to parity-odd quantities – such as local spin density – that do not add up coherently for macroscopic objects (unlike for energy). Or both these reasons could apply at the same time.

In weak gravitational fields373737For solutions in strong fields see [99]. the equations exterior to a spherically symmetric source turn out to have the following simple general solutions:

𝔞(r)=αβtanhXwithX(r):=βγr+δ{\mathfrak{a}}(r)=\alpha-\beta\tanh X\quad\hbox{with}\quad X(r):=\frac{\beta\gamma}{r}+\delta (134)

and

τ(r)=βcoshX(r),\tau(r)=\frac{\beta}{\cosh X(r)}\,, (135)

with four integration constants α\alpha, β\beta, γ\gamma and δ\delta. These solutions descibe motion along geodesics of the target-space metric that turn out to be semicircles in the τ\tau𝔞{\mathfrak{a}} plane since they imply

τ2+(𝔞α)2=β2.\tau^{2}+({\mathfrak{a}}-\alpha)^{2}=\beta^{2}\,. (136)

The four integration constants are determined by the values of the two fields at spatial infinity,

𝔞=αβtanhδandτ=βcoshδ,{\mathfrak{a}}_{\infty}=\alpha-\beta\tanh\delta\quad\hbox{and}\quad\tau_{\infty}=\frac{\beta}{\cosh\delta}\,, (137)

and two first integrals of the field equations that relate the derivatives of the solutions to integrals of the source energy density ρ\rho and axion source density 𝒥{\cal J} (defined in (133)):

γ=R2(𝔞τ2)r=R=13Mp20Rdrr2𝒥(r)23εaGM,\gamma=R^{2}\left(\frac{{\mathfrak{a}}^{\prime}}{\tau^{2}}\right)_{r=R}=-\frac{1}{3M_{p}^{2}}\int_{0}^{R}{\rm d}r\;r^{2}{\cal J}(r)\simeq-\frac{2}{3}\varepsilon_{a}GM\,, (138)

and

γα=R2(ττ+𝔞𝔞τ2)r=R=13Mp20Rdrr2[ρ(r)+𝔞(r)𝒥(r)]23GM,\gamma\alpha=R^{2}\left(\frac{\tau^{\prime}}{\tau}+\frac{{\mathfrak{a}}\,{\mathfrak{a}}^{\prime}}{\tau^{2}}\right)_{r=R}=-\frac{1}{3M_{p}^{2}}\int_{0}^{R}{\rm d}r\;r^{2}\Bigl{[}\rho(r)+{\mathfrak{a}}(r)\,{\cal J}(r)\Bigr{]}\simeq\frac{2}{3}\,GM\,, (139)

where RR is the radius of the gravitating source and we identify its mass as M=4π0Rdrr2ρM=4\pi\int_{0}^{R}{\rm d}r\,r^{2}\rho.

These boundary conditions show that β\beta and δ\delta are dictated by the values of the fields at infinity, while γ\gamma and α\alpha are dominantly controlled by the properties of the source. Clearly γ\gamma in particular can be made as small as desired by making the axion-matter couplings smaller, but the product γα\gamma\alpha is held fixed if ρ\rho remains unchanged. This limit γ0\gamma\to 0 corresonds to the semicircle degenerating into a vertical line in the 𝔞{\mathfrak{a}}τ\tau plane. The screening mechanism is established by showing that e.g. solar-system observables are all suppressed by γ\gamma (and sometimes by δ\delta) without compensating factors of α\alpha.

Post-Newtonian analysis

To compute the sizes of the PPN parameters the key observation is that test particles built from ordinary matter move along geodesics of the Jordan-frame metric (at least in the absence of other – e.g. electromagnetic – forces). They do so because it is this metric that appears in their kinetic terms and so is the one to which standard arguments apply for quantum motion in the eikonal limit.

Consequently PPN parameters measure how the Jordan-frame metric, g~μν=A2gμν\tilde{g}_{\mu\nu}=A^{2}g_{\mu\nu}, deviates from the metric predicted by GR. There are two sources for this deviation: (ii) the position dependence within the Weyl factor AA, and (iiii) deviations of the Einstein-frame metric gμνg_{\mu\nu} from the predictions of GR (that arise becaues of the stress energy of the light scalar fields). Both of these quantities are suppressed by the axion coupling.

To see why consider first the Weyl factor,

A2=eK/(3Mp2)=1τ1βcosh(2βεaGM3r+δ)1τ2εasinhδ3(GMr)+,A^{2}=e^{K/(3M_{p}^{2})}=\frac{1}{\tau}\simeq\frac{1}{\beta}\cosh\left(-\frac{2\beta\varepsilon_{a}GM}{3r}+\delta\right)\simeq\frac{1}{\tau_{\infty}}-\frac{2\varepsilon_{a}\sinh\delta}{3}\left(\frac{GM}{r}\right)+\cdots\,, (140)

which shows that its position dependence is suppressed by ϵa\epsilon_{a} (and by δ\delta when this is small). The deviation of gμνg_{\mu\nu} from the predictions of GR are controlled by the scalar stress energy, which for the above solutions contribute only to the (rr)(rr)-component of the Einstein-frame Ricci curvature, evaluating to [42]

rr=34[(τ)2+(a)2τ2]=3γ2β24r4=εa2β23(GMr2)2.{\cal R}_{rr}=-\frac{3}{4}\left[\frac{(\tau^{\prime})^{2}+(a^{\prime})^{2}}{\tau^{2}}\right]=-\frac{3\gamma^{2}\beta^{2}}{4r^{4}}=-\frac{\varepsilon_{a}^{2}\beta^{2}}{3}\left(\frac{GM}{r^{2}}\right)^{2}\,. (141)

Notice that this expression uses (136) but does not expand in powers of GM/rGM/r. Eq. (141) shows that the scalar field stress energy is also suppressed by the axion coupling εa\varepsilon_{a}.

Because both kinds of deviations from GR come suppressed by axion couplings there is no surprise that the PPN parameters that result are also suppressed. For instance the values for the PPN parameters γPPN\gamma_{\scriptscriptstyle PPN} and βPPN\beta_{\scriptscriptstyle PPN} found in [42] are

γPPN1=2εaβtanhδ3+εaβtanhδandβPPN1=εa2β29(coshδ+13εaβsinhδ)2.\gamma_{\scriptscriptstyle PPN}-1=-\frac{2\varepsilon_{a}\beta\tanh\delta}{3+\varepsilon_{a}\beta\tanh\delta}\quad\hbox{and}\quad\beta_{\scriptscriptstyle PPN}-1=\frac{\varepsilon_{a}^{2}\beta^{2}}{9(\cosh\delta+\frac{1}{3}\varepsilon_{a}\beta\sinh\delta)^{2}}\,. (142)

The main point is that εa\varepsilon_{a} (and so γ\gamma) is controlled by axion-source strength 𝒥{\cal J} through (138) and so can be made arbitrarily small simply by coupling the axion more and more weakly to matter (see [42] for details). Any direct non-metric effects of the axion on particle motion also vanish in this limit.

One might worry about other, non PPN, effects that could constrain these models. These can be explored by integrating out the structure of the source (be it the Earth, Moon or the Sun) and examining the EFT for its centre-of-mass motion. At lowest order in derivatives this becomes:

Spp=dsg~μνx˙μx˙ν(m~+)=dsgμνx˙μx˙ν(Am~+),S_{\rm pp}=-\int{\rm d}s\sqrt{\tilde{g}_{\mu\nu}\dot{x}^{\mu}\dot{x}^{\nu}}\Bigl{(}\tilde{m}+\cdots\Bigr{)}=-\int{\rm d}s\sqrt{g_{\mu\nu}\dot{x}^{\mu}\dot{x}^{\nu}}\Bigl{(}A\tilde{m}+\cdots\Bigr{)}\,, (143)

where m~\tilde{m} is the Jordan frame inertial mass, in terms of which the Einstein-frame mass is m=m~Am~/τm=\tilde{m}A\simeq\tilde{m}/\sqrt{\tau}. To very good approximation the Jordan-frame mass, m~\tilde{m}, is independent of the scalar fields τ\tau and 𝔞{\mathfrak{a}}, and using this when varying SppS_{\rm pp} with respect to the source position xμx^{\mu} is one way to see why the source moves along geodesics of g~μν\tilde{g}_{\mu\nu}.

But m~\tilde{m} does acquire scalar-field dependence once gravitational self-energy is included, because this contributes δm~/m~GM/R\delta\tilde{m}/\tilde{m}\propto GM/R. Although the ratio of different particle masses, m~1/m~2=m1/m2\tilde{m}_{1}/\tilde{m}_{2}=m_{1}/m_{2} remains field-independent (and this is why e.g. electromagnetic binding energies do not introduce scalar-field dependence), the ratio M/MpM/M_{p} does depend on scalar fields (in both Jordan frame and Einstein frame). This might be expected to cause objects to deviate from Jordan-frame geodesic motion, and one might worry that this effect applied to the Earth and the Moon could cause effects in the very precise lunar-laser-ranging experiments at the GM/R109GM_{\oplus}/R_{\oplus}\sim 10^{-9} level (which would be ruled out).

The effects of having field-dependence in m~\tilde{m} is to alter the geodesic equation to include a new term δ(x¨μ)αm~g~μννϕα\delta(\ddot{x}^{\mu})\propto\partial_{\alpha}\tilde{m}\,\tilde{g}^{\mu\nu}\partial_{\nu}\phi^{\alpha}, which is therefore proportional to local gradients of the scalar fields. However eqs. (134) (applied to the field of the Sun) show that all scalar gradients are proportional to

Xr=βγr2=2βεa3(GMr2)\frac{\partial X}{\partial r}=\frac{\beta\gamma}{r^{2}}=-\frac{2\beta\varepsilon_{a}}{3}\left(\frac{GM}{r^{2}}\right) (144)

and so is again suppressed by the strength of the axion-matter coupling.

We regard evading constraints on Brans-Dicke scalars as one of the main model-building challenges for our proposal, and the fact that an evasion mechanism comes pre-paid is a welcome surprise, particularly given that the other known screening mechanisms – such as those in [100] (for a review see also [101]) – do not seem to work for our set-up.383838They do not do so because they typically seek matter-dependent and vacuum contributions to the scalar dependence that push the light field in opposite directions, thereby creating a new matter-dependent minimum about which long-range forces are suppressed. We have been unable to get these to work for Yoga models – and string models more generally (see however [102, 103]) – because the underlying scale invariance makes both matter and vacuum contributions always consistent with the runaway to the scale-invariant limit τ\tau\to\infty. Moving forward we seek to know how robust and widespread are such mechanisms, but their presence makes constraints coming from cosmology all the more interesting. See also [104, 105, 106] for related discussions.

4.3 Other issues

This section closes with a brief summary of some of the other phenomenological issues for this model.

Axion constraints

The phenomenology of the axion is largely shaped by the prediction (55) of an unusually small decay constant faMp/τ10eVf_{a}\sim{M_{p}}/{\tau}\sim 10\,\hbox{eV}. Depending on the model it may be one of: massless; essentially massless (if axion shift-symmetry anomalies allow WebTW\propto e^{-bT} so that maebτm_{a}\sim e^{-b\tau}); very light (with maMp/τ3/21012m_{a}\sim M_{p}/\tau^{3/2}\leq 10^{-12} eV); or of order maMp/τ10m_{a}\sim M_{p}/\tau\sim 10 eV.

We take the point of view that the new physics that intervenes at eV scales is extra-dimensional, in which case the physics above these scales is the physics of an extra-dimensional Kalb-Ramond field, BMNB_{{\scriptscriptstyle M}{\scriptscriptstyle N}}, of which the dual two-form gauge potential bμνb_{\mu\nu} given by μ𝔞ϵμνλρνbλρ\partial_{\mu}{\mathfrak{a}}\propto\epsilon_{\mu\nu\lambda\rho}\partial^{\nu}b^{\lambda\rho} is a single KK mode. In this case the phenomenology above eV energies is the phenomenology of supersymmetric large extra dimensions, and is dominated by missing energy constraints from astrophysical and accelerator observations [107, 108].

At energies below eV scales the phenomenology is more along the lines for a traditional axion. In any case, the observational limits are very mild for the axion-mass options considered here (for recent reviews on axions, including a comprehensive discussion of bounds, see for instance [112, 113, 114]). A massless (or essentially massless) axion can contribute to the Dark Energy story in an interesting way by complicating how the dilaton evolves at very late times in cosmology (see §5 for a preliminary discussion). In the very light case it might fit into the black hole superradiance regime for some types of black holes. Axion-matter couplings seem important to help the dilaton escape solar-system tests [42], and play a similar role for the viability of dilaton cosmology. But if axion-matter couplings lie in a particular range (not to strong – to avoid thermalization – and not too weak) they can also cause problems by providing too efficient an energy-loss channel for red-giant stars and supernovae. But none of these constraints seem particularly dangerous for this model.

Gravitino and dilatino constraints

Similar remarks apply to the gravitino/dilatino system. Eq. (35) shows that the gravitino in this scenario is very light: m3/2mTeV2/Mp105m_{3/2}\sim m_{\scriptscriptstyle TeV}^{2}/M_{p}\sim 10^{-5} eV. §2.3.2 argues that the dilatino has a similar mass. These are too light for these particles to be proper dark matter candidates, even though they can be considered essentially as stable (since gravitational-strength couplings imply a decay time of order Mp2/m3/23M_{p}^{2}/m_{3/2}^{3}). But, precisely because they are so light (with masses 1\ll 1 eV) they are also cosmologically harmless since there is no danger for them to over-close the universe (see for instance [115, 116] and references therein).

Collider bounds provide a general constraint on gravitino/goldstino production [75, 76], since these are constrained by searches for producing invisible final states in coincidence with a photon (say). At face value the non-observation of these gravitino-production reactions at colliders preclude a gravitino with mass smaller than 10510^{-5} eV (see also [117]). It is the equivalence theorem that makes these such strong bounds, because although the gravitino is only gravitationally coupled, at energies high compared with m3/2m_{3/2} longitudinal polarizations dominate gravitino production and the production of these is well-approximated by the production of goldstinos as predicted in the MpM_{p}\to\infty limit. We sketch this argument in Appendix B.

The unwarped models described here evade these bounds, precisely because FXF^{\scriptscriptstyle X} as defined in (26) has been designed to be large – see for example eq. (29). However, these bounds are at first sight a concern for the warped models described in §3.3, for which the gravitino mass prediction (100) can be much smaller (if b>a)b>a). Because this bound is really a constraint on the goldstino emission using the 4-dimensional goldstino emission rate as described in Appendix B, it is really a constraint on the scale 𝔉:=m3/2Mp{\mathfrak{F}}:=m_{3/2}M_{p} that controls goldstino emission at low energies. But the prediction for the emission rate as a function of 𝔉{\mathfrak{F}} breaks down for energies E>𝔉E\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}\sqrt{\mathfrak{F}}, and 𝔉{\mathfrak{F}} can be as low as (10eV)2(10\;\hbox{eV})^{2} in the scenarios described in §3.3. In the warped models the 4D goldstino EFT description breaks down at these low energies, for much the same reasons as for the axion-production rate at the same scales. The production rate must be computed within the UV completion that kicks in at these scales (such as is done in [107, 108] if the UV completion consists of supersymmetric extra dimensions) in order to be compared with collider experiments whose energies are at TeV scales.

It has recently been argued that models with a light gravitino can undergo catastrophic gravitino production [109], although subsequent studies argue that when supersymmetry is linearly realized such production does not arise once goldstino alignment within the light fermion spectrum is carefully tracked. But this leaves open the possibility that it can arise393939Even if it were to occur it would signal the breakdown of the nonlinearly realized EFT rather than physical gravitino emission. in nonlinearly realized models (like the framework used here) for which constrained fields like ϕ\phi exist and evolve nontrivially with time [110, 111]. Although this can be an issue for cosmological applications it is not an issue here because the field ϕ\phi simply sits at its local minimum while other fields evolve. It is also typically not expected to be an issue for inflationary applications for which fields like ϕ\phi only roll very slowly.

Relaxon constraints

The relaxon also raises some issues, though we have found nothing fatal. A key property of the relaxon is that its nonzero vev shifts to cancel out part of the vacuum energy. But it was also true that ϕ\phi had to carry a conserved charge (either for the discrete symmetry ϕϕ\phi\to-\phi for the real ϕ\phi considered in the main text, or a rephasing symmetry ϕeiθϕ\phi\to e^{i\theta}\phi if ϕ\phi were complex) in order to forbid the appearance of a term MΦXWM\Phi X\in W where MMpM\sim M_{p} is the UV scale. Although such a term also gives a mass to ϕ\phi that is of order mϕM/τm_{\phi}\sim M/\sqrt{\tau}, in this case radiative corrections to the mass are not controlled by corrections to a dimensionless coupling gg and so might destabilize the choices leading to mϕ<mem_{\phi}<m_{e}. Naturalness of the small ϕ\phi mass seems to require such a symmetry.

Either type of symmetry can cause phenomenological trouble once spontaneously broken, with ϕϕ\phi\to-\phi potentially leading to the formation of domain walls in the early universe, and ϕeiθϕ\phi\to e^{i\theta}\phi leading to a massless Goldstone boson in the continuous case. Neither of these need be deadly, since (for example) domain walls can be inflated away and Goldstone bosons need not couple significantly to observable matter. The side-effects of radiative stability of ϕ\phi properties seem to be among the model’s less attractive features.

Since the ϕ\phi mass is not too far below the electron mass it is high enough that the effective range of any ϕ\phi-mediated force acts only over submicron scales, making macroscopic forces due to ϕ\phi-exchange not a worry for tests of gravity. It could have particle physics implications if coupled to Standard Model particles, though nothing requires these to be a level that would have been detected.

The most model-independent coupling between ϕ\phi and Standard Model particles is to the ϕ2\phi^{2}\,{\cal H}^{\dagger}{\cal H} coupling to the Higgs in the |wX|2/τ2|w_{\scriptscriptstyle X}|^{2}/\tau^{2} part of the scalar potential. This would be generic if wXw_{\scriptscriptstyle X} were a linear combination of both wXg(Φ2v2)w_{\scriptscriptstyle X}\ni g(\Phi^{2}-v^{2}) and wXgH(HHvh2)w_{\scriptscriptstyle X}\ni g_{\scriptscriptstyle H}(H^{\dagger}H-v_{h}^{2}) terms, and if present would contribute to the decay width of the Higgs boson into unseen decay daughters (for which the branching ratio cannot be larger than 24% without conflicting with observations [118]).

Besides being generic, a ϕ2HH\phi^{2}H^{\dagger}H coupling arising in wXw_{\scriptscriptstyle X} is also likely to be unsuppressed by powers of 1/τ1/\tau given that both the ϕ\phi and Higgs fluctuation fields have kinetic terms that are themselves suppressed by 1/τ1/\tau. This is precisely the argument used in §2.2 (see also [15]) for the quartic Higgs self-interactions themselves. The ϕ2HH\phi^{2}H^{\dagger}H coupling is, however, suppressed by ggHgg_{\scriptscriptstyle H}, where gg is the small ϕ\phi self-coupling that is argued above – see the discussion below eq. (37) – to be of size gye2g\sim y_{e}^{2}, where yey_{e} is the small electron Yukawa coupling.

5 Axio-dilaton cosmology

We turn now to a preliminary discussion of cosmological issues, and work through the good exercise of comparing the gross features of yoga models with Λ\LambdaCDM cosmology. (A full discussion of cosmology, including fluctuations, goes beyond the scope of this paper.) An important difference between this and previous sections is that the scalar potential cannot be neglected (as opposed, say, to solar-system tests of gravity). Indeed, the framework we propose predicts that dilaton evolution in the presence of this potential provides a specific type of quintessence model for Dark Energy, but (unusually) does so with the small masses and potential energies involved seeming to arise in a natural way.

The field equations describing cosmology obtained by varying the action built from dilb+SM{\cal L}_{{\rm dil}\,b}+{\cal L}_{\scriptscriptstyle SM} as given in (108) are

μν+e2ζχ^μ𝔞ν𝔞+μχ^νχ^+1Mp2[V(χ^)gμν+Tμν12gλρTλρgμν]=0,{\cal R}_{\mu\nu}+e^{-2\zeta\hat{\chi}}\,\partial_{\mu}{\mathfrak{a}}\,\partial_{\nu}{\mathfrak{a}}+\partial_{\mu}\hat{\chi}\,\partial_{\nu}\hat{\chi}+\frac{1}{M_{p}^{2}}\left[V(\hat{\chi})\,g_{\mu\nu}+T_{\mu\nu}-\frac{1}{2}\,g^{\lambda\rho}T_{\lambda\rho}\,g_{\mu\nu}\right]=0\,, (145)
χ^+ζe2ζχ^μ𝔞μ𝔞+1Mp2(Vχ^+𝔤gμνTμν)=0\Box\hat{\chi}+\zeta\,e^{-2\zeta\hat{\chi}}\,\partial_{\mu}{\mathfrak{a}}\,\partial^{\mu}{\mathfrak{a}}+\frac{1}{M_{p}^{2}}\left(-\frac{\partial V}{\partial\hat{\chi}}+{\mathfrak{g}}\,g_{\mu\nu}T^{\mu\nu}\right)=0 (146)

and

μ[ge2ζχ^μ𝔞]=g𝒥3Mp2\partial_{\mu}\Bigl{[}\sqrt{-g}\;e^{-2\zeta\hat{\chi}}\,\partial^{\mu}{\mathfrak{a}}\Bigr{]}=-\frac{\sqrt{-g}\,{\cal J}}{3M_{p}^{2}} (147)

where χ^:=χ/Mp\hat{\chi}:=\chi/M_{p} and 𝔤=12ζ0.41{\mathfrak{g}}=-\frac{1}{2}\,\zeta\simeq-0.41 and we assume 𝔞{\mathfrak{a}} is massless.

For cosmological applications we assume the axion source 𝒥{\cal J} — defined in (133) — is proportional to the number density of baryonic matter,404040More precisely, we assume for simplicity here that a3𝒥a^{3}{\cal J} is independent of both χ\chi and cosmic time, where a(t)a(t) is the cosmic scale factor. and so satisfies 𝒥=𝔤anB{\cal J}={\mathfrak{g}}_{a}n_{\scriptscriptstyle B} with a constant effective axion-matter coupling 𝔤a{\mathfrak{g}}_{a} that is small in the sense that |𝒥|ρm|{\cal J}|\ll\rho_{m}. Having 𝒥{\cal J} smaller than ρm\rho_{m} seems natural for two reasons: the density of baryons can be smaller than the dark matter density, and the parity-odd nature of the axion usually keeps microscopic contributions to 𝒥{\cal J} from summing coherently within macroscopic bodies (unlike the energy density).

For homogeneous solutions in a spatially flat FRW spacetime, gμνdxμdxν=dt2+a2(t)d𝐱2g_{\mu\nu}{\rm d}x^{\mu}{\rm d}x^{\nu}=-{\rm d}t^{2}+a^{2}(t)\,{\rm d}{\bf x}^{2}, eq. (147) becomes

(𝔞¨+3H𝔞˙2ζMp2χ^˙𝔞˙)e2ζχ^𝔤anB3Mp2=0.\left(\ddot{\mathfrak{a}}+3H\dot{\mathfrak{a}}-\frac{2\zeta}{M_{p}^{2}}\dot{\hat{\chi}}\dot{\mathfrak{a}}\right)e^{-2\zeta\hat{\chi}}-\frac{{\mathfrak{g}}_{a}n_{\scriptscriptstyle B}}{3M_{p}^{2}}=0\,. (148)

Eq. (145) similarly boils down to the Friedmann equation

H2=ρ3Mp2=13Mp2{ρf+ρa+χ˙22+V},H^{2}=\frac{\rho}{3M_{p}^{2}}=\frac{1}{3M_{p}^{2}}\left\{\rho_{f}+\rho_{a}+\frac{\dot{\chi}^{2}}{2}+V\right\}\,, (149)

Here the subscript ‘ff’ denotes the contribution of the cosmological fluid, whose energy density and pressure

ρf=ρm(χ)+ρradandpf=prad=13ρrad,\rho_{f}=\rho_{m}(\chi)+\rho_{\rm rad}\quad\hbox{and}\quad p_{f}=p_{\rm rad}=\frac{1}{3}\,\rho_{\rm rad}\,, (150)

contain the contributions from of ordinary radiation and nonrelativistic (ordinary and dark) matter. Finally, the dilaton equation (146) reduces to

χ^¨+3Hχ^˙+ζe2ζχ/Mp𝔞˙2+1Mp2[V(χ^)+𝔤ρm(χ^)]=0\ddot{\hat{\chi}}+3H\dot{\hat{\chi}}+\zeta\,e^{-2\zeta\chi/M_{p}}\,\dot{\mathfrak{a}}^{2}+\frac{1}{M_{p}^{2}}\Bigl{[}V^{\prime}(\hat{\chi})+{\mathfrak{g}}\,\rho_{m}(\hat{\chi})\Bigr{]}=0 (151)

with prime denoting differentiation with respect to χ^\hat{\chi} and the scalar potential given by (109)

V=Ueλχ^,V=U\,e^{-\lambda\hat{\chi}}\,, (152)

where λ=4ζ=4233.3\lambda=4\zeta=4\sqrt{\frac{2}{3}}\simeq 3.3 is a known dimensionless parameter and UU is the polynomial or rational function of χ^\hat{\chi} used to stabilize τ\tau in §3.2.

Eq. (150) deliberately emphasizes how the Einstein-frame matter density implicitly depends on χ\chi, since this is important when computing how it evolves. To identify how ρm\rho_{m} depends on χ\chi, start from the observation that the total number of nonrelativistic particles is given in Einstein frame by gn\sqrt{-g}\;n, implying that the Einstein frame particle density na3n\propto a^{-3} falls inversely proportional to the Einstein-frame volume as the universe expands. Writing ρm=m(χ^)n\rho_{m}=m(\hat{\chi})\,n with particle mass m(χ^)=𝔪A(χ^)τ1/2=e12ζχ^m(\hat{\chi})={\mathfrak{m}}\,A(\hat{\chi})\propto\tau^{-1/2}=e^{-\frac{1}{2}\,\zeta\,\hat{\chi}}, then implies that the Einstein-frame energy density for nonrelativistic matter evolves as ρmA(χ^)/a3\rho_{m}\propto A(\hat{\chi})/a^{3} as the universe expands. The factor of A(χ^)A(\hat{\chi}) can be important during epochs where χ^\hat{\chi} evolves.

5.1 Dilaton dark energy

Can the above system describe Dark Energy as revealed to us by observations? We here first explore ‘dilaton-only’ cosmologies for which the axion-matter coupling vanishes: 𝒥=0{\cal J}=0, in which case the trivial solution with constant 𝔞{\mathfrak{a}} satisfies the axion equation (148). We return to more complicated axionic evolution below.

Viable dilaton cosmologies require the present-day scalar energy density to be positive and to make up around 75% of the total, with an equation of state w=pχ/ρχ<0.9w=p_{\chi}/\rho_{\chi}\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}-0.9 [6]. It also must not ruin the other successes of Λ\LambdaCDM, and so (for example) ρχ\rho_{\chi} cannot be more than a few percent of the total energy density at nucleosynthesis [119]. Fluctuations about such configurations must also be consistent with CMB measurements and what is known about its consistency with large-scale structure (LSS).

There are also constraints on how far the field χ\chi itself can roll because changes to A(χ)A(\chi) correspond to changes in particle masses (relative to the Planck mass, which in Einstein frame is fixed). Particle masses cannot vary by more than a few percent between nucleosynthesis and now, without ruining otherwise successful standard predictions [120]. They also cannot vary too much in the relatively recent universe (redshifts z<7z\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}7) without unacceptably changing the properties of spectral lines observed once stars and quasars turn on [6, 121]. In principle particle masses are also constrained not to be too different at recombination, though (intriguingly) a few percent change in particles masses at that time changes the precise epoch of recombination in a way that can help resolve [44] current tensions [43] between CMB and local measurements of the present-day Hubble scale.

We are able to satisfy these constraints, but only if the factor UU appearing in the scalar potential allows the existence of a local minimum, at a position we denote χ=χ0\chi=\chi_{0}. (§3.2 shows that this is relatively easy to arrange and can be done for acceptably large τ0=eζχ^0\tau_{0}=e^{\zeta\hat{\chi}_{0}} using only parameters that are no larger than around 60.) Successful cosmology requires some structure in UU because there are several things that go wrong when U=U0U=U_{0} is constant. First, if the scalar energy density were ever to dominate (as Dark Energy now does) then it must be slowing rolling, with w+1χ˙2/V<0.1w+1\simeq\dot{\chi}^{2}/V\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}0.1 in order to be sufficiently close to w=1w=-1. But slow-roll solutions with V=U0eλχ^V=U_{0}e^{-\lambda\hat{\chi}} satisfy 3Hχ^˙V/Mp2λV/Mp23H\dot{\hat{\chi}}\simeq-V^{\prime}/M_{p}^{2}\simeq\lambda V/M_{p}^{2} and 3Mp2H2V3M_{p}^{2}H^{2}\simeq V and so would predict

χ˙2V=χ^˙2Mp2V(V/3H)2Mp2Vλ233.6,\frac{\dot{\chi}^{2}}{V}=\frac{\dot{\hat{\chi}}^{2}M_{p}^{2}}{V}\simeq\frac{(V^{\prime}/3H)^{2}}{M_{p}^{2}V}\simeq\frac{\lambda^{2}}{3}\simeq 3.6\,, (153)

which is clearly too large.

Alternatively, if the scalar field does not initially dominate (as must have been true in the past, such as during the recombination or nucleosynthesis epochs) then with V=U0eλχ^V=U_{0}e^{-\lambda\hat{\chi}} it turns out never to dominate at later times. It does not do so (at least when λ3.3\lambda\simeq 3.3) because it robustly gets drawn into a late-time attractor scaling solution of the form

χ=χ0+nλln(aa0),\chi=\chi_{0}+\frac{n}{\lambda}\ln\left(\frac{a}{a_{0}}\right)\,, (154)

where the dominant energy density is assumed to scale with universal expansion like ρan\rho\propto a^{-n} (such as n=4n=4 for radiation or n=3n=3 for matter domination). In this solution the scalar energy density scales as a fixed fraction of the dominant energy density [122]. More precisely, if λ2>n\lambda^{2}>n the tracker solution very robustly predicts the late-time scalar energy fraction to take a time-independent value: ρχ/ρ=n/λ2\rho_{\chi}/\rho=n/\lambda^{2} (as we have also verified numerically). Since for the case of interest λ210.9\lambda^{2}\simeq 10.9 this applies in both radiation and matter dominated universes, implying in these cases ρχ/ρrad0.37\rho_{\chi}/\rho_{\rm rad}\simeq 0.37 or ρχ/ρm0.28\rho_{\chi}/\rho_{m}\simeq 0.28. Both of these are much too small to describe Dark Energy.

Dilaton cosmologies with more potential

Refer to caption
Figure 3: Log (base 10) of the scalar potential vs the canonical field χ\chi in Planck units. This plot uses u1=0.027027u_{1}=0.027027, u2=0.00036523u_{2}=0.00036523 and U0=ϵ5U_{0}=\epsilon^{5} with ϵ=1/30\epsilon=1/30.

Things work much better when U(χ)U(\chi) allows a minimum at χ=χ0\chi=\chi_{0} provided the late-time evolution becomes dominated by the fixed vacuum energy V(χ^0)V(\hat{\chi}_{0}), because in this case the scalar sector can behave at late times like a cosmological constant. For concreteness’ sake we use the potential V=Ueλχ^V=U\,e^{-\lambda\hat{\chi}} with

UU0[1u1χ^+u22χ^2],U\simeq U_{0}\left[1-u_{1}\hat{\chi}+\frac{u_{2}}{2}\,\hat{\chi}^{2}\right]\,, (155)

where earlier sections tell us λ=4ζ3.3\lambda=4\zeta\simeq 3.3 and U0ϵ5Mp4U_{0}\sim\epsilon^{5}M_{p}^{4} for ϵ1/400\epsilon\simeq 1/400 implying present-day cosmology should occur for χ^75\hat{\chi}\sim 75 since then τ=eζχ^e601026\tau=e^{\zeta\hat{\chi}}\sim e^{60}\sim 10^{26} (as required in earlier sections) and ϵ5eλχ^ϵ5e2406×10105ϵ5\epsilon^{5}e^{-\lambda\hat{\chi}}\sim\epsilon^{5}e^{-240}\sim 6\times 10^{-105}\epsilon^{5}. These choices are the minimal ones that allow VV to be positive for all χ^\hat{\chi} (which is true provided 2u2u122u_{2}\geq u_{1}^{2}) and have a local minimum: V(χ^0)=0V^{\prime}(\hat{\chi}_{0})=0 with V′′(χ^0)V^{\prime\prime}(\hat{\chi}_{0}) positive (see Fig. 3), with

χ^0=1λ[1+λu1u21+λ2u12u222λ2u2]\hat{\chi}_{0}=\frac{1}{\lambda}\left[1+\frac{\lambda u_{1}}{u_{2}}-\sqrt{1+\frac{\lambda^{2}u_{1}^{2}}{u_{2}^{2}}-\frac{2\lambda^{2}}{u_{2}}}\right] (156)

showing that χ^0\hat{\chi}_{0} can be order 75 simply by requiring a mild hierarchy amongst the positive parameters u1u_{1} and u2u_{2}. The root not shown in (156) is a local maximum, χ^max>χ^0\hat{\chi}_{\rm max}>\hat{\chi}_{0}, that separates the minimum from the runaway as χ^\hat{\chi}\to\infty.

In the absence of the axion and for sufficiently small 𝔤{\mathfrak{g}} these choices give evolution that resembles the cosmic histories described in [37]. Small 𝔤{\mathfrak{g}} is required because when 𝔤<0{\mathfrak{g}}<0 (as in our case) the matter coupling in (151) relentlessly pushes the dilaton to larger values of χ\chi. When 𝔞˙=0\dot{\mathfrak{a}}=0 the only force (besides Hubble friction) pushing in the opposite direction is the scalar potential VV^{\prime} in the range χ^0<χ^<χ^max\hat{\chi}_{0}<\hat{\chi}<\hat{\chi}_{\rm max} for which V(χ^)>0V^{\prime}(\hat{\chi})>0. Trapping in the local minimum only happens when VV^{\prime} successfully competes with the matter force, and so requires |𝔤|ρm<|V||{\mathfrak{g}}|\rho_{m}\mathrel{\raise 1.29167pt\hbox{$<$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}|V^{\prime}|, but because VV(χ^0)V^{\prime}\sim V(\hat{\chi}_{0}) in the range where V>0V^{\prime}>0 this can only occur once |𝔤|ρm|{\mathfrak{g}}|\rho_{m} falls below the present-day Dark Energy density, which only happens at very late times for 𝔤{\mathfrak{g}} order unity.

Refer to caption
Figure 4: A log-log plot (base 10) of the energy density in matter (dashed), radiation (dotted) and the dilaton’s total (solid) and potential (dot-dashed) energy densities as a function of universal scale factor, with a=1a=1 representing the present, with dilaton initially chosen moving near the potential’s minimum. For illustrative purposes we choose no axion evolution and a small matter-dilaton coupling |𝔤|105|{\mathfrak{g}}|\sim 10^{-5} (colour online).

Fig. 4 plots the energies in radiation, matter and the scalar field for a sample evolution in which |𝔤|105|{\mathfrak{g}}|\sim 10^{-5}, showing how in these circumstances the dilaton becomes trapped at late times despite having initially fairly large kinetic energy. Fig. 5 shows how the evolution instead proceeds if 𝔤=12ζ0.408{\mathfrak{g}}=-\frac{1}{2}\zeta\simeq-0.408 as predicted for this model in earlier sections. In this case the stronger matter coupling draws the dilaton evolution into an attractor scaling solution early in the matter-dominated epoch (though driven by the matter coupling rather than the potential, as described in [37]) and the matter-dilaton coupling sweeps the dilaton over the barrier and out to the runaway at χ\chi\to\infty. This is a serious problem when constructing potentially viable cosmologies, but (similar to [42]) a small axion-matter coupling can save the day in an interesting way.

Refer to caption
Figure 5: A log-log plot (base 10) of the energy density in matter (dashed), radiation (dotted) and the dilaton’s total (solid) and potential (dash-dotted) energies as a function of universal scale factor, with a=1a=1 representing the present, with initial kinetic energy near BBN not small. We choose no axion evolution but use 𝔤=0.408{\mathfrak{g}}=-0.408 as predicted by our model. In this case the large dilaton-matter coupling sweeps the dilaton out to a runaway past the potential’s minimum.

5.2 Including axion evolution

Keeping in mind the above experience with the dilaton, we now revisit the possibility that 𝒥0{\cal J}\neq 0, and so assume nonzero coupling 𝔤a{\mathfrak{g}}_{a}. In this case the axion-matter interaction prevents the axion from remaining constant cosmologically, and (similar to what happened in §4.2.3) this can ameliorate the implications of large dilaton coupling.

When 𝔤a=0{\mathfrak{g}}_{a}=0 the axion evolution equation (148) describes a conservation law stating

L:=a3e2ζχ^𝔞˙L:=a^{3}\;e^{-2\zeta\hat{\chi}}\,\dot{\mathfrak{a}} (157)

is time independent. In the absence of dramatic χ\chi evolution constant LL describes the draining of any initial axion kinetic energy

ρa=Mp22e2ζχ^𝔞˙2=L2Mp22a6e+2ζχ^\rho_{a}=\frac{M_{p}^{2}}{2}\,e^{-2\zeta\hat{\chi}}\dot{\mathfrak{a}}^{2}=\frac{L^{2}M_{p}^{2}}{2a^{6}}\,e^{+2\zeta\hat{\chi}} (158)

due to Hubble friction. Initially rolling axion fields stop and thereafter remain constant.

By contrast, when 𝔤a0{\mathfrak{g}}_{a}\neq 0 baryonic matter acts as a source for axion ‘charge’ and (148) describes how this winds the axion up by increasing LL with constant rate

L˙=𝔤a3Mp2a3(t)nB(t)=𝔤anB03Mp2,\dot{L}=\frac{{\mathfrak{g}}_{a}}{3M_{p}^{2}}\;a^{3}(t)n_{\scriptscriptstyle B}(t)=\frac{{\mathfrak{g}}_{a}n_{{\scriptscriptstyle B}0}}{3M_{p}^{2}}\,, (159)

where the combination a3nBa^{3}n_{\scriptscriptstyle B} is time-independent (and is conveniently given by nB0=nB(tnow)n_{{\scriptscriptstyle B}0}=n_{\scriptscriptstyle B}(t_{\rm now}) where tnowt_{\rm now} is the present-day time, for which a(tnow)=1a(t_{\rm now})=1). It is natural to assume L=0L=0 when t=0t=0 and if so this integrates to give:

L(a)=𝔤anB03Mp2t(a),L(a)=\frac{{\mathfrak{g}}_{a}n_{{\scriptscriptstyle B}0}}{3M_{p}^{2}}\,t(a)\,, (160)

where t(a)t(a) is the cosmic time as a function of scale factor.

For a universe sequentially dominated by radiation and matter the Hubble scale is given by H2(a)=12Heq2[(aeq/a)3+(aeq/a)4]H^{2}(a)=\frac{1}{2}H_{\rm eq}^{2}[(a_{\rm eq}/a)^{3}+(a_{\rm eq}/a)^{4}] where the radiation and matter densities are equal when a=aeqa=a_{\rm eq} and H(aeq)=HeqH(a_{\rm eq})=H_{\rm eq}. This implies the expression for t(a)t(a) to be used in (160) is

t(a)=223Heq[2+(aaeq+1)1/2(aaeq2)].t(a)=\frac{2\sqrt{2}}{3H_{\rm eq}}\left[2+\left(\frac{a}{a_{\rm eq}}+1\right)^{1/2}\left(\frac{a}{a_{\rm eq}}-2\right)\right]\,. (161)

Notice that for aaeqa\ll a_{\rm eq} this predicts that Lt(a)a2L\propto t(a)\propto a^{2} during radiation domination but when aaeqa\gg a_{\rm eq} it instead predicts Lt(a)a3/2L\propto t(a)\propto a^{3/2} during matter domination.

Using these scaling rules in eq. (158) predicts (if χ\chi does not roll dramatically) the axion kinetic energy scales like

ρa{a2(radiation dominated)a3(matter dominated)\rho_{a}\propto\begin{cases}a^{-2}&\hbox{(radiation dominated)}\\ a^{-3}&\hbox{(matter dominated)}\end{cases} (162)

showing how the baryon-driven wind-up of the axion makes the axion energy density grow relative to other components during radiation domination, though it remains a fixed fraction of the total density once the universe is matter dominated.

Cosmological evolution is governed by the Friedmann equation (149) together with the dilaton evolution equation (151), which after using (157) becomes

χ^¨+3Hχ^˙+1Mp2[V(χ^)+ζ(L2Mp2a6e+2ζχ^ρm02a3e12ζ(χ^χ^0))]=0\ddot{\hat{\chi}}+3H\dot{\hat{\chi}}+\frac{1}{M_{p}^{2}}\left[V^{\prime}(\hat{\chi})+\zeta\left(\frac{L^{2}M_{p}^{2}}{a^{6}}\,e^{+2\zeta\hat{\chi}}-\frac{\rho_{m0}}{2a^{3}}\,e^{-\frac{1}{2}\zeta(\hat{\chi}-\hat{\chi}_{0})}\right)\right]=0 (163)

where (as before) primes denote differentiation with respect to χ^=χ/Mp\hat{\chi}=\chi/M_{p} and we use the prediction 𝔤=12ζ0.408{\mathfrak{g}}=-\frac{1}{2}\zeta\simeq-0.408 with ζ=230.8165\zeta=\sqrt{\frac{2}{3}}\simeq 0.8165. Notice that the χ\chi-dependence of the axion and matter couplings make the χ\chi evolution appear as if it were moving in the presence of a time-dependent potential

Veff(χ^,a):=V+L2(a)Mp22a6e+2ζχ^+ρm0a3e12ζ(χ^χ^0).V_{\rm eff}(\hat{\chi},a):=V+\frac{L^{2}(a)M_{p}^{2}}{2a^{6}}\,e^{+2\zeta\hat{\chi}}+\frac{\rho_{m0}}{a^{3}}\,e^{-\frac{1}{2}\zeta(\hat{\chi}-\hat{\chi}_{0})}\,. (164)

At late times this always approaches VV but the other two terms can dominate at early times when the matter and axion energy densities are larger than VV.

Viable axio-dilaton cosmologies

For viable cosmologies we again seek solutions whose dilaton becomes trapped at late times at the minimum of the scalar potential, with χ=χ0\chi=\chi_{0} and χ˙0\dot{\chi}_{0} small enough to preclude escaping over the local maximum and running off to infinity. Such solutions ensure that Dark Energy at late times resembles a cosmological constant.

The possibilities for finding such solutions that can trap the dilaton into the potential’s local minimum are much greater once the axion is included, because the axion term in (164) provides a force that always pushes χ^\hat{\chi} towards smaller values. Intuition for why it does so can be found by solving the axio-dilaton evolution in the limit where the dilaton-matter and axion-matter couplings (𝔤{\mathfrak{g}} and 𝔤a{\mathfrak{g}}_{a}) are negligible. In this case LL is to a good approximation conserved and the coupled axio-dilaton equations can be explicitly solved under the assumption that the Hubble scale is dominated by another energy source for which ρ(a)an\rho(a)\propto a^{-n} (such as radiation, say, for which n=4n=4). The solutions are derived very much along the lines pursued in [42], leading to

𝔞=ABtanhX(t)andτ=BcoshX(t){\mathfrak{a}}=A-B\tanh X(t)\quad\hbox{and}\quad\tau=\frac{B}{\cosh X(t)} (165)

where AA and BB are integration constants and (as before) τ=eζχ^\tau=e^{\zeta\hat{\chi}} while

X˙(t)=BL(a0a)3.\dot{X}(t)=-BL\left(\frac{a_{0}}{a}\right)^{3}\,. (166)

This last equation integrates to give

X(t)=X0BLa03t0tdua3(u)=X0+BLt03s1[1(t0t)3s1]X(t)=X_{0}-BLa^{3}_{0}\int_{t_{0}}^{t}\frac{{\rm d}u}{a^{3}(u)}=X_{0}+\frac{BLt_{0}}{3s-1}\left[1-\left(\frac{t_{0}}{t}\right)^{3s-1}\right] (167)

where a(t)=a0(t/t0)sa(t)=a_{0}(t/t_{0})^{s} with s=2/ns=2/n. In particular, eqs. (165) imply these solutions trace out a semicircle in the upper-half τ\tau𝔞{\mathfrak{a}} plane, given by

τ2+(𝔞A)2=B2\tau^{2}+\left({\mathfrak{a}}-A\right)^{2}=B^{2}\, (168)

and it is the LL-dependent force pushing χ\chi to smaller values that causes χ\chi to accelerate as needed to remain on this semicircle.

Returning to viable cosmologies with nonzero 𝔤{\mathfrak{g}} and 𝔤a{\mathfrak{g}}_{a}: as an existence proof for solutions with χ=χ0\chi=\chi_{0} and χ˙0\dot{\chi}\simeq 0 at late times we ask when χ=χ0\chi=\chi_{0} can be a time-independent solution to (163). Since V(χ0)=0V^{\prime}(\chi_{0})=0 such solutions exist when the other terms in VeffV^{\prime}_{\rm eff} obtained from (164) cancel:

L2(a)a6e+2ζχ^0ρm02a3Mp2=0.\frac{L^{2}(a)}{a^{6}}\,e^{+2\zeta\hat{\chi}_{0}}-\frac{\rho_{m0}}{2a^{3}M_{p}^{2}}=0\,. (169)

Remarkably, this is a time-independent condition during a matter-dominated universe because in this case (160) and (161) imply L(a)a3/2L(a)\propto a^{3/2}. Eq. (169) requires that the present-day value L(t0)=L0L(t_{0})=L_{0} should satisfy

L02=ρm02Mp2e2ζχ^0=ρm02Mp2τ02,L_{0}^{2}=\frac{\rho_{m0}}{2M_{p}^{2}}\,e^{-2\zeta\hat{\chi}_{0}}=\frac{\rho_{m0}}{2M_{p}^{2}\tau_{0}^{2}}\,, (170)

and so L02ρm0/Mp2L_{0}^{2}\lll\rho_{m0}/M_{p}^{2} and ρa(t0)ρm0\rho_{a}(t_{0})\lll\rho_{m0}.

With these choices eqs. (160) and (161) imply Leq=L(aeq)L_{\rm eq}=L(a_{\rm eq}) at radiation-matter equality is of order

LeqL0(aeqa0)3/2and soρa(aeq)ρm(aeq)ρa(a0)ρm01L_{\rm eq}\simeq L_{0}\left(\frac{a_{\rm eq}}{a_{0}}\right)^{3/2}\quad\hbox{and so}\quad\frac{\rho_{a}(a_{\rm eq})}{\rho_{m}(a_{\rm eq})}\simeq\frac{\rho_{a}(a_{0})}{\rho_{m0}}\lll 1 (171)

because the evolution of χ\chi plays no role in the evolution of ρm\rho_{m} when χ\chi is fixed at χ0\chi_{0}. Pushing back further, into the radiation dominated regime, the field χ\chi can no longer remain anchored at χ0\chi_{0} because L2/a61/a2L^{2}/a^{6}\propto 1/a^{2} implies (169) can no longer be satisfied for all times. Furthermore, assuming χ\chi does not evolve dramatically, the evolution of L(a)L(a) during radiation domination also implies that the axion term becomes less and less important relative to the matter term (so far as dilaton evolution is concerned) the further back in time one goes, given that they were equal at a=aeqa=a_{\rm eq}.

In this regime the dilaton evolution during radiation domination proceeds as if the axion were not present, and so closely resembles the evolution discussed earlier (and found in [37]) for dilaton-matter evolution. In particular, Hubble damping indeed suppresses χ\chi evolution even when the dilaton kinetic energy is much greater than its potential energy.

We have explored how sensitive successful cosmology is to initial conditions, and find that those stated above can be relaxed (see for example the plot of Fig. 6) provided that there is sufficient axion energy density present. This implies a lower bound on the axion-matter coupling, but this lower bound turns out to be very small (the plot in Fig. 6 corresponds to 𝒥/ρm1.05×1025{\cal J}/\rho_{m}\simeq 1.05\times 10^{-25}). Couplings this small are also large enough to evade solar-system bounds for the dilaton through the ‘homeopathy’ mechanism of [42], yet are easily small enough to keep the axion itself undetected in solar-system tests.

Refer to caption
Figure 6: A log-log plot (base 10) of the energy density in matter (dashed), radiation (dotted) and the dilaton’s total (solid) and potential (orange dash-dotted) energies and axion energy (purple dash-dotted) as a function of universal scale factor, with a=1a=1 representing the present, with initial axion and dilaton kinetic energies nonzero and 𝔤=0.408{\mathfrak{g}}=-0.408 as predicted by our model. The axion energy executes a crossover from ρaa2\rho_{a}\propto a^{-2} to ρaa3\rho_{a}\propto a^{-3} when passing from radiation to matter domination, as described in the text, and the competition between axion-dilaton and matter-dilaton couplings keeps the dilaton near its present value long enough for the scalar potential to take over.
Refer to caption
Figure 7: The evolution of the scalar field, χ^=χ/Mp\hat{\chi}=\chi/M_{p}, as a function of 𝔷=log10a{\mathfrak{z}}=\log_{10}a where aa is the universal scale factor and a=1a=1 represents the present, using the same parameters used in Fig. 6. This shows that the field χ\chi does not evolve unacceptably between nucleosynthesis (a1010a\simeq 10^{-10}) and now, though the transition in axion behaviour at radiation-matter equality (a3×104a\simeq 3\times 10^{-4}) naturally produces variations of in χ\chi near recombination (a103a\simeq 10^{-3}) at the several percent level (and somewhat larger just before and after).
ωr\omega_{r} ωm\omega_{m} ωχ\omega_{\chi} ωa\omega_{a} ωc=ωm+ωa\omega_{c}=\omega_{m}+\omega_{a} ww
5×1055\times 10^{-5} 0.180.18 0.760.76 0.0800.080 0.260.26 0.99-0.99
Table 1: The present-day fraction of the total energy density in the various cosmic fluids, with ωi=ρi/ρtot\omega_{i}=\rho_{i}/\rho_{\rm tot} for radiation (rr), dark matter (mm), the dilaton (χ\chi) and axion (aa), for the cosmology shown in Figs. 6 and 7. Notice that the effective total dark matter density is ωc=ωm+ωa\omega_{c}=\omega_{m}+\omega_{a} in this model. The last column gives the equation of state parameter, w=pχ/ρχw=p_{\chi}/\rho_{\chi}, for the dilaton fluid.

The cosmology shown in Fig. 6 satisfies the main phenomenological smell-checks one would require before launching more detailed studies. For instance, the scalar fields’ energy densities are less than a few percent of the total at nucleosynthesis (indeed, in the scenario shown they are well below this bound). The predictions for the present-day fraction of the total energy budget in the various ingredients are given in Table 1, as is the dilaton’s equation of state parameter w=pχ/ρχw=p_{\chi}/\rho_{\chi}. These are also broadly consistent with standard Λ\LambdaCDM cosmology, though with a few interesting wrinkles.

The wrinkles are revealed once the time-dependence of the energy densities is included. In the scenario shown, for example, 44% of the total apparent Dark Matter at present consists of axions, though the precise mix changes with time as the universe evolves. Similarly, the oscillations of the dilaton around its minimum also imply possibly detectable oscillations in the Dark Energy equation of state parameter in the relatively recent past.

More stringent constraints come from the change in particle masses (relative to MpM_{p}) between now and past epochs, that occur because of their dependence on the dilaton: m(χ)=𝔪e12ζχ^m(\chi)={\mathfrak{m}}\,e^{-\frac{1}{2}\,\zeta\hat{\chi}}. These can be seen from the evolution of χ^(t)\hat{\chi}(t) shown in Fig. 7 (for the same parameters as used in Fig. 6). For instance, particle masses cannot differ from their current values by more than a few percent at nucleosynthesis without altering otherwise-successful Λ\LambdaCDM predictions [120]. Similar bounds coming from CMB observations preclude significant mass changes near recombination (more about which below). Measured spectral lines also strongly constrain mass changes after stars form at redshifts around 7.

These conditions require the dilaton profile not to differ too much from its current value in these windows where these masses are well-measured, and the profile shown in Fig. 7 is chosen to try to minimize these. As the figure shows, the constraint at BBN is fairly easy to satisfy. Although one might think that not allowing the dilaton to move far over cosmological times should also strongly constrain the amount of kinetic energy it can initially have, the ruthless efficiency of Hubble friction turns out to make this not so; cosmic evolution usually drains the scalar kinetic energy before it rolls very far.

We find it to be fairly generic to have excursions in τ\tau shortly after radiation-matter crossover because this changes the nature of the underlying scaling solutions towards which evolution is attracted. Late-time oscillations of the dilaton as it settles into the potential’s local minimum are also common features of most scenarios we have so far found, though the details of their amplitude and timing depend somewhat on the precise initial conditions. Such oscillations in particle masses are likely to be strongly constrained by observations of spectral lines unless they are damped out before the end of the Dark Ages.

A more detailed exploration of these and other cosmological consequences goes beyond the scope of this paper, but is clearly of great interest to further explore.

Axion as Dark Matter

As mentioned above, because the axion energy density falls like 1/a31/a^{3} during matter domination it contributes to the total observed Dark Matter abundance at late times. Indeed in the specific scenario described above it contributed just shy of half of the dark matter at the present epoch.

This suggests it is logically possible that the axion itself could provide all of the dark matter. This possibility might seem surprising at first sight, because normally an axion with the couplings and masses entertained here would not be considered a dark-matter candidate. It would normally be excluded to the extent that these properties are inconsistent with standard production mechanisms, such as when axions arise from coherent oscillations about the minimum of a symmetry-breaking potential.

Inapplicability of such production mechanisms need not be a problem in the present instance, however, because the winding up of the axion due to a small axion-matter coupling – as predicted by (160) and (161) – is a production mechanism in itself. We have explored this numerically and have verified that we can generate a cosmologically significant axion energy density that when dominant falls as 1/a31/a^{3} (as would be required of dark matter). This can be done despite such a cosmology being unable to profit from the balancing of dilaton-axion and dilaton-matter forces, however, and we have verified that trapping of the dilaton can be done in this case under certain circumstances.

The possibility of uniting Dark Matter and Dark Energy into a unified axio-dilaton picture is a beautiful direction that is well worth more detailed exploration, though we leave this for future work.

Potential relevance to the H0H_{0} puzzle

A suggestive consequence of the scenario proposed here is the dilaton evolution that is predicted at relatively late epochs, particularly immediately after radiation-matter equality. This is a fairly generic phenomenon: particle masses can evolve coherently relative to the Planck scale as the dilaton evolves (both in space and in time) although mass ratios do not change. Such changes in particle masses are detectable, and we have tried to minimize this motion when seeking cosmological solutions to avoid many of the bounds that exclude gross changes to particle masses.

It is tantalizing in this context that having particle masses change by a few percent around the epoch of recombination — which conveniently occurs shortly after radiation-matter equality — has been identified [44] as being one of the few viable options for resolving the current Hubble tension [43] through changes to fundamental physics (see also [123, 124] for related ideas involving scale invariance and the Hubble tension).

In particular ref. [44] studies how changes to cosmological parameters can avoid ruining the great success of CMB observations. They find that a correlated change at recombination to the electron mass, mem_{e}, baryon density, ωb\omega_{b} and dark-matter density, ωc\omega_{c}, can drop out of the gross features of the CMB (though not from those more detailed features that depend on non-equilibrium effects). Defining for any variable XX the quantity ΔX=ln(X/X0)\Delta_{\scriptscriptstyle X}=\ln(X/X_{0}) where X0X_{0} is the ‘baseline’ local value, ref. [44] finds CMB observations remain largely unchanged if

Δme=Δωb=Δωc.\Delta_{m_{e}}=\Delta_{\omega_{b}}=\Delta_{\omega_{c}}\,. (172)

Remarkably, nonzero changes of this type can actually improve the description of CMB physics because they alter the epoch of recombination by an amount Δa=Δme\Delta_{a_{\star}}=-\Delta_{m_{e}}. This in turn alters the value of H0H_{0} inferred from CMB measurements in a way that can bring it into line with non-CMB measurements of H0H_{0}. Although the change in H0H_{0} implied by (172) would also alter BAO observations, ref. [44] argues these changes can be compensated for by a small nonzero spatial curvature of order ωk0.125Δme\omega_{k}\simeq-0.125\Delta_{m_{e}}, leading to a change to the Hubble scale of Δh=1.5Δme\Delta_{h}=1.5\Delta_{m_{e}}. In these circumstances a 5% increase in the electron mass at recombination would account for the 8% discrepancy between measured values for H0H_{0}.

Since our model predicts the electron mass could indeed be different at recombination, it is natural to ask whether the conditions (172) would also follow if τ\tau were to differ at recombination from its present-day value. We believe that they do, because this would change all masses by a common factor. For instance, to the extent that the number density of baryons is tied to the CMB photon density, changes to the baryon density at recombination dominantly come from changes to the nucleon mass: Δωb=ΔmN=Δme=12Δτ=𝔤(χ^χ^0)\Delta_{\omega_{b}}=\Delta_{m_{\scriptscriptstyle N}}=\Delta_{m_{e}}=-\frac{1}{2}\Delta_{\tau}=-{\mathfrak{g}}(\hat{\chi}-\hat{\chi}_{0}). A similar argument also plausibly applies to dark matter, provided its mass scales like τ1/2\tau^{-1/2}.

The prediction 𝔤0.41{\mathfrak{g}}\simeq-0.41 implies a 12% decrease in χ^\hat{\chi} (i.e. a 10% decrease in τ\tau) at recombination produces a 5% increase in particle masses, and as plots like Fig. 7 show, 10% excursions in δτ\delta\tau are easily obtainable, and naturally happen shortly after radiation-matter equality. Fig. 7 also shows that in principle this can be done without also requiring unacceptably large mass changes at BBN.

We regard these preliminary cosmological scenarios to be promising enough to warrant more careful exploration of their phenomenological properties, including in particular a more systematic treatment of their fluctuations near recombination and in the universe nearer by, and short of this cannot yet claim that all constraints can be satisfied. We regard the challenge of cosmological model building around these constraints (and the opportunities for new signals) to be a welcome alternative to the head-banging ordeal of seeking a cosmological constant that is technically natural.

6 Concluding Remarks

To summarize: we propose here a framework for understanding the Dark Energy density in a technically natural way that concentrates on the cancelation of the contributions to the vacuum energy coming from Standard Model particles. The framework relies on the following core ingredients:

  1. (i)

    A very supersymmetric gravity sector coupled to a Standard Model sector in which supersymmetry is badly broken and so non-linearly realised using the formalism of constrained superfields. The low-energy presence of supergravity and goldstino auxiliary fields plays an important role by imposing a supersymmetric form on the low-energy scalar potential.

  2. (ii)

    A relaxation mechanism wherein a relaxon field ϕ\phi adjusts to suppress the leading vacuum energy. Here ϕ\phi is a new light scalar field that also realizes supersymmetry nonlinearly (as does the Standard Model sector).

  3. (iii)

    Approximate accidental scale invariance under which a dilaton field τ\tau and the metric scale by constant factors, whose breaking is captured by an expansion of the action in powers of 1/τ1/\tau. The field τ\tau belongs to the gravitationally coupled supersymmetric sector and so comes with axion and dilatino partners.

In the explicit realisation presented here, a minor tuning of lagrangian parameters of order 1/601/60 allows the dilaton field to be stabilised at an exponentially large value τ1026\tau\sim 10^{26}, and this large value explains the size of the electroweak hierarchy inasmuch as Standard Model particles acquire masses of order MTeVMp/τM_{\scriptscriptstyle TeV}\sim M_{p}/\sqrt{\tau}. This exponentially large value for τ\tau is obtained naturally because the potential arises as a rational function of lnτ\ln\tau due to the generic presence of logarithms of mass ratios amongst the UV particles that are integrated out above the weak scale (along the lines proposed in a different context some time ago [37]).

The low-energy scalar potential is calculable as a series in 1/τ1/\tau of the form

V=V2τ2+V3τ3+V4τ4+V=\frac{V_{2}}{\tau^{2}}+\frac{V_{3}}{\tau^{3}}+\frac{V_{4}}{\tau^{4}}+\cdots (173)

and because Standard Model particle masses are mτ1/2m\propto\tau^{-1/2} their loops contribute δVm41/τ2\delta V\sim m^{4}\propto 1/\tau^{2} and so contribute to V2V_{2}. Strictly speaking these contributions actually arise as order m2m^{2} corrections to wXw_{\scriptscriptstyle X} [15].

The supersymmetry of the gravity sector implies V2|wX|2V_{2}\propto|w_{\scriptscriptstyle X}|^{2} is positive since it must turn off in the hypothetical limit where global supersymmetry is unbroken. The relaxation field appears in V2V_{2} and it prefers to minimize wX0w_{\scriptscriptstyle X}\to 0 in the large-τ\tau limit. In global supersymmetry it would do so by seeking V2=wX=0V_{2}=w_{\scriptscriptstyle X}=0, but in supergravity this combination instead becomes Planck suppressed. This suppression does not mean superpartners cannot remain heavy, however, because supersymmetry-breaking fields like FXW0/MpF^{\scriptscriptstyle X}\propto W_{0}/M_{p} remain at the weak scale despite being Planck suppressed.

The interplay between scale invariance and supersymmetry (as manifested in ‘extended no-scale structure’) then leads to V3V_{3} also vanishing, leaving in V4/τ4V_{4}/\tau^{4} a naturally small, positive, cosmological constant414141Note that the dilaton potential V4V_{4} must both dominate its kinetic energy and be positive so that Dark Energy dominates the energy density of the universe at late times with an acceptable equation of state. that is order (MTeV2/Mp)4(M_{\scriptscriptstyle TeV}^{2}/M_{p})^{4} (and so is the right size). The fact that supersymmetry is so mildly broken in the gravity sector allows it to protect the series form of VV given in (173), as well as the cancellation of the V3V_{3} terms.

The large value of τ\tau determines both the small cosmological-constant scale, mvac1/τm_{\rm vac}\propto 1/\tau, and the electroweak scale, mTeVMp/τm_{\scriptscriptstyle TeV}\propto M_{p}/\sqrt{\tau}, fixing their ratio to be mvac/mTeV1/τm_{\rm vac}/m_{\scriptscriptstyle TeV}\propto 1/\sqrt{\tau}. This contains the seeds of the oft-made observation that the weak scale is the geometric mean of the Planck and cosmological-constant scales.

Weinberg’s no-go argument applies (as it must since the underlying mechanism relies on scale invariance [1, 2, 3]) in the sense that quantum corrections to the scalar potential (such as subdominant powers of 1/τ1/\tau) are present. But Weinberg’s argument does not say how large these corrections must be and supersymmetry – through the cancellation of V3V_{3} – is what keeps them small.

The presence of auxiliary fields in the scalar potential required by supersymmetry in the gravity sector allows supersymmetry also to have implications for the naturalness of the relaxon ϕ\phi. This is because its mass term arises from a dimension-three operator gΦ2XWg\,\Phi^{2}X\in W rather than from a dimension-two operator in VV; involving XX because it is a strong supersymmetry-breaking effect. Having its mass come from WXW_{\scriptscriptstyle X} both ensures that the ϕ\phi mass is proportional to τ1/2\tau^{-1/2} (and so at most lies at TeV energies) and makes radiative corrections enter through gg, which is dimensionless.

Similar arguments may also apply to the Higgs boson itself if its scalar potential also arises as a contribution λH(HHvEW2)WX\lambda_{\scriptscriptstyle H}(H^{\dagger}H-v_{\scriptscriptstyle EW}^{2})\in W_{\scriptscriptstyle X}, although in the Higgs case having a mass τ1/2\propto\tau^{-1/2} is more important (and more generic). The same physics underlies both the electroweak and cosmological constant scales at a fundamental level (without need for anthropic arguments).

There are several prices we pay (plus a few opportunities we reap) for this suppression. First, the small size of the vacuum energy in the low-energy theory requires τ>1026\tau\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{26} and then this drives the axion decay constant so low that the low-energy EFT must fail at eV energies. Although plausible UV physics (such as supersymmetric large extra dimensions [45]) could plausibly intervene at these scales, it is not yet known how it does so. In particular, the UV pedigree for τ\tau is not known and so we do not know whether such large values are allowed. This is not an empty worry because if τ𝒱2/3\tau\sim{\cal V}^{2/3}, as suggested by the simplest string models, then extra-dimensional constraints preclude it from being larger than of order 102010^{20}.

A second price we pay is the large Brans-Dicke-like coupling of the dilaton to ordinary matter, which flirts with inconsistency with current tests of gravity. Although the model seems to bring its own evasion mechanism – wherein the SL(2,R)SL(2,R)-invariant axion-dilaton self-couplings divert dilaton couplings into generating harder-to-detect axionic response [42] – there are also numerous potential signals for their presence in tests of gravity and cosmology.

Some of the low-energy axio-dilaton signals might even solve problems, such as the current puzzle over apparent inconsistencies in measurements of the Hubble scale H0H_{0}. Whether these eventually kill or verify the model, we regard it as progress to trade the cosmological constant problem for exercises in late-epoch model-building, and leave a more detailed treatment of this mechanism for future research.

The remainder of this section puts our scenario into the context of earlier approaches using similar ingredients and briefly summarizes several open issues.

6.1 Relation to other approaches

All three of our ingredients have been separately used previously in related contexts, so it is useful to clarify what differs from these in our particular framework.

Low-energy supergravity

Approaches to quintessence and Dark Energy that use supergravity equations of motion424242Examples where this happens within an underlying string construction are discussed in §3.3. go back more than 20 years [125]. Even if supersymmetry is valid at high energies, there is a basic question that these constructions do not address: why should it survive down to the extremely low energies required to be relevant to quintessence, given the apparent absence of supersymmetry at the intervening energies containing the observed Standard Model particles? (For recent evaluations of quintessence models within a string framework see [126, 127].)

This question is special case of a larger problem faced even by nonsupersymmetric quintessence models: why is it legitimate to compute and use a carefully designed quintessence potential entirely within the classical approximation? This is often phrased as the statement that vacuum energies and very small scalar masses are not technically natural (see eg [2]): quantum effects cause scalar masses and potentials to change dramatically once heavier particles are integrated out, making them particularly sensitive to a system’s UV sector. This sensitivity can also undermine [128, 24] the usual low-energy arguments that justify using the classical approximation in gravitating systems.

By contrast, addressing these issues is the main motivation of the model presented here. Our tool for doing so is the explicit coupling of low-energy supergravity to a non-supersymmetric Standard Model permitted by the nonlinear realization of ref. [27] and its coupling to supergravity [28, 29, 30], since this allows one to trace the low-energy effects of integrating out non-supersymmetric sectors [15].

Relaxation mechanisms

Relaxation mechanisms that use a field’s dynamical evolution to suppress the apparent cosmological constant also have a long history [7], and more recently have been applied to the electroweak hierarchy problem [16]. We feel that none have been entirely convincing as solutions to these problems on their own, and the same would have been true for us if we had not combined relaxation with the other two ingredients. See [8] for a recent interesting proposal in this direction.

Scale invariance

Scale invariance has an equally long history, with both early applications to the cosmological constant problem and elsewhere. Early proposals assumed a scale-invariant action with scaling broken only by quantum anomalies [39], leaving open whether such examples exist. They very early ran up against no-go results [1] that identified why even completely unbroken scale invariance cannot prevent the lifting of classically flat dilaton directions.

Our approach evades some of these issues because scale invariance is only approximate, even at the classical level. The arguments of the no-go theorems broadly apply, inasmuch as the central issue is to quantify the lifting of flat directions by quantum effects. But the no-go arguments do not forbid the use of supersymmetry to suppress (though not completely eliminate) this lifting, as we do here using the low-energy distillation [17] of the extended no-scale structure found in string models [34, 35].

Approximate scale invariance has also been conjectured to be related to small cosmological constants and the existence of cosmologically light dilatons [129] and within an anthropic context [130] within string constructions. These papers (and we) both use the robust link between potentials that predict a small cosmological constant and Hubble-scale dilaton masses described in [37], though in the present paper we provide an explicit mechanism for building a potential with a naturally small vacuum energy that exploits this connection (without the need to resort to anthropic arguments).

Indeed, scale invariance is more broadly suggestive as an ingredient for solving naturalness problems because the essence of these problems is that Nature seems to be closer to scale invariance (i.e. some masses or energies are smaller) than we would normally expect. This has led to its exploration for other purposes, such as to higher-derivative theories of gravity (which are more broadly scale invariant in the UV [131], but bring associated difficulties with ghosts [132] – see however [133]). Scale invariance has also been applied to inflationary models [134] or motivating the choices required [135] to obtain inflation using only the Standard Model Higgs as the inflaton [136].

This history teaches that scale invariance, though attractive, seems to come with undesirable extras (such as ghosts in higher-derivative gravity or dangerous dilatons in the models described here). We regard the dilaton to be amongst the more benign of these options, in that its existence need not point towards a fundamental instability, though viability of the model requires threading a minefield of potential observational tests.

6.2 General implications and future directions

In the picture we paint the small observed size of the dark-energy density points towards a very rich low-energy ‘dark’ sector consisting of supersymmetric gravity, a cosmologically active axio-dilaton multiplet, a somewhat heavier gravitationally coupled relaxon plus possibly other dark ingredients. This picture leaves open a great many interesting directions worth further exploring.

  • Naturalness: The main conceptual issue is to better verify through explicit calculations that the general arguments about nonlinear realization of supersymmetry do indeed preserve the supergravity structure of the scalar potential, along the lines of [15] but more explicitly in the regime of small WXW_{\scriptscriptstyle X}.

  • Super-eV completions: τ>1026\tau\mathrel{\raise 1.29167pt\hbox{$>$\kern-8.50006pt\lower 4.30554pt\hbox{$\sim$}}}10^{26} plays an important role suppressing the vacuum energy in the low-energy 4D theory and this drives a breakdown of EFT methods at eV energies (due to the small axion decay constant). Can extra dimensions provide the UV completion that unitarizes the theory up to the electroweak scale? If so, how does τ\tau arise in this completion and can it take the large values that are needed?

  • Axio-dilaton phenomenology: The biggest phenomenological issue is to see whether the required dilaton properties could have evaded contemporary precision tests of gravity, and can be consistent with what we know about cosmology. How robust is the mechanism of [42] to dynamical issues and how sensitive is it to the detailed axion couplings? How does it stand up to more detailed studies of cosmology, including issues of structure formation? Can the baryonic wind-up mechanism of §5.2 allow the axion 𝔞{\mathfrak{a}} to contribute to (or to replace) Dark Matter? Are the predicted variations of masses over cosmological times consistent with observations? Can they help solve the current Hubble tension?

  • Dark matter Our discussion leaves open what plays the role of dark matter, which could be included in the simplest scenarios by supplementing the Standard Model sector by another non-supersymmetric particle whose mass varies as mτ1/2m\propto\tau^{-1/2}. More economical options might also be worth exploring, however, including those for which the dark matter mass varies differently with τ\tau or the option where the axion 𝔞{\mathfrak{a}} itself plays the role of dark matter, as sketched in §5.2. Could the relaxon be the dark matter? (Although the ϕ2\phi^{2}{\cal H}^{\dagger}{\cal H} coupling resembles scalar-portal dark-matter models [137] its small coupling strength is too small to allow the ϕ\phi field to be thermally produced, unlike in the minimal case.)

  • Neutrino physics Our picture potentially populates the low-energy world with a rich spectrum of weakly coupled very light fermions, and explains why they are there (some are superpartners to known light bosons, like the graviton). Their presence is a required consequence of the supersymmetry of the gravity sector (and resembles the low-energy sector of [108] in this way). Should these mix with Standard Model neutrinos they would provide natural candidates for light sterile neutrinos (along the lines of [138, 139]), and could open up new ways to express lepton-number violation at low energies, with possible implications for lepto- and baryogenesis. If super-heavy sterile fermions mix with SM neutrinos (as in the see-saw mechanism) with a Dirac mass mDτ1/2m_{\scriptscriptstyle D}\propto\tau^{-1/2} that scales like other SM masses then physical neutrino masses are order mν1/τm_{\nu}\sim 1/\tau, and so would explain the coincidence between neutrino masses and the cosmological constant scale: Vminmν4V_{\rm min}\sim m_{\nu}^{4}. What observable features do the resulting neutrino/dark-sector interactions imply?

  • Pre-BBN cosmology The picture also likely modifies pre-nucleosynthesis cosmology in a variety of ways that are worth exploring. Among these are the interplay between ϕ\phi and the Higgs {\cal H} in the scalar potential, since if the potential for these both lie within |wX|2|w_{\scriptscriptstyle X}|^{2} then only the vev of a linear combination gets fixed by the single condition wX=0w_{\scriptscriptstyle X}=0. What would this mean for late-time cosmology and/or the epoch of the electroweak phase transition?

    The field ϕ\phi also seems designed to be a good inflaton candidate [82]. After all, nonlinearly realized supersymmetry naturally provides large positive (de Sitter-like) vacuum energies (the potential’s |wX|2|w_{\scriptscriptstyle X}|^{2} term) for any value of ϕ\phi away from its minimum and ϕ\phi has been designed as a field whose evolution parameterizes changes between large nonzero wXw_{\scriptscriptstyle X} and vanishing wXw_{\scriptscriptstyle X}. The early-universe evolution of ϕ\phi while positive energies dominate therefore provides an attractive picture of inflation in which the inflaton is not completely divorced from Standard Model physics, and changes to τ\tau become correlated with changes to the size of the observable universe.434343Notice that when wX0w_{\scriptscriptstyle X}\neq 0 eq. (23), which schematically can be written as V(AwX2τ2BwXτ+C)/τ4V\sim(Aw_{\scriptscriptstyle X}^{2}\tau^{2}-Bw_{\scriptscriptstyle X}\tau+C)/\tau^{4}, allows a new type of stabilization at smaller values of τw0/wX\tau\sim w_{0}/w_{\scriptscriptstyle X}, showing that inflation can drive nontrivial evolution for the τ\tau field as well, providing a realization of large scale inflation. Once inflation ends and wX0w_{\scriptscriptstyle X}\sim 0, τ\tau can roll towards its large global minimum τ1026\tau\sim 10^{26}, correlating the early and late-time hierarchies to the inflationary growth of the volume of the observable universe (as in [141, 142]). The inflationary models found in this way come with scale invariance baked in (much like for the possible UV embeddings described in §3.3), in a way that is known to help such models agree with observations, along the lines described in [140]. We report the details of this scenario, and other ‘yoga breathing’ exercises (relaxed inflation) in a future publication.

Yoga Dark Energy ties together many of the scales of physics and so its implications are legion; further investigations are underway into several of these directions.

Acknowledgements

We thank Aizhan Akhmetzhanova, Clare Burrage, Michele Cicoli, Ed Copeland, Shanta de Alwis, Emilian Dudas, Nemanja Kaloper, Justin Khoury, Lloyd Knox, Francesco Muia, José de Jesús Padua Argüelles and Henry Tye for many helpful conversations. CB’s research was partially supported by funds from the Natural Sciences and Engineering Research Council (NSERC) of Canada. Research at the Perimeter Institute is supported in part by the Government of Canada through NSERC and by the Province of Ontario through MRI. The work of FQ has been partially supported by STFC consolidated grants ST/P000681/1, ST/T000694/1.

Appendix A Useful supergravity formulae

This appendix collects some useful formulae encountered in the main text when working with supergravity models.

One of the cumbersome steps is inverting the Kähler metric. Consider then a Kähler function of the form K=3ln𝒫K=-3\ln{\cal P}, where 𝒫=𝒫(ZA){\cal P}={\cal P}(Z^{\scriptscriptstyle A}). Then

KA=3𝒫A𝒫,KAB¯=3𝒫AB¯𝒫+3𝒫A𝒫B¯𝒫2K_{\scriptscriptstyle A}=-\frac{3{\cal P}_{\scriptscriptstyle A}}{{\cal P}}\,,\quad K_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}}=-\frac{3{\cal P}_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}}}{{\cal P}}+\frac{3{\cal P}_{\scriptscriptstyle A}{\cal P}_{{\overline{{\scriptscriptstyle B}}}}}{{\cal P}^{2}} (174)

and so (as is easy to check) the inverse is

KA¯B=𝒫3[𝒫A¯B+𝒫A¯𝒫B𝒫P2]K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}=-\frac{{\cal P}}{3}\left[{\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}+\frac{{\cal P}^{{\overline{{\scriptscriptstyle A}}}}{\cal P}^{{\scriptscriptstyle B}}}{{\cal P}-P^{2}}\right] (175)

where P2:=𝒫A¯B𝒫A¯𝒫BP^{2}:={\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\cal P}_{{\overline{{\scriptscriptstyle A}}}}{\cal P}_{\scriptscriptstyle B} and 𝒫A¯B𝒫BC¯:=δA¯C¯{\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\cal P}_{{\scriptscriptstyle B}{\overline{{\scriptscriptstyle C}}}}:={\delta^{{\overline{{\scriptscriptstyle A}}}}}_{{\overline{{\scriptscriptstyle C}}}}. With these definitions notice that

KA¯BKB=𝒫𝒫A¯𝒫P2andKA¯BKA¯KB=3P2P2𝒫.K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{\scriptscriptstyle B}=\frac{{\cal P}{\cal P}^{{\overline{{\scriptscriptstyle A}}}}}{{\cal P}-P^{2}}\quad\hbox{and}\quad K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}=\frac{3P^{2}}{P^{2}-{\cal P}}\,. (176)

Two sectors coupled through TT

Next suppose that the fields divide up into three types: {ZA}={T,Si,za}\{Z^{\scriptscriptstyle A}\}=\{T,S^{i},z^{a}\} and 𝒫=A(T,S)+B(T,z){\cal P}=A(T,S)+B(T,z). Then the first derivatives become

𝒫T=AT+BT,𝒫i=Ai,𝒫a=Ba,{\cal P}_{\scriptscriptstyle T}=A_{\scriptscriptstyle T}+B_{\scriptscriptstyle T}\,,\quad{\cal P}_{i}=A_{i}\,,\quad{\cal P}_{a}=B_{a}\,, (177)

and the second derivatives are

𝒫TT¯=ATT¯+BTT¯,𝒫Tȷ¯=ATȷ¯,𝒫Tc¯=BTc¯,𝒫iȷ¯=Aiȷ¯,𝒫ac¯=Bac¯,{\cal P}_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}=A_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}+B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}\,,\quad{\cal P}_{{\scriptscriptstyle T}\bar{\jmath}}=A_{{\scriptscriptstyle T}\bar{\jmath}}\,,\quad{\cal P}_{{\scriptscriptstyle T}\bar{c}}=B_{{\scriptscriptstyle T}\bar{c}}\,,\quad{\cal P}_{i\bar{\jmath}}=A_{i\bar{\jmath}}\,,\quad{\cal P}_{a\bar{c}}=B_{a\bar{c}}\,, (178)

and 𝒫ic¯=0{\cal P}_{i\bar{c}}=0. The vector of 1st derivatives and the matrix of 2nd derivatives therefore are

𝒫A=(AT+BTAiBa)and𝒫AB¯=(ATT¯+BTT¯ATȷ¯BTc¯AiT¯Aiȷ¯0BaT¯0Bac¯).{\cal P}_{\scriptscriptstyle A}=\left(\begin{array}[]{c}A_{\scriptscriptstyle T}+B_{\scriptscriptstyle T}\\ A_{i}\\ B_{a}\end{array}\right)\quad\hbox{and}\quad{\cal P}_{{\scriptscriptstyle A}\bar{\scriptscriptstyle B}}=\left(\begin{array}[]{ccccc}A_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}+B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}&&A_{{\scriptscriptstyle T}\bar{\jmath}}&&B_{{\scriptscriptstyle T}\bar{c}}\\ A_{i{\overline{{\scriptscriptstyle T}}}}&&A_{i\bar{\jmath}}&&0\\ B_{a{\overline{{\scriptscriptstyle T}}}}&&0&&B_{a\bar{c}}\end{array}\right)\,. (179)

The inverse of the matrix of second derivatives is then

𝒫A¯B=(ααAjαBcαAı¯Aı¯j+αAı¯AjαAı¯BcαBa¯αAjBa¯Ba¯c+αBa¯Bc),{\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}=\left(\begin{array}[]{ccccc}\alpha&&-\alpha A^{j}&&-\alpha B^{c}\\ -\alpha A^{\bar{\imath}}&&A^{\bar{\imath}j}+\alpha A^{\bar{\imath}}A^{j}&&\alpha A^{\bar{\imath}}B^{c}\\ -\alpha B^{\bar{a}}&&\alpha A^{j}B^{\bar{a}}&&B^{\bar{a}c}+\alpha B^{\bar{a}}B^{c}\end{array}\right)\,, (180)

where Aj:=Aı¯jATı¯A^{j}:=A^{\bar{\imath}j}A_{{\scriptscriptstyle T}\bar{\imath}}, Ai:=Aȷ¯iAiT¯A^{i}:=A^{\bar{\jmath}i}A_{i{\overline{{\scriptscriptstyle T}}}}, Ba:=Bc¯aBTc¯B^{a}:=B^{\bar{c}a}B_{{\scriptscriptstyle T}\bar{c}}, Ba:=Bc¯aAaT¯B^{a}:=B^{\bar{c}a}A_{a{\overline{{\scriptscriptstyle T}}}} and α\alpha is given by

1α:=ATT¯+BTT¯Aı¯jATı¯AjT¯Ba¯cBTa¯BcT¯.\frac{1}{\alpha}:=A_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}+B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-A^{\bar{\imath}j}A_{{\scriptscriptstyle T}\bar{\imath}}A_{j{\overline{{\scriptscriptstyle T}}}}-B^{\bar{a}c}B_{{\scriptscriptstyle T}\bar{a}}B_{c{\overline{{\scriptscriptstyle T}}}}\,. (181)

In this case

𝒫A¯=𝒫A¯B𝒫B=α(AT+BTAiAiBaBaAı¯(ATT¯+BTT¯ATBT)Ba¯(ATT¯+BTT¯ATBT)),{\cal P}^{{\overline{{\scriptscriptstyle A}}}}={\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\cal P}_{\scriptscriptstyle B}=\alpha\left(\begin{array}[]{c}A_{\scriptscriptstyle T}+B_{\scriptscriptstyle T}-A^{i}A_{i}-B^{a}B_{a}\\ A^{\bar{\imath}}(A_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}+B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-A_{\scriptscriptstyle T}-B_{\scriptscriptstyle T})\\ B^{\bar{a}}(A_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}+B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-A_{\scriptscriptstyle T}-B_{\scriptscriptstyle T})\end{array}\right)\,, (182)

which uses (181) when simplifying the result (and in particular to extract an overall factor of α\alpha). Finally

P2=𝒫A¯𝒫A¯=α[(AT¯+BT¯)(AT+BTAiAiBaBa)+(AiAi+BaBa)(ATT¯+BTT¯ATBT)],P^{2}={\cal P}^{{\overline{{\scriptscriptstyle A}}}}{\cal P}_{{\overline{{\scriptscriptstyle A}}}}=\alpha\Bigl{[}(A_{{\overline{{\scriptscriptstyle T}}}}+B_{{\overline{{\scriptscriptstyle T}}}})(A_{\scriptscriptstyle T}+B_{\scriptscriptstyle T}-A^{i}A_{i}-B^{a}B_{a})+(A^{i}A_{i}+B^{a}B_{a})(A_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}+B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-A_{\scriptscriptstyle T}-B_{\scriptscriptstyle T})\Bigr{]}\,, (183)

One no-scale sector

With no-scale models in mind, suppose now that AA is independent of Im TT and Im SiS^{i} and that AA is a homogeneous degree-one function, so A(λSi,λT)=λA(Si,T)A(\lambda S^{i},\lambda T)=\lambda A(S^{i},T). When these are true we can write

A(S,T)=:τF(xi)withxi:=σiτ,A(S,T)=:\tau\,F(x^{i})\quad\hbox{with}\quad x^{i}:=\frac{\sigma^{i}}{\tau}\,, (184)

with τ:=T+T¯\tau:=T+{\overline{T}} and σi:=Si+S¯i\sigma^{i}:=S^{i}+{{\overline{S}}}^{i} so that

AT=FσiFiτ,Ai=FiA_{\scriptscriptstyle T}=F-\frac{\sigma^{i}F_{i}}{\tau}\,,\quad A_{i}=F_{i} (185)

where Fi:=F/xiF_{i}:=\partial F/\partial x^{i} and so on. The second derivatives then become

ATT¯=σiσjFijτ3,ATȷ¯=σiFijτ2,Aiȷ¯=Fijτand soAi¯j=τFij,A_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}=\frac{\sigma^{i}\sigma^{j}F_{ij}}{\tau^{3}}\,,\quad A_{{\scriptscriptstyle T}\bar{\jmath}}=-\frac{\sigma^{i}F_{ij}}{\tau^{2}}\,,\quad A_{i\bar{\jmath}}=\frac{F_{ij}}{\tau}\quad\hbox{and so}\quad A^{\bar{i}j}=\tau F^{ij}\,, (186)

where FijFjk=δikF^{ij}F_{jk}={\delta^{i}}_{k}. It follows that

Aj=Aı¯jATı¯=σjτ,Aı¯=Aı¯jAjT¯=σiτ,Aı¯jATı¯AjT¯=Fijσiσjτ3A^{j}=A^{\bar{\imath}j}A_{{\scriptscriptstyle T}\bar{\imath}}=-\frac{\sigma^{j}}{\tau}\,,\quad A^{\bar{\imath}}=A^{\bar{\imath}j}A_{j{\overline{{\scriptscriptstyle T}}}}=-\frac{\sigma^{i}}{\tau}\,,\quad A^{\bar{\imath}j}A_{{\scriptscriptstyle T}\bar{\imath}}A_{j{\overline{{\scriptscriptstyle T}}}}=\frac{F_{ij}\sigma^{i}\sigma^{j}}{\tau^{3}} (187)

so that

𝒫A=(FσiFiτ+BTFiBa)and𝒫AB¯=(Fijσiσjτ3+BTT¯σiFijτ2BTc¯σjFijτ2Fijτ0BaT¯0Bac¯).{\cal P}_{\scriptscriptstyle A}=\left(\begin{array}[]{c}F-\frac{\sigma^{i}F_{i}}{\tau}+B_{\scriptscriptstyle T}\\ F_{i}\\ B_{a}\end{array}\right)\quad\hbox{and}\quad{\cal P}_{{\scriptscriptstyle A}\bar{\scriptscriptstyle B}}=\left(\begin{array}[]{ccccc}\frac{F_{ij}\sigma^{i}\sigma^{j}}{\tau^{3}}+B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}&&-\frac{\sigma^{i}F_{ij}}{\tau^{2}}&&B_{{\scriptscriptstyle T}\bar{c}}\\ -\frac{\sigma^{j}F_{ij}}{\tau^{2}}&&\frac{F_{ij}}{\tau}&&0\\ B_{a{\overline{{\scriptscriptstyle T}}}}&&0&&B_{a\bar{c}}\end{array}\right)\,. (188)

and so

𝒫A¯C=(ααAjαBcαAı¯Aı¯j+αAı¯AjαAı¯BcαBa¯αAjBa¯Ba¯c+αBa¯Bc)=(αασj/ταBcασi/ττFij+α(σiσj/τ2)ασiBc/ταBa¯ασjBa¯/τBa¯c+αBa¯Bc),{\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle C}}=\left(\begin{array}[]{ccccc}\alpha&&-\alpha A^{j}&&-\alpha B^{c}\\ -\alpha A_{\bar{\imath}}&&A^{\bar{\imath}j}+\alpha A^{\bar{\imath}}A^{j}&&\alpha A^{\bar{\imath}}B^{c}\\ -\alpha B^{\bar{a}}&&\alpha A^{j}B^{\bar{a}}&&B^{\bar{a}c}+\alpha B^{\bar{a}}B^{c}\end{array}\right)=\left(\begin{array}[]{ccccc}\alpha&&\alpha\sigma^{j}/\tau&&-\alpha B^{c}\\ \alpha\sigma^{i}/\tau&&\tau F^{ij}+\alpha(\sigma^{i}\sigma^{j}/\tau^{2})&&-\alpha\sigma^{i}B^{c}/\tau\\ -\alpha B^{\bar{a}}&&-\alpha\sigma^{j}B^{\bar{a}}/\tau&&B^{\bar{a}c}+\alpha B^{\bar{a}}B^{c}\end{array}\right)\,, (189)

where

1α=BTT¯Ba¯cBTa¯BcT¯\frac{1}{\alpha}=B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-B^{\bar{a}c}B_{{\scriptscriptstyle T}\bar{a}}B_{c{\overline{{\scriptscriptstyle T}}}} (190)

Therefore

𝒫A¯=𝒫A¯B𝒫B=(α(F+BTBaBa)τFijFj+ατσi(F+BTBaBa)Ba¯αBa¯(F+BTBaBa)),{\cal P}^{{\overline{{\scriptscriptstyle A}}}}={\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\cal P}_{\scriptscriptstyle B}=\left(\begin{array}[]{c}\alpha(F+B_{\scriptscriptstyle T}-B^{a}B_{a})\\ \tau F^{ij}F_{j}+\frac{\alpha}{\tau}\,\sigma^{i}(F+B_{\scriptscriptstyle T}-B^{a}B_{a})\\ B^{\bar{a}}-\alpha B^{\bar{a}}(F+B_{\scriptscriptstyle T}-B^{a}B_{a})\end{array}\right)\,, (191)

and so

P2\displaystyle P^{2} =\displaystyle= τFijFiFj+BaBa+α|F+BTBaBa|2\displaystyle\tau F^{ij}F_{i}F_{j}+B^{a}B_{a}+\alpha|F+B_{\scriptscriptstyle T}-B^{a}B_{a}|^{2} (192)
=\displaystyle= τFijFiFj+α[|F+BT|2+(BTT¯2F2BT)BaBa].\displaystyle\tau F^{ij}F_{i}F_{j}+\alpha\Bigl{[}|F+B_{\scriptscriptstyle T}|^{2}+(B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-2F-2B_{\scriptscriptstyle T})B^{a}B_{a}\Bigr{]}\,.

Limiting cases

We can check the above against known examples.

No zaz^{a} sector

As a check, this should become a no-scale model in the absence of the zaz^{a} sector. Setting B=0B=0 and dropping the zaz^{a} fields, we have 𝒫=A{\cal P}=A and so

𝒫A=(ATAi)=(F(σiFi/τ)Fi)and𝒫A¯B=(αασj/τασi/ττFij+α(σiσj/τ2)),{\cal P}_{\scriptscriptstyle A}=\left(\begin{array}[]{c}A_{\scriptscriptstyle T}\\ A_{i}\end{array}\right)=\left(\begin{array}[]{c}F-(\sigma^{i}F_{i}/\tau)\\ F_{i}\end{array}\right)\quad\hbox{and}\quad{\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}=\left(\begin{array}[]{ccc}\alpha&&\alpha\sigma^{j}/\tau\\ \alpha\sigma^{i}/\tau&&\tau F^{ij}+\alpha(\sigma^{i}\sigma^{j}/\tau^{2})\end{array}\right)\,, (193)

Ignoring the fact that α\alpha\to\infty temporarily, it follows that

𝒫A¯=𝒫A¯B𝒫B=(αFαFσiτ+τFijFj)and soP2=αF2+τFijFiFj.{\cal P}^{{\overline{{\scriptscriptstyle A}}}}={\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\cal P}_{\scriptscriptstyle B}=\left(\begin{array}[]{c}\alpha F\\ \frac{\alpha F\sigma^{i}}{\tau}+\tau F^{ij}F_{j}\end{array}\right)\quad\hbox{and so}\quad P^{2}=\alpha F^{2}+\tau F^{ij}F_{i}F_{j}\,. (194)

Taking now α\alpha\to\infty means that P2P^{2}\to\infty as well and so (176) becomes

KA¯BKA¯KB=3P2P2𝒫3,K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}=\frac{3P^{2}}{P^{2}-{\cal P}}\to 3\,, (195)

as is required for a no-scale model.

No SiS^{i} fields

As a second check, imagine there are no SiS^{i} fields, and that A=τA=\tau and so 𝒫=τ+B(τ,za,z¯b){\cal P}=\tau+B(\tau,z^{a},\bar{z}^{b}), in which case the above should reduce to the model of the main text (without modulus stabilization). In this case F=1F=1 and so AT=1A_{\scriptscriptstyle T}=1 and Ai=0A_{i}=0 are to be used in the above formulae, implying in particular that all second derivatives of AA vanish. As a consequence

𝒫A=(1+BTBa)and𝒫AB¯=(BTT¯BTc¯BaT¯Bac¯).{\cal P}_{\scriptscriptstyle A}=\left(\begin{array}[]{c}1+B_{\scriptscriptstyle T}\\ B_{a}\end{array}\right)\quad\hbox{and}\quad{\cal P}_{{\scriptscriptstyle A}\bar{\scriptscriptstyle B}}=\left(\begin{array}[]{ccc}B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}&&B_{{\scriptscriptstyle T}\bar{c}}\\ B_{a{\overline{{\scriptscriptstyle T}}}}&&B_{a\bar{c}}\end{array}\right)\,. (196)

and so

𝒫A¯B=(ααBcαBa¯Ba¯c+αBa¯Bc)where1α=BTT¯Ba¯cBTa¯BcT¯.{\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}=\left(\begin{array}[]{ccc}\alpha&&-\alpha B^{c}\\ -\alpha B^{\bar{a}}&&B^{\bar{a}c}+\alpha B^{\bar{a}}B^{c}\end{array}\right)\quad\hbox{where}\quad\frac{1}{\alpha}=B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-B^{\bar{a}c}B_{{\scriptscriptstyle T}\bar{a}}B_{c{\overline{{\scriptscriptstyle T}}}}\,. (197)

Therefore

𝒫A¯=𝒫A¯B𝒫B=(α(1+BTBaBa)Ba¯αBa¯(1+BTBaBa)),{\cal P}^{{\overline{{\scriptscriptstyle A}}}}={\cal P}^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\cal P}_{\scriptscriptstyle B}=\left(\begin{array}[]{c}\alpha(1+B_{\scriptscriptstyle T}-B^{a}B_{a})\\ B^{\bar{a}}-\alpha B^{\bar{a}}(1+B_{\scriptscriptstyle T}-B^{a}B_{a})\end{array}\right)\,, (198)

and so (183) becomes

P2=α[(1+BT¯)(1+BTBaBa)+BaBa(BTT¯1BT)].P^{2}=\alpha\Bigl{[}(1+B_{{\overline{{\scriptscriptstyle T}}}})(1+B_{\scriptscriptstyle T}-B^{a}B_{a})+B^{a}B_{a}(B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-1-B_{\scriptscriptstyle T})\Bigr{]}\,. (199)

These expressions imply

KA¯BKA¯KB3=3𝒫P2𝒫\displaystyle K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}-3=\frac{3{\cal P}}{P^{2}-{\cal P}}
=3(τ+B)[BTT¯Ba¯cBTa¯BcT¯](1+BT¯)(1+BTBaBa)+BaBa(BTT¯1BT)(τ+B)[BTT¯Ba¯cBTa¯BcT¯],\displaystyle\qquad=\frac{3(\tau+B)[B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-B^{\bar{a}c}B_{{\scriptscriptstyle T}\bar{a}}B_{c{\overline{{\scriptscriptstyle T}}}}]}{(1+B_{{\overline{{\scriptscriptstyle T}}}})(1+B_{\scriptscriptstyle T}-B^{a}B_{a})+B^{a}B_{a}(B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-1-B_{\scriptscriptstyle T})-(\tau+B)[B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-B^{\bar{a}c}B_{{\scriptscriptstyle T}\bar{a}}B_{c{\overline{{\scriptscriptstyle T}}}}]}\,,

which vanishes if BB is independent of τ\tau, since then P2=α(12BaBa)P^{2}=\alpha(1-2B^{a}B_{a}) and α\alpha\to\infty and so again KA¯BKA¯KB=3P2/(P2𝒫)3K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}={3P^{2}}/{(P^{2}-{\cal P})}\to 3.

For the rest of the lagrangian we also require KAK_{\scriptscriptstyle A}, KAB¯K_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}} and KA¯BK^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}} separately, though only to leading order in 1/τ1/\tau since for these the no-scale cancellations do not occur. For these purposes we can use BT𝒪(1/τ)B_{\scriptscriptstyle T}\sim{\cal O}(1/\tau) and BTT¯𝒪(1/τ2)B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}\sim{\cal O}(1/\tau^{2}). For instance, eq. (174) becomes

KA=(KTKa)=3𝒫(1+BTBa)3𝒫(1Ba).K_{\scriptscriptstyle A}=\left(\begin{array}[]{c}K_{\scriptscriptstyle T}\\ K_{a}\end{array}\right)=-\frac{3}{{\cal P}}\left(\begin{array}[]{c}1+B_{\scriptscriptstyle T}\\ B_{a}\end{array}\right)\simeq-\frac{3}{{\cal P}}\left(\begin{array}[]{c}1\\ B_{a}\end{array}\right)\,. (201)

Eq. (175) similarly becomes

KAB¯\displaystyle K_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}} =\displaystyle= 3𝒫AB¯𝒫+3𝒫A𝒫B¯𝒫2\displaystyle-\frac{3{\cal P}_{{\scriptscriptstyle A}{\overline{{\scriptscriptstyle B}}}}}{{\cal P}}+\frac{3{\cal P}_{\scriptscriptstyle A}{\cal P}_{{\overline{{\scriptscriptstyle B}}}}}{{\cal P}^{2}} (206)
=\displaystyle= 3𝒫(BTT¯BTc¯BaT¯Bac¯)+3𝒫2(|1+BT|2(1+BT)Bc¯(1+BT¯)BaBaBc¯)\displaystyle-\frac{3}{{\cal P}}\left(\begin{array}[]{ccc}B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}&&B_{{\scriptscriptstyle T}\bar{c}}\\ B_{a{\overline{{\scriptscriptstyle T}}}}&&B_{a\bar{c}}\end{array}\right)+\frac{3}{{\cal P}^{2}}\left(\begin{array}[]{ccc}|1+B_{\scriptscriptstyle T}|^{2}&&(1+B_{\scriptscriptstyle T})B_{\bar{c}}\\ (1+B_{{\overline{{\scriptscriptstyle T}}}})B_{a}&&B_{a}B_{\bar{c}}\end{array}\right)
\displaystyle\simeq 3𝒫(1/𝒫BTc¯Bc¯/𝒫BaT¯Ba/𝒫Bac¯)=3𝒫(1/𝒫Xc¯XaBac¯).\displaystyle-\frac{3}{{\cal P}}\left(\begin{array}[]{ccc}-1/{\cal P}&&B_{{\scriptscriptstyle T}\bar{c}}-B_{\bar{c}}/{\cal P}\\ B_{a{\overline{{\scriptscriptstyle T}}}}-B_{a}/{\cal P}&&B_{a\bar{c}}\end{array}\right)=-\frac{3}{{\cal P}}\left(\begin{array}[]{ccc}-1/{\cal P}&&X_{\bar{c}}\\ X_{a}&&B_{a\bar{c}}\end{array}\right)\,. (211)

where the last equality defines the quantities XaX_{a} and Xc¯X_{\bar{c}}. As is easy to check the inverse of this last matrix is

KA¯B𝒫3(𝒫X2+1)(𝒫𝒫Xb𝒫Xa¯(𝒫X2+1)Ba¯b𝒫Xa¯Xb)K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}\simeq-\frac{{\cal P}}{3({\cal P}X^{2}+1)}\left(\begin{array}[]{ccc}-{\cal P}&&{\cal P}X^{b}\\ {\cal P}X^{\bar{a}}&&({\cal P}X^{2}+1)B^{\bar{a}b}-{\cal P}X^{\bar{a}}X^{b}\end{array}\right) (212)

where Ba¯bBbc¯=δc¯a¯B^{\bar{a}b}B_{b\bar{c}}=\delta^{\bar{a}}_{\bar{c}} and Xa¯:=Ba¯bXbX^{\bar{a}}:=B^{\bar{a}b}X_{b} while Xb=Ba¯bXa¯X^{b}=B^{\bar{a}b}X_{\bar{a}} and X2=XaXa=Xa¯Xa¯=Ba¯bXa¯XbX^{2}=X_{a}X^{a}=X_{\bar{a}}X^{\bar{a}}=B^{\bar{a}b}X_{\bar{a}}X_{b} and so on. Keeping only the leading powers of 1/𝒫1/{\cal P} and taking XaX_{a} and Xc¯X_{\bar{c}} to be order 1/τ1/\tau and Bac¯B_{a\bar{c}} to be 𝒪(1){\cal O}(1) we have X2=𝒪(1/τ2)X^{2}={\cal O}(1/\tau^{2}) and so 𝒫X2+11{\cal P}X^{2}+1\simeq 1. The inverse therefore becomes approximately

KA¯B𝒫3(𝒫𝒫Xb𝒫Xa¯Ba¯b)K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}\simeq-\frac{{\cal P}}{3}\left(\begin{array}[]{ccc}-{\cal P}&&{\cal P}X^{b}\\ {\cal P}X^{\bar{a}}&&B^{\bar{a}b}\end{array}\right) (213)

and so

KA¯BKB\displaystyle K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{\scriptscriptstyle B} \displaystyle\simeq (𝒫𝒫Xb𝒫Xa¯Ba¯b)(1Bb)=(𝒫(1+XbBb)𝒫Xa¯+Ba¯bBb)\displaystyle\left(\begin{array}[]{ccc}-{\cal P}&&{\cal P}X^{b}\\ {\cal P}X^{\bar{a}}&&B^{\bar{a}b}\end{array}\right)\left(\begin{array}[]{c}1\\ B_{b}\end{array}\right)=\left(\begin{array}[]{c}{\cal P}(-1+X^{b}B_{b})\\ {\cal P}X^{\bar{a}}+B^{\bar{a}b}B_{b}\end{array}\right) (220)
=\displaystyle= (𝒫(1+Ba¯bBTa¯Bb)Ba¯bBa¯Bb𝒫Ba¯bBbT¯)𝒫(1Ba¯bBbT¯).\displaystyle\left(\begin{array}[]{c}{\cal P}(-1+B^{\bar{a}b}B_{{\scriptscriptstyle T}\bar{a}}B_{b})-B^{\bar{a}b}B_{\bar{a}}B_{b}\\ {\cal P}B^{\bar{a}b}B_{b{\overline{{\scriptscriptstyle T}}}}\end{array}\right)\simeq{\cal P}\left(\begin{array}[]{c}-1\\ B^{\bar{a}b}B_{b{\overline{{\scriptscriptstyle T}}}}\end{array}\right)\,. (225)

With these same approximations eq. (A) becomes

KA¯BKA¯KB33𝒫(BTT¯Ba¯cBTa¯BcT¯)12Ba¯bBa¯Bb𝒪(τ1),K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}-3\simeq\frac{3{\cal P}(B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-B^{\bar{a}c}B_{{\scriptscriptstyle T}\bar{a}}B_{c{\overline{{\scriptscriptstyle T}}}})}{1-2B^{\bar{a}b}B_{\bar{a}}B_{b}}\sim{\cal O}(\tau^{-1})\,,

All that matters for the scalar potential is the superpotential and its first derivative, and so we take the superpotential to have the form

W=w0+wazaW=w_{0}+w_{a}z^{a} (226)

and evaluate the result at za=0z^{a}=0 (this is particularly appropriate for X{za}X\in\{z^{a}\}). Both w0w_{0} and waw_{a} are imagined to depend on TT only as functions of lnT\ln T. The ordinary derivatives of WW then are WT=(w0+waza)/TW_{\scriptscriptstyle T}=(w^{\prime}_{0}+w^{\prime}_{a}z^{a})/T and Wa=waW_{a}=w_{a} (where primes denote differentiation with respect to lnT\ln T). The Kähler covariant derivatives are DAW:=WA+KAWD_{\scriptscriptstyle A}W:=W_{\scriptscriptstyle A}+K_{\scriptscriptstyle A}W, and so (evaluated at za=0z^{a}=0)

DaW\displaystyle D_{a}W =\displaystyle= Wa+KaW=wa3Baw0𝒫\displaystyle W_{a}+K_{a}W=w_{a}-\frac{3B_{a}w_{0}}{{\cal P}}
andDTW\displaystyle\hbox{and}\quad D_{\scriptscriptstyle T}W =\displaystyle= WT+KTW=w0T3w0𝒫(1+BT)2w0τ3w0τ+𝒪(1/τ2).\displaystyle W_{\scriptscriptstyle T}+K_{\scriptscriptstyle T}W=\frac{w_{0}^{\prime}}{T}-\frac{3w_{0}}{{\cal P}}(1+B_{\scriptscriptstyle T})\simeq\frac{2w_{0}^{\prime}}{\tau}-\frac{3w_{0}}{\tau}+{\cal O}(1/\tau^{2})\,. (227)

The combination appearing in the FF-term potential then is

KA¯BDAW¯DBW3|W|2\displaystyle K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\overline{D_{\scriptscriptstyle A}W}}D_{\scriptscriptstyle B}W-3|W|^{2} =\displaystyle= KA¯B[WA¯WB+KA¯W¯WB+KBWA¯W]+(KA¯BKA¯KB3)|W|2\displaystyle K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}\Bigl{[}{\overline{W_{\scriptscriptstyle A}}}W_{\scriptscriptstyle B}+K_{{\overline{{\scriptscriptstyle A}}}}{\overline{W}}W_{\scriptscriptstyle B}+K_{\scriptscriptstyle B}{\overline{W_{\scriptscriptstyle A}}}W\Bigr{]}+(K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}-3)|W|^{2}
=\displaystyle= KT¯TWT¯WT+KT¯aWaW¯T+Ka¯TW¯aWT+KT¯BKBWWT¯+KB¯TKB¯W¯WT\displaystyle K^{{\overline{{\scriptscriptstyle T}}}{\scriptscriptstyle T}}{\overline{W_{\scriptscriptstyle T}}}W_{\scriptscriptstyle T}+K^{{\overline{{\scriptscriptstyle T}}}a}W_{a}{\overline{W}}_{\scriptscriptstyle T}+K^{\bar{a}{\scriptscriptstyle T}}{\overline{W}}_{a}W_{\scriptscriptstyle T}+K^{{\overline{{\scriptscriptstyle T}}}{\scriptscriptstyle B}}K_{\scriptscriptstyle B}W{\overline{W_{\scriptscriptstyle T}}}+K^{{\overline{{\scriptscriptstyle B}}}{\scriptscriptstyle T}}K_{{\overline{{\scriptscriptstyle B}}}}{\overline{W}}W_{\scriptscriptstyle T}
+Ka¯bWa¯Wb+Ka¯BKBWWa¯+KB¯aKB¯W¯Wa+(KA¯BKA¯KB3)|W|2\displaystyle\quad+K^{\bar{a}b}{\overline{W_{a}}}W_{b}+K^{\bar{a}{\scriptscriptstyle B}}K_{\scriptscriptstyle B}W{\overline{W_{a}}}+K^{{\overline{{\scriptscriptstyle B}}}a}K_{{\overline{{\scriptscriptstyle B}}}}{\overline{W}}W_{a}+(K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}-3)|W|^{2}
\displaystyle\simeq 𝒫23|w0T|2𝒫23[Ba¯b(BbT¯Bb/𝒫)wa¯w0T+c.c.]𝒫(w0w0¯T+c.c.)\displaystyle\frac{{\cal P}^{2}}{3}\left|\frac{w_{0}^{\prime}}{T}\right|^{2}-\frac{{\cal P}^{2}}{3}\left[\frac{B^{\bar{a}b}(B_{b{\overline{{\scriptscriptstyle T}}}}-B_{b}/{\cal P}){\overline{w_{a}}}w_{0}^{\prime}}{T}+\hbox{c.c.}\right]-{\cal P}\left(\frac{w_{0}^{\prime}\,{\overline{w_{0}}}}{T}+\hbox{c.c.}\right)
𝒫3Ba¯bwa¯wb+𝒫Ba¯bBbT¯w0wa¯+𝒫Ba¯bBTa¯wbw0¯\displaystyle\qquad\qquad-\frac{{\cal P}}{3}B^{\bar{a}b}{\overline{w_{a}}}w_{b}+{\cal P}B^{\bar{a}b}B_{b{\overline{{\scriptscriptstyle T}}}}w_{0}{\overline{w_{a}}}+{\cal P}B^{\bar{a}b}B_{{\scriptscriptstyle T}\bar{a}}w_{b}{\overline{w_{0}}}
+BTT¯Ba¯cBTa¯BcT¯12Ba¯bBa¯Bb(3𝒫|w0|2),\displaystyle\qquad\qquad\qquad\qquad+\frac{B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-B^{\bar{a}c}B_{{\scriptscriptstyle T}\bar{a}}B_{c{\overline{{\scriptscriptstyle T}}}}}{1-2B^{\bar{a}b}B_{\bar{a}}B_{b}}\Bigl{(}3{\cal P}|w_{0}|^{2}\Bigr{)}\,,

and so VFV_{\scriptscriptstyle F} is given by

VF=eK[KA¯BDAW¯DBW3|W|2]=1𝒫3[KA¯BDAW¯DBW3|W|2]=V2+V3+V4+,V_{\scriptscriptstyle F}=e^{K}\Bigl{[}K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\overline{D_{\scriptscriptstyle A}W}}D_{\scriptscriptstyle B}W-3|W|^{2}\Bigr{]}=\frac{1}{{\cal P}^{3}}\Bigl{[}K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}{\overline{D_{\scriptscriptstyle A}W}}D_{\scriptscriptstyle B}W-3|W|^{2}\Bigr{]}=V_{2}+V_{3}+V_{4}+\cdots\,, (229)

where VnV_{n} is of order τn\tau^{-n} and explicitly

V2\displaystyle V_{2} \displaystyle\simeq 1𝒫2[13Ba¯bwa¯wb]\displaystyle\frac{1}{{\cal P}^{2}}\left[-\frac{1}{3}\,B^{\bar{a}b}{\overline{w_{a}}}w_{b}\right]
V3\displaystyle V_{3} \displaystyle\simeq 1𝒫2[Ba¯bBbT¯w0wa¯+Ba¯bBTa¯wbw0¯\displaystyle\frac{1}{{\cal P}^{2}}\left[B^{\bar{a}b}B_{b{\overline{{\scriptscriptstyle T}}}}w_{0}{\overline{w_{a}}}+B^{\bar{a}b}B_{{\scriptscriptstyle T}\bar{a}}w_{b}{\overline{w_{0}}}\phantom{\frac{1}{2}}\right.
+𝒫3|w0T|2𝒫3[Ba¯b(BbT¯Bb/𝒫)wa¯w0T+c.c.](w0w0¯T+c.c.)]\displaystyle\qquad\qquad\left.+\frac{{\cal P}}{3}\left|\frac{w_{0}^{\prime}}{T}\right|^{2}-\frac{{\cal P}}{3}\left[\frac{B^{\bar{a}b}(B_{b{\overline{{\scriptscriptstyle T}}}}-B_{b}/{\cal P}){\overline{w_{a}}}w_{0}^{\prime}}{T}+\hbox{c.c.}\right]-\left(\frac{w_{0}^{\prime}\,{\overline{w_{0}}}}{T}+\hbox{c.c.}\right)\right]
V4\displaystyle V_{4} \displaystyle\simeq 1𝒫2[BTT¯Ba¯cBTa¯BcT¯12Ba¯bBa¯Bb(3|w0|2)],\displaystyle\frac{1}{{\cal P}^{2}}\left[\frac{B_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-B^{\bar{a}c}B_{{\scriptscriptstyle T}\bar{a}}B_{c{\overline{{\scriptscriptstyle T}}}}}{1-2B^{\bar{a}b}B_{\bar{a}}B_{b}}\Bigl{(}3|w_{0}|^{2}\Bigr{)}\right]\,,

and so on. These show that the wa¯wb{\overline{w_{a}}}w_{b} term arises at order τ2\tau^{-2} while the w0¯wb{\overline{w_{0}}}w_{b} terms are at order τ3\tau^{-3} and the |w0|2|w_{0}|^{2} term is order τ4\tau^{-4}.

Specializing to the case where {za}=X\{z^{a}\}=X and writing B=kB=-k so that 𝒫=τk{\cal P}=\tau-k the above potential becomes

VF1𝒫2[13kX¯XwX¯wX+kX¯XkXT¯w0wX¯+kX¯XkTX¯wXw0¯kTT¯kX¯XkTX¯kXT¯1+2kXX¯kXkX¯(3|w0|2)],V_{\scriptscriptstyle F}\simeq\frac{1}{{\cal P}^{2}}\left[\frac{1}{3}\,k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}{\overline{w_{\scriptscriptstyle X}}}w_{\scriptscriptstyle X}+k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}w_{0}{\overline{w_{\scriptscriptstyle X}}}+k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}w_{\scriptscriptstyle X}{\overline{w_{0}}}-\frac{k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}}{1+2k^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}k_{\scriptscriptstyle X}k_{{\overline{{\scriptscriptstyle X}}}}}\Bigl{(}3|w_{0}|^{2}\Bigr{)}\right]\,, (230)

which has the no-scale form that ensures TT is a flat direction when kT=0k_{\scriptscriptstyle T}=0. This potential also has no mixing between w0w_{0} and wXw_{\scriptscriptstyle X} whenever kXT¯=0k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}=0, and this is why in our simple models stabilization with respect to variations of wXw_{\scriptscriptstyle X} always led to wX=0w_{\scriptscriptstyle X}=0.

In a scenario where wXw_{\scriptscriptstyle X} is a function of Φ\Phi and so can be minimized, we see (as mentioned above) that the minimization occurs at wX=0w_{\scriptscriptstyle X}=0 when kTX¯=0k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}=0. More generally extremizing with respect to wX¯{\overline{w_{\scriptscriptstyle X}}} gives the saddle point

wX=3kXT¯w0,w_{\scriptscriptstyle X}=-3k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}w_{0}\,, (231)

and evaluated at this point the potential becomes

VF3|w0|2τ2[kX¯XkXT¯kTX¯+kTT¯kX¯XkTX¯kXT¯1+2kXX¯kXkX¯],V_{\scriptscriptstyle F}\simeq-\frac{3|w_{0}|^{2}}{\tau^{2}}\left[k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}+\frac{k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}-k^{{\overline{{\scriptscriptstyle X}}}{\scriptscriptstyle X}}k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle X}}}}k_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle T}}}}}{1+2k^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}k_{\scriptscriptstyle X}k_{{\overline{{\scriptscriptstyle X}}}}}\right]\,, (232)

where both terms are order 1/τ41/\tau^{4}. Notice that the kinetic term for TT is controlled by KTT¯K_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}} and this is independent of kTT¯k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}} once corrections in powers of 1/τ1/\tau are dropped, and as a result kTT¯k_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}} can have either sign, allowing the potential to be positive.

No-scale Kähler geometry

In this appendix we compute some of the geometrical quantities that arise in the main text for a general two-field no-scale geometry. To this end consider a general no-scale system with two fields SS and TT and Kähler potential

K=3Mp2ln[τ(σ/τ)]=3Mp2[lnτ+F(σ/τ)]K=-3M_{p}^{2}\ln\Big{[}\tau\,{\cal F}(\sigma/\tau)\Bigr{]}=-3M_{p}^{2}\Bigl{[}\ln\tau+F(\sigma/\tau)\Bigr{]} (233)

where τ=T+T¯\tau=T+{\overline{T}} and σ=S+S¯\sigma=S+{\overline{S}} and =eF{\cal F}=e^{F} is otherwise arbitrary. For such a choice we have

A2=eK/(3Mp2)=1τ(σ/τ)A^{2}=e^{K/(3M_{p}^{2})}=\frac{1}{\tau\,{\cal F}(\sigma/\tau)} (234)

and the first derivatives of KK are

KT=3Mp2[1τ+FT]=3Mp2τ[1xF(x)]andKS=3Mp2FS=3Mp2F(x)τ,K_{\scriptscriptstyle T}=-3M_{p}^{2}\left[\frac{1}{\tau}+F_{\scriptscriptstyle T}\right]=-\frac{3M_{p}^{2}}{\tau}\Bigl{[}1-x\,F^{\prime}(x)\Bigr{]}\quad\hbox{and}\quad K_{\scriptscriptstyle S}=-3M_{p}^{2}F_{\scriptscriptstyle S}=-\frac{3M_{p}^{2}F^{\prime}(x)}{\tau}\,, (235)

where x:=σ/τx:=\sigma/\tau.

The components of the Kähler metric then are

KSS¯\displaystyle K_{{\scriptscriptstyle S}{\overline{{\scriptscriptstyle S}}}} =\displaystyle= 3Mp2FSS¯=3Mp2F′′(x)τ2\displaystyle-3M_{p}^{2}F_{{\scriptscriptstyle S}{\overline{{\scriptscriptstyle S}}}}=-\frac{3M_{p}^{2}F^{\prime\prime}(x)}{\tau^{2}}
KST¯\displaystyle K_{{\scriptscriptstyle S}{\overline{{\scriptscriptstyle T}}}} =\displaystyle= 3Mp2FST¯=3Mp2τ2[F(x)+xF′′(x)]\displaystyle-3M_{p}^{2}F_{{\scriptscriptstyle S}{\overline{{\scriptscriptstyle T}}}}=\frac{3M_{p}^{2}}{\tau^{2}}\Bigl{[}F^{\prime}(x)+xF^{\prime\prime}(x)\Bigr{]} (236)
KTT¯\displaystyle K_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}} =\displaystyle= 3Mp2[1τ2FTT¯]=3Mp2τ2[12xF(x)x2F′′(x)]\displaystyle 3M_{p}^{2}\left[\frac{1}{\tau^{2}}-F_{{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}\right]=\frac{3M_{p}^{2}}{\tau^{2}}\Bigl{[}1-2xF^{\prime}(x)-x^{2}F^{\prime\prime}(x)\Bigr{]}

and so the inverse metric has components

KS¯S\displaystyle K^{{\overline{{\scriptscriptstyle S}}}{\scriptscriptstyle S}} =\displaystyle= τ23Mp2[F′′+(F)2][1+2xF+x2F′′]\displaystyle\frac{\tau^{2}}{3M_{p}^{2}[F^{\prime\prime}+(F^{\prime})^{2}]}\Bigl{[}-1+2xF^{\prime}+x^{2}F^{\prime\prime}\Bigr{]}
KT¯S\displaystyle K^{{\overline{{\scriptscriptstyle T}}}{\scriptscriptstyle S}} =\displaystyle= τ23Mp2[F′′+(F)2][F+xF′′]\displaystyle\frac{\tau^{2}}{3M_{p}^{2}[F^{\prime\prime}+(F^{\prime})^{2}]}\Bigl{[}F^{\prime}+xF^{\prime\prime}\Bigr{]} (237)
KT¯T\displaystyle K^{{\overline{{\scriptscriptstyle T}}}{\scriptscriptstyle T}} =\displaystyle= τ2F′′3Mp2[F′′+(F)2].\displaystyle\frac{\tau^{2}F^{\prime\prime}}{3M_{p}^{2}[F^{\prime\prime}+(F^{\prime})^{2}]}\,.

Notice that these imply that the combination ηa\eta^{a} that controls the coupling to matter is

KA¯TKA¯=τandKA¯SKA¯=σ,K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle T}}K_{{\overline{{\scriptscriptstyle A}}}}=-\tau\quad\hbox{and}\quad K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle S}}K_{{\overline{{\scriptscriptstyle A}}}}=-\sigma\,, (238)

completely independent of the function F(x)F(x) (in agreement with the general scaling proof given for more general models in [32, 17]). As a check notice that (235) and (238) together imply KA¯BKA¯KB=3Mp2K^{{\overline{{\scriptscriptstyle A}}}{\scriptscriptstyle B}}K_{{\overline{{\scriptscriptstyle A}}}}K_{\scriptscriptstyle B}=3M_{p}^{2} as required for a no-scale model.

The target-space Christoffel symbols that appear in the scalar field equations require the third derivatives:

KSSS¯\displaystyle K_{{\scriptscriptstyle S}{\scriptscriptstyle S}{\overline{{\scriptscriptstyle S}}}} =\displaystyle= 3Mp2FSSS¯=3Mp2F′′′(x)τ3\displaystyle-3M_{p}^{2}F_{{\scriptscriptstyle S}{\scriptscriptstyle S}{\overline{{\scriptscriptstyle S}}}}=-\frac{3M_{p}^{2}F^{\prime\prime\prime}(x)}{\tau^{3}}
KSST¯\displaystyle K_{{\scriptscriptstyle S}{\scriptscriptstyle S}{\overline{{\scriptscriptstyle T}}}} =\displaystyle= 3Mp2FSST¯=3Mp2τ3[2F′′(x)+xF′′′(x)]\displaystyle-3M_{p}^{2}F_{{\scriptscriptstyle S}{\scriptscriptstyle S}{\overline{{\scriptscriptstyle T}}}}=\frac{3M_{p}^{2}}{\tau^{3}}\Bigl{[}2F^{\prime\prime}(x)+xF^{\prime\prime\prime}(x)\Bigr{]} (239)
KSTT¯\displaystyle K_{{\scriptscriptstyle S}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}} =\displaystyle= 3Mp2FSTT¯=3Mp2τ3[2F(x)+4xF′′(x)+x2F′′′(x)]\displaystyle-3M_{p}^{2}F_{{\scriptscriptstyle S}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}=-\frac{3M_{p}^{2}}{\tau^{3}}\Bigl{[}2F^{\prime}(x)+4xF^{\prime\prime}(x)+x^{2}F^{\prime\prime\prime}(x)\Bigr{]}

and

KTSS¯\displaystyle K_{{\scriptscriptstyle T}{\scriptscriptstyle S}{\overline{{\scriptscriptstyle S}}}} =\displaystyle= 3Mp2FTSS¯=3Mp2τ3[2F′′(x)+xF′′′(x)]\displaystyle-3M_{p}^{2}F_{{\scriptscriptstyle T}{\scriptscriptstyle S}{\overline{{\scriptscriptstyle S}}}}=\frac{3M_{p}^{2}}{\tau^{3}}\Bigl{[}2F^{\prime\prime}(x)+xF^{\prime\prime\prime}(x)\Bigr{]}
KTST¯\displaystyle K_{{\scriptscriptstyle T}{\scriptscriptstyle S}{\overline{{\scriptscriptstyle T}}}} =\displaystyle= 3Mp2FSTT¯=3Mp2τ3[2F(x)+4xF′′(x)+x2F′′′(x)]\displaystyle-3M_{p}^{2}F_{{\scriptscriptstyle S}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}=-\frac{3M_{p}^{2}}{\tau^{3}}\Bigl{[}2F^{\prime}(x)+4xF^{\prime\prime}(x)+x^{2}F^{\prime\prime\prime}(x)\Bigr{]} (240)
KTTT¯\displaystyle K_{{\scriptscriptstyle T}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}} =\displaystyle= 3Mp2[2τ3+FTTT¯]=3Mp2τ3[26xF(x)6x2F′′(x)x3F′′′(x)]\displaystyle-3M_{p}^{2}\left[\frac{2}{\tau^{3}}+F_{{\scriptscriptstyle T}{\scriptscriptstyle T}{\overline{{\scriptscriptstyle T}}}}\right]=-\frac{3M_{p}^{2}}{\tau^{3}}\Bigl{[}2-6xF^{\prime}(x)-6x^{2}F^{\prime\prime}(x)-x^{3}F^{\prime\prime\prime}(x)\Bigr{]}

and so

ΓSTT\displaystyle\Gamma^{\scriptscriptstyle T}_{{\scriptscriptstyle S}{\scriptscriptstyle T}} =\displaystyle= 2x(F′′)2+xFF′′′τ[F′′+(F)2]\displaystyle\frac{-2x(F^{\prime\prime})^{2}+xF^{\prime}F^{\prime\prime\prime}}{\tau[F^{\prime\prime}+(F^{\prime})^{2}]}
ΓSST\displaystyle\Gamma^{\scriptscriptstyle T}_{{\scriptscriptstyle S}{\scriptscriptstyle S}} =\displaystyle= 2(F′′)2FF′′′τ[F′′+(F)2]\displaystyle\frac{2(F^{\prime\prime})^{2}-F^{\prime}F^{\prime\prime\prime}}{\tau[F^{\prime\prime}+(F^{\prime})^{2}]} (241)
ΓSTS\displaystyle\Gamma^{\scriptscriptstyle S}_{{\scriptscriptstyle S}{\scriptscriptstyle T}} =\displaystyle= 2(F)22F′′2xFF′′xF′′′2x2(F′′)2+x2FF′′′τ[F′′+(F)2]\displaystyle\frac{-2(F^{\prime})^{2}-2F^{\prime\prime}-2xF^{\prime}F^{\prime\prime}-xF^{\prime\prime\prime}-2x^{2}(F^{\prime\prime})^{2}+x^{2}F^{\prime}F^{\prime\prime\prime}}{\tau[F^{\prime\prime}+(F^{\prime})^{2}]}
ΓSSS\displaystyle\Gamma^{\scriptscriptstyle S}_{{\scriptscriptstyle S}{\scriptscriptstyle S}} =\displaystyle= 2FF′′+F′′′+2x(F′′)2xFF′′′τ[F′′+(F)2]\displaystyle\frac{2F^{\prime}F^{\prime\prime}+F^{\prime\prime\prime}+2x(F^{\prime\prime})^{2}-xF^{\prime}F^{\prime\prime\prime}}{\tau[F^{\prime\prime}+(F^{\prime})^{2}]}

and

ΓTTT\displaystyle\Gamma^{\scriptscriptstyle T}_{{\scriptscriptstyle T}{\scriptscriptstyle T}} =\displaystyle= 2(F)22F′′+2x2(F′′)2x2FF′′′τ[F′′+(F)2]\displaystyle\frac{-2(F^{\prime})^{2}-2F^{\prime\prime}+2x^{2}(F^{\prime\prime})^{2}-x^{2}F^{\prime}F^{\prime\prime\prime}}{\tau[F^{\prime\prime}+(F^{\prime})^{2}]}
ΓTST\displaystyle\Gamma^{\scriptscriptstyle T}_{{\scriptscriptstyle T}{\scriptscriptstyle S}} =\displaystyle= 2x(F′′)2+xFF′′′τ[F′′+(F)2]\displaystyle\frac{-2x(F^{\prime\prime})^{2}+xF^{\prime}F^{\prime\prime\prime}}{\tau[F^{\prime\prime}+(F^{\prime})^{2}]} (242)
ΓTTS\displaystyle\Gamma^{\scriptscriptstyle S}_{{\scriptscriptstyle T}{\scriptscriptstyle T}} =\displaystyle= 2x(F)2+2xF′′+2x2FF′′+x2F′′′+2x3(F′′)2x3FF′′′τ[F′′+(F)2]\displaystyle\frac{2x(F^{\prime})^{2}+2xF^{\prime\prime}+2x^{2}F^{\prime}F^{\prime\prime}+x^{2}F^{\prime\prime\prime}+2x^{3}(F^{\prime\prime})^{2}-x^{3}F^{\prime}F^{\prime\prime\prime}}{\tau[F^{\prime\prime}+(F^{\prime})^{2}]}
ΓTSS\displaystyle\Gamma^{\scriptscriptstyle S}_{{\scriptscriptstyle T}{\scriptscriptstyle S}} =\displaystyle= 2(F)22F′′2xFF′′xF′′′2x2(F′′)2+x2FF′′′τ[F′′+(F)2].\displaystyle\frac{-2(F^{\prime})^{2}-2F^{\prime\prime}-2xF^{\prime}F^{\prime\prime}-xF^{\prime\prime\prime}-2x^{2}(F^{\prime\prime})^{2}+x^{2}F^{\prime}F^{\prime\prime\prime}}{\tau[F^{\prime\prime}+(F^{\prime})^{2}]}\,.

Appendix B Goldstino production

These notes estimate the size of gravitino production and show how the dilaton couplings appear in the gravitino/goldstino equivalence theorem. To this end we use the component lagrangian describing the coupings of the gravitino to the goldstino GXG\in X and a proxy for a SM fermion χY\chi\in Y, using constrained fields satisfying X2=XY=0X^{2}=XY=0.

The Kähler potential K=3ln(τk)K=-3\ln(\tau-k) where kk is

k\displaystyle k =\displaystyle= X¯X+Y¯Y+𝔟2(YY¯2+Y¯Y2)+𝔠4Y2Y¯2+𝔢2(XY¯2+X¯Y2)\displaystyle{\overline{X}}X+{\overline{Y}}Y+\frac{{\mathfrak{b}}}{2}\Bigl{(}Y{\overline{Y}}^{2}+{\overline{Y}}Y^{2}\Bigr{)}+\frac{{\mathfrak{c}}}{4}\,Y^{2}{\overline{Y}}^{2}+\frac{{\mathfrak{e}}}{2}\Bigl{(}X{\overline{Y}}^{2}+{\overline{X}}Y^{2}\Bigr{)} (243)
+μ(X+X¯)+𝔨Y+𝔪2Y2,\displaystyle\qquad\qquad\qquad\qquad\qquad\qquad+\mu(X+{\overline{X}})+{\mathfrak{k}}Y+\frac{{\mathfrak{m}}}{2}\,Y^{2}\,,

while the superpotential is as before:

W=W0+𝔣X+𝔤Y+𝔥2(Y2+Y¯2).W=W_{0}+{\mathfrak{f}}\,X+{\mathfrak{g}}\,Y+\frac{{\mathfrak{h}}}{2}\,(Y^{2}+{\overline{Y}}^{2})\,. (244)

The arguments of the main text show that the gravitino mass is given by

m3/2=eK/2|W|Mp2=𝔉3Mp,m_{3/2}=e^{K/2}\frac{|W|}{M_{p}^{2}}=\frac{{\mathfrak{F}}}{\sqrt{3}\,M_{p}}\,, (245)

where

𝔉=[𝒢JI¯FJF¯I]1/2=[eKKJI¯DJWDIW¯]1/2|W0|τ3/2MpMp2τ.{\mathfrak{F}}=\Bigl{[}{\cal G}_{{\scriptscriptstyle J}{\overline{{\scriptscriptstyle I}}}}F^{\scriptscriptstyle J}{\overline{F}}^{\scriptscriptstyle I}\Bigr{]}^{1/2}=\Bigl{[}e^{K}K^{{\scriptscriptstyle J}{\overline{{\scriptscriptstyle I}}}}D_{\scriptscriptstyle J}W{\overline{D_{\scriptscriptstyle I}W}}\Bigr{]}^{1/2}\sim\frac{|W_{0}|}{\tau^{3/2}M_{p}}\sim\frac{M_{p}^{2}}{\tau}\,. (246)

Jordan frame component action

The component form of the lagrangian is again given by expressions similar to those given in the gauge G=0G=0 by [29]. The terms that come from the FF term include the masses

Fg~12[𝔥χ¯γLχ+W0Mg2ψ¯μγ~μνγLψν+h.c.]\frac{{\cal L}_{\scriptscriptstyle F}}{\sqrt{-\tilde{g}}}\ni-\frac{1}{2}\left[{\mathfrak{h}}\,{\overline{\chi}}\gamma_{\scriptscriptstyle L}\chi+\frac{W_{0}}{M_{g}^{2}}\,{\overline{\psi}}_{\mu}\tilde{\gamma}^{\mu\nu}\gamma_{\scriptscriptstyle L}\psi_{\nu}+\hbox{h.c.}\right] (247)

where the tilde on g~\sqrt{-\tilde{g}} and γ~μν=γabe~aμe~bν\tilde{\gamma}^{\mu\nu}=\gamma^{ab}{\tilde{e}_{a}}^{\mu}\,{\tilde{e}_{b}}^{\nu} emphasize their dependence on the Jordan-frame metric g~μν\tilde{g}_{\mu\nu}. The scalar potential vanishes, V=0V=0, when kk is independent of τ\tau, because the system is a no-scale model.

The kinetic and 4-fermi terms similarly arise from the DD-terms and so come with the prefactor eK/3=τk+e^{-K/3}=\tau-k+\cdots:

Dg~\displaystyle\frac{{\cal L}_{\scriptscriptstyle D}}{\sqrt{-\tilde{g}}} \displaystyle\ni eK/3[Mg22R~^+i2ϵ~μνλρψ¯μγ5γ~νDλψρ+12KYY¯χ¯~ D /χ]+(4-fermi terms)\displaystyle-e^{-K/3}\left[\frac{M_{g}^{2}}{2}\,\widehat{\tilde{R}}+\frac{i}{2}\,\tilde{\epsilon}^{\mu\nu\lambda\rho}{\overline{\psi}}_{\mu}\gamma_{5}\tilde{\gamma}_{\nu}D_{\lambda}\psi_{\rho}+\frac{1}{2}\,K_{{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle Y}}}}{\overline{\chi}}\tilde{}\hbox to0.0pt{\hbox to5.00002pt{\hfil$D$\hfil}\hss}/\chi\right]+\hbox{(4-fermi terms)} (248)
\displaystyle\simeq τ[Mg22R~^+i2ϵ~μνλρψ¯μγ5γ~νDλψρ+32τχ¯~ D /χ]+(4-fermi terms)\displaystyle-\tau\left[\frac{M_{g}^{2}}{2}\,\widehat{\tilde{R}}+\frac{i}{2}\,\tilde{\epsilon}^{\mu\nu\lambda\rho}{\overline{\psi}}_{\mu}\gamma_{5}\tilde{\gamma}_{\nu}D_{\lambda}\psi_{\rho}+\frac{3}{2\tau}\,{\overline{\chi}}\tilde{}\hbox to0.0pt{\hbox to5.00002pt{\hfil$D$\hfil}\hss}/\chi\right]+\hbox{(4-fermi terms)}

which shows that the canonically normalized gravitino is given in Jordan frame by ψ^μτψμ\hat{\psi}_{\mu}\simeq\sqrt{\tau}\;\psi_{\mu}. The 4-fermion interactions similarly have the schematic form

4fermig~\displaystyle\frac{{\cal L}_{\rm 4-fermi}}{\sqrt{-\tilde{g}}} =\displaystyle= eK/3[(KYY¯KYY¯8Mg2+KYYY¯Y¯KYY¯KYY¯Y¯KYYY¯KXX¯KXY¯Y¯KYYX¯)(χ¯γLχ)(χ¯γRχ)\displaystyle e^{-K/3}\left[\left(-\frac{K_{{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle Y}}}}K_{{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle Y}}}}}{8M_{g}^{2}}+K_{{\scriptscriptstyle Y}{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle Y}}}{\overline{{\scriptscriptstyle Y}}}}-K^{{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle Y}}}}K_{{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle Y}}}{\overline{{\scriptscriptstyle Y}}}}K_{{\scriptscriptstyle Y}{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle Y}}}}-K^{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle X}}}}K_{{\scriptscriptstyle X}{\overline{{\scriptscriptstyle Y}}}{\overline{{\scriptscriptstyle Y}}}}K_{{\scriptscriptstyle Y}{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle X}}}}\right)({\overline{\chi}}\gamma_{\scriptscriptstyle L}\chi)({\overline{\chi}}\gamma_{\scriptscriptstyle R}\chi)\right.
+KYY¯4Mg2(ϵ~μνλρψ¯μγLγ~νψλ+ψ¯μγLγ~ρψμ)(χ¯γLγ~ρχ)]\displaystyle\qquad\qquad\left.+\frac{K_{{\scriptscriptstyle Y}{\overline{{\scriptscriptstyle Y}}}}}{4M_{g}^{2}}\Bigl{(}\tilde{\epsilon}^{\mu\nu\lambda\rho}{\overline{\psi}}_{\mu}\gamma_{\scriptscriptstyle L}\tilde{\gamma}_{\nu}\psi_{\lambda}+{\overline{\psi}}_{\mu}\gamma_{\scriptscriptstyle L}\tilde{\gamma}^{\rho}\psi^{\mu}\Bigr{)}({\overline{\chi}}\gamma_{\scriptscriptstyle L}\tilde{\gamma}_{\rho}\chi)\right]
\displaystyle\simeq τ[(98Mg2τ2+3𝔠τ3(|𝔢|2+|𝔟|2)τ)(χ¯γLχ)(χ¯γRχ)\displaystyle\tau\left[\left(-\frac{9}{8M_{g}^{2}\tau^{2}}+\frac{3{\mathfrak{c}}}{\tau}-\frac{3(|{\mathfrak{e}}|^{2}+|{\mathfrak{b}}|^{2})}{\tau}\right)({\overline{\chi}}\gamma_{\scriptscriptstyle L}\chi)({\overline{\chi}}\gamma_{\scriptscriptstyle R}\chi)\right.
+34Mg2τ(ϵ~μνλρψ¯μγLγ~νψλ+ψ¯μγLγ~ρψμ)(χ¯γLγ~ρχ)],\displaystyle\qquad\qquad\left.+\frac{3}{4M_{g}^{2}\tau}\Bigl{(}\tilde{\epsilon}^{\mu\nu\lambda\rho}{\overline{\psi}}_{\mu}\gamma_{\scriptscriptstyle L}\tilde{\gamma}_{\nu}\psi_{\lambda}+{\overline{\psi}}_{\mu}\gamma_{\scriptscriptstyle L}\tilde{\gamma}^{\rho}\psi^{\mu}\Bigr{)}({\overline{\chi}}\gamma_{\scriptscriptstyle L}\tilde{\gamma}_{\rho}\chi)\right]\,,

and all we need use is that the ψ2χ2\psi^{2}\chi^{2} terms have a τ\tau-independent coupling proportional to 1/Mg21/M_{g}^{2}. Notice that both of the 1/Mg21/M_{g}^{2} couplings end up proportional to 1/(Mg2τ)1/Mp21/(M_{g}^{2}\tau)\simeq 1/M_{p}^{2} when written in terms of the canonically normalized fermions, where the coefficient of the Einstein-Hilbert term is taken to define the Planck mass, Mg2τ=Mp2M_{g}^{2}\tau=M_{p}^{2}, at least at present day where τ=τ0\tau=\tau_{0}.

Einstein frame component action

Next we repeat the above estimate directly in Einstein frame, being careful to not change the units of the present-day metric:

g~μν=e(KK0)/3gμν=τ0τgμν.\tilde{g}_{\mu\nu}=e^{(K-K_{0})/3}g_{\mu\nu}=\frac{\tau_{0}}{\tau}\,g_{\mu\nu}\,. (250)

The component form of the lagrangian is then obtained by making this metric substitution, which also implies

g~=ge2(KK0)/3=g(τ0τ)2ande~aμ=eaμττ0,\sqrt{-\tilde{g}}=\sqrt{-g}\;e^{2(K-K_{0})/3}=\sqrt{-g}\;\left(\frac{\tau_{0}}{\tau}\right)^{2}\quad\hbox{and}\quad{\tilde{e}_{a}}^{\mu}={e_{a}}^{\mu}\sqrt{\frac{\tau}{\tau_{0}}}\,, (251)

and so

Fg12(τ0τ)2[(𝔥𝔢𝔣)χ¯γLχ+W0Mg2(ττ0)ψ¯μγμνγLψν+h.c.].\frac{{\cal L}_{\scriptscriptstyle F}}{\sqrt{-g}}\ni-\frac{1}{2}\left(\frac{\tau_{0}}{\tau}\right)^{2}\left[\left({\mathfrak{h}}-{\mathfrak{e}}{\mathfrak{f}}\right)\,{\overline{\chi}}\gamma_{\scriptscriptstyle L}\chi+\frac{W_{0}}{M_{g}^{2}}\left(\frac{\tau}{\tau_{0}}\right){\overline{\psi}}_{\mu}\gamma^{\mu\nu}\gamma_{\scriptscriptstyle L}\psi_{\nu}+\hbox{h.c.}\right]\,. (252)

Similarly

Dgτ0[Mg22R^+i2(ττ0)1/2ϵμνλρψ¯μγ5γνDλψρ]32(τ0τ)3/2χ¯ / Dχ+(4-fermi terms),\frac{{\cal L}_{\scriptscriptstyle D}}{\sqrt{-g}}\ni-\tau_{0}\left[\frac{M_{g}^{2}}{2}\,\widehat{R}+\frac{i}{2}\left(\frac{\tau}{\tau_{0}}\right)^{1/2}\epsilon^{\mu\nu\lambda\rho}{\overline{\psi}}_{\mu}\gamma_{5}\gamma_{\nu}D_{\lambda}\psi_{\rho}\right]-\frac{3}{2}\left(\frac{\tau_{0}}{\tau}\right)^{3/2}{\overline{\chi}}\hbox to0.0pt{\hbox to8.55695pt{\hfil/\hfil}\hss}D\chi+\hbox{(4-fermi terms)}\,, (253)

which shows that Mp2=Mg2τ0M_{p}^{2}=M_{g}^{2}\tau_{0} is the physical Planck scale (as expected).

The canonically normalized gravitino is given in Einstein frame by ψ^μ(ττ0)1/4ψμ\hat{\psi}_{\mu}\simeq(\tau\tau_{0})^{1/4}\;\psi_{\mu} and the canonically normalized spin-half particle is χ^(τ0/τ)3/4χ\hat{\chi}\sim(\tau_{0}/\tau)^{3/4}\chi. Using these in (252) verifies that the Einstein-frame fermion masses are mχ𝔥/τm_{\chi}\propto{\mathfrak{h}}/\sqrt{\tau} and m3/2W0/τ3/2m_{3/2}\propto W_{0}/\tau^{3/2}, as expected. The 4-fermion interactions of (B) similarly become

4fermig~\displaystyle\frac{{\cal L}_{\rm 4-fermi}}{\sqrt{-\tilde{g}}} \displaystyle\simeq (τ0τ)2[(98Mg2τ+3𝔠3(|𝔢|2+|𝔟|2))(χ¯γLχ)(χ¯γRχ)\displaystyle\left(\frac{\tau_{0}}{\tau}\right)^{2}\left[\left(-\frac{9}{8M_{g}^{2}\tau}+3{\mathfrak{c}}-3(|{\mathfrak{e}}|^{2}+|{\mathfrak{b}}|^{2})\right)({\overline{\chi}}\gamma_{\scriptscriptstyle L}\chi)({\overline{\chi}}\gamma_{\scriptscriptstyle R}\chi)\right.
+34Mg2(ττ0)(ϵ~μνλρψ¯μγLγ~νψλ+ψ¯μγLγ~ρψμ)(χ¯γLγ~ρχ)],\displaystyle\qquad\qquad\left.+\frac{3}{4M_{g}^{2}}\left(\frac{\tau}{\tau_{0}}\right)\Bigl{(}\tilde{\epsilon}^{\mu\nu\lambda\rho}{\overline{\psi}}_{\mu}\gamma_{\scriptscriptstyle L}\tilde{\gamma}_{\nu}\psi_{\lambda}+{\overline{\psi}}_{\mu}\gamma_{\scriptscriptstyle L}\tilde{\gamma}^{\rho}\psi^{\mu}\Bigr{)}({\overline{\chi}}\gamma_{\scriptscriptstyle L}\tilde{\gamma}_{\rho}\chi)\right]\,,

and so once written in terms of canonically normalized fermions the ψ2χ2\psi^{2}\chi^{2} terms have a τ\tau-dependent coupling proportional to

1Mg2(τ0τ)1ττ0(ττ0)3/2=1Mg2τ(ττ0)=1Mp2(ττ0)\frac{1}{M_{g}^{2}}\left(\frac{\tau_{0}}{\tau}\right)\frac{1}{\sqrt{\tau\tau_{0}}}\left(\frac{\tau}{\tau_{0}}\right)^{3/2}=\frac{1}{M_{g}^{2}\tau}\left(\frac{\tau}{\tau_{0}}\right)=\frac{1}{M_{p}^{2}}\left(\frac{\tau}{\tau_{0}}\right) (255)

and so is Planck-suppressed (as expected).

Gravitino/goldstino production

The estimate for the cross section for gravitino production, χχψψ\chi\chi\to\psi\psi, using the above Einstein-frame action is given in the regime Em3/2E\gg m_{3/2} by the result

dσdΩE2(1Mp2)2(E2m3/22)2E6(Mpm3/2)4=E6𝔉4,\frac{{\rm d}\sigma}{{\rm d}\Omega}\sim E^{2}\left(\frac{1}{M_{p}^{2}}\right)^{2}\left(\frac{E^{2}}{m_{3/2}^{2}}\right)^{2}\sim\frac{E^{6}}{(M_{p}m_{3/2})^{4}}=\frac{E^{6}}{{\mathfrak{F}}^{4}}\,, (256)

where the factors (E/m3/2)2(E/m_{3/2})^{2} come from the gravitino spin sums 𝒮μν(k)=λuμ(k)v¯ν(k){\cal S}_{\mu\nu}(k)=\sum_{\lambda}u_{\mu}(k)\bar{v}_{\nu}(k). This shows how the equivalence theorem gives the production rate of longitudinal graviniti in terms of the effective goldstino decay constant 𝔉Mp/τ{\mathfrak{F}}\sim M_{p}/\sqrt{\tau}. The production rate is acceptably small because this is at the TeV scale.

References

  • [1] S. Weinberg, “The Cosmological Constant Problem,” Rev. Mod. Phys. 61 (1989), 1-23
  • [2] C. P. Burgess, “The Cosmological Constant Problem: Why it’s hard to get Dark Energy from Micro-physics,” [arXiv:1309.4133 [hep-th]].
  • [3] P. J. E. Peebles and B. Ratra, “The Cosmological Constant and Dark Energy,” Rev. Mod. Phys. 75 (2003), 559-606 [arXiv:astro-ph/0207347 [astro-ph]].
  • [4] E. J. Copeland, M. Sami and S. Tsujikawa, “Dynamics of dark energy,” Int. J. Mod. Phys. D 15 (2006), 1753-1936 [arXiv:hep-th/0603057 [hep-th]].
  • [5] S. Perlmutter et al. [Supernova Cosmology Project], “Measurements of Ω\Omega and Λ\Lambda from 42 high redshift supernovae,” Astrophys. J. 517 (1999), 565-586 [arXiv:astro-ph/9812133 [astro-ph]];
    A. G. Riess et al. [Supernova Search Team], “Observational evidence from supernovae for an accelerating universe and a cosmological constant,” Astron. J. 116 (1998), 1009-1038 [arXiv:astro-ph/9805201 [astro-ph]].
  • [6] P. A. R. Ade et al. [Planck], “Planck 2015 results. XIV. Dark energy and modified gravity,” Astron. Astrophys. 594 (2016), A14 [arXiv:1502.01590 [astro-ph.CO]].
  • [7] L. F. Abbott, “A Mechanism for Reducing the Value of the Cosmological Constant,” Phys. Lett. B 150 (1985), 427-430;
    J. D. Brown and C. Teitelboim, “Neutralization of the Cosmological Constant by Membrane Creation,” Nucl. Phys. B 297 (1988), 787-836; “Dynamical Neutralization of the Cosmological Constant,” Phys. Lett. B 195 (1987), 177-182;
  • [8] P. W. Graham, D. E. Kaplan and S. Rajendran, “Relaxation of the Cosmological Constant,” Phys. Rev. D 100 (2019) no.1, 015048 [arXiv:1902.06793 [hep-ph]].
  • [9] R. Bousso and J. Polchinski, “Quantization of four form fluxes and dynamical neutralization of the cosmological constant,” JHEP 06 (2000), 006 [arXiv:hep-th/0004134 [hep-th]];
    J. Polchinski, “The Cosmological Constant and the String Landscape,” [arXiv:hep-th/0603249 [hep-th]].
  • [10] J. Khoury and O. Parrikar, “Search Optimization, Funnel Topography, and Dynamical Criticality on the String Landscape,” JCAP 12 (2019), 014 [arXiv:1907.07693 [hep-th]];
    G. Kartvelishvili, J. Khoury and A. Sharma, “The Self-Organized Critical Multiverse,” JCAP 02 (2021), 028 [arXiv:2003.12594 [hep-th]];
    G. F. Giudice, M. McCullough and T. You, “Self-Organised Localisation,” [arXiv:2105.08617 [hep-ph]].
  • [11] S. Y. Li, Y. C. Qiu and S. H. H. Tye, “Standard Model from A Supergravity Model with a Naturally Small Cosmological Constant,” JHEP 05 (2021), 181 [arXiv:2010.10089 [hep-th]].
  • [12] N. Kaloper and A. Padilla, “Sequestering the Standard Model Vacuum Energy,” Phys. Rev. Lett. 112 (2014) no.9, 091304 doi:[arXiv:1309.6562 [hep-th]].
  • [13] N. Kaloper, A. Padilla, D. Stefanyszyn and G. Zahariade, “Manifestly Local Theory of Vacuum Energy Sequestering,” Phys. Rev. Lett. 116 (2016) no.5, 051302 doi:[arXiv:1505.01492 [hep-th]].
  • [14] E. J. Copeland, S. Ghataore, F. Niedermann and A. Padilla, “Generalised scalar-tensor theories and self-tuning,” [arXiv:2111.11448 [gr-qc]].
  • [15] C.P. Burgess and F. Quevedo, “Who’s Afraid of the (Supersymmetric) Dark? The Standard Model vs Low-Energy Supergravity” [arXiv:2110.13275 [hep-th]].
  • [16] P. W. Graham, D. E. Kaplan and S. Rajendran, “Cosmological Relaxation of the Electroweak Scale,” Phys. Rev. Lett. 115 (2015) no.22, 221801 [arXiv:1504.07551 [hep-ph]].
  • [17] C. P. Burgess, M. Cicoli, D. Ciupke, S. Krippendorf and F. Quevedo, “UV Shadows in EFTs: Accidental Symmetries, Robustness and No-Scale Supergravity,” Fortsch. Phys. 68 (2020) no.10, 2000076 [arXiv:2006.06694 [hep-th]].
  • [18] A. Salam and E. Sezgin, “Supergravities In Diverse Dimensions. Vol. 1, 2,” Amsterdam, Netherlands: North-Holland (1989) 1499 p. Singapore, Singapore: World Scientific (1989) 1499 p
  • [19] Y. Aghababaie, C. P. Burgess, J. M. Cline, H. Firouzjahi, S. L. Parameswaran, F. Quevedo, G. Tasinato and I. Zavala, “Warped brane worlds in six-dimensional supergravity,” JHEP 0309 (2003) 037 [hep-th/0308064];
  • [20] A. J. Tolley, C. P. Burgess, C. de Rham and D. Hoover, “Scaling solutions to 6D gauged chiral supergravity,” New J. Phys. 8 (2006), 324 [arXiv:hep-th/0608083 [hep-th]].
  • [21] C. P. Burgess, A. Maharana, L. van Nierop, A. A. Nizami and F. Quevedo, “On Brane Back-Reaction and de Sitter Solutions in Higher-Dimensional Supergravity,” JHEP 1204 (2012) 018 [arXiv:1109.0532 [hep-th]];
  • [22] F. F. Gautason, D. Junghans and M. Zagermann, “Cosmological Constant, Near Brane Behavior and Singularities,” JHEP 1309 (2013) 123 [arXiv:1301.5647 [hep-th]].
  • [23] E. Witten, “Three lectures on topological phases of matter,” Riv. Nuovo Cim. 39 (2016) no.7, 313-370 [arXiv:1510.07698 [cond-mat.mes-hall]];
    D. Tong, “Lectures on the Quantum Hall Effect,” [arXiv:1606.06687 [hep-th]],//
  • [24] C. P. Burgess, “Introduction to Effective Field Theory,” Cambridge University Press (December 2020).
  • [25] S. Bielleman, L. E. Ibanez and I. Valenzuela, “Minkowski 3-forms, Flux String Vacua, Axion Stability and Naturalness,” JHEP 12 (2015), 119 [arXiv:1507.06793 [hep-th]]..
  • [26] C. P. Burgess, R. Diener and M. Williams, “Self-Tuning at Large (Distances): 4D Description of Runaway Dilaton Capture,” JHEP 10 (2015), 177 [arXiv:1509.04209 [hep-th]].
  • [27] Z. Komargodski and N. Seiberg, “From Linear SUSY to Constrained Superfields,” JHEP 09 (2009), 066 [arXiv:0907.2441 [hep-th]].
  • [28] E. A. Bergshoeff, D. Z. Freedman, R. Kallosh and A. Van Proeyen, “Pure de Sitter Supergravity,” Phys. Rev. D 92 (2015) no.8, 085040 [erratum: Phys. Rev. D 93 (2016) no.6, 069901] [arXiv:1507.08264 [hep-th]].
  • [29] G. Dall’Agata and F. Farakos, “Constrained superfields in Supergravity,” JHEP 02 (2016), 101 [arXiv:1512.02158 [hep-th]].
  • [30] M. Schillo, E. van der Woerd and T. Wrase, “The general de Sitter supergravity component action,” Fortsch. Phys. 64 (2016), 292-302 [arXiv:1511.01542 [hep-th]].
  • [31] Y. Aghababaie, C. P. Burgess, S. L. Parameswaran and F. Quevedo, “SUSY breaking and moduli stabilization from fluxes in gauged 6-D supergravity,” JHEP 03 (2003), 032 [arXiv:hep-th/0212091 [hep-th]].
  • [32] C. P. Burgess, J. M. Cline and M. Postma, “Axionic D3-D7 Inflation,” JHEP 03 (2009), 058 [arXiv:0811.1503 [hep-th]].
  • [33] E. Cremmer, S. Ferrara, C. Kounnas and D. V. Nanopoulos, “Naturally Vanishing Cosmological Constant in N=1 Supergravity,” Phys. Lett. B 133 (1983), 61
  • [34] M. Berg, M. Haack and B. Kors, “String loop corrections to Kahler potentials in orientifolds,” JHEP 11 (2005), 030 [arXiv:hep-th/0508043 [hep-th]].
  • [35] M. Cicoli, J. P. Conlon and F. Quevedo, “Systematics of String Loop Corrections in Type IIB Calabi-Yau Flux Compactifications,” JHEP 01 (2008), 052 [arXiv:0708.1873 [hep-th]];
  • [36] R. Barbieri, E. Cremmer and S. Ferrara, “Flat and Positive Potentials in N=1N=1 Supergravity,” Phys. Lett. B 163 (1985), 143
  • [37] A. Albrecht, C. P. Burgess, F. Ravndal and C. Skordis, “Natural quintessence and large extra dimensions,” Phys. Rev. D 65 (2002), 123507 [arXiv:astro-ph/0107573 [astro-ph]].
  • [38] B. Ratra and P. J. E. Peebles, “Cosmological Consequences of a Rolling Homogeneous Scalar Field,” Phys. Rev. D 37 (1988), 3406.
  • [39] R. D. Peccei, J. Sola and C. Wetterich, “Adjusting the Cosmological Constant Dynamically: Cosmons and a New Force Weaker Than Gravity,” Phys. Lett. B 195 (1987), 183-190;
    C. Wetterich, “Cosmology and the Fate of Dilatation Symmetry,” Nucl. Phys. B 302 (1988), 668-696 [arXiv:1711.03844 [hep-th]].
  • [40] P. Jordan, “Schwerkraft und Weltall,” Vieweg und Sohn (1955);
    C.H. Brans and R.H. Dicke, R. H., “Mach’s Principle and a Relativistic Theory of Gravitation,” Phys. Rev. 124 (1961) 925–935;
    C.H. Brans, “Jordan-Brans-Dicke Theory,” Scholarpedia 9(4) (2014) 31358.
  • [41] C. M. Will, “The Confrontation between general relativity and experiment,” Living Rev. Rel. 9 (2006), 3 [arXiv:gr-qc/0510072 [gr-qc]];
    C. M. Will, “Putting General Relativity to the Test: Twentieth-Century Highlights and Twenty-First-Century Prospects,” Einstein Stud. 14 (2018), 81-96
  • [42] C.P. Burgess and F. Quevedo, “Axion Homeopathy: How Small-but-Nonzero Matter-Axion Couplings Cure Large Dilaton Interactions,” [arXiv:2110.10352 [hep-th]].
  • [43] W. L. Freedman, “Cosmology at a Crossroads,” Nature Astron. 1 (2017), 0121 [arXiv:1706.02739 [astro-ph.CO]];
    E. Di Valentino, O. Mena, S. Pan, L. Visinelli, W. Yang, A. Melchiorri, D. F. Mota, A. G. Riess and J. Silk, “In the realm of the Hubble tension—a review of solutions,” Class. Quant. Grav. 38 (2021) no.15, 153001 [arXiv:2103.01183 [astro-ph.CO]]; N. Schöneberg, G. Franco Abellán, A. Pérez Sánchez, S. J. Witte, V. Poulin and J. Lesgourgues, “The H0H_{0} Olympics: A fair ranking of proposed models,” [arXiv:2107.10291 [astro-ph.CO]].
  • [44] T. Sekiguchi and T. Takahashi, “Early recombination as a solution to the H0H_{0} tension,” Phys. Rev. D 103 (2021) no.8, 083507 [arXiv:2007.03381 [astro-ph.CO]].
  • [45] Y. Aghababaie, C. P. Burgess, S. L. Parameswaran and F. Quevedo, “Towards a naturally small cosmological constant from branes in 6-D supergravity,” Nucl. Phys. B 680 (2004), 389-414 [arXiv:hep-th/0304256 [hep-th]].
  • [46] Supergravity by Daniel Z. Freedman and Antoine Van Proeyen, Cambridge University Press (2012)
  • [47] J. Bagger and J. Wess, “Supersymmetry and supergravity,” JHU-TIPAC-9009; Princeton Series in Physics (1992)
  • [48] D. V. Volkov and V. P. Akulov, “Is the Neutrino a Goldstone Particle?,” Phys. Lett. B 46 (1973), 109-110
  • [49] S. Weinberg, “Nonlinear realizations of chiral symmetry,” Phys. Rev. 166 (1968), 1568-1577
  • [50] S. R. Coleman, J. Wess and B. Zumino, “Structure of phenomenological Lagrangians. 1.,” Phys. Rev. 177 (1969), 2239-2247
  • [51] C. G. Callan, Jr., S. R. Coleman, J. Wess and B. Zumino, “Structure of phenomenological Lagrangians. 2.,” Phys. Rev. 177 (1969), 2247-2250
  • [52] M. Cicoli, J. P. Conlon, A. Maharana and F. Quevedo, “A Note on the Magnitude of the Flux Superpotential,” JHEP 01 (2014), 027 [arXiv:1310.6694 [hep-th]].
  • [53] M.T. Grisaru, W. Siegel and M. Rocek, “Improved Methods For Supergraphs,” Nucl. Phys. B159 (1979) 429; E. Witten, “New Issues In Manifolds Of SU(3) Holonomy,” Nucl. Phys. B268 (1986) 79; M. Dine, N. Seiberg, “Nonrenormalization Theorems in Superstring Theory,” Phys. Rev. Lett. 57 (1986) 21; C.P. Burgess, A. Font and F. Quevedo, “Low-Energy Effective Action For The Superstring,” Nucl. Phys. B272 (1986) 661; N. Seiberg, “Naturalness versus supersymmetric nonrenormalization theorems,” Phys. Lett. B318 (1993) 469 [hep-ph/9309335]; C. P. Burgess, C. Escoda and F. Quevedo, “Nonrenormalization of flux superpotentials in string theory,” JHEP 06 (2006), 044 [arXiv:hep-th/0510213 [hep-th]].
  • [54] M. Cicoli, F. Quevedo, R. Savelli, A. Schachner and R. Valandro, “Systematics of the α\alpha^{\prime} Expansion in F-Theory,” doi:[arXiv:2106.04592 [hep-th]].
  • [55] M. Cicoli, J. P. Conlon and F. Quevedo, “General Analysis of LARGE Volume Scenarios with String Loop Moduli Stabilisation,” JHEP 10 (2008), 105 [arXiv:0805.1029 [hep-th]].
  • [56] M. Cicoli, C. P. Burgess and F. Quevedo, “Fibre Inflation: Observable Gravity Waves from IIB String Compactifications,” JCAP 03 (2009), 013 [arXiv:0808.0691 [hep-th]].
  • [57] S. B. Giddings, S. Kachru and J. Polchinski, “Hierarchies from fluxes in string compactifications,” Phys. Rev. D 66 (2002), 106006 [arXiv:hep-th/0105097 [hep-th]].
  • [58] S. Kachru, R. Kallosh, A. D. Linde and S. P. Trivedi, “De Sitter vacua in string theory,” Phys. Rev. D 68 (2003), 046005 [arXiv:hep-th/0301240 [hep-th]].
  • [59] V. Balasubramanian, P. Berglund, J. P. Conlon and F. Quevedo, “Systematics of moduli stabilisation in Calabi-Yau flux compactifications,” JHEP 03 (2005), 007 [arXiv:hep-th/0502058 [hep-th]].
  • [60] J. P. Conlon, F. Quevedo and K. Suruliz, “Large-volume flux compactifications: Moduli spectrum and D3/D7 soft supersymmetry breaking,” JHEP 08 (2005), 007 [arXiv:hep-th/0505076 [hep-th]].
  • [61] N. Arkani-Hamed, S. Dimopoulos and G. R. Dvali, “The Hierarchy problem and new dimensions at a millimeter,” Phys. Lett. B 429 (1998), 263-272 [arXiv:hep-ph/9803315 [hep-ph]].
  • [62] I. R. Klebanov and M. J. Strassler, “Supergravity and a confining gauge theory: Duality cascades and chi SB resolution of naked singularities,” JHEP 08 (2000), 052 [arXiv:hep-th/0007191 [hep-th]].
  • [63] S. Kachru, R. Kallosh, A. D. Linde, J. M. Maldacena, L. P. McAllister and S. P. Trivedi, “Towards inflation in string theory,” JCAP 10 (2003), 013 [arXiv:hep-th/0308055 [hep-th]].
  • [64] S. Ferrara, R. Kallosh and A. Linde, “Cosmology with Nilpotent Superfields,” JHEP 10 (2014), 143 [arXiv:1408.4096 [hep-th]].
  • [65] R. Kallosh and T. Wrase, “Emergence of Spontaneously Broken Supersymmetry on an Anti-D3-Brane in KKLT dS Vacua,” JHEP 12 (2014), 117 [arXiv:1411.1121 [hep-th]].
  • [66] I. Antoniadis, E. Dudas, S. Ferrara and A. Sagnotti, “The Volkov–Akulov–Starobinsky supergravity,” Phys. Lett. B 733 (2014), 32-35 [arXiv:1403.3269 [hep-th]].
  • [67] L. Aparicio, F. Quevedo and R. Valandro, “Moduli Stabilisation with Nilpotent Goldstino: Vacuum Structure and SUSY Breaking,” JHEP 03 (2016), 036 [arXiv:1511.08105 [hep-th]].
  • [68] M. P. Garcia del Moral, S. Parameswaran, N. Quiroz and I. Zavala, “Anti-D3 branes and moduli in non-linear supergravity,” JHEP 10 (2017), 185 doi:10.1007/JHEP10(2017)185 [arXiv:1707.07059 [hep-th]]
  • [69] I. Bena, E. Dudas, M. Graña and S. Lüst, “Uplifting Runaways,” Fortsch. Phys. 67 (2019) no.1-2, 1800100 doi:[arXiv:1809.06861 [hep-th]].
  • [70] E. Dudas and S. Lüst, “An update on moduli stabilization with antibrane uplift,” JHEP 03 (2021), 107 doi:[arXiv:1912.09948 [hep-th]].
  • [71] C. Crinò, F. Quevedo and R. Valandro, “On de Sitter String Vacua from Anti-D3-Branes in the Large Volume Scenario,” JHEP 03 (2021), 258 doi:[arXiv:2010.15903 [hep-th]].
  • [72] J. F. G. Cascales, M. P. Garcia del Moral, F. Quevedo and A. M. Uranga, “Realistic D-brane models on warped throats: Fluxes, hierarchies and moduli stabilization,” JHEP 02 (2004), 031 [arXiv:hep-th/0312051 [hep-th]].
  • [73] R. Kallosh, F. Quevedo and A. M. Uranga, “String Theory Realizations of the Nilpotent Goldstino,” JHEP 12 (2015), 039 [arXiv:1507.07556 [hep-th]].
  • [74] I. García-Etxebarria, F. Quevedo and R. Valandro, “Global String Embeddings for the Nilpotent Goldstino,” JHEP 02 (2016), 148 [arXiv:1512.06926 [hep-th]].
  • [75] A. Brignole, F. Feruglio, M. L. Mangano and F. Zwirner, “Signals of a superlight gravitino at hadron colliders when the other superparticles are heavy,” Nucl. Phys. B 526 (1998), 136-152 [erratum: Nucl. Phys. B 582 (2000), 759-761] [arXiv:hep-ph/9801329 [hep-ph]].
  • [76] A. Brignole, F. Feruglio and F. Zwirner, “Signals of a superlight gravitino at e+ee^{+}e^{-} colliders when the other superparticles are heavy,” Nucl. Phys. B 516 (1998), 13-28 [erratum: Nucl. Phys. B 555 (1999), 653-655] [arXiv:hep-ph/9711516 [hep-ph]].
  • [77] S. Gukov, C. Vafa and E. Witten, “CFT’s from Calabi-Yau four folds,” Nucl. Phys. B 584 (2000), 69-108 [erratum: Nucl. Phys. B 608 (2001), 477-478] [arXiv:hep-th/9906070 [hep-th]].
  • [78] M. Demirtas, M. Kim, L. McAllister, J. Moritz and A. Rios-Tascon, “Small Cosmological Constants in String Theory,” [arXiv:2107.09064 [hep-th]].
  • [79] J. P. Conlon and F. G. Pedro, “Moduli Redefinitions and Moduli Stabilisation,” JHEP 06 (2010), 082 [arXiv:1003.0388 [hep-th]].
  • [80] M. Weissenbacher, “α?\alpha^{?} -Corrections and de Sitter vacua - A mirage?,” Phys. Lett. B 792 (2019), 269-275 [arXiv:1901.09626 [hep-th]].
  • [81] I. Antoniadis, Y. Chen and G. K. Leontaris, “Logarithmic loop corrections, moduli stabilisation and de Sitter vacua in string theory,” JHEP 01 (2020), 149 [arXiv:1909.10525 [hep-th]].
  • [82] C. P. Burgess and F. Quevedo, “RG-Induced Modulus Stabilization: Perturbative de Sitter Vacua and Improved D3-D3¯\overline{\hbox{D3}} Inflation,” [arXiv:2202.05344 [hep-th]].
  • [83] G. D. Coughlan, W. Fischler, E. W. Kolb, S. Raby and G. G. Ross, “Cosmological Problems for the Polonyi Potential,” Phys. Lett. B 131 (1983), 59-64
  • [84] T. Banks, D. B. Kaplan and A. E. Nelson, “Cosmological implications of dynamical supersymmetry breaking,” Phys. Rev. D 49 (1994), 779-787 [arXiv:hep-ph/9308292 [hep-ph]].
  • [85] B. de Carlos, J. A. Casas, F. Quevedo and E. Roulet, “Model independent properties and cosmological implications of the dilaton and moduli sectors of 4-d strings,” Phys. Lett. B 318 (1993), 447-456 [arXiv:hep-ph/9308325 [hep-ph]].
  • [86] J. P. Conlon and F. Quevedo, “Astrophysical and cosmological implications of large volume string compactifications,” JCAP 08 (2007), 019 [arXiv:0705.3460 [hep-ph]].
  • [87] J. P. Conlon, D. Cremades and F. Quevedo, “Kahler potentials of chiral matter fields for Calabi-Yau string compactifications,” JHEP 01 (2007), 022 [arXiv:hep-th/0609180 [hep-th]].
  • [88] M. Cicoli, C. P. Burgess and F. Quevedo, “Anisotropic Modulus Stabilisation: Strings at LHC Scales with Micron-sized Extra Dimensions,” JHEP 10 (2011), 119 [arXiv:1105.2107 [hep-th]].
  • [89] M. Williams, C. P. Burgess, A. Maharana and F. Quevedo, “New Constraints (and Motivations) for Abelian Gauge Bosons in the MeV-TeV Mass Range,” JHEP 08 (2011), 106 [arXiv:1103.4556 [hep-ph]].
  • [90] T. Damour and G. Esposito-Farèse, “Tensor-multi-scalar theories of gravitation,” Class. Quantum Grav. 9 (1992) 2093;
    T. Damour and K. Nordtvedt, “Tensor-scalar Cosmological Models and their Relaxation Toward General Relativity,” Phys. Rev. D 48 (1993) 3436;
    T. Damour and A. M. Polyakov, “The String dilaton and a least coupling principle,” Nucl. Phys. B 423 (1994), 532-558 [arXiv:hep-th/9401069 [hep-th]].
  • [91] T. Damour and D. Vokrouhlicky, “The Equivalence principle and the moon,” Phys. Rev. D 53 (1996), 4177-4201 [arXiv:gr-qc/9507016 [gr-qc]];
    T. Damour and J. F. Donoghue, “Equivalence Principle Violations and Couplings of a Light Dilaton,” Phys. Rev. D 82 (2010), 084033 [arXiv:1007.2792 [gr-qc]].
  • [92] T. Dent and M. Fairbairn, “Time varying coupling strengths, nuclear forces and unification,” Nucl. Phys. B 653 (2003), 256-278 [arXiv:hep-ph/0112279 [hep-ph]];
    K. A. Olive, M. Pospelov, Y. Z. Qian, A. Coc, M. Casse and E. Vangioni-Flam, “Constraints on the variations of the fundamental couplings,” Phys. Rev. D 66 (2002), 045022 [arXiv:hep-ph/0205269 [hep-ph]];
    G. R. Dvali and M. Zaldarriaga, “Changing alpha with time: Implications for fifth force type experiments and quintessence,” Phys. Rev. Lett. 88 (2002), 091303 [arXiv:hep-ph/0108217 [hep-ph]]. S. Lee, K. A. Olive and M. Pospelov, “Quintessence models and the cosmological evolution of alpha,” Phys. Rev. D 70 (2004), 083503 [arXiv:astro-ph/0406039 [astro-ph]];
    J. P. Uzan, “Varying Constants, Gravitation and Cosmology,” Living Rev. Rel. 14 (2011), 2 [arXiv:1009.5514 [astro-ph.CO]].
  • [93] B. Bertotti, L. Iess and P. Tortora, “A test of general relativity using radio links with the Cassini spacecraft,” Nature 425 (2003) 374.
  • [94] T. Damour and J. H. Taylor, “Strong field tests of relativistic gravity and binary pulsars,” Phys. Rev. D 45 (1992), 1840-1868;
    T. Damour and G. Esposito-Farese, “Tensor - scalar gravity and binary pulsar experiments,” Phys. Rev. D 54 (1996), 1474-1491 [arXiv:gr-qc/9602056 [gr-qc]];
    I. H. Stairs, “Testing general relativity with pulsar timing,” Living Rev. Rel. 6 (2003), 5 [arXiv:astro-ph/0307536 [astro-ph]];
    M. W. Horbatsch and C. P. Burgess, “Semi-Analytic Stellar Structure in Scalar-Tensor Gravity,” JCAP 08 (2011), 027 [arXiv:1006.4411 [gr-qc]].
  • [95] N. Yunes, F. Pretorius and D. Spergel, “Constraining the evolutionary history of Newton’s constant with gravitational wave observations,” Phys. Rev. D 81 (2010), 064018 [arXiv:0912.2724 [gr-qc]].
  • [96] M. Kramer, I. H. Stairs, R. N. Manchester, M. A. McLaughlin, A. G. Lyne, R. D. Ferdman, M. Burgay, D. R. Lorimer, A. Possenti and N. D’Amico, et al. “Tests of general relativity from timing the double pulsar,” Science 314 (2006), 97-102 [arXiv:astro-ph/0609417 [astro-ph]].
  • [97] A.S. Eddington, The Mathematical Theory of Relativity, Cambridge University Press (1924);
    H.P. Robertson in Space and Astronomy, edited by A.J. Deutsch and W.B. Klemperer, Academic Press (1962).
  • [98] L. A. Boyle, R. R. Caldwell and M. Kamionkowski, “Spintessence! New models for dark matter and dark energy,” Phys. Lett. B 545 (2002), 17-22 [arXiv:astro-ph/0105318 [astro-ph]];
    A. Achucarro, J. O. Gong, S. Hardeman, G. A. Palma and S. P. Patil, “Effective theories of single field inflation when heavy fields matter,” JHEP 05 (2012), 066 [arXiv:1201.6342 [hep-th]];
    C. P. Burgess, M. W. Horbatsch and S. P. Patil, “Inflating in a Trough: Single-Field Effective Theory from Multiple-Field Curved Valleys,” JHEP 01 (2013), 133 [arXiv:1209.5701 [hep-th]];
    S. Renaux-Petel and K. Turzyński, “Geometrical Destabilization of Inflation,” Phys. Rev. Lett. 117 (2016) no.14, 141301 [arXiv:1510.01281 [astro-ph.CO]];
    P. Christodoulidis, D. Roest and E. I. Sfakianakis, “Angular inflation in multi-field α\alpha-attractors,” JCAP 11 (2019), 002 [arXiv:1803.09841 [hep-th]];
    M. Cicoli, V. Guidetti and F. G. Pedro, JCAP 05 (2019), 046 [arXiv:1903.01497 [hep-th]];
    M. Cicoli, V. Guidetti, F. Muia, F. G. Pedro and G. P. Vacca, [arXiv:2107.12392 [hep-th]].
  • [99] C. P. Burgess, R. C. Myers and F. Quevedo, “On spherically symmetric string solutions in four-dimensions,” Nucl. Phys. B 442 (1995), 75-96 [arXiv:hep-th/9410142 [hep-th]];
    C. P. Burgess, R. C. Myers and F. Quevedo, “Duality and four-dimensional black holes,” Nucl. Phys. B 442 (1995), 97-125 [arXiv:hep-th/9411195 [hep-th]].
  • [100] J. Khoury and A. Weltman, “Chameleon fields: Awaiting surprises for tests of gravity in space,” Phys. Rev. Lett. 93 (2004), 171104 [arXiv:astro-ph/0309300 [astro-ph]].
  • [101] C. Burrage and J. Sakstein, “Tests of Chameleon Gravity,” Living Rev. Rel. 21 (2018) no.1, 1 [arXiv:1709.09071 [astro-ph.CO]].
  • [102] K. Hinterbichler, J. Khoury and H. Nastase, “Towards a UV Completion for Chameleon Scalar Theories,” JHEP 03 (2011), 061 [erratum: JHEP 06 (2011), 072] [arXiv:1012.4462 [hep-th]].
  • [103] H. Nastase and A. Weltman, “Chameleons on the Racetrack,” JHEP 08 (2013), 059 [arXiv:1301.7120 [hep-th]].
  • [104] M. Cicoli, F. G. Pedro and G. Tasinato, “Natural Quintessence in String Theory,” JCAP 07 (2012), 044 [arXiv:1203.6655 [hep-th]].
  • [105] B. S. Acharya, M. Dhuria, D. Ghosh, A. Maharana and F. Muia, “Cosmology in the presence of multiple light moduli,” JCAP 11 (2019), 035 [arXiv:1906.03025 [hep-th]].
  • [106] M. Cicoli, G. Dibitetto and F. G. Pedro, “Out of the Swampland with Multifield Quintessence?,” JHEP 10 (2020), 035 [arXiv:2007.11011 [hep-th]].
  • [107] D. Atwood, C. P. Burgess, E. Filotas, F. Leblond, D. London and I. Maksymyk, “Supersymmetric large extra dimensions are small and/or numerous,” Phys. Rev. D 63 (2001), 025007 [arXiv:hep-ph/0007178 [hep-ph]].
  • [108] C. P. Burgess, J. Matias and F. Quevedo, “MSLED: A Minimal supersymmetric large extra dimensions scenario,” Nucl. Phys. B 706 (2005), 71-99 [arXiv:hep-ph/0404135 [hep-ph]].
  • [109] E. W. Kolb, A. J. Long and E. McDonough, “Catastrophic production of slow gravitinos,” Phys. Rev. D 104 (2021) no.7, 075015 [arXiv:2102.10113 [hep-th]].
  • [110] E. Dudas, M. A. G. Garcia, Y. Mambrini, K. A. Olive, M. Peloso and S. Verner, “Slow and Safe Gravitinos,” Phys. Rev. D 103 (2021), 123519 [arXiv:2104.03749 [hep-th]].
  • [111] I. Antoniadis, K. Benakli and W. Ke, “Salvage of too slow gravitinos,” JHEP 11 (2021), 063 [arXiv:2105.03784 [hep-th]].
  • [112] D. J. E. Marsh, “Axion Cosmology,” Phys. Rept. 643 (2016), 1-79 [arXiv:1510.07633 [astro-ph.CO]].
  • [113] A. Hook, “TASI Lectures on the Strong CP Problem and Axions,” PoS TASI2018 (2019), 004 [arXiv:1812.02669 [hep-ph]].
  • [114] K. Choi, S. H. Im and C. S. Shin, “Recent progress in physics of axions or axion-like particles,” [arXiv:2012.05029 [hep-ph]].
  • [115] J. L. Feng, “Dark Matter Candidates from Particle Physics and Methods of Detection,” Ann. Rev. Astron. Astrophys. 48 (2010), 495-545 [arXiv:1003.0904 [astro-ph.CO]].
  • [116] M. Kawasaki, K. Kohri, T. Moroi and A. Yotsuyanagi, “Big-Bang Nucleosynthesis and Gravitino,” Phys. Rev. D 78 (2008), 065011 [arXiv:0804.3745 [hep-ph]].
  • [117] A. Hebecker, T. Skrzypek and M. Wittner, “The FF-term Problem and other Challenges of Stringy Quintessence,” JHEP 11 (2019), 134 [arXiv:1909.08625 [hep-th]].
  • [118] P.A. Zyla et al. [Particle Data Group], “Review of Particle Physics,” PTEP 2020 (2020) no.8, 083C01
  • [119] F. Iocco, G. Mangano, G. Miele, O. Pisanti and P. D. Serpico, “Primordial Nucleosynthesis: from precision cosmology to fundamental physics,” Phys. Rept. 472 (2009), 1-76 [arXiv:0809.0631 [astro-ph]];
    M. Pospelov and J. Pradler, “Big Bang Nucleosynthesis as a Probe of New Physics,” Ann. Rev. Nucl. Part. Sci. 60 (2010), 539-568 [arXiv:1011.1054 [hep-ph]];
    G. Mangano and P.D. Serpico, “A robust upper limit on NeffN_{eff} from BBN, circa 2011,” Phys. Lett. B701 (2011) 296 [arXiv:1103.1261 [astro-ph]].
  • [120] J. Alvey, N. Sabti, M. Escudero and M. Fairbairn, “Improved BBN Constraints on the Variation of the Gravitational Constant,” Eur. Phys. J. C 80 (2020) no.2, 148 [arXiv:1910.10730 [astro-ph.CO]].
  • [121] J. Mould and S. Uddin, “Constraining a possible variation of G with Type Ia supernovae,” Publ. Astron. Soc. Austral. 31 (2014), 15 [arXiv:1402.1534 [astro-ph.CO]].
  • [122] P. J. Steinhardt, L. M. Wang and I. Zlatev, “Cosmological tracking solutions,” Phys. Rev. D 59 (1999), 123504 [arXiv:astro-ph/9812313 [astro-ph]];
    P. G. Ferreira and M. Joyce, “Cosmology with a primordial scaling field,” Phys. Rev. D 58 (1998), 023503 [arXiv:astro-ph/9711102 [astro-ph]];
    E. J. Copeland, A. R. Liddle and D. Wands, “Exponential potentials and cosmological scaling solutions,” Phys. Rev. D 57 (1998), 4686-4690 [arXiv:gr-qc/9711068 [gr-qc]];
    T. Barreiro, E. J. Copeland and N. J. Nunes, “Quintessence arising from exponential potentials,” Phys. Rev. D 61 (2000), 127301 [arXiv:astro-ph/9910214 [astro-ph]].
  • [123] F. Y. Cyr-Racine, F. Ge and L. Knox, “A Symmetry of Cosmological Observables, and a High Hubble Constant as an Indicator of a Mirror World Dark Sector,” [arXiv:2107.13000 [astro-ph.CO]].
  • [124] O. Zahn and M. Zaldarriaga, “Probing the Friedmann equation during recombination with future CMB experiments,” Phys. Rev. D 67 (2003), 063002 [arXiv:astro-ph/0212360 [astro-ph]].
  • [125] P. Binetruy, “Models of dynamical supersymmetry breaking and quintessence,” Phys. Rev. D 60 (1999), 063502 [arXiv:hep-ph/9810553 [hep-ph]];
    A. Masiero, M. Pietroni and F. Rosati, “SUSY QCD and quintessence,” Phys. Rev. D 61 (2000), 023504 [arXiv:hep-ph/9905346 [hep-ph]];
    P. Brax and J. Martin, “Quintessence and supergravity,” Phys. Lett. B 468 (1999), 40-45 [arXiv:astro-ph/9905040 [astro-ph]];
    E. J. Copeland, N. J. Nunes and F. Rosati, “Quintessence models in supergravity,” Phys. Rev. D 62 (2000), 123503 [arXiv:hep-ph/0005222 [hep-ph]].
  • [126] M. Cicoli, S. De Alwis, A. Maharana, F. Muia and F. Quevedo, “De Sitter vs Quintessence in String Theory,” Fortsch. Phys. 67 (2019) no.1-2, 1800079 doi:10.1002/prop.201800079 [arXiv:1808.08967 [hep-th]].
  • [127] M. Cicoli, F. Cunillera, A. Padilla and F. G. Pedro, “Quintessence and the Swampland: The numerically controlled regime of moduli space,” [arXiv:2112.10783 [hep-th]]; “Quintessence and the Swampland: The parametrically controlled regime of moduli space,” [arXiv:2112.10779 [hep-th]].
  • [128] C. P. Burgess, H. M. Lee and M. Trott, “Power-counting and the Validity of the Classical Approximation During Inflation,” JHEP 09 (2009), 103 [arXiv:0902.4465 [hep-ph]];
    P. Adshead, C. P. Burgess, R. Holman and S. Shandera, “Power-counting during single-field slow-roll inflation,” JCAP 02 (2018), 016 [arXiv:1708.07443 [hep-th];
    I. Babic, C. P. Burgess and G. Geshnizjani, “Keeping an eye on DBI: power-counting for small-csc_{s} cosmology,” JCAP 05 (2020), 023 [arXiv:1910.05277 [gr-qc]].
  • [129] S. H. H. Tye and S. S. C. Wong, “Linking Light Scalar Modes with A Small Positive Cosmological Constant in String Theory,” JHEP 06 (2017), 094 [arXiv:1611.05786 [hep-th]].
  • [130] S. Andriolo, S. Y. Li and S. H. H. Tye, “The Cosmological Constant and the Electroweak Scale,” JHEP 10 (2019), 212 [arXiv:1812.04873 [hep-th]].
  • [131] E. S. Fradkin and A. A. Tseytlin, “Renormalizable Asymptotically Free Quantum Theory of Gravity,” Phys. Lett. B 104 (1981), 377-381; “Renormalizable asymptotically free quantum theory of gravity,” Nucl. Phys. B 201 (1982), 469-491;
    S. D. Odintsov and I. L. Shapiro, “General relativity as the low-energy limit in higher derivative quantum gravity,” Class. Quant. Grav. 9 (1992), 873-882;
    P. D. Mannheim, “Making the Case for Conformal Gravity,” Found. Phys. 42 (2012), 388-420 [arXiv:1101.2186 [hep-th]];
    R. Kallosh and A. Linde, “Superconformal generalizations of the Starobinsky model,” JCAP 06 (2013), 028 [arXiv:1306.3214 [hep-th]];
    A. Salvio and A. Strumia, “Agravity,” JHEP 06 (2014), 080 [arXiv:1403.4226 [hep-ph]];
    M. B. Einhorn and D. R. T. Jones, “Naturalness and Dimensional Transmutation in Classically Scale-Invariant Gravity,” JHEP 03 (2015), 047 [arXiv:1410.8513 [hep-th]];
    B. Holdom and J. Ren, “QCD analogy for quantum gravity,” Phys. Rev. D 93 (2016) no.12, 124030 [arXiv:1512.05305 [hep-th]];
    J. F. Donoghue and G. Menezes, “Inducing the Einstein action in QCD-like theories,” Phys. Rev. D 97 (2018) no.5, 056022 [arXiv:1712.04468 [hep-ph]].
  • [132] I. Antoniadis and E. T. Tomboulis, “Gauge Invariance and Unitarity in Higher Derivative Quantum Gravity,” Phys. Rev. D 33 (1986), 2756;
    R. P. Woodard, “Ostrogradsky’s theorem on Hamiltonian instability,” Scholarpedia 10 (2015) no.8, 32243 [arXiv:1506.02210 [hep-th]].
  • [133] M. Raidal and H. Veermäe, “On the Quantisation of Complex Higher Derivative Theories and Avoiding the Ostrogradsky Ghost,” Nucl. Phys. B 916 (2017), 607-626 [arXiv:1611.03498 [hep-th]];
    G. Narain, “Signs and Stability in Higher-Derivative Gravity,” Int. J. Mod. Phys. A 33 (2018) no.04, 1850031 [arXiv:1704.05031 [hep-th]];
    J. F. Donoghue and G. Menezes, “Ostrogradsky instability can be overcome by quantum physics,” Phys. Rev. D 104 (2021) no.4, 045010 [arXiv:2105.00898 [hep-th]]. B. Holdom and J. Ren, “Quadratic gravity: from weak to strong,” Int. J. Mod. Phys. D 25 (2016) no.12, 1643004 [arXiv:1605.05006 [hep-th]].
  • [134] R. Kallosh and A. Linde, “Universality Class in Conformal Inflation,” JCAP 07 (2013), 002 [arXiv:1306.5220 [hep-th]];
    R. Kallosh, A. Linde and D. Roest, “Superconformal Inflationary α\alpha-Attractors,” JHEP 11 (2013), 198 [arXiv:1311.0472 [hep-th]].
  • [135] R. Kallosh and A. Linde, “Superconformal generalization of the chaotic inflation model λ4ϕ4ξ2ϕ2R\frac{\lambda}{4}\phi^{4}-\frac{\xi}{2}\phi^{2}R,” JCAP 06 (2013), 027 [arXiv:1306.3211 [hep-th]].
  • [136] F. L. Bezrukov and M. Shaposhnikov, “The Standard Model Higgs boson as the inflaton,” Phys. Lett. B 659 (2008), 703-706 [arXiv:0710.3755 [hep-th]].
  • [137] M. Veltman and F. Yndurain, Nucl. Phys. B325 (1989) 1;
    V. Silveira and A. Zee, Phys. Lett. B161 (1985) 136;
    J. McDonald, “Gauge singlet scalars as cold dark matter,” Phys. Rev. D 50 (1994), 3637-3649 [arXiv:hep-ph/0702143 [hep-ph]];
    C. P. Burgess, M. Pospelov and T. ter Veldhuis, “The Minimal model of nonbaryonic dark matter: A Singlet scalar,” Nucl. Phys. B 619 (2001), 709-728 [arXiv:hep-ph/0011335 [hep-ph]].
  • [138] K. R. Dienes, E. Dudas and T. Gherghetta, “Neutrino oscillations without neutrino masses or heavy mass scales: A Higher dimensional seesaw mechanism,” Nucl. Phys. B 557 (1999), 25 [arXiv:hep-ph/9811428 [hep-ph]].
  • [139] J. Matias and C. P. Burgess, “MSLED, neutrino oscillations and the cosmological constant,” JHEP 09 (2005), 052 [arXiv:hep-ph/0508156 [hep-ph]].
  • [140] C. P. Burgess, M. Cicoli, S. de Alwis and F. Quevedo, “Robust Inflation from Fibrous Strings,” JCAP 05 (2016), 032 [arXiv:1603.06789 [hep-th]].
  • [141] J. P. Conlon, R. Kallosh, A. D. Linde and F. Quevedo, “Volume Modulus Inflation and the Gravitino Mass Problem,” JCAP 09 (2008), 011 [arXiv:0806.0809 [hep-th]].
  • [142] C. P. Burgess, J. J. H. Enns, P. Hayman and S. P. Patil, “Goldilocks Models of Higher-Dimensional Inflation (including modulus stabilization),” JCAP 08 (2016), 045 [arXiv:1605.03297 [gr-qc]].