This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Nonlinear definition of the shadowy mode in higher-order scalar-tensor theories

Antonio De Felice Center for Gravitational Physics, Yukawa Institute for Theoretical Physics, Kyoto University, Kyoto 606-8502, Japan    Shinji Mukohyama Center for Gravitational Physics, Yukawa Institute for Theoretical Physics, Kyoto University, Kyoto 606-8502, Japan Kavli Institute for the Physics and Mathematics of the Universe (WPI), The University of Tokyo, 277-8583, Chiba, Japan    Kazufumi Takahashi Center for Gravitational Physics, Yukawa Institute for Theoretical Physics, Kyoto University, Kyoto 606-8502, Japan
Abstract

We study U-DHOST theories, i.e., higher-order scalar-tensor theories which are degenerate only in the unitary gauge and yield an apparently unstable extra mode in a generic coordinate system. We show that the extra mode satisfies a three-dimensional elliptic differential equation on a spacelike hypersurface, and hence it does not propagate. We clarify how to treat this “shadowy” mode at both the linear and the nonlinear levels.

preprint: YITP-21-104, IPMU21-0061

I Introduction

Scalar-tensor theories provide a useful description of modified gravity theories. When constructing a covariant action containing a metric and a scalar field, one requires the absence of Ostrogradsky ghosts [1, 2], which are associated with higher-order equations of motion (EOMs). This requirement poses the so-called degeneracy conditions [3, 4, 5, 6, 7, 8], which severely constrain the form of the higher-derivative interactions. Scalar-tensor theories with higher derivatives satisfying the degeneracy conditions are called degenerate higher-order scalar-tensor (DHOST) theories. Many classes of DHOST theories have been constructed so far [4, 9, 10, 11, 12], which include the Horndeski [13, 14, 15] and Gleyzes-Langlois-Piazza-Vernizzi theories [16] as limiting cases (see [17, 18] for reviews).

In the context of cosmology, it is natural to assume that the scalar field has a timelike gradient, and hence it is useful to take the so-called unitary gauge where the scalar field is fixed to be a given function of time. In such a particular choice of coordinate system, one obtains a weaker degeneracy condition, which defines a broader class of theories than the DHOST class. In particular, theories that are degenerate only in the unitary gauge are called U-DHOST theories [19].*1*1*1Under the unitary gauge, the U-DHOST theories reduce to spatially covariant gravity [20, 21, 22]. Conversely, restoring the time diffeomorphism invariance by introducing a Stückelberg scalar field, models of spatially covariant gravity are mapped into the union of the DHOST class and the U-DHOST class. As such, the U-DHOST theories seem to contain an extra mode originating from higher-order EOMs in a generic coordinate system where the scalar field has an inhomogeneous profile, which apparently signals the Ostrogradsky instability. However, it was pointed out that the extra mode actually does not propagate, and hence the U-DHOST theories are free of the apparent instability.*2*2*2This is not the case when the gradient of the scalar field is not timelike. When the scalar field has a spacelike gradient at least in a part of the spacetime domain of our interest, the Ostrogradsky instability may occur and/or the theory can no longer be trusted as a valid effective field theory, i.e., the knowledge of its ultra-violet continuation is necessary. Hence, the U-DHOST theories are valid only when they are used to describe a spacetime accompanied by the scalar field with a timelike gradient. Due to this nature, the extra mode was called a “shadowy” mode in [19]. Note that the shadowy mode is defined in a coordinate-independent manner as a mode living on a spacelike hypersurface (exactly like the shadows produced by the people on the beach). Such a non-propagating extra mode was found earlier in the context of massive gravity [23] and the khronometric theories [24, 25]. In these works, the extra mode was said to be “instantaneous” as it can be interpreted to propagate with an infinite speed. It is interesting to note that the U-DHOST theories include the khronometric theories, and hence the shadowy mode is understood as generalization of the instantaneous mode.

In order to elucidate the nature of the shadowy mode, the authors of [19] studied the following simple toy model in a flat two-dimensional spacetime:

L=12X+LU,LUξ4[XμXμX(μϕμX)2],L=-\frac{1}{2}X+L_{\rm U},\qquad L_{\rm U}\coloneqq-\frac{\xi}{4}\left[X\partial^{\mu}X\partial_{\mu}X-(\partial^{\mu}\phi\partial_{\mu}X)^{2}\right], (1)

where XμϕμϕX\coloneqq\partial_{\mu}\phi\partial^{\mu}\phi and ξ\xi is a constant. Here, the first term in the Lagrangian is nothing but the canonical kinetic term of the scalar field and note that the second term LUL_{\rm U} is vanishing when the scalar field is homogeneous. The authors of [19] investigated perturbations about some background solution in two different coordinate systems. The first one is such that the background field is homogeneous, while in the second one the background field has an inhomogeneous profile but still has a timelike gradient. For perturbations about a homogeneous background, no higher time derivatives appear in the EOM. On the other hand, once we consider perturbations about an inhomogeneous background, the EOM contains a fourth time derivative, which apparently signals the existence of an unstable extra mode. This situation is similar to what happens in the U-DHOST theories. The point is that, for this toy model, the dangerous extra mode satisfies a one-dimensional differential equation on a spacelike line. This means that the configuration of the extra mode is completely determined once an appropriate boundary condition is imposed on a spatial boundary, and hence the apparent instability is actually not a problem.

Although the above discussion based on the toy model (1) helps us to infer the nature of the shadowy mode in the U-DHOST theories, it is still nontrivial to predict how the situation changes when the dynamics of the metric is taken into account. Also, it remains unclear how the shadowy mode shows up at the nonlinear level. Actually, the instantaneous mode in the khronometric theories was studied only at the level of linear perturbations [24, 25]. The aim of the present paper is to clarify how to understand and deal with the shadowy mode in the presence of gravity even at the nonlinear level. For demonstration purposes, we focus on one of the simplest examples of the U-DHOST theories and study the effects of the shadowy mode at both the linear and nonlinear levels. We argue that the existence of the shadowy mode indeed affects the structure of the EOMs, where in particular the Hamiltonian constraint takes the form of a (quasi-)linear partial differential equation, whose principal part has the Laplacian operator defined on a three-dimensional hypersurface. This is in sharp contrast to the case of DHOST theories where the Hamiltonian constraint fixes the lapse function algebraically in the unitary gauge. The appearance of such higher spatial derivatives in the U-DHOST theories is indeed due to the existence of the shadowy mode.

The rest of this paper is organized as follows. We first introduce our model in §II. In §III, we investigate cosmological perturbations in the U-DHOST theories. In §IV, we perform a nonlinear Hamiltonian analysis for the U-DHOST theories. The results of these two sections clarify how the shadowy mode appears at the linear and nonlinear levels, respectively. Then, we discuss how to treat the shadowy mode in §V. Finally, we draw our conclusions in §VI.

II The model

We study higher-order scalar-tensor theories described by the following action:

S=d4xg[P(X)+Q(X)ϕ+F(X)R+I=15AI(X)LI(2)],S=\int{\rm{d}}^{4}x\sqrt{-g}\left[P(X)+Q(X)\Box\phi+F(X)R+\sum_{I=1}^{5}A_{I}(X)L_{I}^{(2)}\right], (2)

where PP, QQ, FF, and AIA_{I} (I=1,,5I=1,\cdots,5) are functions of XϕμϕμX\coloneqq\phi_{\mu}\phi^{\mu} and

L1(2)ϕμνϕμν,L2(2)(ϕ)2,L3(2)ϕμϕμνϕνϕ,L4(2)ϕμϕμνϕνλϕλ,L5(2)(ϕμϕμνϕν)2,L_{1}^{(2)}\coloneqq\phi^{{\mu\nu}}\phi_{{\mu\nu}},\quad L_{2}^{(2)}\coloneqq(\Box\phi)^{2},\quad L_{3}^{(2)}\coloneqq\phi^{\mu}\phi_{{\mu\nu}}\phi^{\nu}\Box\phi,\quad L_{4}^{(2)}\coloneqq\phi^{\mu}\phi_{{\mu\nu}}\phi^{\nu\lambda}\phi_{\lambda},\quad L_{5}^{(2)}\coloneqq(\phi^{\mu}\phi_{{\mu\nu}}\phi^{\nu})^{2}, (3)

with ϕμμϕ\phi_{\mu}\coloneqq\nabla_{\mu}\phi and ϕμνμνϕ\phi_{{\mu\nu}}\coloneqq\nabla_{\mu}\nabla_{\nu}\phi. Equation (2) is the most general action up to quadratic order in the second derivative of ϕ\phi. For a generic choice of the functions FF and AIA_{I}, we have higher-order EOMs and, as such, there exists an associated Ostrogradsky ghost. In order to avoid this problem, it is in general necessary to impose the degeneracy conditions on these functions. DHOST theories are those satisfying the degeneracy requirement in an arbitrary coordinate system, where the coefficient functions satisfy the following set of conditions [4]:

A2=A1,A4=18(FXA1)2{4F[3(A12FX)22A3F]A3X2(16A1FX+A3F)+4X(3A1A3F+16A12FX16A1FX24A13+2A3FFX)},A5=18(FXA1)2(2A1XA34FX)[A1(2A1+3XA34FX)4A3F].\begin{split}A_{2}&=-A_{1},\\ A_{4}&=\frac{1}{8(F-XA_{1})^{2}}\bigl{\{}4F\left[3(A_{1}-2F_{X})^{2}-2A_{3}F\right]-A_{3}X^{2}(16A_{1}F_{X}+A_{3}F)\\ &~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}+4X\left(3A_{1}A_{3}F+16A_{1}^{2}F_{X}-16A_{1}F_{X}^{2}-4A_{1}^{3}+2A_{3}FF_{X}\right)\bigr{\}},\\ A_{5}&=\frac{1}{8(F-XA_{1})^{2}}(2A_{1}-XA_{3}-4F_{X})\left[A_{1}(2A_{1}+3XA_{3}-4F_{X})-4A_{3}F\right].\end{split} (4)

If one requires the degeneracy in a specific coordinate system, a weaker condition is obtained. In particular, the degeneracy condition in the unitary gauge was derived in [4, 19], which reads

4[X(A1+3A2)+2F][A1+A2+X(A3+A4)+X2A5]=3X(2A2+XA3+4FX)2.4\left[X(A_{1}+3A_{2})+2F\right]\left[A_{1}+A_{2}+X(A_{3}+A_{4})+X^{2}A_{5}\right]=3X(2A_{2}+XA_{3}+4F_{X})^{2}. (5)

Indeed, one can check that (5) is satisfied if the set of degeneracy conditions (4) for the DHOST theories is satisfied. As mentioned earlier, the U-DHOST theories are higher-order scalar-tensor theories that satisfy the degeneracy conditions only in the unitary gauge. Namely, within the class of theories described by the action (2), the U-DHOST theories are defined as those satisfying (5) but not (4). The DHOST and U-DHOST theories can be generalized to actions with arbitrary powers of ϕμν\phi_{\mu\nu} [19], though we restrict ourselves to the action (2) for simplicity.

In the present paper, for demonstration purposes, we study one of the simplest examples of the U-DHOST theories, the one defined by

F(X)=MPl22,A1(X)=A2(X)=A3(X)=0,A4(X)=XA5(X)=ξX,F(X)=\frac{M_{\rm Pl}^{2}}{2},\qquad A_{1}(X)=A_{2}(X)=A_{3}(X)=0,\qquad A_{4}(X)=-XA_{5}(X)=-\xi X, (6)

with ξ\xi being a constant, namely,

S=d4xg{MPl22R+P(X)+Q(X)ϕξ[Xϕμϕμνϕνλϕλ(ϕμϕμνϕν)2]}.S=\int{\rm{d}}^{4}x\sqrt{-g}\left\{\frac{M_{\rm Pl}^{2}}{2}R+P(X)+Q(X)\Box\phi-\xi\left[X\phi^{\mu}\phi_{{\mu\nu}}\phi^{\nu\lambda}\phi_{\lambda}-(\phi^{\mu}\phi_{{\mu\nu}}\phi^{\nu})^{2}\right]\right\}. (7)

The term with ξ\xi here is nothing but the four-dimensional covariant version of LUL_{\rm U} in equation (1) and it characterizes the deviation from the DHOST theories. Also, we are interested in generic cases where there exists one propagating scalar mode in addition to the two tensor modes in the gravity sector. Namely, we do not discuss special cases with only two propagating degrees of freedom (plus a non-propagating shadowy mode), e.g., the cuscuton model [26] (and its extension [27, 28]) or minimally modified gravity [29, 30, 31, 32, 33, 34], for which the discussion below should be modified. Nevertheless, we expect that our analysis can be generalized to include such cases.

In what follows, we study how the shadowy mode shows up at both the linear and nonlinear levels based on the above simple model of the U-DHOST theories. We first investigate how the shadowy mode affects cosmological perturbations in the next section.

III Shadowy mode in cosmological perturbations

In this section, we study perturbations about homogeneous and isotropic solutions in the U-DHOST theories, which helps us understand the effects of the shadowy mode in the presence of gravity.

III.1 Cosmological scalar perturbations

Let us study homogeneous and isotropic solutions where the metric and the scalar field have the following form:

gμν(0)dxμdxν=N2(t)dt2+a2(t)δijdxidxj,ϕ(0)=ϕ(t).g_{\mu\nu}^{(0)}{\rm{d}}x^{\mu}{\rm{d}}x^{\nu}=-N^{2}(t){\rm{d}}t^{2}+a^{2}(t)\delta_{ij}{\rm{d}}x^{i}{\rm{d}}x^{j},\qquad\phi^{(0)}=\phi(t). (8)

We substitute this ansatz into equation (2) and derive the action written in terms of NN, aa, and ϕ\phi, from which the background EOMs are obtained. Note that the EOM for ϕ\phi is redundant due to the Noether identity associated with the time reparametrization symmetry [35]. Hence, we need only the EOMs for NN and aa, which are summarized as follows:

3MPl2H2+P2XPX+6Hϕ˙NXQX=0,MPl2(3H2+2H˙N)+P+ϕ˙X˙N2QX=0,\begin{split}3M_{\rm Pl}^{2}H^{2}+P-2XP_{X}+6H\frac{\dot{\phi}}{N}XQ_{X}&=0,\\ M_{\rm Pl}^{2}\left(3H^{2}+2\frac{\dot{H}}{N}\right)+P+\frac{\dot{\phi}\dot{X}}{N^{2}}Q_{X}&=0,\end{split} (9)

where a dot denotes the time derivative and Ha˙/(Na)H\coloneqq\dot{a}/(Na). Note also that we have X=ϕ˙2/N2X=-\dot{\phi}^{2}/N^{2} on the background (8). These EOMs will be used to simplify the coefficients in the quadratic Lagrangian for cosmological perturbations. Note that the term with ξ\xi in the Lagrangian (7) is vanishing at the background level.*3*3*3If we choose a coordinate system where the scalar field has an inhomogeneous profile, the effects of ξ\xi should appear even at the background level. However, in this section, we focus on linear perturbations about the homogeneous background and the effects of ξ\xi on them.

Let us now study scalar perturbations about the above background, by writing down the metric tensor as follows:

gμνdxμdxν=N2(1+2α)dt2+2Niχdtdxi+[a2(1+2ζ)δij+(ijδij32)E]dxidxj,ϕ=ϕ(0)+δϕ,\begin{split}g_{\mu\nu}{\rm{d}}x^{\mu}{\rm{d}}x^{\nu}&=-N^{2}(1+2\alpha){\rm{d}}t^{2}+2N\partial_{i}\chi{\rm{d}}t{\rm{d}}x^{i}+\left[a^{2}(1+2\zeta)\delta_{ij}+\left(\partial_{i}\partial_{j}-\frac{\delta_{ij}}{3}\partial^{2}\right)E\right]{\rm{d}}x^{i}{\rm{d}}x^{j},\\ \phi&=\phi^{(0)}+\delta\phi,\end{split} (10)

with 2δijij\partial^{2}\coloneqq\delta^{ij}\partial_{i}\partial_{j} and the perturbation variables being denoted by α\alpha, χ\chi, ζ\zeta, EE, and δϕ\delta\phi. We fix the gauge degrees of freedom by setting E=δϕ=0E=\delta\phi=0, which is a complete gauge fixing and hence can be imposed at the Lagrangian level [35]. Then, the quadratic Lagrangian takes the form,

=Na3[3MPl2ζ˙2N2MPl2a2ζ2ζ+α(Σξϕ˙6N62a2)α2Θa2α2χ+6Θαζ˙N2MPl2a2α2ζ+2MPl2a2ζ˙N2χ].\mathcal{L}=Na^{3}\left[-3M_{\rm Pl}^{2}\frac{\dot{\zeta}^{2}}{N^{2}}-\frac{M_{\rm Pl}^{2}}{a^{2}}\zeta\partial^{2}\zeta+\alpha\left(\Sigma-\xi\frac{\dot{\phi}^{6}}{N^{6}}\frac{\partial^{2}}{a^{2}}\right)\alpha-\frac{2\Theta}{a^{2}}\alpha\partial^{2}\chi+6\Theta\alpha\frac{\dot{\zeta}}{N}-\frac{2M_{\rm Pl}^{2}}{a^{2}}\alpha\partial^{2}\zeta+\frac{2M_{\rm Pl}^{2}}{a^{2}}\frac{\dot{\zeta}}{N}\partial^{2}\chi\right]. (11)

Here, we have defined the following quantities:

Σ3MPl2H2+XPX+2X2PXX6Hϕ˙N(2XQX+X2QXX),ΘMPl2Hϕ˙3N3QX.\Sigma\coloneqq-3M_{\rm Pl}^{2}H^{2}+XP_{X}+2X^{2}P_{XX}-6H\frac{\dot{\phi}}{N}\left(2XQ_{X}+X^{2}Q_{XX}\right),\qquad\Theta\coloneqq M_{\rm Pl}^{2}H-\frac{\dot{\phi}^{3}}{N^{3}}Q_{X}. (12)

As we shall see below, the effects of the shadowy mode can be better understood in the Hamiltonian language. The Hamiltonian analysis is also useful to define the shadowy mode in a nonlinear manner in §IV.

III.2 Hamiltonian for scalar perturbations

From the quadratic Lagrangian (11), one can construct the canonical momenta for each perturbation variable as

pαα˙=0,pχχ˙=0,pζζ˙=a3(6MPl2ζ˙N+6Θα+2MPl2a22χ).p_{\alpha}\coloneqq\frac{\partial\mathcal{L}}{\partial\dot{\alpha}}=0,\qquad p_{\chi}\coloneqq\frac{\partial\mathcal{L}}{\partial\dot{\chi}}=0,\qquad p_{\zeta}\coloneqq\frac{\partial\mathcal{L}}{\partial\dot{\zeta}}=a^{3}\left(-6M_{\rm Pl}^{2}\frac{\dot{\zeta}}{N}+6\Theta\alpha+\frac{2M_{\rm Pl}^{2}}{a^{2}}\partial^{2}\chi\right). (13)

Hence, we have

ζ˙=N(ΘMPl2α+2χ3a2pζ6a3MPl2),\dot{\zeta}=N\left(\frac{\Theta}{M_{\rm Pl}^{2}}\alpha+\frac{\partial^{2}\chi}{3a^{2}}-\frac{p_{\zeta}}{6a^{3}M_{\rm Pl}^{2}}\right), (14)

whereas pαp_{\alpha} and pχp_{\chi} are subject to the primary constraints pα0p_{\alpha}\approx 0 and pχ0p_{\chi}\approx 0. Then, performing the needed Legendre transformation, we find the total Hamiltonian, in which both the primary constraints are imposed by use of Lagrange multipliers [36, 37], as

HT=\displaystyle H_{T}= d3x(+λ1pα+λ2pχ),\displaystyle\;\int{\rm{d}}^{3}x\left(\mathcal{H}+\lambda_{1}p_{\alpha}+\lambda_{2}p_{\chi}\right), (15)
\displaystyle\mathcal{H}\coloneqq MPl2Naζ2ζNa3α(Σ+3Θ2MPl2ξϕ˙6N62a2)α+2MPl2Naα2ζMPl23Na(2χ)2N12MPl2a3pζ2\displaystyle\;M_{\rm Pl}^{2}Na\zeta\partial^{2}\zeta-Na^{3}\alpha\left(\Sigma+\frac{3\Theta^{2}}{M_{\rm Pl}^{2}}-\xi\frac{\dot{\phi}^{6}}{N^{6}}\frac{\partial^{2}}{a^{2}}\right)\alpha+2M_{\rm Pl}^{2}Na\alpha\partial^{2}\zeta-\frac{M_{\rm Pl}^{2}}{3}\frac{N}{a}(\partial^{2}\chi)^{2}-\frac{N}{12M_{\rm Pl}^{2}a^{3}}p_{\zeta}^{2}
+NΘMPl2αpζ+N3a2pζ2χ.\displaystyle\;+\frac{N\Theta}{M_{\rm Pl}^{2}}\alpha p_{\zeta}+\frac{N}{3a^{2}}p_{\zeta}\partial^{2}\chi. (16)

Next, we require that the primary constraints are conserved under time evolution, which poses secondary constraints. In doing so, it is useful to define smeared constraints by use of a test function φ\varphi and compute their time evolution. The results are summarized as

{d3xφpα,HT}Pd3xφ𝒞α0,{d3xφpχ,HT}Pd3xφ𝒞χ0,\begin{split}\left\{\int{\rm{d}}^{3}x\,\varphi p_{\alpha},H_{T}\right\}_{\rm P}\approx\int{\rm{d}}^{3}x\,\varphi\mathcal{C}_{\alpha}\approx 0,\qquad\left\{\int{\rm{d}}^{3}x\,\varphi p_{\chi},H_{T}\right\}_{\rm P}\approx\int{\rm{d}}^{3}x\,\varphi\mathcal{C}_{\chi}\approx 0,\end{split} (17)

where

𝒞αN[2a3(Σ+3Θ2MPl2ξϕ˙6N62a2)α2MPl2a2ζΘMPl2pζ],𝒞χN2(2MPl232χpζ3a).\mathcal{C}_{\alpha}\coloneqq N\left[2a^{3}\left(\Sigma+\frac{3\Theta^{2}}{M_{\rm Pl}^{2}}-\xi\frac{\dot{\phi}^{6}}{N^{6}}\frac{\partial^{2}}{a^{2}}\right)\alpha-2M_{\rm Pl}^{2}a\partial^{2}\zeta-\frac{\Theta}{M_{\rm Pl}^{2}}p_{\zeta}\right],\qquad\mathcal{C}_{\chi}\coloneqq N\partial^{2}\left(\frac{2M_{\rm Pl}^{2}}{3}\partial^{2}\chi-\frac{p_{\zeta}}{3a}\right). (18)

One can verify that there is no tertiary constraint and all the four constraints (pαp_{\alpha}, pχp_{\chi}, 𝒞α\mathcal{C}_{\alpha}, 𝒞χ\mathcal{C}_{\chi}) obtained so far are of second class. Hence, the number of degrees of freedom of the system in the scalar sector can be computed as follows:

12[(# of phase-space variables)2×(# of first-class constraints)(# of second-class constraints)]\displaystyle\frac{1}{2}\left[\left(\text{$\#$ of phase-space variables}\right)-2\times\left(\text{$\#$ of first-class constraints}\right)-\left(\text{$\#$ of second-class constraints}\right)\right]
=12(62×04)=1,\displaystyle\qquad=\frac{1}{2}\left(6-2\times 0-4\right)=1, (19)

which is the same number as in the case of the DHOST theories, as expected.

Having specified all the constraints, let us now study the structure of them, in particular, the secondary constraint 𝒞α0\mathcal{C}_{\alpha}\approx 0. This constraint can be rewritten in the form,

L^α2aξϕ˙6N62α2a3(Σ+3Θ2MPl2)α2MPl2a2ζΘMPl2pζ.\hat{L}\alpha\coloneqq 2a\xi\frac{\dot{\phi}^{6}}{N^{6}}\partial^{2}\alpha-2a^{3}\left(\Sigma+\frac{3\Theta^{2}}{M_{\rm Pl}^{2}}\right)\alpha\approx-2M_{\rm Pl}^{2}a\partial^{2}\zeta-\frac{\Theta}{M_{\rm Pl}^{2}}p_{\zeta}. (20)

It should be noted that this constraint amounts to the Hamiltonian constraint. Indeed, in the Lagrangian formalism, 𝒞α=0\mathcal{C}_{\alpha}=0 is obtained as the equation of motion for the lapse perturbation α\alpha. When ξ=0\xi=0, i.e., in the case of the DHOST theories, this equation generically fixes α\alpha algebraically. On the other hand, when ξ0\xi\neq 0, the constraint (20) has the form of a Poisson equation, which is a manifestation of the existence of the shadowy mode. In this case, we put the general solution in the form,

α=αhom+αpart,\alpha=\alpha_{\rm hom}+\alpha_{\rm part}, (21)

where αpart\alpha_{\rm part} is a particular solution satisfying 𝒞α0\mathcal{C}_{\alpha}\approx 0. Then, the homogeneous part αhom\alpha_{\rm hom} is obtained as a solution to the following elliptic differential equation:

L^α=[2aξϕ˙6N622a3(Σ+3Θ2MPl2)]α=0,\hat{L}\alpha=\left[2a\xi\frac{\dot{\phi}^{6}}{N^{6}}\partial^{2}-2a^{3}\left(\Sigma+\frac{3\Theta^{2}}{M_{\rm Pl}^{2}}\right)\right]\alpha=0, (22)

which fixes α\alpha once an appropriate boundary condition is imposed. As such, αhom\alpha_{\rm hom} represents the shadowy mode. It is also possible to construct a gauge-invariant quantity that reduces to α\alpha in the unitary gauge,

αGIα1Nt(Nδϕϕ˙),\alpha_{\rm GI}\coloneqq\alpha-\frac{1}{N}\frac{\partial}{\partial t}\left(N\frac{\delta\phi}{\dot{\phi}}\right), (23)

which provides a gauge-independent definition of the shadowy mode at the linear level.

IV Nonlinear definition of the shadowy mode

In this section, we perform a nonlinear Hamiltonian analysis of the U-DHOST theories, which provides a nonlinear definition of the shadowy mode. We assume that the scalar field has a timelike gradient so that we can take the unitary gauge, but the background spacetime remains arbitrary.

In terms of the Arnowitt-Deser-Misner (ADM) variables, the metric takes the form,

gμνdxμdxν=N2dt2+γij(dxi+Nidt)(dxj+Njdt),g_{\mu\nu}{\rm{d}}x^{\mu}{\rm{d}}x^{\nu}=-N^{2}{\rm{d}}t^{2}+\gamma_{ij}({\rm{d}}x^{i}+N^{i}{\rm{d}}t)({\rm{d}}x^{j}+N^{j}{\rm{d}}t), (24)

where NN is the lapse function, NiN^{i} is the shift vector, and γij\gamma_{ij} is the induced metric. We denote the unit vector normal to a constant-tt hypersurface and the projection tensor as

nμNδμ0,hμνgμν+nμnν.n_{\mu}\coloneqq-N\delta^{0}_{\mu},\qquad h_{\mu\nu}\coloneqq g_{\mu\nu}+n_{\mu}n_{\nu}. (25)

The extrinsic curvature and the acceleration vector are defined by

Kμνhμααnν,aμnααnμ,K_{\mu\nu}\coloneqq h_{\mu}{}^{\alpha}\nabla_{\alpha}n_{\nu},\qquad a_{\mu}\coloneqq n^{\alpha}\nabla_{\alpha}n_{\mu}, (26)

which can be written in terms of the ADM variables as

Kij=12N(γ˙ijDiNjDjNi),ai=1NDiN,K_{ij}=\frac{1}{2N}\left(\dot{\gamma}_{ij}-D_{i}N_{j}-D_{j}N_{i}\right),\qquad a_{i}=\frac{1}{N}D_{i}N, (27)

with DiD_{i} being the covariant derivative associated with γij\gamma_{ij}. The following relations are also useful for translating the covariant Lagrangian into the ADM language under the unitary gauge [38]:

ϕμ=ϕ˙Nnμ,ϕμν=ϕ˙N(Kμνnμaνnνaμ)N2ϕ˙(nααX)nμnν.\phi_{\mu}=-\frac{\dot{\phi}}{N}n_{\mu},\qquad\phi_{\mu\nu}=-\frac{\dot{\phi}}{N}(K_{\mu\nu}-n_{\mu}a_{\nu}-n_{\nu}a_{\mu})-\frac{N}{2\dot{\phi}}(n^{\alpha}\partial_{\alpha}X)n_{\mu}n_{\nu}. (28)

Note also that X=ϕμϕμ=ϕ˙2/N2X=\phi_{\mu}\phi^{\mu}=-\dot{\phi}^{2}/N^{2}.

With the above ADM variables, the Lagrangian density in (2) can be written as

=Nγ[MPl22(R(3)+KijKijK2)+P+Q~K+ξϕ˙6N6DiNDiNN2],\mathcal{L}=N\sqrt{\gamma}\left[\frac{M_{\rm Pl}^{2}}{2}\left(R^{(3)}+K_{ij}K^{ij}-K^{2}\right)+P+\tilde{Q}K+\xi\frac{\dot{\phi}^{6}}{N^{6}}\frac{D_{i}ND^{i}N}{N^{2}}\right], (29)

where the function Q~(X)\tilde{Q}(X) is defined so that Q~X=(X)1/2QX\tilde{Q}_{X}=-(-X)^{-1/2}Q_{X}. Note also that XX appearing here should be understood as a function of NN through X=ϕ˙2/N2X=-\dot{\phi}^{2}/N^{2}. The canonical momenta conjugate to NN, NiN^{i}, and γij\gamma_{ij} are denoted as πN\pi_{N}, πi\pi_{i}, and πij\pi^{ij}, respectively. Since the action does not contain time derivatives of NN and NiN^{i}, we have πN=πi=0\pi_{N}=\pi_{i}=0. The canonical momentum πij\pi^{ij} can be computed as follows:

πij=MPl22γ(KijKγij)+12γQ~γij,\pi^{ij}=\frac{M_{\rm Pl}^{2}}{2}\sqrt{\gamma}\left(K^{ij}-K\gamma^{ij}\right)+\frac{1}{2}\sqrt{\gamma}\tilde{Q}\gamma^{ij}, (30)

which can be solved for KijK_{ij} as

Kij=MPl2[1γ(2πijπγij)+Q~2γij],K_{ij}=M_{\rm Pl}^{-2}\left[\frac{1}{\sqrt{\gamma}}(2\pi_{ij}-\pi\gamma_{ij})+\frac{\tilde{Q}}{2}\gamma_{ij}\right], (31)

with πγijπij\pi\coloneqq\gamma_{ij}\pi^{ij}. Then, the total Hamiltonian is given by

HT=H+d3x(uNπN+uiπi),Hd3x(πijγ˙ij).H_{T}=H+\int{\rm{d}}^{3}x\left(u_{N}\pi_{N}+u^{i}\pi_{i}\right),\qquad H\coloneqq\int{\rm{d}}^{3}x\left(\pi^{ij}\dot{\gamma}_{ij}-\mathcal{L}\right). (32)

More explicitly, HH can be written as

H=d3x(N+Niiγξϕ˙6N6DiNDiNN),H=\int{\rm{d}}^{3}x\left(\mathcal{H}_{N}+N^{i}\mathcal{H}_{i}-\sqrt{\gamma}\,\xi\frac{\dot{\phi}^{6}}{N^{6}}\frac{D_{i}ND^{i}N}{N}\right), (33)

where we have defined

NNγ[MPl22(R(3)+KijKijK2)P],i2Djπijγ.\mathcal{H}_{N}\coloneqq N\sqrt{\gamma}\left[\frac{M_{\rm Pl}^{2}}{2}\left(-R^{(3)}+K_{ij}K^{ij}-K^{2}\right)-P\right],\qquad\mathcal{H}_{i}\coloneqq-2D^{j}\frac{\pi_{ij}}{\sqrt{\gamma}}. (34)

Here, KijK_{ij} is regarded as a function of canonical variables through equation (31) and N\mathcal{H}_{N} is no longer linear in NN. Then, the Hamiltonian constraint, which corresponds to the consistency condition for the primary constraint πN0\pi_{N}\approx 0, reads

NN+γξϕ˙6N6(2DiDiNN7DiNDiNN2)0.\frac{\partial\mathcal{H}_{N}}{\partial N}+\sqrt{\gamma}\,\xi\frac{\dot{\phi}^{6}}{N^{6}}\left(2\frac{D_{i}D^{i}N}{N}-7\frac{D_{i}ND^{i}N}{N^{2}}\right)\approx 0. (35)

As a consistency check, one can obtain an equation which is equivalent to this Hamiltonian constraint in the Lagrangian formalism (i.e., equivalent to δ/δN\delta{\mathcal{L}}/\delta N in the unitary gauge). Indeed, without imposing the unitary gauge, on denoting the EOM for the metric as

μν1gδSδgμν=0,\mathcal{E}_{\mu\nu}\coloneqq\frac{1}{\sqrt{-g}}\frac{\delta S}{\delta g^{\mu\nu}}=0, (36)

one can verify that the Hamiltonian constraint is equivalent to nμnνμν=0n^{\mu}n^{\nu}\mathcal{E}_{\mu\nu}=0, which is, once more, nothing but the time-time component of the metric EOM in the unitary gauge. Going back to equation (35), for ξ=0\xi=0, the lapse function NN is fixed algebraically from N/N0\partial\mathcal{H}_{N}/\partial N\approx 0. On the other hand, for ξ0\xi\neq 0, (35) is an elliptic differential equation for NN, and hence an appropriate boundary condition should be imposed to fix NN. Therefore, the lapse function NN contains the shadowy mode. It should be noted that the structure of (35) is similar to that of (20), except that (35) is a nonlinear differential equation. This is as expected, since α\alpha, which we identified as the shadowy mode in the context of cosmological perturbations, is nothing but the perturbation of the lapse function NN.

We can now infer the covariant definition of the shadowy mode. As X=ϕ˙2/N2X=-\dot{\phi}^{2}/N^{2} in the unitary gauge, we expect that XX should contain the shadowy mode. In order to rewrite the Hamiltonian in terms of XX instead of NN, we perform the following canonical transformation:

(N,πN)(X,πX)=(ϕ˙2N2,N32ϕ˙2πN),(N,\pi_{N})\quad\to\quad(X,\pi_{X})=\left(-\frac{\dot{\phi}^{2}}{N^{2}},\frac{N^{3}}{2\dot{\phi}^{2}}\pi_{N}\right), (37)

so that we have {X,πX}P=1\left\{X,\pi_{X}\right\}_{\rm P}=1. Then, the Hamiltonian constraint (35) can be recast as

ξX3(DiDiXX+DiXDiX4X2)+2XPXP3Q~24MPl2+3XQ~Q~XMPl2+Q~2XQ~XMPl2πγ\displaystyle\xi X^{3}\left(\frac{D_{i}D^{i}X}{X}+\frac{D_{i}XD^{i}X}{4X^{2}}\right)+2XP_{X}-P-\frac{3\tilde{Q}^{2}}{4M_{\rm Pl}^{2}}+\frac{3X\tilde{Q}\tilde{Q}_{X}}{M_{\rm Pl}^{2}}+\frac{\tilde{Q}-2X\tilde{Q}_{X}}{M_{\rm Pl}^{2}}\frac{\pi}{\sqrt{\gamma}}\qquad
=MPl22R(3)1MPl22πijπijπ2γ\displaystyle=\frac{M_{\rm Pl}^{2}}{2}R^{(3)}-\frac{1}{M_{\rm Pl}^{2}}\frac{2\pi_{ij}\pi^{ij}-\pi^{2}}{\gamma} . (38)

We further make a canonical transformation to replace XY(X)5/4X\to Y\coloneqq(-X)^{5/4}, which brings the above equation in the form of a nonlinear Poisson equation,

ξΔY+f(Y;γij,πij,R(3))=0,\xi\Delta Y+f(Y;\gamma_{ij},\pi^{ij},R^{(3)})=0, (39)

where we have denoted ΔDiDi\Delta\coloneqq D_{i}D^{i} and

f=54Y7/5[52YPYP3Q~MPl2(Q~54YQ~Y)+1MPl2πγ(Q~52YQ~Y)MPl22R(3)+1MPl22πijπijπ2γ].f=-\frac{5}{4}Y^{-7/5}\left[\frac{5}{2}YP_{Y}-P-\frac{3\tilde{Q}}{M_{\rm Pl}^{2}}\left(\tilde{Q}-\frac{5}{4}Y\tilde{Q}_{Y}\right)+\frac{1}{M_{\rm Pl}^{2}}\frac{\pi}{\sqrt{\gamma}}\left(\tilde{Q}-\frac{5}{2}Y\tilde{Q}_{Y}\right)-\frac{M_{\rm Pl}^{2}}{2}R^{(3)}+\frac{1}{M_{\rm Pl}^{2}}\frac{2\pi_{ij}\pi^{ij}-\pi^{2}}{\gamma}\right]. (40)

Note that, even when matter fields exist, the Hamiltonian constraint takes this form and the energy density of the matter fields appears in a generalization of the function ff. It is also possible to find a four-dimensional covariant expression for the Laplacian operator via the Stückelberg trick as follows:

ξΔY=ξγijDiDjYξhμνμ(hνλλY),\xi\Delta Y=\xi\gamma^{ij}D_{i}D_{j}Y\quad\to\quad\xi h^{{\mu\nu}}\nabla_{\mu}(h_{\nu}{}^{\lambda}\nabla_{\lambda}Y), (41)

where hμνh_{\mu\nu} was defined in (25). Written explicitly,

ξΔYξ[(gμν1Xϕμϕν)μν1X(ϕ12XϕμμX)ϕνν]Y.\xi\Delta Y\quad\to\quad\xi\left[\left(g^{\mu\nu}-\frac{1}{X}\phi^{\mu}\phi^{\nu}\right)\nabla_{\mu}\nabla_{\nu}-\frac{1}{X}\left(\Box\phi-\frac{1}{2X}\phi^{\mu}\nabla_{\mu}X\right)\phi^{\nu}\nabla_{\nu}\right]Y. (42)

Thus, we have obtained the differential equation that the shadowy mode YY satisfies in a general coordinate system. In the equation, the second derivative acting on YY is projected onto the hypersurface which is orthogonal to the timelike vector ϕμ\phi_{\mu}. Hence, this is indeed a three-dimensional elliptic differential operator on the spacelike hypersurface. It is also interesting to mention the relation to the covariant EOM. As mentioned earlier, the Hamiltonian constraint is equivalent to the nμnνn^{\mu}n^{\nu}-component of the metric EOM, nμnνμν=0n^{\mu}n^{\nu}\mathcal{E}_{\mu\nu}=0, where the left-hand side has the form,

nμnνμν=\displaystyle n^{\mu}n^{\nu}\mathcal{E}_{\mu\nu}= ξ[X32(gμνϕμϕνX)μνX+X8(ϕμμX)2+X28μXμXX22ϕμμXϕ]\displaystyle\;\xi\left[\frac{X^{3}}{2}\left(g^{\mu\nu}-\frac{\phi^{\mu}\phi^{\nu}}{X}\right)\nabla_{\mu}\nabla_{\nu}X+\frac{X}{8}(\phi^{\mu}\nabla_{\mu}X)^{2}+\frac{X^{2}}{8}\nabla_{\mu}X\nabla^{\mu}X-\frac{X^{2}}{2}\phi^{\mu}\nabla_{\mu}X\Box\phi\right]
+(terms without ξ).\displaystyle+(\text{terms without $\xi$}). (43)

Here, upon using Y=(X)5/4Y=(-X)^{5/4}, one can verify that the terms with ξ\xi coincide with (42) up to an overall factor.

V How to deal with the shadowy mode

In the last two sections, we saw how the parameter ξ\xi, which characterizes the deviation from the DHOST class, affects the structure of the Hamiltonian constraint at both the linear and nonlinear levels. In particular, in the nonlinear Hamiltonian analysis, the constraint equation has the form of a nonlinear Poisson equation (39), which is more complicated than the linear Poisson equation (20) for cosmological perturbations. In this section, we discuss how to treat the nonlinear Poisson equation.

V.1 Iterative solution

In §IV, we obtained the Hamiltonian constraint in the form of a nonlinear Poisson equation (39), which we reproduce here as

ξΔY+f(Y;γij,πij,R(3),ρm)=0.\xi\Delta Y+f(Y;\gamma_{ij},\pi^{ij},R^{(3)},\rho_{\rm m})=0. (44)

Here, the energy density of matter fields ρm\rho_{\rm m} is taken into account. In the absence of ξ\xi, the equation fixes YY algebraically. On the other hand, for a nonvanishing ξ\xi, it is a nontrivial problem to find a solution for the nonlinear Poisson equation. Nevertheless, so long as the effect of ξ\xi can be considered to be small enough, one can construct a solution in the form of series expansion with respect to ξ\xi, as we shall see below. The solution is assumed to be of the form

Y=Y0+ξY1+ξ2Y2+,Y=Y_{0}+\xi Y_{1}+\xi^{2}Y_{2}+\cdots, (45)

and we find each YnY_{n} (n=0,1,2,n=0,1,2,\cdots) in an iterative manner.

At the zeroth order in ξ\xi, (44) reduces to

f(Y0;γij,πij,R(3),ρm)=0,f(Y_{0};\gamma_{ij},\pi^{ij},R^{(3)},\rho_{\rm m})=0, (46)

which is no longer a differential equation, and hence Y0Y_{0} can be fixed algebraically. Note that this is a nonlinear equation and yields multiple of solutions in general. For a while, we suppress the second and subsequent arguments of ff for notational simplicity. We choose one of the solutions such that fY(Y0)0f_{Y}(Y_{0})\neq 0, which is necessary for the series expansion with respect to ξ\xi to be valid. Let us then proceed to find Y1Y_{1}. Substituting (45) into (44) and taking the terms that are first order in ξ\xi, we have

ΔY0+fY(Y0)Y1=0.\Delta Y_{0}+f_{Y}(Y_{0})Y_{1}=0. (47)

So long as fY(Y0)0f_{Y}(Y_{0})\neq 0, this equation can be solved for Y1Y_{1} to obtain

Y1=ΔY0fY(Y0).Y_{1}=-\frac{\Delta Y_{0}}{f_{Y}(Y_{0})}. (48)

Likewise, one can iteratively construct a solution for YY. For instance, one has

Y2=2ΔY1+Y12fYY(Y0)2fY(Y0),Y3=6ΔY2+6Y1Y2fYY(Y0)+Y13fYYY(Y0)6fY(Y0).\displaystyle Y_{2}=-\frac{2\Delta Y_{1}+Y_{1}^{2}f_{YY}(Y_{0})}{2f_{Y}(Y_{0})},\qquad Y_{3}=-\frac{6\Delta Y_{2}+6Y_{1}Y_{2}f_{YY}(Y_{0})+Y_{1}^{3}f_{YYY}(Y_{0})}{6f_{Y}(Y_{0})}. (49)

Hence, provided that the series expansion with respect to ξ\xi converges, one can uniquely construct YY once Y0Y_{0} is fixed.

V.2 Boundary condition for global solution

In the previous subsection, we studied an iterative solution to the nonlinear Poisson equation (44). However, in reality, there are nonlinear structures in the Universe, and hence the expansion with respect to ξ\xi may break down near such structures. Nevertheless, going far away from the nonlinear structures, one can still employ the iterative solution there, which can be used as a good boundary condition for solving the nonlinear Poisson equation within the enclosed region. The solution constructed in this manner would be unique at least locally in the configuration space, though the global uniqueness does not hold in general. Note that this prescription applies not only when the inhomogeneities have a compact support but when the inhomogeneities exist everywhere. For instance, we can accommodate cosmological perturbations with a sufficiently small amplitude far from the nonlinear structures of interest. Note also that our procedure applies only if the equation for the lapse function NN has a nontrivial physical solution when ξ=0\xi=0 (i.e., in the DHOST limit), which is the case for generic U-DHOST theories. Otherwise, the expansion in the parameter ξ\xi does not work.*4*4*4Therefore, it does not apply to the case of non-projectable Hořava gravity, for which ignoring terms depending on the spatial derivatives of the lapse function in the action results in inconsistencies [39]. On the other hand, in the projectable Hořava gravity, a shadowy mode is not present and instead the scalar graviton acts as “dark matter as integration constant” at low energy [40, 41].

So far, we focused only on the Hamiltonian constraint and discussed how the value of YY is fixed for given configurations of γij\gamma_{ij}, πij\pi^{ij}, and the matter energy density ρm\rho_{\rm m}. In practice, one has to take into account the evolution equations for the spatial metric and the matter fields. For a given set of initial conditions for (γij,πij,ρm)(\gamma_{ij},\pi^{ij},\rho_{\rm m}), the value of YY at the initial surface is fixed from the Hamiltonian constraint by the prescription mentioned above. Then, we can compute the value of (γij,πij,ρm)(\gamma_{ij},\pi^{ij},\rho_{\rm m}) at the next time step from the evolution equations, which allows us to fix the value of YY at this time step again from the Hamiltonian constraint. It should be noted that the Hamiltonian constraint should be solved at each time step to determine the nondynamical variable NN (or XX or YY), as it is no longer a first-class constraint under a fixed gauge. By repeating this procedure, one can in principle compute the time evolution of all the variables in concern within the region enclosed by the boundary.

V.3 ξ0\xi\to 0 limit

We have discussed how to deal with the problem of nonlinear Poisson equation (44) for a given model (7) with fixed ξ\xi. Let us now consider the limit ξ0\xi\to 0. Although it is nontrivial whether the limit ξ0\xi\to 0 is well defined for the global solution, one can safely take the limit at least near the boundary where the iterative solution derived in §V.1 is assumed to be valid. Also, we expect that the region where the iterative solution works would become larger and larger as ξ\xi approaches zero, and ultimately it would cover whole the domain of our interest for |ξ||\xi| below some critical value. If this is the case, the ξ0\xi\to 0 limit of the global solution is well defined. There might be some pathological cases where this expectation does not work, but any physically plausible setup, in the regime of validity of the effective field theory, would accommodate a well-defined limit.

VI Conclusions

In the present paper, we have studied the framework of U-DHOST theories, i.e., higher-order scalar-tensor theories which do not belong to the DHOST class but satisfy the degeneracy condition in the unitary gauge. We have clarified that the apparent Ostrogradsky (actually shadowy) mode satisfies a three-dimensional elliptic differential equation and hence does not propagate. It should be emphasized that we have taken into account also the dynamics of the metric, which was ignored in [19]. For demonstration purposes, we have focused on the model (7), where the deviation from the DHOST class is characterized by the parameter ξ\xi. In §III, we have studied cosmological perturbations in this model to show that the constraint equation associated with the lapse perturbation, which corresponds to the Hamiltonian constraint, has the form of a Poisson equation with its principal part proportional to ξ\xi. If an appropriate boundary condition is imposed, the constraint equation can be solved uniquely to fix the lapse perturbation. In §IV, we have performed a nonlinear Hamiltonian analysis in the unitary gauge without specifying the background metric. We have found that, as was the case for cosmological perturbations, the Hamiltonian constraint can be recast in the form of a nonlinear Poisson equation in the presence of ξ\xi, which can be regarded as an equation that the shadowy mode satisfies. Then in §V, we have discussed how one should deal with the shadowy mode. We have constructed an iterative solution to the nonlinear Poisson equation, which would be useful (at least) as a boundary condition for solving the system of equations within the region enclosed by the boundary. To reiterate, our analysis is based on the specific model (7), but we expect that it applies to generic U-DHOST theories. We believe our prescription would also apply to the shadowy mode in the cuscuton models [26, 27, 28] and minimally modified gravity [29, 30, 31, 32, 33, 34].

Having provided a nonlinear definition of the shadowy mode in the U-DHOST theories, it would be intriguing to investigate their phenomenological aspects. For instance, since the shadowy mode should affect cosmological perturbations as we saw in §III, there would be nontrivial corrections in, e.g., the effective gravitational constant. Another important issue is to study black holes in the U-DHOST theories. To this end, we need to consider black hole solutions with a time-dependent scalar hair (see, e.g., [42, 43, 44, 45, 46, 47, 48]) so that one can take the unitary gauge. Interestingly, there may exist a universal horizon, within which even the instantaneous mode cannot escape to infinity [25]. It would also be interesting to study perturbations about black hole solutions in the U-DHOST theories, following the works [49, 50, 51, 52, 53, 54, 55, 56, 57] in the DHOST theories.

Acknowledgements.
The work of A.D.F. was supported by Japan Society for the Promotion of Science (JSPS) Grants-in-Aid for Scientific Research No. 20K03969. S.M.’s work was supported in part by JSPS Grants-in-Aid for Scientific Research No. 17H02890, No. 17H06359, and by World Premier International Research Center Initiative, MEXT, Japan. The work of K.T. was supported by JSPS KAKENHI Grant No. JP21J00695.

References