This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A data-driven Reconstruction of Horndeski gravity via the Gaussian processes

Reginald Christian Bernardo    Jackson Levi Said
Abstract

We reconstruct the Hubble function from cosmic chronometers, supernovae, and baryon acoustic oscillations compiled data sets via the Gaussian process (GP) method and use it to draw out Horndeski theories that are fully anchored on expansion history data. In particular, we consider three well-established formalisms of Horndeski gravity which single out a potential through the expansion data, namely: quintessence potential, designer Horndeski, and tailoring Horndeski. We discuss each method in detail and complement it with the GP reconstructed Hubble function to obtain predictive constraints on the potentials and the dark energy equation of state.

1 Introduction

Flat Λ\LambdaCDM has dominated as the cosmological concordance model since the discovery of the accelerating expansion of the Universe [1, 2] some decades ago. It mimics numerous observations in a wide variety of cosmological and astrophysical settings [3, 4]. It does this despite the necessity of large portions of the model requiring particle physics beyond the standard model as well as future solutions to foundational problems such as fine-tuning issues, the horizon and coincidence problems, and the cosmological constant problem [5, 6]. In this background, there have been several interesting proposals to replace Λ\LambdaCDM ranging from dynamical dark energy [7, 8, 9], extended gravity [10], to beyond general relativity (GR) [4], and others.

In recent years, observations of some cosmological parameters appear to feature growing discrepancies with early Universe predictions, based on vanilla Λ\LambdaCDM, and late time cosmology-independent measurements, and is fast becoming a central issue in Λ\LambdaCDM cosmology [11, 12, 13, 14]. The value of the Hubble parameter at current times H0H_{0} has encapsulated one part of this cosmological tension with late time measurements taken from Cepheid calibrated type Ia supernovae events [15] and strong lensing by distant quasars [16] resulting in very high values of H0H_{0}, while early Universe predictions based on a Λ\Lambda cosmology produce a much lower value [13, 17]. Some other measurements point to an even larger H0H_{0} tension [18, 19, 20].

Further analysis of potential solutions within Λ\LambdaCDM such as nonflat cosmologies [21, 22], or more exotic contributions from particle physics beyond the standard model, may be done. However, the issue may ultimately require a reexamination of the gravitational contribution to Λ\LambdaCDM, together with the underpinnings of general relativity (GR) [6, 9]. The plethora of modifications to GR form a large landscape of theories on which to build cosmological model. On the other hand, many of these theories have been shown to be dynamically equivalent to a second order gravitational theory, in terms of metric tensor derivatives, provided a scalar field degree of freedom is allowed. In this context, by exploring Horndeski gravity [23], which is the most general second-order theory of gravity that contains only one scalar field, we can investigate a large region on cosmology beyond Λ\LambdaCDM.

Horndeski gravity encompasses many of the popular formulations of gravity that have been studied. For instance, despite being organically fourth order f(R)f(R) gravity [24, 25, 26, 27, 28, 29, 30] can be transformed into a Horndeski class of models through an appropriate mapping [31]. However, the recent measurement of the propagation speed of gravitational waves has severely constrained many of the promising branches of the Horndeski landscape [32, 33]. Some of these theories include quartic and quintic Galileon models [34, 35], de-Sitter Horndeski [36], the Fab Four [37], and purely kinetic coupled models [38], among others. While many Horndeski gravity models continue to be viable, an interesting new avenue of Horndeski gravity has started to emerge in which the curvature associated with the mediation of gravity is replaced with teleparallel torsion [39]. This organically lower order form of gravity has been shown to produce a much larger space of Horndeski theories [40, 41] which can be shown to revive some models within Horndeski gravity [42].

In this work, we reconsider the range of viable Horndeski gravity models using Gaussian process regression (GP) which is a method of reducing noise as well as simulating new intermediary points in a data set. GP functions as a non-parametric reconstruction technique by employing a covariance function whose hyperparameters are fixed in a Bayesian approach so that the Kernel approximates the data set and can produce a smoothed continuous data set with respective uncertainties. Thus the more points in a data set, the more well constrained the hyperparameters will be. The only drawback of this approach is if clustering occurs in a data set then the hyperparameters may be optimized for only part of the data set which may translate into other regions being poorly reconstructed. GP has been widely used in cosmology ranging from reconstructing the value of H0H_{0} as in Refs. [43, 44, 45, 46, 47, 48, 49, 50, 51, 52] where the contentious value of H0H_{0} has been approximated using various compiled sources of expansion data, to the value of fσ8f\sigma_{8} at current times as in Ref. [53], and gravitational wave analysis [54, 55, 56].

More recently, GP has been applied to the inverse problem in extended models of gravity in which arbitrary classes of gravity are constrained through the prism of compiled data sets without assuming particular models a priori which is a core problem in modified gravity. By allowing the gravitational sector Lagrangian to remain largely unprescribed, various studies have produced viable ranges that such models would need to satisfy. Consider Refs. [57, 58, 59] where background expansion data was used to produce restrictions in f(T)f(T) gravity for medium redshift ranges, while in Ref. [60] the same goal was achieved but with growth data. This approach has also been used in the context of interacting models between dark energy and dark matter as in Ref. [61]. Another approach to reducing the space of large classes of gravitational theories is Ref. [62] in which viable paths to a Horndeski model are explored in terms of their predictions on cosmological parameters. The work is interesting because it delves into the classes of models that continue to be viable in the background of the speed of GW constraint.

In the present work, we trace down viable paths of Horndeski gravity in the context of the range of priors on H0H_{0} which forms the base of the so-called Hubble tension. In Sec. 2 we briefly review Horndeski gravity and the classes of models we will study later on. Sec. 3 then expands on some GP background together with an explanation of how various data sets were used with the GP approach to produce the Hubble diagram with the various prior choices. The core work on Horndeski gravity is contained in Sec. 4 where each of the family of viable Horndeski classes is investigated through the GP reconstruction method, and where we also produce the equation of state for the effective dark energy component of the theory. Throughout the work, we assume a (1,+1,+1,+1)(-1,+1,+1,+1) metric signature and use geometric units where c=MPl2=1c=M_{\text{Pl}}^{2}=1 where MPl2=1/(8πG)M_{\text{Pl}}^{2}=1/\left(8\pi G\right), unless otherwise stated.

2 Horndeski theory

Horndeski gravity is one of the most prominent modifications of Λ\LambdaCDM cosmology beyond GR since it is synonymous with many of the extensions or modifications of GR, at least in terms of it dynamical equations. Thus, Horndeski gravity brings together a wide swath of theories of gravity and makes their study much more accessible for general analysis. The base of theory emerges from the Lovelock theorem [63] in which the Einstein-Hilbert action is found to be a unique theory that produces second-order field equations. However, on adding a single scalar field, Horndeski gravity produces a much richer plethora of models that spans the entire range of Lagrangians that produce second-order equations of motion while only containing a single scalar field. It is worthwhile to mention that Horndeski gravity is constructed in a curvature-based context (through the Levi-Civita connection), and that other teleparallel proposals have been made in recent years [42, 40, 41].

Besides avoiding Ostrogradsky instability problems [64], Horndeski gravity also only contains one extra (scalar) degree of freedom which is propagating [65], which may appear as a massive or massless gravitational wave. Horndeski gravity features a number of equivalent representations, which in our case we present through the action [64]

𝒮H=d4x(2+3+4+2)+𝒮mat(ψ,gμν),\mathcal{S}_{\rm H}=\int{\rm d}^{4}x\left(\mathcal{L}_{2}+\mathcal{L}_{3}+\mathcal{L}_{4}+\mathcal{L}_{2}\right)+\mathcal{S}_{\rm mat}(\psi,g_{\mu\nu})\,, (2.1)

where ψ\psi represents the matter fields in the matter action 𝒮mat\mathcal{S}_{\rm mat} and gμνg_{\mu\nu} is the metric tensor, and

2\displaystyle\mathcal{L}_{2} =G2(ϕ,X),\displaystyle=G_{2}\left(\phi,\,X\right)\,, (2.2)
3\displaystyle\mathcal{L}_{3} =G3(ϕ,X)ϕ,\displaystyle=-G_{3}\left(\phi,\,X\right)\Box\phi\,, (2.3)
4\displaystyle\mathcal{L}_{4} =G4(ϕ,X)R+G4,X[(ϕ)2(μνϕ)(μνϕ)],\displaystyle=G_{4}\left(\phi,\,X\right)R+G_{4,X}\left[\left(\Box\phi\right)^{2}-\left(\nabla_{\mu}\nabla_{\nu}\phi\right)\left(\nabla^{\mu}\nabla^{\nu}\phi\right)\right]\,, (2.4)
5\displaystyle\mathcal{L}_{5} =G5(ϕ,X)Gμνμνϕ\displaystyle=G_{5}\left(\phi,\,X\right)G_{\mu\nu}\nabla^{\mu}\nabla^{\nu}\phi
16G5,X[(ϕ)33(ϕ)(μνϕ)(μνϕ)+2(μαϕ)(αβϕ)(βμϕ)],\displaystyle-\frac{1}{6}G_{5,X}\left[\left(\Box\phi\right)^{3}-3\left(\Box\phi\right)\left(\nabla_{\mu}\nabla_{\nu}\phi\right)\left(\nabla^{\mu}\nabla^{\nu}\phi\right)+2\left(\nabla^{\mu}\nabla_{\alpha}\phi\right)\left(\nabla^{\alpha}\nabla_{\beta}\phi\right)\left(\nabla^{\beta}\nabla_{\mu}\phi\right)\right]\,, (2.5)

in which X=12σϕσϕX=-\frac{1}{2}\nabla_{\sigma}\phi\nabla^{\sigma}\phi is the kinetic term associated with the scalar field, ϕ=μμϕ\Box\phi=\nabla_{\mu}\nabla^{\mu}\phi is the d’Alembertian operator, GμνG_{\mu\nu} is the Einstein tensor, and Gi(ϕ,X)G_{i}\left(\phi,\,X\right) are arbitrary functions of the scalar field and its kinetic term.

Of particular importance is the impact of Lovelock’s theorem on this formulation of Horndeski gravity. Notice that only the Ricci scalar appears as a purely gravitational scalar while the Einstein tensor appears as a coupling with the scalar field derivative terms. Thus the Lovelock theorem renders a finite Lagrangian for Horndeski theory. Another interesting feature of Horndeski gravity is its subclasses such as Brans-Dicke theory [66] which occurs for G3=0=G5G_{3}=0=G_{5}, G2=2ωX/ϕG_{2}=2\omega X/\phi and G4=ϕG_{4}=\phi, f(R)f(R) gravity [24] when G3=0=G5G_{3}=0=G_{5}, G2=f(ϕ)ϕf(ϕ)G_{2}=f(\phi)-\phi f^{\prime}(\phi) and G4=f(ϕ)G_{4}=f^{\prime}(\phi), and GR for the choice where G2=0=G3=G5G_{2}=0=G_{3}=G_{5} and G4=1/2G_{4}=1/2.

The recent binary neutron star merger which was recorded in the multimessenger events in which GW170817 [32] together with an electromagnetic counterpart GRB170817A [67] were used to severely constrain the speed of GW propagation to within one part in 101510^{15}. The result of this is a Horndeski theory that is drastically reduced in potential models [68]. This is described by a smaller class of viable actions

𝒮Hc=d4xg(f(ϕ)R+K(ϕ,X)G(ϕ,X)ϕ)+𝒮mat(ψ,gμν)\mathcal{S}_{\rm H-c}=\int d^{4}x\sqrt{-g}\left(f\left(\phi\right)R+K\left(\phi,X\right)-G\left(\phi,X\right)\Box\phi\right)+\mathcal{S}_{\rm mat}(\psi,g_{\mu\nu}) (2.6)

where we refer to ff, KK, and GG as the conformal, kk-essence, and braiding potentials, respectively. This theory continues to encompasses a large swath of alternative gravity theories that have been widely studied, including f(R)f(R) gravity [24, 25, 26], generalized Brans-Dicke theories [66], covariant Galileons [34, 35], quintessence [69], and kinetic gravity braiding [70, 71], among others [72].

In this work, we restrict our attention to particular subclasses of the viable Horndeski gravity models in Eq. (2.6) where we single out particular classes formed by subclasses of the (f,K,G)\left(f,K,G\right) functions. In particular, we focus on the following subclasses:

  1. 1.

    Quintessence [69]:

    f(ϕ)=1/2f\left(\phi\right)=1/2 (2.7)
    K(ϕ,X)=XV(ϕ)K\left(\phi,X\right)=X-V\left(\phi\right) (2.8)
    G(ϕ,X)=CG\left(\phi,X\right)=C (2.9)

    where CC is a constant. We refer to VV as the quintessence potential.

  2. 2.

    Designer Horndeski [73]:

    f(ϕ)=1/2f\left(\phi\right)=1/2 (2.10)
    K(ϕ,X)=K(X)K\left(\phi,X\right)=K\left(X\right) (2.11)
    G(ϕ,X)=G(X).G\left(\phi,X\right)=G\left(X\right). (2.12)
  3. 3.

    Tailoring Horndeski [74]:

    f(ϕ)=1/2f\left(\phi\right)=1/2 (2.13)
    K(ϕ,X)=X2ΛK\left(\phi,X\right)=X-2\Lambda (2.14)
    G(ϕ,X)=G(X)G\left(\phi,X\right)=G\left(X\right) (2.15)

    where Λ\Lambda is a constant.

In addition to the gravitational action (2.6), we consider a perfect fluid of energy density ρ\rho and pressure PP and assume spatial flatness for a homogeneous and isotropic cosmology. This produces Friedmann equations for each subclass which are presented in Sec. 4, where they are inverted so that the ensuing data sets can be used to constrain the values of the Lagrangian contributions.

3 Determining the Hubble diagram using Gaussian process regression

We devote this section to presenting a brief introduction to GP in Sec. 3.1 which is then applied directly to expansion data to obtain the Hubble diagram in Sec. 3.2. In this section, we discuss the assumptions and dependencies that this approach is built on and thus the context in which to interpret the results that follow.

3.1 Gaussian process: A Brief Review

The GP regression is a powerful tool that merges the idea of a kernel and Bayesian analysis to make meaningful predictions of a function HH from a given data set {(z,H(z))}\{\left(z,H(z)\right)\} [75, 76]. Most notably, it is a non-parametric way of learning a function and so is a refreshing change of view in making cosmological predictions usually based on arbitrary parametrizations and Markov chain Monte Carlo (MCMC) methods [43, 44, 45]. In light of the existing tensions between early, i.e., during last scattering, and local cosmological observations, GP has also naturally emerged as a go-to approach in cosmology and is increasingly becoming a popular tool to make cosmological predictions without assuming a particular model of cosmology [44, 52, 46, 43, 45, 53, 54, 55, 56, 57, 58, 60, 61, 62, 77, 47, 78, 79, 80, 48, 49, 81, 82, 51].

Consider an observation {(z,H(z))}\{\left(z,H(z)\right)\} with uncertainties captured by the covariance matrix CC. In order to reconstruct the function H(z)H(z^{*}) at the coordinates zz^{*} via the GP, we first assume a kernel K(z,z~)K\left(z^{*},\tilde{z}^{*}\right) that relates the function values at different coordinates zz^{*} and z~\tilde{z}^{*} in the data set of observations. In terms of this kernel, the mean and the covariance of the GP reconstruction of the nnth derivative of H(z)H(z) at zz^{*} are given by

H(n)=K(n,0)(z,Z)[K(Z,Z)+C]1H(Z),\langle H^{*(n)}\rangle=K^{(n,0)}\left(z^{*},Z\right)\left[K\left(Z,Z\right)+C\right]^{-1}H\left(Z\right)\,, (3.1)

and

cov(H(n))=K(n,n)(z,z)K(n,0)(z,Z)[K(Z,Z)+C]1K(0,n)(Z,z),\text{cov}\left(H^{*(n)}\right)=K^{(n,n)}\left(z^{*},z^{*}\right)-K^{(n,0)}\left(z^{*},Z\right)\left[K\left(Z,Z\right)+C\right]^{-1}K^{(0,n)}\left(Z,z^{*}\right)\,, (3.2)

respectively, where ZZ stands for the union of the redshifts of the measurements and y(n,m)y^{(n,m)} refers to the nnth derivative of a function yy with respect to its first argument and the mmth derivative with respect to the second argument. This kernel depends on hyperparameters that will be trained using a Bayesian implementation in order to fit the observations. However, in contrast with parametric approaches, the hyperparameters do not specify the shape of the function, but rather, they describe its characteristic scales in the zz and HH directions, typically, as a length scale ll and an amplitude AA. In particular, we consider the simplest, infinitely-differentiable, and arguably the most widely-used kernel in machine learning, known as the radial basis function (RBF), also often referred to as the squared exponential kernel,

K(z,z~)=Aexp((zz~)22l2).K\left(z,\tilde{z}\right)=A\exp\left(-\dfrac{\left(z-\tilde{z}\right)^{2}}{2l^{2}}\right)\,. (3.3)

The hyperparameters (l,A)\left(l,A\right) are then selected by optimizing, or more consistently, marginalizing over the marginal likelihood =p(H|Z,l,A)\mathcal{L}=p\left(H|Z,l,A\right). However, marginalization over the hyperparameters usually demands more computational power and can be impractically time consuming. Fortunately, in cosmology, and other applications, where the \mathcal{L}s are at least approximately Gaussian, optimization usually turns out to be a good approximation. In what follows, we will optimize over the hyperparameters to reconstruct the local Hubble function.

Two remarks are in order. First, the GP mean function is kept to zero in our reconstructions. This seems reasonable as we intend to obtain function values in regions encompassing the domain of the data points. Most importantly, this is in line with a data-driven, model-independent approach which would otherwise be compromised with, for example, a Λ\LambdaCDM or another mean function. Second, it is often the case in GP applications that the reconstruction is tested for various kernels which lead to redundant results. For this reason, we pursue this work with only the most natural GP kernel (3.3), and instead investigate in detail the consequences of the choice of the kernel in a different paper [83] where we show that the reconstructions of the Hubble function per kernel can be at most in mild statistical tension with each other.

Our implementation is based on the public codes GaPP [43] and scikit [84] for the GP and cobaya [85] and getdist [86] for calibration of the absolute magnitude MM via MCMC to be described shortly. The codes used in this paper also used numpynumpy [87], scipyscipy [88], seabornseaborn [89], and matplotlibmatplotlib [90]. A friendly implementation of the computations in this work is communicated as a two-part jupyter notebook [91] which can be freely downloaded [92].

3.2 Application to the Hubble expansion

We now apply the GP approach using the RBF kernel in Eq. (3.3) to a number of combined Hubble data sources, which we use to reconstruct the H(z)H(z) data. This is done using three core sources of H(z)H(z) data, namely cosmic chronometers (CC), supernovae of Type Ia (SNe) and baryonic acoustic oscillation (BAO). In addition, we consider three priors on H(z)H(z) that have been reported in the literature, namely the Riess prior which is H0R19=74.03±1.42kms1Mpc1H_{0}^{\rm R19}=74.03\pm 1.42\,{\rm km\,s}^{-1}{\rm Mpc}^{-1} [15], the Carnegie-Chicago Hubble prior H0TRGB=69.8±1.9kms1Mpc1H_{0}^{\rm TRGB}=69.8\pm 1.9\,{\rm km\,s}^{-1}{\rm Mpc}^{-1} [93], and the latest value from the Planck collaboration (P18) H0P18=67.4±0.5kms1Mpc1H_{0}^{\rm P18}=67.4\pm 0.5\,{\rm km\,s}^{-1}{\rm Mpc}^{-1} (Ωm0=0.3153±0.0073\Omega_{m0}=0.3153\pm 0.0073 and ΩΛ=0.6847±0.073\Omega_{\Lambda}=0.6847\pm 0.073) [13]. These H0H_{0} priors clearly represent the current Hubble tension and are considered in this analysis to shed more light to this intriguing puzzle.

Adding some background on these priors, the H0P18H_{0}^{\rm P18} estimates is the result of using (flat) Λ\LambdaCDM as a fiducial model and cosmic microwave background (CMB) data from the Planck mission to predict the late time value of H0H_{0} [13]. This contrasts with the cosmology independent estimates that come from local observations, where H0R19H_{0}^{\rm R19} is the highest of these values and comes from long period observations of Cepheids in the Large Magellanic Cloud using the Hubble Space Telescope which significantly reduces the uncertainty in the measurement of H0H_{0}. We also consider measurements of the tip of the red giant branch (TRGB) where this turning point is used as a standard candle to predict values of H0TRGBH_{0}^{\rm TRGB}. While other prior values exist in the literature, these adequately represent the tension in the value of the Hubble constant.

On the data sets themselves, the CC data comprise the majority of the combined data set points with 29 (our of 40) points within the z2z\lesssim 2 range, and do not rely on any cosmological models [94, 95, 96, 97, 98]. For the SNe data set, we employ both the full Pantheon data set [99] and the compressed Pantheon compilation together with the CANDELS and CLASH Multi-cycle Treasury (MCT) data [100]. This is done using the corresponding covariance matrix, and where only five of the six points in Ref. [100] are used since the sixth point is non-Gaussian. In particular, we incorporate the compressed SNe data in the analysis by promoting the E(z)E(z) data points to H(z)=H0E(z)H(z)=H_{0}E(z) using the H0H_{0} priors and then feeding resulting H(z)H(z) and the corresponding covariance matrix into the GP regression.

CC data depends on a differential ages technique between galaxies while the SNe data is compiled using Cepheid calibrated distance measurements for SNe event. Finally, we consider the BAO data from BOSS and eBOSS in Refs. [101, 102, 103, 104, 105, 106, 107, 108, 109]. While BAO data is not entirely independent from Λ\LambdaCDM (due to the assumption of a fiducial radius of the sound horizon rs=147.78Mpcr_{s}=147.78\,\mathrm{Mpc}), they add value to the analysis as being data that emanates from the growth of large scale structure which has a strong bearing on the value of H0H_{0}. In particular, these measurements are made in terms of the comoving distance dM(z)/rsd_{M}(z)/r_{s} and dH(z)/rsd_{H}(z)/r_{s}, where

dM(z)=(1+z)dA(z),d_{M}(z)=(1+z)d_{A}(z)\,, (3.4)

and

dH(z)=c/H(z),d_{H}(z)=c/H(z)\,, (3.5)

in which the Hubble expansion can be determined. Instead of relying on the sound horizon radius rsr_{s}, which consequently depends on a cosmological model which in this case is Λ\LambdaCDM, we calculate the ratio dM(z)/dH(z)d_{M}(z)/d_{H}(z) from the BAO data set, cancelling out the dependence on rsr_{s}, and compliment this with a GP reconstructed angular-diameter distance dA(z)d_{A}(z) from the compressed Pantheon samples. This step assumes spatial-isotropy and the distance-duality relation, dA(z)=dL(z)/(1+z)2d_{A}(z)=d_{L}(z)/(1+z)^{2}, where dA(z)d_{A}(z) and dL(z)d_{L}(z) are the angular-diameter distance and luminosity distance, respectively, and also relies on a calibrated SNe absolute magnitude. Arguably, both spatial-isotropy and the distance-duality relation can be considered as cosmological assumptions, but neither requires any hard-parametrization like Λ\LambdaCDM and, most importantly, both assumptions can be supported just as well with any metric theories of gravity.

Now, the full Pantheon data set was also considered where a cosmological model is necessary to obtain the Hubble data and thus calibrate the absolute magnitude MM. Nonetheless, this quantity is related to the intrinsic brightness of a luminous object and so should not reflect, or at least, weakly reflect, the dynamics of cosmic expansion. This assertion is strongly supported by the fact that MM can be calibrated, provided an H0H_{0} prior, with subpercent precision even when considering only the z<0.1z<0.1 redshifts (the first 11 of 40 points) in the compressed Pantheon samples [99]. The calibrated MM which will be used later for the GP reconstruction of H(z)H(z) per H0H_{0} prior is shown in Fig. 1.

Refer to caption
Figure 1: The posteriors of the calibrated SNe absolute magnitude MM for each H0H_{0} (in units of kms1Mpc1\,{\rm km\,s}^{-1}{\rm Mpc}^{-1}) prior. The mean and standard deviations are M=19.25±0.05,19.38±0.06M=-19.25\pm 0.05,-19.38\pm 0.06, and 19.45±0.03-19.45\pm 0.03 for the R19, TRGB, and P18 H0H_{0} priors, respectively.

The mean and standard deviations are M=19.25±0.05,19.38±0.06M=-19.25\pm 0.05,-19.38\pm 0.06, and 19.45±0.03-19.45\pm 0.03 for the R19, TRGB, and P18 H0H_{0} priors, respectively. Lastly, the GP reconstructed Pantheon SNe apparent magnitudes is presented in Fig. 2.

Refer to caption
Figure 2: GP reconstruction of the SNe apparent magnitudes as a function of the redshift zz.

We emphasize that the remarkable precision of this GP reconstruction of m(z)m(z), achieved by performing the GP in log(z)\log(z) (due to the drastic variance in the density of points across zz), justifies and completes the outlined sound horizon-free construction of H(z)H(z) from BAO.

In the present study, we perform a number of GP analyses using a combination of prior values. GP is a non-parametric reconstruction implementation but it does depend on the kernel hyperparameters. Given the level of precision in the discrepancy in H0H_{0}, we consider the RBF kernel in order to reduce any fine differences between these estimations of H0H_{0}.

Refer to caption
(a) χR192=26.2\chi^{2}_{\text{R19}}=26.2
Refer to caption
(b) χTRGB2=19.8\chi^{2}_{\text{TRGB}}=19.8
Refer to caption
(c) χP182=25.8\chi^{2}_{\text{P18}}=25.8
Refer to caption
(d) χΛCDM2=28.5\chi^{2}_{\Lambda\text{CDM}}=28.5
Figure 3: (a-c) GP reconstructed Hubble function H(z)H(z) (in units of kms1Mpc1\,{\rm km\,s}^{-1}{\rm Mpc}^{-1}) given the full data set (Pantheon/MCT + BAO + CC) and H0H_{0} prior (R19, TRGB, P18); (d) GP reconstructed Hubble function H(z)H(z) for the for the full data set with different H0H_{0} priors and the Λ\LambdaCDM contour. The filled-hatched regions in (d) show the 2σ2\sigma confidence intervals for each prior. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13].

The reconstructed Hubble parameter is shown for the combined CC, Pantheon/MCT, and BAO data sets for each of the H0H_{0} priors in Fig. 3. For completeness, we also write down the χ2\chi^{2} likelihoods for each of these reconstructions and the one for Λ\LambdaCDM (taking in the P18 constraints [13]) in the subtitles of Fig. 3, and where

χ2=i(Hobs(zi)Hrecon(zi))2σobs(zi)2,\chi^{2}=\sum_{i}\frac{\left(H_{\rm obs}(z_{i})-H_{\rm recon}(z_{i})\right)^{2}}{\sigma_{\rm obs}(z_{i})^{2}}\,, (3.6)

for each data set in the combination. We do this to quantity possible over fitting which is a common pathology of non-parametric reconstruction methods. Indeed, this shows up in the χ2\chi^{2}-statistics (<40<40, number of data points). Nonetheless, it can be seen that the constrained Λ\LambdaCDM model based on the Planck 2018 release also overfits the joint data set. On the other hand, this overfitting by Λ\LambdaCDM can in fact also be viewed as a remarkable coincidence considering that the Planck mission probes the physics of last scattering (z1000z\sim 1000). The relative value of Δχ2=χi2χΛCDM2\Delta\chi^{2}=\chi^{2}_{i}-\chi^{2}_{\Lambda\text{CDM}} may then be taken to provide a rough degree of how well a GP reconstruction fits the data.

In Fig. 3, we also notice that the low redshift behavior of the reconstructed Hubble diagram is not drastically different for the different prior selections. However, the uncertainties at high redshifts are impacted by this choice with R19 giving the largest uncertainties region. In fact, these reconstructions produce small uncertainty regions up to roughly z1.5z\sim 1.5 after which these regions start to grow due to the sparsity of observational data points in that region.

In the next section, the reconstructed H(z)H(z) and its first derivative H(z)H^{\prime}(z) will be used to predict Horndeski Lagrangian potentials. In doing so, it is important to highlight that H(z)H(z) and H(z)H^{\prime}(z) are naturally correlated together because of the GP reconstruction [43, 45]. That is, at each redshift zz^{*}, this is fleshed out by the covariance matrix

cov(H(z),H(z))=K(0,1)(z,z)K(z,Z)[K(Z,Z)+C]1K(0,1)(Z,z).\text{cov}\left(H(z^{*}),H^{\prime}(z^{*})\right)=K^{(0,1)}\left(z^{*},z^{*}\right)-K\left(z^{*},Z\right)\left[K\left(Z,Z\right)+C\right]^{-1}K^{(0,1)}\left(Z,z^{*}\right)\,. (3.7)

To consistently obtain a function f(H(z),H(z),θ)f\left(H(z^{*}),H^{\prime}(z^{*}),\theta\right) where θ\theta stands for any additional parameters, with a variance var(θ)\text{var}\left(\theta\right), we draw a large number of samples, a la Monte Carlo (MC), from the multivariate Gaussian distribution

(HHθ)𝒩[(HHθ),(cov(H,H)cov(H,H)0cov(H,H)cov(H,H)000var(θ))],\left(\begin{matrix}H\\ H^{\prime}\\ \theta\end{matrix}\right)\sim\mathcal{N}\left[\left(\begin{matrix}H\\ H^{\prime}\\ \theta\end{matrix}\right),\left(\begin{matrix}\text{cov}\left(H,H\right)&\text{cov}\left(H,H^{\prime}\right)&0\\ \text{cov}\left(H,H^{\prime}\right)&\text{cov}\left(H^{\prime},H^{\prime}\right)&0\\ 0&0&\text{var}\left(\theta\right)\end{matrix}\right)\right]\,, (3.8)

and then take the mean, standard deviation, and other relevant statistical outputs of the resulting posterior distribution of the function ff.

4 Gaussian Processes Reconstruction of Horndeski Gravity

In this section, we discuss in detail three model-building implementations in Horndeski cosmology and complement this with the GP reconstructed Hubble function. This outputs meaningful predictions, mean and confidence intervals, of the Horndeski potentials and the corresponding dark energy equation of state.

4.1 Quintessence potential

The simplest and perhaps most well-known potential construction method is in quintessence cosmology [69]. In this model, the Friedmann and scalar field equations are given by

3H2=ρ+ϕ˙22+V(ϕ),3H^{2}=\rho+\dfrac{\dot{\phi}^{2}}{2}+V\left(\phi\right)\,, (4.1)
2H˙+3H2=Pϕ˙22+V(ϕ),2\dot{H}+3H^{2}=-P-\dfrac{\dot{\phi}^{2}}{2}+V\left(\phi\right)\,, (4.2)

and

ϕ¨+3Hϕ˙+V(ϕ)=0,\ddot{\phi}+3H\dot{\phi}+V^{\prime}\left(\phi\right)=0\,, (4.3)

respectively. It can of course be shown that Eq. (4.3) follows from Eqs. (4.1) and (4.2) as long as the perfect fluid’s energy is thermodynamically conserved. Using the Friedmann equations, it can then be shown that the potential and kinetic terms of the scalar field are given by

V(ϕ)=H˙+3H2ρP2,V\left(\phi\right)=\dot{H}+3H^{2}-\dfrac{\rho-P}{2}\,, (4.4)

and

ϕ˙2=2H˙(ρ+P).\dot{\phi}^{2}=-2\dot{H}-\left(\rho+P\right)\,. (4.5)

Eq. (4.4) shows that the quintessence potential can be uniquely determined given a Hubble function and matter fields. The kinetic term X=ϕ˙2/2X=\dot{\phi}^{2}/2 (through Eq. (4.5)) can be similarly obtained. We additionally assume non-relativistic matter fields which take on the P18 value for the current matter density parameter [13].

We can now use the GP reconstructed Hubble function to predict the shapes of the VV and ϕ(z)2\phi^{\prime}(z)^{2}, where f˙(t)=(1+z)H(z)f(z)\dot{f}\left(t\right)=-(1+z)H(z)f^{\prime}(z) for an arbitrary function ff of time (or redshift). The results are shown in Fig. 4.

Refer to caption
(a)
Refer to caption
(b)
Figure 4: (a) Reconstructed quintessence potential V(z)V(z) and (b) ϕ(z)2\phi^{\prime}(z)^{2} for varying H0H_{0} prior. The filled-hatched regions show the 2σ2\sigma confidence intervals for each prior. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13].

Note that by expressing the dimensionful quantities, in this case, VV, in units of H0H_{0}, the tensions which exist at z=0z=0 for the different H0H_{0} priors can be alleviated in the construction. For example, in Fig. 4, the mean values of VV and ϕ(z)2\phi^{\prime}(z)^{2} for any one of the priors always lie well within the 1σ1\sigma contours of the two other priors. In this case, it can be seen that the potential and kinetic term traces out a particular shape regardless of the H0H_{0} prior. This then constrains the quintessence potential. In principle, the explicit function V(ϕ)V\left(\phi\right) can also be obtained by integrating ϕ(z)\phi^{\prime}(z) to obtain ϕ\phi up to a constant. However, we find that the mean ϕ(z)2\phi^{\prime}(z)^{2} is mostly-negative throughout the late-time cosmological evolution. Notably, positive values of ϕ(z)2\phi^{\prime}(z)^{2} which happen to occur near the upper 2σ2\sigma contours can be used to predict V(ϕ)V\left(\phi\right). This should of course be taken with caution given that it hovers just around, and some times beyond, the 95%95\% confidence region. Nonetheless, the dark energy equation of state, wϕ=Pϕ/ρϕw_{\phi}=P_{\phi}/\rho_{\phi}, can be obtained which in the quintessence model becomes

wϕ=ϕ˙2/2Vϕ˙2/2+V.w_{\phi}=\dfrac{\dot{\phi}^{2}/2-V}{\dot{\phi}^{2}/2+V}\,. (4.6)

However, instead of wϕw_{\phi} it turns out to be as convenient to additionally sample over the compactified variable arctan(1+wϕ)\arctan\left(1+w_{\phi}\right). We refer to this as a compactified dark energy equation of state. This alternative variable tames down the singularities which otherwise would make the MC procedure terribly unreliable, i.e., MC propagated errors in wϕw_{\phi} diverge at z1z\sim 1. The result of the additional sampling of the compactified dark energy equation of state is shown in Fig. 5.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 5: (a) The compactified dark energy equation of state, arctan(1+wϕ(z))\arctan\left(1+w_{\phi}(z)\right), for quintessence for various H0H_{0} priors as a function of the redshift. The solid, dashed, and dash-dotted lines represent the median of the distribution and the filled-hatched regions show the 34.1%34.1\% of the probability mass surrounding the median from both sides. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13]. Posteriors of the compactified dark energy equation of state at sample redshifts z=0,0.5,1,1.5,2z=0,0.5,1,1.5,2 are presented for the (b) R19, (c) TRGB, and (d) P18 H0H_{0} priors, respectively.

In Fig. 5-(a), the median of the distribution and the 34.1%34.1\% of the probability mass surrounding it on both sides are shown. To see why this is a more consistent statistic, compared with the usual Gaussian-anchored mean ±\pm 1σ1\sigma statistic, we also show the resulting posterior distributions of arctan(1+wϕ(z))\arctan\left(1+w_{\phi}(z)\right) at different redshifts in Figs. 5-(b-d).

Clearly, for low redshifts, z1z\lesssim 1, the resulting distributions can be considered to be normally-distributed. However, at higher redshifts, it can be seen that the distribution not only starts to deviate away from normality, but even becomes bimodal and takes samples at arctan(1+wϕ)±π/2\arctan\left(1+w_{\phi}\right)\sim\pm\pi/2 where wϕw_{\phi} diverges. For this reason, the median and the 34.1%34.1\% probability mass surrounding it from both sides can be considered as a more consistent statistic as it agrees with the mean ±\pm 1σ1\sigma when the distribution is normal, but it tracks the most concentration of probability mass even when the distribution is no longer normal. See Appendix A for a supplementary illustration of this point. In addition, a naive use of the mean ±\pm 1σ1\sigma statistic for a compactified variable may predict results outside of the domain of the compactified variable while the median ±\pm 34%34\% probability mass will always only sample within this domain.

The results shown in Fig. 5 agree that dark energy may as well be vacuum energy, i.e., w1w\approx-1 or arctan(1+wϕ)=0\arctan\left(1+w_{\phi}\right)=0, for low redshifts. On the other hand, the use of the compactified dark energy equation of state lets us understand a clearer picture of dark energy at earlier redshifts when the prediction of wϕw_{\phi} becomes spoiled by diverging uncertainties. For instance, at z1z\gtrsim 1, Figs. 5-(b-d) reveal that the onset of the divergence in the uncertainty of wϕw_{\phi} is alternatively due to the compactified variable starting take values with |arctan(x±π/2)|1|\arctan(x\sim\pm\pi/2)|\gg 1. This sampling procedure on a compactified random will be utilized further to study dark energy in the two following Horndeski models.

To end this section, we recall that Ωm\Omega_{m} is a free parameter which we fixed to its P18 value (anchored on Λ\LambdaCDM) in order to avoid an otherwise ad hoc choice. The above reconstruction is therefore bias to this information. We briefly describe the reconstruction for other values of Ωm\Omega_{m}. Smaller Ωm\Omega_{m} leads to higher V(ϕ)V(\phi) and ϕ(z)2\phi^{\prime}(z)^{2}, while larger Ωm\Omega_{m} leads to lower V(ϕ)V(\phi) and ϕ(z)2\phi^{\prime}(z)^{2}. This can be also be deduced from Eqs. (4.4) and (4.5). In particular, for Ωm0.1\Omega_{m}\lesssim 0.1, the reconstruction of ϕ(z)2\phi^{\prime}(z)^{2} can be positive throughout. This can be taken to mean that quintessence and late-time data favor a smaller Ωm\Omega_{m} than that of Λ\LambdaCDM. We encourage the reader to test this out using our codes [92].

4.2 Designer Horndeski

Another Horndeski construction method has been introduced recently in Ref. [73]. This is referred to as designer Horndeski (HDES) and is built on top of kinetic gravity braiding. To describe this method, we start with the cosmological field equations given by

3H2=ρK(X)+2XKX+3Hϕ˙2GX,3H^{2}=\rho-K(X)+2XK_{X}+3H\dot{\phi}^{2}G_{X}\,, (4.7)
2H˙+3H2=PK(X)+2Xϕ¨GX,2\dot{H}+3H^{2}=-P-K(X)+2X\ddot{\phi}G_{X}\,, (4.8)

and

ϕ¨(ϕ˙(3H(GXXϕ˙2+2GX)+KXXϕ˙)KX)3ϕ˙(GXH˙ϕ˙+3GXH2ϕ˙+HKX)=0,\ddot{\phi}\left(-\dot{\phi}\left(3H\left(G_{XX}\dot{\phi}^{2}+2G_{X}\right)+K_{XX}\dot{\phi}\right)-K_{X}\right)-3\dot{\phi}\left(G_{X}\dot{H}\dot{\phi}+3G_{X}H^{2}\dot{\phi}+HK_{X}\right)=0\,, (4.9)

where a subscript in the potentials K(X)K(X) and G(X)G(X) denote differentiation with respect to XX. These are the Friedmann and scalar field equations. Also, a noteworthy feature in these shift symmetric models is that the scalar field equation can be written in the form

J˙+3HJ=0,\dot{J}+3HJ=0\,, (4.10)

where JJ is the shift current given by

J=ϕ˙KX+3Hϕ˙2GX.J=\dot{\phi}K_{X}+3H\dot{\phi}^{2}G_{X}\,. (4.11)

Therefore, instead of the scalar field equation, one can consider as a surrogate the exact solution of Eq. (4.10), namely

ϕ˙KX+3Hϕ˙2GX=𝒥a3,\dot{\phi}K_{X}+3H\dot{\phi}^{2}G_{X}=\dfrac{\mathcal{J}}{a^{3}}\,, (4.12)

where 𝒥\mathcal{J} is an integration constant which we refer to as a shift charge. The designer approach recognizes the constraint equations (Eqs. (4.7) and (4.12)) as two independent equations of the system but with three unknowns, i.e., (H(a),K(X),G(X))\left(H(a),K(X),G(X)\right). The system is then closed by a priori assuming H(X)H(X). In this case, the exact solution of Eqs. (4.7) and (4.12) can be shown to be

K(X)=3H02ΩΛ+𝒥2XH(X)2H02Ωm0𝒥2XΩΛΩm0,K(X)=-3H_{0}^{2}\Omega_{\Lambda}+\dfrac{\mathcal{J}\sqrt{2X}H(X)^{2}}{H_{0}^{2}\Omega_{m0}}-\dfrac{\mathcal{J}\sqrt{2X}\Omega_{\Lambda}}{\Omega_{m0}}\,, (4.13)

and

GX(X)=2𝒥H(X)3H02Ωm0.G_{X}(X)=-\dfrac{2\mathcal{J}H^{\prime}(X)}{3H_{0}^{2}\Omega_{m0}}\,. (4.14)

Eqs. (4.13) and (4.14) fleshes out the designer approach. By complementing this with the GP, we can then make a predictive analysis of the kk-essence and braiding potentials.

Before proceeding, we clarify that the designer Horndeski approach relies on a mapping between XX and aa which leads to a Horndeski theory (K(X),G(X))\left(K(X),G(X)\right). This then means that the designer trajectory XHDES(a)X_{\text{HDES}}(a) must not have fixed points. However, the resulting theory (K(X),G(X))\left(K(X),G(X)\right) may have other trajectories X(a)X(a) and so in general fixed points are not prohibited in the designer approach.

As with Ref. [73], we take

X=c0H(X)n,X=\dfrac{c_{0}}{H(X)^{n}}\,, (4.15)

where c0c_{0} is a constant with units of H0n+2H_{0}^{n+2}. Using the GP reconstructed Hubble function, we then obtain the predictions for XX and H(X)H^{\prime}(X) shown in Fig. 6 for n=1n=1 and c0=H0n+2c_{0}=H_{0}^{n+2}.

Refer to caption
(a)
Refer to caption
(b)
Figure 6: Reconstructed HDES kinetic density (a) XX and (b) dH/dXdH/dX as a function of the redshift zz for c0=H0n+2c_{0}=H_{0}^{n+2} and n=1n=1 for different H0H_{0} priors. The filled-hatched regions show the 2σ2\sigma confidence intervals for each prior. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13].

By expressing dimensionful physical quantities in units of the Hubble parameter H0H_{0}, we find that the mean z=0z=0 prediction for each H0H_{0} prior is always within the 1σ1\sigma contours of the two other priors. This holds for both XX and H(X)H^{\prime}(X) in Fig. 6 and, as we shall further see, also in the following figures. The precision of the H0H_{0} prior also does play a role. In Fig. 6, the prediction using the Planck prior turns out to be the most stringent for all redshifts. Moving forward, Fig. 7 shows the predicted HDES potentials corresponding to Fig. 6 with c0=H0n+2c_{0}=H_{0}^{n+2}, n=1n=1, and 𝒥/H0=1\mathcal{J}/H_{0}=1. In this analysis, we also assume the P18 cosmological parameter values.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 7: Reconstructed HDES potentials (a) K(z)K\left(z\right), (b) K(X)K(X), (c) GX(z)G_{X}\left(z\right), and (d) GX(X)G_{X}(X) for varying H0H_{0} prior and fixed HDES parameters c0=H0n+2c_{0}=H_{0}^{n+2}, n=1n=1, and 𝒥=H0\mathcal{J}=H_{0}. The filled-hatched regions show the 2σ2\sigma confidence intervals for each prior. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13].

Fig. 7 reveals the shape of the kk-essence and braiding potentials. Most importantly, this shape appears to be consistent regardless of the choice of H0H_{0} prior.

The dark energy equation of state can also be computed. In general, in shift symmetric kinetic gravity braiding, this is given by

wϕ=K+2XX˙GXK2X(KX+32XH(X)GX),w_{\phi}=\dfrac{-K+\sqrt{2X}\dot{X}G_{X}}{K-2X\left(K_{X}+3\sqrt{2X}H(X)G_{X}\right)}\,, (4.16)

where the potentials KK and GG and their derivatives are evaluated at XX. By substituting the HDES solution to Eq. (4.16), we obtain

wϕ=1+𝒥2X(H(z)2H02ΩΛ)3H04Ωm0ΩΛ2𝒥2X(1+z)H(z)H(z)9H04Ωm0ΩΛ.w_{\phi}=-1+\dfrac{\mathcal{J}\sqrt{2X}\left(H(z)^{2}-H_{0}^{2}\Omega_{\Lambda}\right)}{3H_{0}^{4}\Omega_{m0}\Omega_{\Lambda}}-\dfrac{2\mathcal{J}\sqrt{2X}(1+z)H(z)H^{\prime}(z)}{9H_{0}^{4}\Omega_{m0}\Omega_{\Lambda}}\,. (4.17)

Using the reconstructed Hubble function and its first derivative, we can therefore obtain the dark energy equation of state. This is shown in compactified version, like we did with the quintessence case, in Fig. 8.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 8: (a) The compactified HDES’s dark energy equation of state, arctan(1+wϕ(z))\arctan\left(1+w_{\phi}(z)\right), for varying H0H_{0} prior as a function of the redshift. The solid, dashed, and dash-dotted lines represent the median of the distribution and the filled-hatched regions show the 34.1%34.1\% of the probability mass surrounding the median from both sides. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13]. Posteriors of the compactified dark energy equation of state at sample redshifts z=0,0.5,1,1.5,2z=0,0.5,1,1.5,2 for the (b) R19, (c) TRGB, and (d) P18 H0H_{0} priors, respectively.

Interestingly, the compactified dark energy equation of state here retains single-modality for all redshifts, unlike its quintessence counterpart. Above all, this result is showing a strong deviation from Λ\LambdaCDM at higher redshifts. This is shown in Fig. 8(a) and is also revealed more closely in Figs. 8-(b-d). In particular, the posteriors of the probability distribution for the z=1.5z=1.5 and z=2z=2 cases for all H0H_{0} priors are strongly concentrated outside of the Λ\LambdaCDM limit (arctan(1+wϕ)=0\arctan\left(1+w_{\phi}\right)=0). This may have been expected due to the kk-essence and braiding potentials growing at higher redshifts (Figs. 7-(a) and (c)); nonetheless, this is worth highlighting as it points to potential future work with perturbations. Consider as an example the case of structure formation where the braiding potential may significantly affect.

To end the section, we recall that Fig. 8 shows the posterior of arctan(1+wϕ(z))\arctan\left(1+w_{\phi}(z)\right) and not w(z)w(z). So while the error bands do indeed become smaller in arctan(1+wϕ(z))\arctan\left(1+w_{\phi}(z)\right) for high redshifts, the corresponding uncertainty in w(z)w(z) in fact becomes larger. In particular, Figs. 8(b-d) show that the posterior for z2z\sim 2 is nearly localized at arctan(1+w(z))π/2\arctan\left(1+w(z)\right)\sim\pi/2 which corresponds to very large mean values and uncertainties in w(z)w(z). We clarify this further in Appendix A where the posterior of a random variable and its compactified version are displayed side by side.

4.3 Tailoring Horndeski

Tailoring Horndeski [74] is related to both quintessence potential and HDES models and is amenable to this reconstruction method. This can be considered as the J(t)=0J(t)=0 limit of HDES but it works more similar with quintessence in the sense that one does not have to rely on an a priori H(X)H(X) in order to single out a particular potential 111In Ref. [73], it was mentioned that the J(t)=0J(t)=0 limit of HDES is Λ\LambdaCDM. This was based on naively making the shift charge 𝒥\mathcal{J} vanish in Eqs. (4.13) and (4.14). However, this limit should be merely realized to be just an artifact of the particular parametrization of the HDES solution (Eqs. (4.13) and (4.14)). In general, with 𝒥=0\mathcal{J}=0, one should start again with Eqs. (4.7) and (4.12), from which HDES was derived. This will inevitably lead to the tailoring Horndeski approach..

The field equations of the theory are given by

3H2=ρ+2Λ+ϕ˙22+3Hϕ˙3GX,3H^{2}=\rho+2\Lambda+\dfrac{\dot{\phi}^{2}}{2}+3H\dot{\phi}^{3}G_{X}\,, (4.18)
2H˙+3H2=P+2Λϕ˙22+GXϕ˙2ϕ¨,2\dot{H}+3H^{2}=-P+2\Lambda-\dfrac{\dot{\phi}^{2}}{2}+G_{X}\dot{\phi}^{2}\ddot{\phi}\,, (4.19)

and

ϕ¨(3H(GXXϕ˙3+2GXϕ˙)1)3ϕ˙(GXH˙ϕ˙+3GXH2ϕ˙+H)=0.\ddot{\phi}\left(-3H\left(G_{XX}\dot{\phi}^{3}+2G_{X}\dot{\phi}\right)-1\right)-3\dot{\phi}\left(G_{X}\dot{H}\dot{\phi}+3G_{X}H^{2}\dot{\phi}+H\right)=0\,. (4.20)

Note that the above equations can be obtained from Eqs. (4.7), (4.8), and (4.9) by substituting K(X)=X2ΛK(X)=X-2\Lambda. The solution of the system will therefore also lie on the dynamical hypersurface determined by Eq. (4.12). In contrast, the tailoring Horndeski approach recognizes the J(t)=0J(t)=0 as a dynamical attractor and so builds the solution on top of this hypersurface. One can then write down

GX(X)=132XH(X).G_{X}(X)=-\dfrac{1}{3\sqrt{2X}H(X)}\,. (4.21)

Since there is only one potential, i.e., GXG_{X}, Eq. (4.21) can be used to obtain model independent necessary conditions by substituting into Eqs. (4.18) and (4.19). The emerging necessary conditions are given by

X=3(H02Ωm(z)+H02ΩΛH2),X=3\left(H_{0}^{2}\Omega_{m}(z)+H_{0}^{2}\Omega_{\Lambda}-H^{2}\right)\,, (4.22)

and

2H˙+3H2=P+3H02ΩΛXX˙3H,2\dot{H}+3H^{2}=-P+3H_{0}^{2}\Omega_{\Lambda}-X-\dfrac{\dot{X}}{3H}\,, (4.23)

where Ωm(z)=ρ(z)/(3H02)\Omega_{m}(z)=\rho(z)/\left(3H_{0}^{2}\right) and ΩΛ=2Λ/(3H02)\Omega_{\Lambda}=2\Lambda/\left(3H_{0}^{2}\right). The kinetic density can then be uniquely determined for a given Hubble function by using Eq. (4.22). Moreover, by eliminating XX in Eqs. (4.22) and (4.23), one can recover the thermodynamic conservation law

ρ˙+3H(ρ+P)=0\dot{\rho}+3H\left(\rho+P\right)=0\,\ (4.24)

which ensures the consistency of the method. In summary, the braiding potential and the scalar field’s kinetic density can be uniquely assigned to a given Hubble function through Eqs. (4.21) and (4.22).

By complementing the above outlined tailoring Horndeski method with the GP reconstructed Hubble function, we now obtain predictions of the theory. The resulting kinetic density is shown in Fig. 9.

Refer to caption
Figure 9: Tailoring Horndeski’s kinetic density X(z)X\left(z\right) as a function of the redshift for varying H0H_{0} prior. The filled-hatched regions show the 2σ2\sigma confidence intervals for each prior. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13].

Here, we also consider the P18 cosmological parameter values, and so, once more, by expressing dimensionful quantities in units of H0H_{0}, we find consistent shapes of the predicted functions regardless of the choice of H0H_{0} prior. It is noteworthy that the scalar field has spent most of its late-time dynamics with |X/H02|1|X/H_{0}^{2}|\ll 1. This is important as the braiding potential (Eq. (4.21)) possesses a singularity at X=0X=0 and so MC sampling over GXG_{X} can be expected to result to posteriors with very large, consequently unreliable, uncertainties. Fig. 9 also shows that an appreciable size of the samples fall to the region X<0X<0. This is problematic from a computational point of view when sampling GXG_{X} because of the square root in Eq. (4.21). To avoid the above mentioned issues, we proceed by additionally sampling over a compactified, real, random variable, G~X2\tilde{G}_{X}^{2} given by

G~X2=arctan(H0418XH(X)2).\tilde{G}_{X}^{2}=\arctan\left(\dfrac{H_{0}^{4}}{18XH\left(X\right)^{2}}\right)\,. (4.25)

We refer to this as a compactified braiding potential. The results are shown in Fig. 10.

Refer to caption
(a)
Refer to caption
(b)
Figure 10: The compactified braiding potential, G~X2\tilde{G}_{X}^{2} (Eq. (4.25)), in GP tailoring Horndeski as a function of the (a) redshift zz and (b) the kinetic density XX. The solid, dashed, dash-dotted lines show the median of the distribution while the filled-hatched regions show the 34.1%34.1\% of the probability mass lying above and below the median. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13]. The inset of (a) is a closeup of the region z(1,2)z\in\left(1,2\right) and G~X2(0,6×103)\tilde{G}_{X}^{2}\in\left(0,6\times 10^{-3}\right).

Here, it is shown that G~X2\tilde{G}_{X}^{2} may also accept negative values at low redshifts (or low XX), whenever X<0X<0 can be drawn with appreciable probability. In such cases, the braiding potential is going to be undefined, or rather that the model is disfavored by the data. However, at higher redshifts, the distribution tends to become more concentrated towards positive values. The inset of of Fig. 10-(a) proves this. As complementary to Fig. 10, snapshots of the posterior distribution of the distributions of G~X2\tilde{G}_{X}^{2} at redshifts z=0,0.5,1,1.5,2z=0,0.5,1,1.5,2 are shown in Fig. 11.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 11: Snapshots of the posterior of the compactified braiding potential in tailoring Horndeski at sample redshifts z=0,0.5,1,1.5,2z=0,0.5,1,1.5,2 for the (a) R19, (b) TRGB, and (c) P18 H0H_{0} priors. Inset plots separately show the z=1.5z=1.5 and z=2z=2 cases because the posterior for these two cases scale too short compared with the later redshift cases. (d) Posterior distribution at z=2z=2 for each H0H_{0} prior.

This shows that the distribution of G~X2\tilde{G}_{X}^{2} is generally bimodal and supports the use of the median and its 34.1%34.1\% surrounding mass as a reasonable statistic. At z=0z=0, Figs. 11-(a-c) show that G~X2\tilde{G}_{X}^{2} may almost-equally be positive or negative and samples may even be drawn with appreciable probability near the boundaries, G~X2±π/2\tilde{G}_{X}^{2}\sim\pm\pi/2, of the random variable. Interestingly, at higher, or earlier, redshifts, the probability mass at negative G~X2\tilde{G}_{X}^{2} always decreases with an increase in the redshift, i.e., the weight of the samples drawn at negative G~X2\tilde{G}_{X}^{2} becomes smaller for increasing zz. This is a consistent result throughout the H0H_{0} priors and so suggests that the braiding may have played a relevant earlier minor role in cosmic history. Fig. 11-(d) supports this interpretation. Indeed, at z=2z=2, the posteriors of G~X2\tilde{G}_{X}^{2} can already be considered to be practically single-modal and undeniably heavier at G~X2>0\tilde{G}_{X}^{2}>0, where the braiding potential GXG_{X} can be computed. A crude estimate of H02GXH_{0}^{2}G_{X}, based on Fig. 11-(d) and arctan(x1)x\arctan\left(x\ll 1\right)\sim x, leads to H02GX0.1H_{0}^{2}G_{X}\sim 0.1 at z2z\sim 2.

The dark energy equation of state in tailoring Horndeski can also be computed using Eq. (4.16). Substituting K(X)=X2ΛK(X)=X-2\Lambda and then using the tailored solution (Eqs. (4.21) and (4.22)), it is straightforward to obtain

wϕ(z)=H(z)(3H(z)2(1+z)H(z))3(H(z)2+H02Ωm0(1+z)3).w_{\phi}(z)=\dfrac{H(z)\left(3H(z)-2(1+z)H^{\prime}(z)\right)}{3\left(-H(z)^{2}+H_{0}^{2}\Omega_{m0}\left(1+z\right)^{3}\right)}\,. (4.26)

This is notably the exact same expression that can be obtained by assuming that dark energy is a perfect fluid with an equation of state w(z)w(z). The results of the sampling over the compactified dark energy equation of state is shown in Fig. 12.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 12: (a) The compactified dark energy equation of state, arctan(1+wϕ(z))\arctan\left(1+w_{\phi}(z)\right), as a function of the redshift in tailoring Horndeski for varying H0H_{0} prior. The solid, dashed, and dash-dotted lines represent the median of the distribution and the filled-hatched regions show the 34.1%34.1\% of the probability mass surrounding the median from both sides. Hatches used: (:H0R19)\left({}^{\prime}-^{\prime}:H_{0}^{\text{R19}}\right) [15], (|:H0TRGB)\left({}^{\prime}|^{\prime}:H_{0}^{\text{TRGB}}\right) [93], (×:H0P18)\left({}^{\prime}\times^{\prime}:H_{0}^{\text{P18}}\right) [13]. Posteriors of the compactified dark energy equation of state at sample redshifts z=0,0.5,1,1.5,2z=0,0.5,1,1.5,2 for the (b) R19, (c) TRGB, and (d) P18 H0H_{0} priors.

It is most interesting to point out that this looks very similar to Fig. 12 even though the earlier analysis of quintessence did not take into account ΩΛ\Omega_{\Lambda}. Similar conclusions can therefore be drawn. First, at low redshifts, the posterior distributions of the compactified dark energy equation of state can be considered to be approximately Gaussian distributed. However, at higher redshifts, this cannot anymore be taken to be true as the distribution becomes bimodal and an appreciable fraction of samples fall too close to the boundaries ±π/2\sim\pm\pi/2 of the compactified region. In particular, for z=2z=2 in Figs. 12-(b-d), the probability mass appear to be mostly heavier at the positive side, i.e., arctan(1+wϕ)1\arctan\left(1+w_{\phi}\right)\sim 1, suggesting a deviation from Λ\LambdaCDM. The implications of this should be further examined in a future work.

4.4 Constraints on the dark energy equation of state

We now summarize the constraints on the dark energy equation of state at z=0z=0 obtained in this work in Table 1.

Table 1: Constraints on the dark energy equation of state in Horndeski cosmology. The columns H0R19H_{0}^{\text{R19}}, H0TRGBH_{0}^{\text{TRGB}}, and H0P18H_{0}^{\text{P18}} stand for the GP analysis using the corresponding H0H_{0} priors R19, TRGB, and P18. The P18 prior parameter values were assumed in this analysis [13]. For designer Horndeski, c0=H0n+2c_{0}=H_{0}^{n+2}, n=1n=1, and 𝒥=H0\mathcal{J}=H_{0} were additionally assumed.
wDE(z=0)w_{\text{DE}}\,(z=0)
11\dfrac{1}{1} Theory + parameters 11\dfrac{1}{1} H0R19H_{0}^{\text{R19}} H0TRGBH_{0}^{\text{TRGB}} H0P18H_{0}^{\text{P18}}
11\dfrac{1}{1} Quintessence +(Ωm0)+\left(\Omega_{m0}\right) 11\dfrac{1}{1} 1.1±0.1-1.1\pm 0.1 1.1±0.1-1.1\pm 0.1 1.06±0.08-1.06\pm 0.08
11\dfrac{1}{1} Designer Horndeski +(Ωm0,ΩΛ,c0,n,𝒥)+\left(\Omega_{m0},\Omega_{\Lambda},c_{0},n,\mathcal{J}\right) 11\dfrac{1}{1} 0.8±0.2-0.8\pm 0.2 0.9±0.3-0.9\pm 0.3 0.9±0.1-0.9\pm 0.1
11\dfrac{1}{1} Tailoring Horndeski +(Ωm0,ΩΛ)+\left(\Omega_{m0},\Omega_{\Lambda}\right) 11\dfrac{1}{1} 1.1±0.1-1.1\pm 0.1 1.1±0.1-1.1\pm 0.1 1.06±0.08-1.06\pm 0.08
11\dfrac{1}{1} Λ\LambdaCDM 11\dfrac{1}{1} -1
11\dfrac{1}{1} w0w_{0}CDM (Planck + SNe + BAO) 11\dfrac{1}{1} 1.03±0.03-1.03\pm 0.03 [13]
11\dfrac{1}{1} w0waw_{0}w_{a}CDM (Planck + SNe + BAO) 11\dfrac{1}{1} 0.96±0.08-0.96\pm 0.08 [13]

The wDE(z=0)w_{\text{DE}}\,(z=0) constraints from quintessence and tailoring Horndeski differ only in their fifth significant digit which is well within the predictability region that can be reasonably expected from the reconstruction method. For both of these theories, the dark energy equation of state remains consistent with each other and regardless of the H0H_{0} prior. In this background the priors on the Hubble data have little to no impact on the equation of state reconstructions since these fine differences will be suppressed by the precise formula that expresses the quantity. The situation is entirely different for the direct reconstruction of the Hubble diagram where priors not only have a significant impact on the diagrams that are produced but also on the reconstructed values of H0H_{0} which is one of the most important results from this reconstruction method.

On the other hand, the wDE(z=0)w_{\text{DE}}(z=0) constraints coming from designer Horndeski should be taken more carefully as it strongly depends on the assumed shift charge 𝒥\mathcal{J} and so is not completely predictive with just the low-redshift data. Since the shift charge is a constant throughout the evolution, it would have been then highly coincidental if it scales with the Hubble parameter today. The shift charge can instead be constrained with the CMB as done in Ref. [73]. Alternatively, combining the low redshift observations from Hubble data and fσ8f\sigma_{8} can be considered to constrain the shift charge.

It is also interesting to point out the recent work of Ref. [110] which suggests that quintessence is at odds with local determinations of H0H_{0}. Extending this perturbative analysis to broader Horndeski theories may lead to a better understanding of dark energy and the H0H_{0} tension.

5 Conclusion

We first discuss briefly the related work of Ref. [62] which also focused on constraining Horndeski theory. Here, the novelty can be found in the GP reconstruction of H(z)H(z) which exploited the use of observations of the function dp(z)d_{p}(z) and its first derivative dp(z)=1/H(z)d_{p}^{\prime}(z)=1/H(z). The reconstructed Hubble function was then applied to kk-essence and used to constrain a quintessence potential of the restricted form V(ϕ)=Cexp(ϕn)V\left(\phi\right)=C\exp\left(\phi n\right).

In this paper, we combined the results from the GP approach to reconstruction and well-known inversions of the Friedmann equations in Horndeski theory in order to obtain predictions of completely unprescribed Horndeski potentials. We particularly considered a combined data set from cosmic chronometers, supernovae, and baryon acoustic oscillations to construct the Hubble function using GP (Sec. 3.2) and later used this to single out Horndeski theories using three implementations: quintessence potential (Sec. 4.1), designer Horndeski (Sec. 4.2), and tailoring Horndeski (Sec. 4.3). In doing so, we obtained predictions of the potentials that are fully anchored on expansion history data. New constraints on the dark energy equation of state are presented in Table 1.

We also introduced a novel practical way of obtaining predictions of the dark energy equation of state wDE(z)w_{\text{DE}}(z) for all redshifts zz, i.e., by instead sampling over the compactified random variable arctan(1+wDE(z))\arctan\left(1+w_{\text{DE}}(z)\right) (Figs. 5, 8, and 12). Through this, we were able to closely examine regions of the reconstructed function that would otherwise be spoiled by very large uncertainties because of the presence of a nearby singularity. The same technique was used to predict the braiding potential in tailoring Horndeski. We also argued that for a compactified random variable the median surrounded by 34.1%34.1\% of probability mass above and below it is a more reasonable statistic (Appendix A). This is true for the dark energy equation of state which turned out to be generally bimodal. Improvements to this methodology are also interesting for future work.

Several more directions can be considered for future work. First, both quintessence and tailoring Horndeski theories have been completely specified by the Hubble data and prior values of Ωm\Omega_{m} and ΩΛ\Omega_{\Lambda}, i.e., they have no more extra parameters left. Therefore, it is possible to put very tight constraints to both of these theories by comparing their predictions of the growth rate with observations. Similarly, the shift charge in designer Horndeski should be constrainable with additional information such as the existing fσ8f\sigma_{8} data. Second, but not completely unrelated to the first, the perturbations of these theories should be examined for instabilities, e.g., ghost- and Laplace-type. In practice, the stability conditions are usually taken to be reasonable theoretical priors when sampling over parametrized dark energy theories and should also be integrable within GP reconstruction Horndeski implementations. Lastly, it would be interesting to further apply the GP inspired model reconstruction method to dark energy models outside of the Horndeski class such as degenerate higher-order scalar-tensor theories [111] and vector-tensor theories [112, 113].

Acknowledgments

The authors would like to thank Eoin Colgáin for a helpful discussion on quintessence. JLS would like to acknowledge networking support by the COST Action CA18108 and funding support from Cosmology@MALTA which is supported by the University of Malta.

Appendix A Statistics for a compactified random variable

In Fig. 13, we show distributions of a compactified random variable X=arctan(x)X=\arctan(x), where xx is a random variable with domain x(,)x\in\left(-\infty,\infty\right), in cases where it is approximately Gaussian and bimodal.

Refer to caption
(a)
Refer to caption
(b)
Figure 13: Distributions of a compactified random variable XX when it is (a) approximately Gaussian-distributed and (b) bimodal. The red-dashed and blue-dash-dotted lines show the median and the mean, respectively. The red-filled ({}^{\prime}-^{\prime}-hatched) region shows the 34.1% probability mass above and below the median. The blue-filled (×{}^{\prime}\times^{\prime}-hatched) region shows the 1σ1\sigma confidence intervals from the mean, where σ\sigma is the standard deviation.

Consider first Fig. 13-(a). In this case, XX is approximately Gaussian-distributed and so the median and mean of the distribution coincides. In addition, as shown by the colored-hatched regions in Fig. 13-(a), the 1σ1\sigma confidence region around the mean also coincides with the 34.1%34.1\% probability mass above and below the median (50th percentile). On the other hand, Fig. 13-(b) shows an example where the compactified random variable is non-Gaussian, in particular bimodal. In this case, it can be seen that the locations of the median and the mean of the distribution are undisputably different. However, the mean also takes weight from the tails of a non-Gaussian-distribution; in this case, it therefore finds itself in a place where probability mass obviously is not localized. Moreover, a naive interpretation of the 1σ1\sigma confidence intervals (blue-×{}^{\prime}\times^{\prime}-hatched region of Fig. 13-(b)) will predict values supposedly outside the domain of the random variable. On the other hand, the median, by definition, is the place in a distribution where 50%50\% of the mass fall below it. This almost always ends up in a place where the mass is concentrated, as shown in Fig. 13-(b). Moreover, the 34.1%34.1\% of probability mass above and below the median will also always consistently predict samples within the domain of a compactified random variable, regardless of the shape of the distribution.

We clarify one more statistical aspect of a compactified random variable. Fig. 14 shows the GP posteriors of a random variable – the dark energy equation of state (Sec. 4.2) – and its compactified version.

Refer to caption
(a)
Refer to caption
(b)
Figure 14: GP posteriors of (a) a random variable and (b) its compactified version.

Clearly, the posterior of a compactified random variable (Fig. 14(b)) may give the illusion that the error bars are getting smaller at high redshift where the data are sparse. This happens near the boundaries (±π/2\sim\pm\pi/2) of a compactified variable.

References