This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

New Probes of Large Scale Structure

Peikai Li Department of Physics, Carnegie Mellon University, Pittsburgh, PA 15213, USA McWilliams Center for Cosmology, Carnegie Mellon University, Pittsburgh, PA 15213, USA NSF AI Planning Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA    Rupert A. C. Croft Department of Physics, Carnegie Mellon University, Pittsburgh, PA 15213, USA McWilliams Center for Cosmology, Carnegie Mellon University, Pittsburgh, PA 15213, USA NSF AI Planning Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA    Scott Dodelson Department of Physics, Carnegie Mellon University, Pittsburgh, PA 15213, USA McWilliams Center for Cosmology, Carnegie Mellon University, Pittsburgh, PA 15213, USA NSF AI Planning Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
Abstract

This is the second paper in a series where we propose a method of indirectly measuring large scale structure using information from small scale perturbations. The idea is to build a quadratic estimator from small scale modes that provides a map of structure on large scales. We demonstrated in the first paper that the quadratic estimator works well on a dark-matter-only N-body simulation at a snapshot of z=0z=0. Here we generalize the theory to the case of a light cone halo catalog with a non-cubic region taken into consideration. We successfully apply the generalized version of the quadratic estimator to the light cone halo catalog based on an N-body simulation of volume 15.03(h1Gpc)3\sim 15.03\,(h^{-1}\,\rm Gpc)^{3}. The most distant point in the light cone is at a redshift of 1.421.42, indicating the applicability of our method to next generation of galaxy surveys.

I Introduction

Directly measuring the distribution of matter on large scales is extremely difficult due to observational and astrophysical limitations. For example, Modi et al. (2019) point out how large spatial scales in neutral hydrogen surveys are completely contaminated by astrophysical foregrounds. Attempts to use small scale perturbations to get around these limitations and infer large scale information has been frequently discussed in recent years, by Modi et al. (2019), and others: Baldauf et al. (2011)Jeong and Kamionkowski (2012)Li et al. (2014)Zhu et al. (2016)Barreira and Schimidt (2017). In our first work Li et al. (2020), we proposed a method for indirectly measuring large scale structure using the small scale density contrast. Physically, long- and short-wavelength modes are correlated because small scale modes grow differently depending on the large scale structure they reside in. This phenomenon leaves a signature in Fourier space: the two-point statistics of short-wavelength matter density modes will have non-zero off-diagonal terms proportional to long-wavelength modes. This is our starting point for constructing the quadratic estimator for long-wavelength modes. We tested the power of this quadratic estimator using a dark-matter-only catalog from an N-body simulation in our first paper. In the present work, we generalize Ref. Li et al. (2020) to account for two main effects that must be accounted for before applying the techniques to upcoming surveys, e.g., LSST Daek Energy Science Collaboration (2012)WFIRST Science Definition Team (2012)DESI Collaboration (2019): (i) we observe galaxies, not the dark matter field; (ii) we observe a non-cubic light cone rather than a single redshift snapshot. After dealing with these, we should be able to apply our method to real surveys in the near future.

We first need to account for galaxy bias Kravtsov and Klypin (1999)Desjacques et al. (2018). Galaxy bias is a term relating the galaxy number density contrast to the matter density contrast Gil-Marín et al. (2015)Gil-Marín et al. (2017). We use a model of second order bias, as done in recent treatments of galaxy surveys. Meanwhile, analytically the generalization to even higher order biases is straightforward. We adopt the most commonly used second-order galaxy bias model and assume all the bias parameters to be constants even while we are considering a large volume across a wide redshift range.

Observationally a galaxy catalog will be in the form of a light cone Carroll (1997) instead of a single redshift snapshot. The typical treatment is to cut a light cone into several thin redshift bins Chuang et al. (2017) and analyze the properties within each bin. Doing this, though, leads to loss of information on the long-wavelength modes along the line of sight. Thus in this paper we propose a method of considering all the galaxies in a light cone together, using the well-known Feldman-Kaiser-Peacock (FKP) estimator Feldman et al. (1994) to account for the evolution of the galaxy number density. Using an octant volume (the technique can be applied to a even more generalized shape), we test the quadratic estimator for long-wavelength modes using information from non-zero off-diagonal terms as in Li et al. (2020). It should be noticed that the FKP description corresponds to the monopole part of the estimator in redshift space (e.g. the Yamamoto estimator Yamamoto et al. (2006)Bianchi et al. (2015)). Because of this, our formalism will be able to reconstruct the large scale monopole power spectrum which is the main goal when studying the large scale matter distribution of the 3D universe.

We begin with a brief review of the formalism developed in Ref. Li et al. (2020), then present our treatment of the galaxy number density contrast in a light cone and build the quadratic estimator. Finally we apply the estimator to light-cone halo simulations and extract the large scale modes accounting for these effects.

II Review of Quadratic Estimator

We first review the construction of a quadratic estimator of a dark-matter-only catalog Li et al. (2020) before moving to a halo catalog. We start from the perturbative expansion of the matter density contrast in Fourier space up to second order Jain and Bertschinger (1994)Bernardeau et al. (2012):

δm(k;z)=δm(1)(k;z)+δm(2)(k;z)\displaystyle\delta_{\rm m}(\vec{k};z)=\delta^{(1)}_{\rm m}(\vec{k};z)+\delta^{(2)}_{\rm m}(\vec{k};z) (1)
=\displaystyle= δm(1)(k;z)\displaystyle\delta^{(1)}_{\rm m}(\vec{k};z)
+\displaystyle+ d3k(2π)3δm(1)(k;z)δm(1)(kk;z)F2(k,kk)\displaystyle\int\frac{d^{3}\vec{k}^{\prime}}{(2\pi)^{3}}\delta^{(1)}_{m}(\vec{k}^{\prime};z)\delta^{(1)}_{m}(\vec{k}-\vec{k}^{\prime};z){F}_{2}(\vec{k}^{\prime},\vec{k}-\vec{k}^{\prime})

where “m” stands for matter, the superscript i=1,2,i=1,2,\cdots corresponds to the ii-th order term of the expansion, and δm(k;z)\delta_{\rm m}(\vec{k};z) is the full Fourier space matter density contrast in a snapshot at redshift zz. The kernel F2F_{2} is a function particularly insensitive to the choice of cosmological parameters in a dark-energy-dominated universe Takahashi (2008):

F2(k1,k2)=57+27(k1k2)2k12k22+k1k22k1k2[k1k2+k2k1].F_{2}(\vec{k}_{1},\vec{k}_{2})=\frac{5}{7}+\frac{2}{7}\frac{(\vec{k}_{1}\cdot\vec{k}_{2})^{2}}{k_{1}^{2}k_{2}^{2}}+\frac{\vec{k}_{1}\cdot\vec{k}_{2}}{2k_{1}k_{2}}\bigg{[}\frac{k_{1}}{k_{2}}+\frac{k_{2}}{k_{1}}\bigg{]}. (2)

Thus, δm(1)\delta_{\rm m}^{(1)} is the linear density contrast, and the second order term δm(2)\delta_{\rm m}^{(2)} can be written as a convolution-like integral using the first order term.

When evaluating the two-point function of the full density contrast, cross-terms appear. For example, δm(1)(k;z)δm(2)(k;z)\langle\delta_{\rm m}^{(1)}(\vec{k};z)\delta_{\rm m}^{(2)}(\vec{k}^{\prime};z)\rangle is proportional to δm(1)(k+k;z)\delta^{(1)}_{\rm m}(\vec{k}+\vec{k}^{\prime};z) if both k\vec{k} and k\vec{k}^{\prime} correspond to short wavelengths but their sum is small (long wavelength). Explicitly, keeping terms up to second order,

δm(ks;z)δm(ks;z)=f(ks,ks;z)δm(1)(kl;z).\langle{\delta}_{\rm m}(\vec{k}_{s};z){\delta}_{\rm m}(\vec{k}_{s}^{\prime};z)\rangle=f(\vec{k}_{s},\vec{k}_{s}^{\prime};z){\delta}^{(1)}_{\rm m}(\vec{k}_{l};z). (3)

Here ks\vec{k}_{s} and ks\vec{k}_{s}^{\prime} are two short-wavelength modes and kl\vec{k}_{l} is a long-wavelength mode (ks,kskl\vec{k}_{s},\vec{k}_{s}^{\prime}\gg\vec{k}_{l}). They satisfy the squeezed-limit condition, ks+ks=kl\vec{k}_{s}+\vec{k}_{s}^{\prime}=\vec{k}_{l}, and ff is given by:

f(ks,ks;z)\displaystyle f(\vec{k}_{s},\vec{k}_{s}^{\prime};z) =\displaystyle= 2F2(ks,ks+ks)Pm(1)(ks;z)\displaystyle 2F_{2}(-\vec{k}_{s},\vec{k}_{s}+\vec{k}_{s}^{\prime})P_{\rm m}^{(1)}(k_{s};z) (4)
+\displaystyle+ 2F2(ks,ks+ks)Pm(1)(ks;z).\displaystyle 2F_{2}(-\vec{k}_{s}^{\prime},\vec{k}_{s}+\vec{k}_{s}^{\prime})P_{\rm m}^{(1)}(k_{s}^{\prime};z).

Here Pm(1)P_{\rm m}^{(1)} is the linear matter power spectrum. Eq. (3) indicates that we can estimate the long-wavelength modes using small scale information with the following minimum variance quadratic estimator:

δ^m(1)(kl;z)=A(kl;z)d3ks(2π)3g(ks,ks;z)δm(ks;z)δm(ks;z)\displaystyle\hat{\delta}^{(1)}_{\rm m}(\vec{k}_{l};z)=A(\vec{k}_{l};z)\int\frac{d^{3}\vec{k}_{s}}{(2\pi)^{3}}g(\vec{k}_{s},\vec{k}_{s}^{\prime};z){\delta}_{\rm m}(\vec{k}_{s};z){\delta}_{\rm m}(\vec{k}_{s}^{\prime};z)
(5)

with ks=klks\vec{k}_{s}^{\prime}=\vec{k}_{l}-\vec{k}_{s}. The normalization factor AA is defined by requiring that δ^m(1)(kl;z)=δm(1)(kl;z)\langle\hat{\delta}_{\rm m}^{(1)}(\vec{k}_{l};z)\rangle={\delta}_{\rm m}^{(1)}(\vec{k}_{l};z), and the weighting function gg is obtained by minimizing the noise. They can be expressed as:

A(kl;z)\displaystyle A(\vec{k}_{l};z) =\displaystyle= [d3ks(2π)3g(ks,ks;z)f(ks,ks;z)]1\displaystyle\bigg{[}\int\frac{d^{3}\vec{k}_{s}}{(2\pi)^{3}}g(\vec{k}_{s},\vec{k}_{s}^{\prime};z)f(\vec{k}_{s},\vec{k}_{s}^{\prime};z)\bigg{]}^{-1}
g(ks,ks;z)\displaystyle g(\vec{k}_{s},\vec{k}_{s}^{\prime};z) =\displaystyle= f(ks,ks;z)2Pm(ks;z)Pm(ks;z)\displaystyle\frac{f(\vec{k}_{s},\vec{k}_{s}^{\prime};z)}{2P_{\rm m}(k_{s};z)P_{\rm m}(k_{s}^{\prime};z)} (6)

where PmP_{\rm m} is the nonlinear matter power spectrum including shot noise. With this choice of the weighting function gg, the noise on the estimator N(kl;z)=A(kl;z)N(\vec{k}_{l};z)=A(\vec{k}_{l};z) if non-Gaussian terms in the four-point function are neglected. Therefore, the projected detectability of a power spectrum measurement using this quadratic estimator can be written as:

1σ2(kl;z)=Vkl2Δk(2π)2[Pm(1)(kl;z)Pm(1)(kl;z)+A(kl;z)]2,\frac{1}{\sigma^{2}(k_{l};z)}=\frac{Vk_{l}^{2}\Delta k}{(2\pi)^{2}}\bigg{[}\frac{P_{\rm m}^{(1)}(k_{l};z)}{P_{\rm m}^{(1)}(k_{l};z)+A(k_{l};z)}\bigg{]}^{2}, (7)

where VV is the volume of a survey and Δk\Delta k is the width of long-wavelength mode bins. We also take advantage of the fact that A(kl;z)A(\vec{k}_{l};z) does not depend on the direction of the long mode kl\vec{k}_{l}.

III Generalization: Bias Model and FKP estimator

Galaxy bias describes the statistical relation between dark-matter and galaxy distributions. Similar to Eq. (1), we use the most commonly used Eulerian non-linear and non-local galaxy bias model np to second-order first proposed by McDonald and Roy (2009):

δg(k;z)=b1δm(1)(k;z)\displaystyle\delta_{\rm g}(\vec{k};z)=b_{1}\delta^{(1)}_{m}(\vec{k};z) (8)
+\displaystyle+ d3k(2π)3δm(1)(k;z)δm(1)(kk;z)2(𝓀,𝓀𝓀).\displaystyle\int\frac{d^{3}\vec{k}^{\prime}}{(2\pi)^{3}}\delta^{(1)}_{m}(\vec{k}^{\prime};z)\delta^{(1)}_{m}(\vec{k}-\vec{k}^{\prime};z)\mathpzc{F}_{2}(\vec{k}^{\prime},\vec{k}-\vec{k}^{\prime})\;.

here “g\rm g” denotes galaxy, and b1b_{1} is the linear bias parameter relating galaxy and the matter density contrasts. The kernel 2\mathpzc{F}_{2} is given by:

2(𝓀1,𝓀2)=𝒷12(𝓀1,𝓀2)+𝒷22+𝒷𝓈22𝒮2(𝓀1,𝓀2)\mathpzc{F}_{2}(\vec{k}_{1},\vec{k}_{2})=b_{1}F_{2}(\vec{k}_{1},\vec{k}_{2})+\frac{b_{2}}{2}+\frac{b_{s^{2}}}{2}S_{2}(\vec{k}_{1},\vec{k}_{2}) (9)

with S2S_{2} given by:

S2(k1,k2)=(k1k2)2k12k2213.S_{2}(\vec{k}_{1},\vec{k}_{2})=\frac{(\vec{k}_{1}\cdot\vec{k}_{2})^{2}}{k_{1}^{2}k_{2}^{2}}-\frac{1}{3}\;. (10)

Comparing the perturbative expansion of the galaxy density contrast Eq. (8) with that of the matter density contrast Eq. (1), we see that the difference with the first order term is an extra coefficient b1b_{1}. The second order term is also almost the same, with a simple replacement of the kernel function. This implies that we can easily generalize to the case of a galaxy catalog in a snapshot. In the case of a galaxy catalog in a light cone, the Feldman-Kaiser-Peacock (FKP) estimator is usually used to construct a weighted over-density that can be used to obtain the observed galaxy power spectrum Feldman et al. (1994):

F(r)I1/2wFKP(r)[ng(r)αns(r)]F(\vec{r})\equiv I^{-1/2}w_{\rm FKP}(\vec{r})[n_{g}(\vec{r})-\alpha n_{s}(\vec{r})] (11)

with

IVd3rwFKP2(r)ng2(r).I\equiv\int_{V}d^{3}\vec{r}w_{\rm FKP}^{2}(\vec{r})\langle n_{g}\rangle^{2}(\vec{r})\;. (12)

Here ngn_{g} is the observed galaxy number density and nsn_{s} is the corresponding synthetic catalog (a random catalog with the same angular and radial selection function as the observations). The constant α\alpha is the ratio of the observed number density to the synthetic catalog’s number density. The FKP weight wFKP(r)w_{\rm FKP}(\vec{r}) is usually defined as:

wFKP(r)=11+ng(r)P0\displaystyle w_{\rm FKP}(\vec{r})=\frac{1}{1+\langle n_{g}\rangle(\vec{r})P_{0}} (13)

where P0P_{0} is the typical amplitude of the observed power spectrum at the scale where the signal-to-noise of the power spectrum estimation is maximized, usually k0.12hMpc1k\sim 0.12\,h\rm Mpc^{-1}. Note that in real surveys we will have other types of weights Gil-Marín et al. (2015)Gil-Marín et al. (2018), which can be easily included in the formalism of this section. The FKP estimator F(r)F(\vec{r}) is related to the observed galaxy power spectrum Pg,obs(k)P_{g,\rm obs}(\vec{k}) by considering the following expectation value (diagonal elements) in Fourier space:

|F(k)|2\displaystyle\langle|F(\vec{k})|^{2}\rangle =\displaystyle= d3k(2π)3Pg(k;zeff)|W(kk)|2+Pshotnoise\displaystyle\int\frac{d^{3}\vec{k}^{\prime}}{(2\pi)^{3}}P_{g}(k^{\prime};z_{\rm eff})|W(\vec{k}-\vec{k}^{\prime})|^{2}+P_{\rm shot\;noise} (14)
=\displaystyle= |δg,W(k;zeff)|2+PshotnoisePg,obs(k),\displaystyle\langle|\delta_{g,W}(\vec{k};z_{\rm eff})|^{2}\rangle+P_{\rm shot\;noise}\equiv P_{g,\rm obs}(\vec{k}),

where zeffz_{\rm eff} is the effective redshift of the whole light cone. The window function W(k)W(\vec{k}), the shot noise spectrum PshotnoiseP_{\rm shot\;noise} and the windowed galaxy density contrast δg,W(k;z)\delta_{g,W}(\vec{k};z) are given respectively by111An interesting thing to notice here is that in the original paper introducing the FKP estimator Feldman et al. (1994) and in almost every succeeding work, W(k)W(\vec{k}) is defined as the complex conjugate of the quantity we use here. In these other cases, W(k)W(\vec{k}) only appears in the form of |W(k)|2|W(\vec{k})|^{2}, and so this does not make a difference. However in our current work, we demonstrate using simulations that eikre^{-i\vec{k}\cdot\vec{r}} should appear instead of e+ikre^{+i\vec{k}\cdot\vec{r}} in the integrand, the same as for the definition of the Fourier transform.:

W(k)\displaystyle W(\vec{k}) =\displaystyle= I1/2Vd3rng(r)wFKP(r)eikr\displaystyle I^{-1/2}{\int_{V}d^{3}\vec{r}\langle n_{g}\rangle(\vec{r})w_{\rm FKP}(\vec{r})e^{-i\vec{k}\cdot\vec{r}}} (15)
Pshotnoise\displaystyle P_{\rm shot\;noise} =\displaystyle= (1+α)I1Vd3rng(r)wFKP2(r)\displaystyle(1+\alpha)I^{-1}{\int_{V}d^{3}\vec{r}\langle n_{g}\rangle(\vec{r})w_{\rm FKP}^{2}(\vec{r})} (16)
δg,W(k)\displaystyle\delta_{g,W}(\vec{k}) \displaystyle\equiv d3k(2π)3δg(k)W(kk).\displaystyle\int\frac{d^{3}\vec{k}^{\prime}}{(2\pi)^{3}}\delta_{g}(\vec{k}^{\prime})W(\vec{k}-\vec{k}^{\prime})\;. (17)

We want to calculate the off-diagonal term of the FKP estimator F(k)F(\vec{k}) given the fact that F(k)F(\vec{k}) is the observable from a galaxy light cone survey rather than δg,W\delta_{g,W}. Note that the two point functions of ng(r)αns(r)n_{g}(\vec{r})-\alpha n_{s}(\vec{r}) can be written as Feldman et al. (1994):

[ng(r)αns(r)][ng(r)αns(r)]\displaystyle\langle[n_{g}(\vec{r})-\alpha n_{s}(\vec{r})][n_{g}(\vec{r}^{\prime})-\alpha n_{s}(\vec{r}^{\prime})]\rangle
=\displaystyle= ng(r)ng(r)ξg(rr)+(1+α)ng(r)δD(rr)\displaystyle\langle n_{g}\rangle(\vec{r})\langle n_{g}\rangle(\vec{r}^{\prime})\xi_{g}(\vec{r}-\vec{r}^{\prime})+(1+\alpha)\langle n_{g}\rangle(\vec{r})\delta_{\rm D}(\vec{r}-\vec{r}^{\prime})

Assuming the squeezed limit ks+ks=kl\vec{k}_{s}+\vec{k}^{\prime}_{s}=\vec{k}_{l} and using the expression above, we can write the off-diagonal term as:

F(ks)F(ks)\displaystyle\langle F(\vec{k}_{s})F(\vec{k}^{\prime}_{s})\rangle (19)
=\displaystyle= δg,W(ks;zeff)δg,W(ks;zeff)+Qshotnoise(kl),\displaystyle\langle\delta_{g,W}(\vec{k}_{s};z_{\rm eff})\delta_{g,W}(\vec{k}^{\prime}_{s};z_{\rm eff})\rangle+Q_{\rm shot\;noise}(\vec{k}_{l}),

with the“off-diagonal shot noise” Q(kl)Q(\vec{k}_{l}) given by:

Q(kl)=(1+α)I1Vd3rng(r)wFKP2(r)eiklr.Q(\vec{k}_{l})=(1+\alpha)I^{-1}{\int_{V}d^{3}\vec{r}\langle n_{g}\rangle(\vec{r})w_{\rm FKP}^{2}(\vec{r})e^{i\vec{k}_{l}\cdot\vec{r}}}\;. (20)

The two point function δg,W(ks;zeff)δg,W(ks;zeff)\langle\delta_{g,W}(\vec{k}_{s};z_{\rm eff})\delta_{g,W}(\vec{k}^{\prime}_{s};z_{\rm eff})\rangle up to second order can be simply expressed as:

δg,Wδg,W=δg,W(1)δg,W(1)+δg,W(1)δg,W(2)+δg,W(2)δg,W(1)\langle\delta_{g,W}\delta_{g,W}^{\prime}\rangle=\langle\delta^{(1)}_{g,W}\delta^{\prime(1)}_{g,W}\rangle+\langle\delta^{(1)}_{g,W}\delta^{\prime(2)}_{g,W}\rangle+\langle\delta^{(2)}_{g,W}\delta^{\prime(1)}_{g,W}\rangle (21)

by defining δg,Wδg,W(ks;zeff)\delta_{g,W}\equiv\delta_{g,W}(\vec{k}_{s};z_{\rm eff}) and δg,Wδg,W(ks;zeff)\delta_{g,W}^{\prime}\equiv\delta_{g,W}(\vec{k}^{\prime}_{s};z_{\rm eff}). One major difference here is that, for a non-cubical region, the leading order term would also be non-zero unlike the cubic volume in the last section II:

δg,W(1)δg,W(2)\displaystyle\langle\delta^{(1)}_{g,W}\delta^{\prime(2)}_{g,W}\rangle (22)
=\displaystyle= d3k(2π)3d3k(2π)3W(kks)W(kks)\displaystyle\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}\int\frac{d^{3}\vec{k}^{\prime}}{(2\pi)^{3}}W(\vec{k}-\vec{k}_{s})W(\vec{k}^{\prime}-\vec{k}^{\prime}_{s})
×\displaystyle\times b12(2π)3δD(kk)Pm(1)(k;zeff)\displaystyle b_{1}^{2}(2\pi)^{3}\delta_{\rm D}(\vec{k}-\vec{k}^{\prime})P_{m}^{(1)}(k;z_{\rm eff})
=\displaystyle= b12d3k(2π)3W(kks)W(kks)Pm(1)(k;zeff)\displaystyle b_{1}^{2}\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}W(\vec{k}-\vec{k}_{s})W(-\vec{k}-\vec{k}^{\prime}_{s})P_{m}^{(1)}(k;z_{\rm eff})

where δD\delta_{\rm D} is the Dirac delta function. This term would vanish since in the case of a cube, W(k)W(\vec{k}) would be close to a Dirac delta function. For a non-cubic region, though, the term is no longer zero. Note that this leading order term can be fully determined numerically.

Using the expressions from Eq. (8) and Eq. (17), we can compute the second order two-point correlation of two short-wavelength modes δg,W(ks;zeff)\delta_{g,W}(\vec{k}_{s};z_{\rm eff}) and δg,W(ks;zeff)\delta_{g,W}(\vec{k}^{\prime}_{s};z_{\rm eff}). Using δg,W(1)δg,W(2)\langle\delta^{(1)}_{g,W}\delta^{\prime(2)}_{g,W}\rangle as an example, we have:

δg,W(1)δg,W(2)\displaystyle\langle\delta^{(1)}_{g,W}\delta^{\prime(2)}_{g,W}\rangle (23)
=\displaystyle= b1d3k(2π)3d3k(2π)3W(ksk)W(ksk)\displaystyle b_{1}\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}\int\frac{d^{3}\vec{k}^{\prime}}{(2\pi)^{3}}W(\vec{k}_{s}-\vec{k})W(\vec{k}^{\prime}_{s}-\vec{k}^{\prime})
×\displaystyle\times δm(1)(k;zeff)δg(2)(k;zeff)\displaystyle\langle\delta^{(1)}_{m}(\vec{k};z_{\rm eff})\delta^{(2)}_{g}(\vec{k}^{\prime};z_{\rm eff})\rangle

Notice that we have computed the bracket δm(1)(k;zeff)δg(2)(k;zeff)\langle\delta^{(1)}_{m}(\vec{k};z_{\rm eff})\delta^{(2)}_{g}(\vec{k}^{\prime};z_{\rm eff})\rangle in Ref. Li et al. (2020), with F2F_{2} replaced by 2\mathpzc{F}_{2}. The result is:

δm(1)(k;zeff)δg(2)(k;zeff)\displaystyle\langle\delta^{(1)}_{m}(\vec{k};z_{\rm eff})\delta^{(2)}_{g}(\vec{k}^{\prime};z_{\rm eff})\rangle (24)
=\displaystyle= 22(𝓀,𝓀+𝓀)𝒫𝓂(1)(𝓀;𝓏eff)δ𝓂(1)(𝓀+𝓀;𝓏eff)\displaystyle 2\mathpzc{F}_{2}(-\vec{k},\vec{k}+\vec{k}^{\prime})P_{m}^{(1)}(k;z_{\rm eff})\delta_{m}^{(1)}(\vec{k}+\vec{k}^{\prime};z_{\rm eff})

Thus we can further express the bracket as:

δg,W(1)δg,W(2)\displaystyle\langle\delta^{(1)}_{g,W}\delta^{\prime(2)}_{g,W}\rangle (25)
=\displaystyle= 2b1d3k(2π)3d3k(2π)3W(ksk)W(ksk)\displaystyle 2b_{1}\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}\int\frac{d^{3}\vec{k}^{\prime}}{(2\pi)^{3}}W(\vec{k}_{s}-\vec{k})W(\vec{k}^{\prime}_{s}-\vec{k}^{\prime})
×\displaystyle\times 2(𝓀,𝓀+𝓀)𝒫𝓂(1)(𝓀;𝓏eff)δ𝓂(1)(𝓀+𝓀;𝓏eff)\displaystyle\mathpzc{F}_{2}(-\vec{k},\vec{k}+\vec{k}^{\prime})P_{m}^{(1)}(k;z_{\rm eff})\delta_{m}^{(1)}(\vec{k}+\vec{k}^{\prime};z_{\rm eff})

Ideally, We would like to extract a term δg,W(1)(kl;zeff)\delta^{(1)}_{g,W}(\vec{k}_{l};z_{\rm eff}) from this, where:

δg,W(1)(kl;zeff)=b1d3k(2π)3δm(1)(k;zeff)W(klk).\displaystyle\delta^{(1)}_{g,W}(\vec{k}_{l};z_{\rm eff})=b_{1}\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}\delta_{m}^{(1)}(\vec{k};z_{\rm eff})W(\vec{k}_{l}-\vec{k}). (26)

If one of the WW functions were a Dirac delta function, this would follow automatically. Here, WW is not a delta function, but given a large enough volume, W(k)W(\vec{k}) is peaked at k=0\vec{k}=0 and also:

d3k(2π)3W(k)=W(r=0)C\displaystyle\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}W(\vec{k})=W(\vec{r}=0)\equiv C (27)

Thus we have the following approximations by first applying a redefinition of dummy variables:

δg,W(1)(ks;zeff)δg,W(2)(ks;zeff)\displaystyle\langle\delta^{(1)}_{g,W}(\vec{k}_{s};z_{\rm eff})\delta^{(2)}_{g,W}(\vec{k}^{\prime}_{s};z_{\rm eff})\rangle (28)
=\displaystyle= 2b1d3k(2π)3d3k(2π)3W(ksk+k)W(ksk)\displaystyle 2b_{1}\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}\int\frac{d^{3}\vec{k}^{\prime}}{(2\pi)^{3}}W(\vec{k}_{s}-\vec{k}+\vec{k}^{\prime})W(\vec{k}^{\prime}_{s}-\vec{k}^{\prime})
×\displaystyle\times 2(𝓀+𝓀,𝓀)𝒫𝓂(1)(|𝓀𝓀|;𝓏eff)δ𝓂(1)(𝓀;𝓏eff)\displaystyle\mathpzc{F}_{2}(-\vec{k}+\vec{k}^{\prime},\vec{k})P_{m}^{(1)}(|\vec{k}-\vec{k}^{\prime}|;z_{\rm eff})\delta_{m}^{(1)}(\vec{k};z_{\rm eff})
\displaystyle\simeq 2C2(𝓀𝓈,𝓀𝓈+𝓀𝓈)𝒫𝓂(1)(𝓀𝓈;𝓏eff)\displaystyle 2C\mathpzc{F}_{2}(-\vec{k}_{s},\vec{k}_{s}+\vec{k}^{\prime}_{s})P_{m}^{(1)}(k_{s};z_{\rm eff})
×\displaystyle\times b1d3k(2π)3δm(1)(k;z)W(ks+ksk)\displaystyle b_{1}\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}\delta_{m}^{(1)}(\vec{k};z)W(\vec{k}_{s}+\vec{k}^{\prime}_{s}-\vec{k})
=\displaystyle= 2C2(𝓀𝓈,𝓀𝓈+𝓀𝓈)𝒫𝓂(1)(𝓀𝓈;𝓏eff)δ,𝒲(1)(𝓀𝓁;𝓏eff).\displaystyle 2C\mathpzc{F}_{2}(-\vec{k}_{s},\vec{k}_{s}+\vec{k}^{\prime}_{s})P_{m}^{(1)}(k_{s};z_{\rm eff})\delta^{(1)}_{g,W}(\vec{k}_{l};z_{\rm eff}).

With the calculation above, we can then recover the long-wavelength modes from the off-diagonal two-point functions of short-wavelength modes:

F(ks)F(ks)Qshotnoise(kl)\displaystyle\langle F(\vec{k}_{s})F(\vec{k}^{\prime}_{s})\rangle-Q_{\rm shot\;noise}(\vec{k}_{l}) (29)
\displaystyle- b12d3k(2π)3W(kks)W(kks)Pm(1)(k;zeff)\displaystyle b_{1}^{2}\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}W(\vec{k}-\vec{k}_{s})W(-\vec{k}-\vec{k}^{\prime}_{s})P^{(1)}_{m}(k;z_{\rm eff})
=\displaystyle= 𝒻(𝓀𝓈,𝓀𝓈;𝓏eff)δ,𝒲(1)(𝓀𝓁;𝓏eff)\displaystyle\mathpzc{f}(\vec{k}_{s},\vec{k}^{\prime}_{s};z_{\rm eff})\delta^{(1)}_{g,W}(\vec{k}_{l};z_{\rm eff})

with

𝒻(𝓀𝓈,𝓀𝓈;𝓏eff)\displaystyle\mathpzc{f}(\vec{k}_{s},\vec{k}^{\prime}_{s};z_{\rm eff}) =\displaystyle= 2C2(𝓀𝓈,𝓀𝓈+𝓀𝓈)𝒫𝓂(1)(𝓀𝓈;𝓏eff)\displaystyle 2C\mathpzc{F}_{2}(-\vec{k}_{s},\vec{k}_{s}+\vec{k}^{\prime}_{s})P_{m}^{(1)}(k_{s};z_{\rm eff}) (30)
+\displaystyle+ 2C2(𝓀𝓈,𝓀𝓈+𝓀𝓈)𝒫𝓂(1)(𝓀𝓈;𝓏eff).\displaystyle 2C\mathpzc{F}_{2}(-\vec{k}^{\prime}_{s},\vec{k}_{s}+\vec{k}^{\prime}_{s})P_{m}^{(1)}(k^{\prime}_{s};z_{\rm eff}).

Notice that the 𝒻\mathpzc{f} is almost identical to the ff function in section II, simply with a replacement of the F2F_{2} function and an extra coefficient CC. The quadratic estimator can then be similarly formed, and is:

δ^g,W(1)(kl;zeff)=𝒜(𝓀𝓁;𝓏eff)𝒹3𝓀𝓈(2π)3(𝓀𝓈,𝓀𝓈;𝓏eff)\displaystyle\hat{\delta}^{(1)}_{g,W}(\vec{k}_{l};z_{\rm eff})=\mathpzc{A}(\vec{k}_{l};z_{\rm eff})\int\frac{d^{3}\vec{k}_{s}}{(2\pi)^{3}}\mathpzc{g}(\vec{k}_{s},\vec{k}^{\prime}_{s};z_{\rm eff})
×[F(ks)F(ks)Qshotnoise(kl)\displaystyle\times\bigg{[}F(\vec{k}_{s})F(\vec{k}^{\prime}_{s})-Q_{\rm shot\;noise}(\vec{k}_{l})
b12d3k(2π)3W(kks)W(kks)Pm(1)(k;zeff)]\displaystyle-b_{1}^{2}\int\frac{d^{3}\vec{k}}{(2\pi)^{3}}W(\vec{k}-\vec{k}_{s})W(-\vec{k}-\vec{k}^{\prime}_{s})P^{(1)}_{m}(k;z_{\rm eff})\bigg{]} (31)

with ks=klks\vec{k}^{\prime}_{s}=\vec{k}_{l}-\vec{k}_{s} and \mathpzc{g} being the weighting function. Notice here that the only difference is that we subtract off the non-zero leading order terms due to the non-cubical shape of the galaxy survey volume, and these two terms can be calculated numerically. By requiring that δ^g,W(1)(kl;zeff)=δg,W(1)(kl;zeff)\langle\hat{\delta}^{(1)}_{g,W}(\vec{k}_{l};z_{\rm eff})\rangle={\delta}^{(1)}_{g,W}(\vec{k}_{l};z_{\rm eff}) we can similarly determine the normalization function 𝒜\mathpzc{A}:

𝒜(𝓀𝓁;𝓏eff)=[𝒹3𝓀𝓈(2π)3(𝓀𝓈,𝓀𝓈;𝓏eff)𝒻(𝓀𝓈,𝓀𝓈;𝓏eff)]1.\displaystyle\mathpzc{A}(\vec{k}_{l};z_{\rm eff})=\bigg{[}\int\frac{d^{3}\vec{k}_{s}}{(2\pi)^{3}}\mathpzc{g}(\vec{k}_{s},\vec{k}^{\prime}_{s};z_{\rm eff})\mathpzc{f}(\vec{k}_{s},\vec{k}^{\prime}_{s};z_{\rm eff})\bigg{]}^{-1}.

Similar to Ref. Li et al. (2020), by minimizing the noise we obtain the expression for the weighting function \mathpzc{g}:

(𝓀𝓈,𝓀𝓈;𝓏eff)=𝒻(𝓀𝓈,𝓀𝓈;𝓏eff)2𝒫,obs(𝓀𝓈)𝒫,obs(𝓀𝓈)\displaystyle\mathpzc{g}(\vec{k}_{s},\vec{k}^{\prime}_{s};z_{\rm eff})=\frac{\mathpzc{f}(\vec{k}_{s},\vec{k}^{\prime}_{s};z_{\rm eff})}{2P_{g,\rm obs}(\vec{k}_{s})P_{g,\rm obs}(\vec{k}^{\prime}_{s})} (33)
=\displaystyle= C[2(𝓀𝓈,𝓀𝓈+𝓀𝓈)𝒫𝓂(1)(𝓀𝓈;𝓏eff)Pg,obs(ks)Pg,obs(ks)\displaystyle C\bigg{[}\frac{\mathpzc{F}_{2}(-\vec{k}_{s},\vec{k}_{s}+\vec{k}^{\prime}_{s})P_{m}^{(1)}(k_{s};z_{\rm eff})}{P_{g,\rm obs}(\vec{k}_{s})P_{g,\rm obs}(\vec{k}^{\prime}_{s})}
+2(𝓀𝓈,𝓀𝓈+𝓀𝓈)𝒫𝓂(1)(𝓀𝓈;𝓏eff)Pg,obs(ks)Pg,obs(ks)].\displaystyle\,+\frac{\mathpzc{F}_{2}(-\vec{k}^{\prime}_{s},\vec{k}_{s}+\vec{k}^{\prime}_{s})P_{m}^{(1)}(k^{\prime}_{s};z_{\rm eff})}{P_{g,\rm obs}(\vec{k}_{s})P_{g,\rm obs}(\vec{k}^{\prime}_{s})}\bigg{]}.

Here Pg,obsP_{g,\rm obs} is the full observed galaxy power spectrum including shot noise. With this choice of \mathpzc{g} the noise term 𝒩\mathpzc{N} is identical to the normalization factor 𝒜\mathpzc{A}. The projected detectability is defined as in Eq. (7):

1σ(kl;zeff)2=Vkl2Δk(2π)2[Pm(1)(kl;zeff)Pm(1)(kl;zeff)+𝒜(𝓀𝓁;𝓏eff)]2.\displaystyle\frac{1}{\sigma(k_{l};z_{\rm eff})^{2}}=\frac{Vk_{l}^{2}\Delta k}{(2\pi)^{2}}\bigg{[}\frac{P_{m}^{(1)}(k_{l};z_{\rm eff})}{P_{m}^{(1)}(k_{l};z_{\rm eff})+\mathpzc{A}(k_{l};z_{\rm eff})}\bigg{]}^{2}.
(34)

Using the quadratic estimator Eq. (31) we can use the entirety of the small scale information from the non-cubical light cone to infer the large scale field of the windowed galaxy density contrast δg,W(r)\delta_{g,W}(\vec{r}).

IV Demonstration with An N-Body Simulation

We use the MICE Grand Challenge light cone N-body simulation (MICE-GC) Fosalba et al. (2015a)Crocce et al. (2015)Fosalba et al. (2015b) to demonstrate the power of the estimator in a light cone. The catalog contains one octant of the full sky up to z=1.42z=1.42 (comoving distance 3062h1Mpc3062\,h^{-1}\,\rm Mpc) without simulation box repetition, as shown in Fig. 1. This simulation used a flat Λ\LambdaCDM model with cosmological parameters Ωm=0.25\Omega_{\rm m}=0.25, σ8=0.8\sigma_{8}=0.8, ns=0.95n_{\rm s}=0.95, Ωb=0.044\Omega_{\rm b}=0.044, ΩΛ=0.75\Omega_{\Lambda}=0.75, h=0.7h=0.7.

Refer to caption
Figure 1: The survey region of the MICE-GC simulation, which is an octant. Note that due to some technical reasons there are no galaxies near the origin OO, so a small octant is removed from the survey region.

We consider the halo catalog in this light cone with halo masses between 2.2×1012h1M<M<1014h1M2.2\times 10^{12}h^{-1}M_{\odot}<M<10^{14}h^{-1}M_{\odot}, a wide mass bin. We obtained similar results using other mass bins as well. The effective redshift of this light cone is zeff=0.76z_{\rm eff}=0.76. We assume the bias parameter b1b_{1} and b2b_{2} to be free parameters of the model, and use FAST-PT McEwen et al. (2016) to determine the bias parameters to be:

b1\displaystyle b_{1} =\displaystyle= 1.88\displaystyle 1.88
b2\displaystyle b_{2} =\displaystyle= 3.13.\displaystyle 3.13\,. (35)

The remaining bias parameter bs2b_{s^{2}} can be constrained by assuming the bias model is local in Lagrangian space Baldauf et al. (2012):

bs2=47(b11)=0.50.b_{s^{2}}=-\frac{4}{7}(b_{1}-1)=-0.50\,. (36)

We use the quadratic estimator, Eq. (31) to obtain the reconstructed Fourier space windowed galaxy density field, and transform it back into real space. Then, we compare this indirectly estimated result with the directly measured (using the FKP estimator) galaxy density in real space in Fig. 2. We use the information from small scale modes up to ks,max=0.48hMpc1k_{s,\rm max}=0.48\,h\,\rm Mpc^{-1}. We also plot in Fig. 3 the directly measured large scale galaxy power spectrum with cosmic variance error bars (corresponding to Eq. (34) with 𝒜=0\mathpzc{A}=0) versus the estimated large scale power spectrum using Eq. (31) with detectability given by Eq. (34). We see that our quadratic estimator gives a good estimation of the linear matter power spectrum and the uncertainty of the estimated result is only slightly larger than cosmic variance.

Note that in Fig. 2 because of the light cone, we cannot have a direct measurement of the δg,W\delta_{g,W} field. Both the field computed from direct measurement of the Fourier modes (with FKP weighting) and the field derived from the quadratic estimator can be seen to encode almost the same large scale information from the light cone catalog. The large scales we are observing correspond to about 103hMpc110^{-3}h\,\rm Mpc^{-1}<kl<102hMpc1<k_{l}<10^{-2}h\,\rm Mpc^{-1}, where the magnitude of the observed power spectrum Fig. 3 is much greater than the shot noise term (in this case Pshotnoise1000(h1Mpc)3P_{\rm shot\;noise}\simeq 1000\,(h^{-1}\rm Mpc)^{3}). From Fig. 2 we see that as our quadratic estimator extracts large scale information, the cells with large over- and under-densities are especially well reconstructed. The difference with the true field becomes slightly larger when we go to higher redshift (corresponding to the panels on the right) and is worst for the very right panel.

Refer to caption
Figure 2: Comparison of the true real space galaxy density field in the MICE-GC simulation (F(r)F(\vec{r}) computed using the directly measured large scale modes and FKP estimator, top row) and the windowed halo density field from the quadratic estimator (δ^h,W(r)\hat{\delta}_{h,W}(\vec{r}), middle row). The bottom row shows their difference. Each cell is (0.44h1Gpc0.44\,h^{-1}\rm Gpc) thick, and panels are arranged so that the mean redshift increases from left to right. The upper limit on ks\vec{k}_{s} input to the quadratic estimator is 0.48hMpc10.48\,h\,\rm Mpc^{-1}.
Refer to caption
Figure 3: The inferred linear matter power spectrum from direct measurement versus indirect estimation using our quadratic estimator, both from the MICE-GC light-cone halo catalog. The error bar for the direct measurement is derived from cosmic variance (see text). The error for the indirect estimate is expressed using Pm(1)(kl)σ(kl)P_{m}^{(1)}(k_{l})\sigma(k_{l}), where σ(kl)\sigma(k_{l}) is from Eq. (34) after scaling.

V Conclusion

V.1 Summary

In prior work Li et al. (2020) we have shown that the amplitude and phase of large scale density fluctuations can be recovered by applying a quadratic estimator to measurements of small scale Fourier modes and their correlations. In this paper we extend that work (which was limited to a matter density field at a single instance in cosmic time) to a light cone galaxy catalog in order to make it applicable to observational data. All extensions are tested on appropriate mock survey datasets derived from N-body simulations.

V.2 Discussion

Our formalism includes the major effects that are relevant for an application to observational data. There are some minor aspects however which will need to be dealt with when this occurs. One is the fact that we have tested on homogeneous mock surveys, when real observations will include masked data (to account for bright stars for example), and a potentially more complex window function. In a spectroscopic survey, the observed distribution of galaxies is distorted and squashed when we use their redshift as an indicator of their radial distance due to galaxies’ peculiar velocity. This effect is known as redshift space distortion Kaiser (1987) and the FKP formalism corresponds to the monopole moment in redshift space. We have left the generalization to include higher order power spectrum multipoles (quadrapole, hexadecapole) to future work.

At present the large scale limitations on direct measurement of galaxy clustering are observational systematics (e.g., Ho et al. (2012)). These include angular variations in obscuration, seeing, sky brightness, colors, extinction and magnitude errors. Because these result in relatively small modulations of the measured galaxy density, they will affect large scale modes most importantly, hence the utility of our indirect measurements of clustering on these scales. Quantification of these effects on the scales for which we do measure clustering will still be needed though. It will be also be instructive to apply large scale low amplitude modulations to our mock surveys in order to test how well the quadratic estimator works with imperfect data. Even small scale issues with clustering, such as fiber collisions Hahn et al. (2017) could affect our reconstruction, depending on how their effects propagate through the quadratic estimator.

Observational datasets exist at present which could be used to carry out measurements using our methods. These include the SDSS surveys BOSS Dawson et al. (2013) and eBOSS Dawson et al. (2016) (both luminous red galaxies and emission line galaxies). Substantial extent in both angular coordinates and redshift are necessary, so that deep but narrow surveys such as VIMOS Le Fèvre et al. (2015) or DEEP2 Coil et al. (2006) would not be suitable. In the near future, the available useful data will increase rapidly with the advent of WEAVE Dalton et al. (2014) and DESI DESI Collaboration (2019). Space based redshift surveys with EUCLID Amiaux et al. (2021) and WFIRST WFIRST Science Definition Team (2012) will expand the redshift range, and SPHEREx Doré et al. (2014), due for launch even earlier will offer maximum sky coverage, and likely the largest volume of all.

In order to model what is expected from all these datasets, the effective range of wavelengths used in the reconstruction of large scale modes should be considered. Surveys covering large volumes but with low galaxy number density will have large shot noise contributions to density fluctuations, and this will limit the range of scales that can be used. For example, in our present work we have successfully tested number densities of 3×103\sim 3\times 10^{-3} galaxies per (Mpc/h)3. Surveys such as the eBOSS quasar redshift survey Ata et al. (2018) with a number density 100\sim 100 times lower will not be useful, for example.

Once an indirect measurement of large scale modes has been made from an observational dataset, there are many different potential applications. We can break these up into two groups, involving the power spectrum itself, and the map (and statistics beyond Pm(k)P_{\rm m}(k)) which can be derived from it.

First, because of the effect of observational systematics mentioned above, and the fact the our indirect estimate of clustering is sensitive to fluctuations beyond the survey boundaries itself, then it is likely that the measurement we propose would correspond to the largest scale estimate of three dimensional matter clustering yet made. This would in itself be an exciting test of theories, for example probing the power spectrum beyond the matter-radiation equality turnover, and allowing access to the Harrison-Zeldovich portion. There has been much work analyzing large scale anomalies in the clustering measured from the CMB Copi et al. (2010)Rassat et al. (2014)Schwarz et al. (2016), and it would be extremely useful to see if anything comparable is seen from galaxy large scale structure data. On smaller scales, one could use the matter-radiation equality turnover as a cosmic ruler Hasenkamp and Kersten (2013), and this would allow comparison to measurements based on BAO Lazkoz et al. (2008).

Second, there will be much information in the reconstructed maps of the large scale densities (such as Fig. 2). One could look at statistics beyond the power spectrum, such as counts-in-cells Yang and Saslaw (2011), or the bispectrum, and see how consistent they are with model expectations. One can also compare to the directly measured density field and obtain information on the large scale systematic effects which are modulating the latter. Cross-correlation of the maps with those of different tracers can also be carried out. For example the large scale potential field inferred can be used in conjunction with CMB observations to constrain the Integrated Sachs Wolfe effectNishizawa (2014).

In general, as we will be looking at large scale fluctuations beyond current limits by perhaps an order of magnitude in scale or more, one may expect to find interesting constraints on new physics. For example evidence for the Λ\LambdaCDM model was seen in the first reliable measurements of large scale galaxy clustering on scales greater than 10h1Mpc10\,h^{-1}\rm Mpc (e.g., Efstathiou et al. (1990)). Moving to wavelengths beyond 2π/(k=0.02)300Mpc2\pi/(k=0.02)\sim 300\rm\,Mpc may yet lead to more surprises.

Acknowledgements.
We thank Jonathan A. Blazek, Duncan Campbell and Rachel Mandelbaum for resourceful discussions. We also thank Enrique Gaztanaga for providing us with the 11 in 700700 matter particles’ positions of Mice-GC simulation for a test. This work is supported by U.S. Dept. of Energy contract DE-SC0019248 and NSF AST-1909193. This work has made use of CosmoHub. CosmoHub has been developed by the Port d’Informació Científica (PIC), maintained through a collaboration of the Institut de Física d’Altes Energies (IFAE) and the Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas (CIEMAT), and was partially funded by the “Plan Estatal de Investigación Científica y Técnica y de Innovación” program of the Spanish government.

References